ºÝºÝߣ

ºÝºÝߣShare a Scribd company logo
Next	Generation	Intelligent	APM:	
Pain	Points,	Trends	and	Solutions
Yuchen	Zhao,	Principal	Data	Scientist,	AppDynamics
Application		
Support
Application		
Development
IT		
Operations
DevOps		
Release
Application		
Owner
? an	APM	Company	
? a	Leader	in	Gartner's	Magic	Quadrant	
? highest	scores	in	every	APM	Use	Case	
(Gartner,	2016)
Why	APM?
Software	Is	Eating	The	World	
¡ª	Marc	Andreessen
Next Generation Intelligent APM: Pain Points, Trends and Solutions
Next Generation Intelligent APM: Pain Points, Trends and Solutions
It¡¯s	critical	to	make	sure	
the	apps	
are	working	properly
8/8/2016	Delta	Airlines	System	Outage
? Flights	canceled:	1,000+	
? Passengers	Affected:	hundreds	of	thousands	
? Days	to	recover:	4+	
? Loss:	millions	of	$$$
4	Pain	Points	
of	APM
Monitoring	should	not	be	too	hard,	right?
+
Well,	it	can	be	more	complex¡­
Or,	really	really	complex
recently,	micro-services	are	widely	adopted
Login
Flight Status
Search Flight
Purchase
Mobile
SOA
NOSQL
Cloud
Agile
Web
IOT
Next Generation Intelligent APM: Pain Points, Trends and Solutions
Pain	Point	1:	
App	complexity	increases	significantly
A	lot	of	things	can	go	wrong.	
Any	component	of	the	system	can	be	unhappy.
Countless	components/modules	need	to	be	monitored
End	User Backend
? mobile	
? browser	
? web	server	
? middleware	
? database
10+	tools!
? storage	
? cloud	
? network	
? log	
? IoT
A	typical	DevOps	Scenario
Pain	Point	2:	
APM	requires	investigation	of		
multiple	sources		
across	multiple	teams
Dashboards	and	alerts	are	our	friends
Let¡¯s	consider	a	simplified	made-up	example
Suppose	we	have		
10	dashboards	and	10	metrics		
for	each	server.
How	many	things	do	we	need	to	monitor	for	this?
#	of	servers:	3	
#	of	dashboards:	30	
#	of	alerts	to	set:	30
but,	how	about	this?
#	of	servers:	100+	
#	of	dashboards:	???	
#	of	alerts	to	set:	???
In	a	real	production	system¡­
How	to	watch	thousands	of	dashboards?
How	to	manually	set	the	alert	thresholds		
for	millions	of	metrics?
Even	all	alerts	are	set,	what	if		
your	business	grows	quickly	and		
previous	thresholds	no	longer	apply?
Next Generation Intelligent APM: Pain Points, Trends and Solutions
You	can	NOT	observe	everything		
if	there	are	too	many!
Pain	Point	3:	
data	volume	is	too	large	to	see
APM	data,	dashboards,	alerts,	etc.
APM	data	is	semi-structured
We	have	metrics	/	time	series:	
? CPU	usage	
? Java	JVM	metrics	
? network	latency	
? daily	active	user	
? hourly	sales	amount	
? ¡­
We	have	categorical	data:	
? node	name	
? service	name	
? region	ID	
? user	IP	
? payment	method	
? ¡­
We	have	hierarchical	trees:	
? modules	¡ª>	nodes	¡ª>	tiers	
? aggregated	stats	¡ª>	individual	level	stats	
? ¡­
We	have	graphs:
We	have	unstructured	data:	
? logs	
? error	messages	
? stack	traces	
? ¡­
? Different	data	types	require	various	
methods	to	retrieve,	discover	and	analyze.	
? It¡¯s	difficult	to	find	single	generic	approach	
to	handle	all.
Pain	Point	4:	
APM	data	is	heterogeneous	in	nature
Considering	all	these	pain	points,	
how	can	we	address	them?
Next Generation Intelligent APM: Pain Points, Trends and Solutions
See Act Know
APM	Work	Flow
See Act Know
Monitoring	today:	too	many	tools!
Unified	Monitoring	
one	scalable	data	platform,	end-to-end	visibility
Application Performance
Management
Database ?
Monitoring
Synthetic ?
Monitoring
Browser Real-User?
Monitoring
Mobile Real-User?
Monitoring
Server?
Monitoring
Flexible ?
Deployment
SaaS
On-Prem
Unified Monitoring
Auto	Discovery	
automatic	distributed	flow	discovery	for	micro-services
Application Performance
Management
Database ?
Monitoring
Synthetic ?
Monitoring
Browser Real-User?
Monitoring
Mobile Real-User?
Monitoring
Server?
Monitoring
Flexible ?
Deployment
SaaS
On-Prem
Database
API
Server
ApplicationWeb?
Server
Business?
Transaction
Browser
Mobile
Unified Monitoring
You	have	to	capture	everything,	
only	then	you¡¯ll	be	able	to	see.
See Act Know
With	all	extracted	data,	the	future	APM		
must	have	to	be	intelligent.
A	data-driven	machine	learning	powered	
analytical	platform
use	machine	learning	to	extract		
useful	info	and	insights	
from	massive	APM	data
See Act Know
Scenario	1:	Easy	Log	Monitoring
Deriving	insights	from	logs:	unstructured	¡ú	structured
Writing	regex:	
? requires	domain	knowledge	
? tedious	
? error-prone
Automate:	unstructured	¡ú	structured
machine		
learning
Scenario	2:	Smart	Alerting
Traditionally,	we	need	to	create	health	rules	and		
manually	set	alerting	thresholds.
Use	machine	learning:	
? Automatic	baseline	
? Dynamics	threshold	
? Seasonality	analysis	
Techniques:	
? auto	regression	
? PCA	
? wavelet	decomposition
See Act Know
Scenario	3:	Fast	Root	Cause	Analysis
Next Generation Intelligent APM: Pain Points, Trends and Solutions
Automatically	recommend	most	relevant	signals!
Scenario	4:	Data	Summarization
Next Generation Intelligent APM: Pain Points, Trends and Solutions
See Act Know
Intelligent	APM
APM Applications:
? relevant signal recommendation
? time series modeling
? metric prediction
? auto regex inference
? unstructured data clustering
? anomaly detection
? graph optimization
? ...
ML frequently applied to APM:
? clustering
? neural net / deep learning
? classification
? ranking
? recommendation
? graph/network analysis
? prediction
? ...
Reactive Real-time Proactive
fast	post-mortem	
root	cause	analysis	
low	MTTI
smart	alerting	
real-time	war	room	
automation
predictive	analytics	
anomaly	detection	
recommendation
Recap
APM	pain	points:	
? App	complexity	is	exploding	
? Investigation	of	multiple	sources	
? Data	is	too	big	
? Data	is	heterogeneous
The	next	generation	APM:	
? Unified	monitoring	
? Data-driven	intelligent	APM					
(see	¡ú	act	¡ú	know)	
? reactive	¡ú	real-time	¡ú	proactive
Thank	You

More Related Content

Next Generation Intelligent APM: Pain Points, Trends and Solutions