際際滷

際際滷Share a Scribd company logo
Monitoring Your Infrastructure the open source way
Kris Buytaert  Senior Linux and Open Source Consultant @inuits.be
 Infrastructure Architect
Linux since 0.98
OpenMosix, openQRM, ...
Early Adopter (Xen, MySQL Cluster)
Automating Large Scale Deployment , High Availability
Surviving the 10 th  floor test
http://www.krisbuytaert.be/blog/
http://www.virtualization.com/
Tom De Cooman Linux and Open Source Consultant @inuits.be Tom De Cooman has been a Linux user for over 8 years, and active in system's administration for about 4 years.  He is a  general Unix system administrator with focus/strong interest in monitoring, mail and virtualisation.  Previously he has been working mostly for System Integrators. He also has a lot of experience with SUN hardware  and software.
Do you know what your children do  at 5 am  in the morning ?  Are they asleep
Or Crashing at a party ?
Why are there cops at your front door ?
Did something happen to them ?
How long have they been gone already ?
Do you know what your servers are doing at 5 am  in the morning ?  You can't afford to be down
You can't afford to be slow
Systems grow and scale beyond manual/human capacity
Plan for growth
Good admins know how their systems behave
And what's abnormal systems behaviour
Monitoring  Check status Define Limits
Running ? How to check ? Script
Status File
Agent
SNMP
Active vs Passive Checks Active :  checks performed by the monitoring tool itself Http , ping , ... Passive :  checks performed and submitted by an external application snmptrap , syslog ,
Agent(less) Agent Based Impact on Measurement
More detailed information
Often Big performance penalty  Agent Less Non intrusive
Less detail SNMP
Alerts / Notifications Send a Warning Signal Email, SMS , xmpp , other Choose based on situation Based on time
Based on service
Based on state of system Escalation
SLA
Reporting Up /  down
Since
Graphical Overview
Summary
Lies, damn lies and statistics
Trending Chart the data
A Visionary approach
Find Anomalies
Plan for Growth
What do you want from a tool ? Easy to configure
Autodetection
Supporting Gui
Automatable
Consistent
SNMP Integration
Trending Included ? Agentless
Templates
Non Intrusive
Plenty of notification
Active community
Hackable
The Contenders Hyperic HQ
Zabbix
Zenoss
OpenNMS
Nagios
GroundWorks
Hobbit
...
Initial Experience First Phase
Setup Different Tools/Platforms
Initial Feeling
Installation Experience
Nagios The Standard
A zillion tools based on it
Awkward config for the newbie
Very configurable
Very Pluggable
Great ecosystem
Often integrated with Cacti
GroundWorks Claims to be Nagios ++
Be prepared to be spammed
Integrates 70+ tools
Worst Installation experience ever (twice) Installation failed multiple times

More Related Content

Monitoring shootout loadays

Editor's Notes

  1. An item has all the data to define how a check is to be performed on the host. ( important ones: a name for the item, a check type: info about what data we want and how to get it, a check interval). The result is that a 'key' is stored for a certain host. (eg FTP-key being 0 or 1, off or on) In Zabbix, we speak of several 'Check types' the most important ones being 'simple checks' and 'external checks'.
  2. Zabbix sender: command line util used to send perfdata to zabbix item: ftp on trigger: ftp down action: if ftpdown then mail system.cpu.load system.proc.mun Simple checks Agent SNMP Other Scripts Internal checks : used to monitor the inernals of zabbix Aggregated checks : direct datbase queries (calculate avg cpuload of a group)
  3. Applications: group that can contain all items related to smth mysql