The document provides an overview of Nagios, an open source network monitoring software. It discusses storage management challenges, what Nagios is, and provides tutorial topics on how to start a Nagios server, write storage service monitoring code, monitor local and remote storage, and handle events. The tutorial covers installing and configuring Nagios, defining hosts and services, writing check commands, installing NRPE for remote monitoring, and using event handlers to automate responses. Additional Nagios resources are also listed.
2. ? Storage management challenges
? What is Nagios
? Tutorial topics:
? How to start a Nagios server
? Writing storage service monitoring code
? Monitoring local & remote storage
? Event handling
pgaref@ics.forth.gr
4. ? A key measurement tool for actively
monitoring availability of devices and
services.
? The most used open source network
monitoring software.
? Can support monitoring and
management of thousands of devices
and services.
pgaref@ics.forth.gr
17. oRemote Host IP is: 139.91.70.76
oYour IP has to be added at nrpe.cfg before running!
pgaref@ics.forth.gr
18. ? Follow the instructions to install NRPE Server
¡°Enable NRPE Server-Ubuntu.txt¡±
? You can Skip the Command and Service
Definitions.
? You can check your connection by running
the following command and using the IP
Address of the remote box you want to
monitor. You should get the return ¡°NRPE
v2.8.1¡± if all is working.
? Command: /usr/lib/nagios/plugins/check_nrpe -H 139.91.70.76
pgaref@ics.forth.gr
19. ? We have an NFS server running in the remote host. A
plugin for monitoring NFS is included ¡°check_nfsmount.pl¡±
? We will modify NRPE configuration at the server part to be
able to run check nfs remotely.
? Finally test the command:
? /usr/lib/nagios/plugins/check_nrpe -H 139.91.70.76 -c
check_nfs
pgaref@ics.forth.gr
20. ? Nagios can attempt to rectify a fault by
running a script.
? We can use Event Handlers to take action
when something goes wrong.
? Growing File example:
? Print error message
? Compress File
? Truncate File
pgaref@ics.forth.gr
21. ? We want to react to above-threshold growth
of files.
? Copy myhandler.sh to libexec/eventhandler
? Change permission to nagios user!
? Add the following line to our command:
? event_handler my_handler!$SERVICESTATE$ $STATETYPE$
$SERVICEATTEMPT$
Finally add the command:
define command{
command_name my_eventhandler
command_line $USER1$/eventhandlers/myhandler $ARG1$
}
pgaref@ics.forth.gr
23. ? Nagios is a very useful tool saving time of administrators but
can appear very complex when you first look at it.
? My advice is:
? Install it on your test node (though this may well end up as your
master server)
? Run a few check scripts by hand to get the feel for them
? Set up a simple config file that runs a few check on the local host
? Install nrpe on the host and nrpe and nagios-plugins on a remote
host
? Run check nrpe by hand to get it working then add a couple of
simple checks on the remote host
? Now add hosts and service until you run out, then write some
more
pgaref@ics.forth.gr
24. ? http://www.nagios.org Nagios web site
? http://sourceforge.net/projects/nagiosplug
Nagios plugins site
? http://www.nagiosexchange.org Unofficial
Nagios plugin site
? http://www.debianhelp.co.uk/nagios.htm A
Debian tutorial on Nagios
? http://www.nagios.com/ Commercial Nagios
support
pgaref@ics.forth.gr