Nagios FTW TriLUG 8/10/06 Presented by: Jason Faulkner Ian Kilgore.

Slides:



Advertisements
Similar presentations
How to monitor the $H!T out of Hadoop Developing a comprehensive open approach to monitoring hadoop clusters.
Advertisements

Nagios: An introduction and Brief Tutorial
Nagios on Tier1 farm Jonathan Wheeler RAL Tier1 Fabric Team 20 th June 2008.
Bangkok, Thailand An Introduction intERLab at AIT Network Management Workshop March – Bangkok, Thailand Hervey Allen & Phil Regnauld.
Network & System Monitoring with Nagios & Cacti Kevin Mueller.
Network Monitoring with Nagios and other tools Wednesday, 19 July 2006 Martin B. Smith
Network Monitoring with Nagios. Joint technical meeting #2 What is a network monitoring system? A combination of hardware and software used to administer.
1 Automating Monitoring with Puppet Chris Mague Moovweb May 23, 2012.
M. Bechtel, S. Blümel, A. Quignon1 Linux Network Server Group: Nagios Marc Bechtel Sebastian Blümel Alexandre Quignon.
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
R. Lange, M. Giacchini: Monitoring a Control System Using Nagios Monitoring a Control System Using Nagios Ralph Lange, BESSY – Mauro Giacchini, LNL.
11 Distributed Monitoring and Cloud Scaling for Web Apps Fernando Hönig
Papeete, French Polynesia PacNOG 5 Papeete, French Polynesia 17 June 2009 Hervey Allen.
These materials are licensed under the Creative Commons Attribution-Noncommercial 3.0 Unported license (
Passive Monitoring with Nagios Jim Prins
Your university or experiment logo here Nagios: An introduction and Brief Tutorial Chris Brew SciTech/PPD.
Josh Riggs Utilizing Open Source Network Monitoring.
CENT 305 Information Systems Security Linux Introduction.
workshop eugene, oregon Nagios Network Design and Operations 24 July 2009
1. A key measurement tool for actively monitoring availability of devices and services. Possible the most used open source network monitoring software.
2010 These materials are licensed under the Creative Commons Attribution-Noncommercial 3.0 Unported license (
1 Apache. 2 Module - Apache ♦ Overview This module focuses on configuring and customizing Apache web server. Apache is a commonly used Hypertext Transfer.
10/13/2015 ©2006 Scott Miller, University of Victoria 1 Content Serving Static vs. Dynamic Content Web Servers Server Flow Control Rev. 2.0.
The Art and Zen of Managing Nagios with Puppet Michael Merideth - VictorOps
Introduction To Nagios A Linux-based Monitoring System.
11 Distributed Monitoring for Web Apps Fernando Hönig
Network Monitoring Manage your business without blowing your budget. Learn how the Calhoun ISD utilizes free “Open Source” tools for real-time monitoring.
Partner Logo German Cancio – WP4-install LCFG HOW-TO - n° 1 LCFGng configuration examples Updated 10/2002
Nagios Speaker: Yan-Shiang Wang Adviser: Quincy Wu Date: 2005/12/21.
Nagios The monitoring tool. Why ? Nagios is a powerful, modular network monitoring system that can be used to monitor many network services like smtp,
The HTTP is a standard that all Web browsers and Web servers must speak in order for the Web portion of the Internet to work.
NAGIOS 1. Introduction A key measurement tool for actively monitoring availability of devices and services. Possible the most used open source network.
2010 These materials are licensed under the Creative Commons Attribution-Noncommercial 3.0 Unported license (
1 Grid Monitoring with Nagios Aries Hung, Joanna Huang, Felix Lee, Min Tsai ASGC WLCG T2 Asia Workshop TIFR, Dec 2, 2006.
2010 NAGIOS APRICOT 2010 Kuala Lumpur, Malaysia.
Queensland University of Technology Nagios – an Open Source monitoring solution and it’s deployment at QUT.
ADMINISTRACIÓN DE SERVICIOS PARA LA WEB BRENDA GONZÁLEZ GÓMEZ
2008 Taipei, Taiwan An Introduction APRICOT 2008 Network Management Workshop February – Taipei, Taiwan Hervey Allen & Phil.
Nagios - introduction Dhruba Raj Bhandari ( CCNA ) p Additions by Phil Regnauld.
Monitoring Asterisk with Icinga2
CIS Host Manager Bryce Johnston CIS 597 May 8, 2009.
'08 Rabat An Introduction AfNOG 2008 Network Management Workshop June 1-2 – Rabat, Morocco Hervey Allen & Phil Regnauld.
Administering the SOWN Network David R Newman & Chris Malton.
Network Management Workshop March – Bangkok, Thailand
10 Minutes to Migrate Nagios to BMC TrueSight Operations Management
Introduction to Ansible
Advisor : Quincy Wu Speaker : Xang-Ting Date : 2010/06/08
Progress Apama Fundamentals
Essentials of UrbanCode Deploy v6.1 QQ147
Version Control with Subversion
Use of Nagios in Central European ROC
What is nagios? Version 2 8/ M.A.Newhall.
Process Creation Processes get created (and destroyed) all the time in a typical computer Some by explicit user command Some by invocation from other running.
Outline Introduction Characteristics of intrusion detection systems
NCAR-Developed Tools Bill Anderson and Marc Genty
The Linux Operating System
Campus Monitoring Service
Monitoring with Nagios
Nagios – Our Open Source Network Management Solution
INSTALLING AND SETTING UP APACHE2 IN A LINUX ENVIRONMENT
DHCP, DNS, Client Connection, Assignment 1 1.3
How to monitor the $H!T out of Hadoop
Nessus Vulnerability Scanning
Intro to Ethical Hacking
IS3440 Linux Security Unit 9 Linux System Logging and Monitoring
Intro to Ethical Hacking
Simple Socket Client Project 1.
Introduction to Ansible
Class Projects and Environment
SUSE Linux Enterprise Desktop Administration
Presentation transcript:

Nagios FTW TriLUG 8/10/06 Presented by: Jason Faulkner Ian Kilgore

What is Nagios? ● A network monitor for small to medium size networks ● Flexible/Featureful – Pluggable checks/notifications – Host/Service dependency support – Escalation framework – Web interface – Remote checks ● GPL software (yay! free!)

Basic Nagios Information ● (go to web interface)

Access Control ● The nagios web interface allows you to specify fine-grained access controls ● It uses the built-in Apache auth method. You can specify in cgi.cfg what users should have access to what. ● Say, for instance, you have a boss who wants to view status of all services, but you don't want him breaking things. Easy! Edit cgi.cfg to give him read access to just a few things.

Nagios 1.3 vs. Nagios 2.0 ● Bugfixes ● No more hostgroup escalations ● Passive host checks ● Service groups Conclusion: Nagios 2.0 is definitely an improvement over Nagios 1.3, but the basic syntax remains the same. We will be showing off Nagios 1.3 tonight, and consult your local man page for the subtle config differences.

Installing Nagios ● Distro packages available for most major Linux distributions – RHEL/CentOS – Fedora – Debian – Ubuntu – Gentoo ● What do you need to run nagios? – Web server (apache) – *nix server – (more required for plugins)

Basic Configuration ● Most distros organize configuration in a way that is difficult to scale. Feel free to use it, but be prepared to deal with hundred, maybe even thousand line configuration files. ● Is there a better way? ● YES! There is!

Super Secret Broadwick Nagios Configuration(TM) ● We separate config files into two basic groups: – Host/Service definitions – Everything else ● cfg_dir is your friend – Put all host/service definitions in /etc/nagios/hosts/ – Put other stuff (except cgi.cfg & resource.cfg) in /etc/nagios/conf.d/ ● In /etc/nagios/hosts there is one file for each host, which contains the relevant host and service defs

Why templates are good ● Nagios has a ton of individual options for a service/host – Check interval – Notification groups – Parents ● Use templates to avoid duplication

Basic Configuration Example Setup ● Download your own example config: – –

Host Template (in conf.d/templates.cfg) define host{ name critical-host notifications_enabled 1 register 0 check_command check-host-alive max_check_attempts 5 notification_interval 60 notification_options d,r }

Host Definition (in hosts/webserver.cfg) define host{ use critical-host host_name webserver alias Main Web Server address webserver.foo.com }

Service Template (in conf.d/templates.cfg) define service{ name critical-service notifications_enabled 1 max_check_attempts 4 normal_check_interval 5 retry_check_interval 1 contact_groups sysadmin notification_interval 60 notification_period 24x7 notification_options w,c,r register 0 }

Command Definition (in conf.d/checkcommands.cfg) define command{ command_name check_http command_line /usr/lib/nagios/plugins/check_http -H $HOSTADDRESS$ -S }

Service Definition (in hosts/webserver.cfg) define service { use critical-service host_name webserver service_description HTTP check_command check_http }

Remote Checks ● Check by SSH? – Unless you want to deploy ssh keys, and leave yourself wide open to attack. We dub it a Bad Idea(tm). ● statd? – We've never used it. ● NRPE – Pluggable – Secure

NRPE ● How does it work? – The nagios server connects to the NRPE daemon on the monitored host, and gives it a command. Commands are mapped to plugins in the NRPE configuration. ● Security – All NRPE transmissions use SSL, and require you to specify the hosts that are allowed to connect. – Do not enable arguments. These are substituted into the command line. (bad)

Basic NRPE setup ● On NRPE server (nrpe.cfg) – allowed_hosts= – command[check_users]=/usr/lib/nagios/plugins/check_users -w 5 -c 10 ● On Nagios host (hosts/webserver.cfg) – define service { use critical-service host_name webserver service_description USERS check_command check_nrpe_plain!check_users }

Advanced Configuration ● Dependencies – Because getting paged 100 times if the network goes out sucks ● Escalations – Because we don't always wake up when our pager goes off ● Custom plugins – For those of us with special needs

Advanced Configuration Scripts ● Services and Escalations setup on a service-by-service level – If you have a lot of services, this sucks ● Script it! ● Find the scripts at: – –

Service Dependency Script ● Makes all services on a host webserver depend on webserver's ping service ● Any host dependencies you have setup will carry over – host webserver depends on host router, therefore, services on webserver will depend on router's ping service ●

Service Dependency Script (cont) ● Usage is simple – Reads host and service info from STDIN – Outputs a properly formatted servicedeps.cfg to STDOUT – cat /etc/nagios/hosts/* |./servicedeps.pl > /etc/nagios/conf.d/servicedeps.cfg

Escalations Script ● Modify template to your liking ● Script plugs-in host and service names into template ● Downside: Requires you to actually answer pages so your boss doesn't get paged. ●

Escalations Script (cont) ● Usage is simple – Reads host and service info from STDIN – Outputs a properly formatted escalations.cfg to STDOUT – cat /etc/nagios/hosts/* |./escalations.pl > /etc/nagios/conf.d/escalations.cfg

Custom Nagios Plugins ● Custom nagios plugins are simply executable files on the nagios host. They have to output a short line of status info and an exit code – 0 = OK – 1 = Warning – 2 = Critical – >2 = Unknown ● Feel free to use our plugins as a template. The best way to write a plugin is to hack someone else's :) – –

Stuff we didn't cover ● Passive checks – Nagios will accept passive checks – We don't do this. ● Distributed monitoring – There are ways to make multiple nagios servers talk to each other – We don't do this, either.

Alternatives to Nagios? ● OpenNMS – Written in java ● Big Brother – Written in bash ● Angry Customers calling the support department – Written in blood ● Interns with Cellphones and a terminal – Written in India

Resources ● Scripts – – ● Example nagios configuration – – ● Custom Nagios plugins written by us – – ● This presentation – ● Nagios Documentation –

Questions?