Nagios – Our Open Source Network Management Solution

Slides:



Advertisements
Similar presentations
1 Network Monitoring with Nagios Asian Internet Interconnection Initiatives Project Yan Adikusuma Nara Institute of Science and Technology
Advertisements

powerful network monitoring & management solution
System for Transmission
Overview of network monitoring development at AMRES Slavko Gajin.
AlarmTraq™ is the leader in Proactive Remote Alarm Monitoring and Management for AVAYA™ Communications Systems. AlarmTraq™ is the leader in Proactive Remote.
ActiveXperts Network Monitor Monitors servers, workstations and devices for availability Alerts and corrects.
ActiveXperts Network Monitor Monitors servers, workstations and devices for availability Alerts and corrects.
1 CHEP 2000, Roberto Barbera Roberto Barbera (*) Grid monitoring with NAGIOS WP3-INFN Meeting, Naples, (*) Work in collaboration with.
Network Management Workshop intERlab at AIT Thailand March 11-15, 2008 Network Operations and Network Management.
Advanced Workgroup System. Printer Admin Utility Monitors printers over IP networks Views Sharp and non-Sharp SNMP Devices Provided Standard with Sharp.
BayTech Global Power Management GPM Software for BayTech Remote Power Controllers  Discovers BayTech devices on existing subnet  Build Logical and Functional.
Keeping Tabs on Your Network First, a Horror Story Types of Management Tools What is SNMP? Dartmouth’s Net Management InterMapper demo Questions Rich Brown.
ManageEngine TM Applications Manager 8 Monitoring Custom Applications.
Monitoring a Large-Scale Network: Selecting the Right Tool Sayadur Rahman United International University & Network Manager, Financial Service.
Chapter 15 Chapter 15: Network Monitoring and Tuning.
Security Tools CS-480b Dick Steflik. CACLS Windows NT, W2000, XP Displays or modifies access control lists (ACLs) of files.
Monitoring System Monitors Basics Monitor Types Alarms Actions RRD Charts Reports.
Confidential property of Belkin International. Unlawful to copy or reproduce in any manner without the express written consent of Belkin International.
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
R. Lange, M. Giacchini: Monitoring a Control System Using Nagios Monitoring a Control System Using Nagios Ralph Lange, BESSY – Mauro Giacchini, LNL.
DEMONSTRATION FOR SIGMA DATA ACQUISITION MODULES Tempatron Ltd Data Measurements Division Darwin Close Reading RG2 0TB UK T : +44 (0) F :
HiVision SNMP Software.
1 Network Statistic and Monitoring System Wayne State University Division of Computing and Information Technology Information Technology.
NMS Labs Mikko Suomi LAB1 Choose SNMP device managment software Features: –Gives Nice overview of network –Bandwith monitoring –Multible.
CCNA 1 v3.0 Module 11 TCP/IP Transport and Application Layers.
WhatsUp Gold v15 – WhatsUp Companion 3.7 WhatsUp Companion Extended
Josh Riggs Utilizing Open Source Network Monitoring.
© 2007 – 2010, Cisco Systems, Inc. All rights reserved. Cisco Public TSHOOT v6 Chapter 1 1 Chapter 1: Planning Maintenance for Complex Networks CCNP TSHOOT:
“ACT” Audio Conference over ATM Network with TRM Technion - Israel Institute of Technology Electrical Engineering Faculty Computer Networks Laboratory.
Course Presentation EEL5881, Fall, 2003 Project: Network Reliability Tests Project: Network Reliability Tests Team: Gladiator Team: Gladiator Shuxin Li.
Network Management Tool Amy Auburger. 2 Product Overview Made by Ipswitch Affordable alternative to expensive & complicated Network Management Systems.
Monitoring EMS Infrastructure Ann Moore San Diego Gas & Electric September 13, 2004 EMS Users Group Meeting-St. Louis.
workshop eugene, oregon What is network management? System & Service monitoring  Reachability, availability Resource measurement/monitoring.
INFN-GRID Testbed Monitoring System Roberto Barbera Paolo Lo Re Giuseppe Sava Gennaro Tortone.
1 © 2003, Cisco Systems, Inc. All rights reserved. CCNA 1 v3.0 Module 11 TCP/IP Transport and Application Layers.
Introduction To Nagios A Linux-based Monitoring System.
Network Monitoring Manage your business without blowing your budget. Learn how the Calhoun ISD utilizes free “Open Source” tools for real-time monitoring.
Automated Scheduling and Operations for Legacy Applications.
Project Concept Plan O.A.S.I.S. Scott Weisenborn Overview Objectives Contractual Aspects Schedule Resources Personnel Evaluation Methods Potential Problems.
Course Presentation EEL5881, Fall, 2003 Project: Network Reliability Tests Project: Network Reliability Tests Team Client: Dr. Joseph Berrios Team Client:
® BayTech The Power to Control Global Power Management.
APC Web/SNMP Management Card and PowerChute Network Shutdown
2010 These materials are licensed under the Creative Commons Attribution-Noncommercial 3.0 Unported license (
CCNA4 v3 Module 6 v3 CCNA 4 Module 6 JEOPARDY K. Martin.
Digital Temperature Gauge Project Proposal ECE 4330 Patricia Kaminski System Overview Hardware Design Software Design Testing Extended Design.
Citrix XenApp and XenDesktop Monitoring Solution Overview.
Network management Network management refers to the activities, methods, procedures, and tools that pertain to the operation, administration, maintenance,
Queensland University of Technology Nagios – an Open Source monitoring solution and it’s deployment at QUT.
2008 Taipei, Taiwan An Introduction APRICOT 2008 Network Management Workshop February – Taipei, Taiwan Hervey Allen & Phil.
IST 201 Chapter 11 Lecture 2. Ports Used by TCP & UDP Keep track of different types of transmissions crossing the network simultaneously. Combination.
Network Monitoring Sebastian Büttrich, NSRC / IT University of Copenhagen Last edit: February 2012, ICTP Trieste
Nagios - introduction Dhruba Raj Bhandari ( CCNA ) p Additions by Phil Regnauld.
S. Bancal / rev. 28 oct 2009 Page 1 ENAC-IT Nagios 3 - “Basics” EPFL – Espace Logiciel Libre à l'École Nagios 3 Meeting 1 “Basics”
'08 Rabat An Introduction AfNOG 2008 Network Management Workshop June 1-2 – Rabat, Morocco Hervey Allen & Phil Regnauld.
Network Management Workshop March – Bangkok, Thailand
Frequently Asked Questions (FAQ)
By Sachin Kumar Korenga & Rochita Thakkallapally
Cisco Unified Operations Manager Proactive Voice Troubleshooting
Chapter 2: Configure a Network Operating System
Nectus Click to edit Master title style
Network Operations and Network Management
INFNGRID Monitoring Group report
Lecture 6: TCP/IP Networking By: Adal Alashban
How to monitor the $H!T out of Hadoop
Chapter 2: Configure a Network Operating System
Chapter 8: Monitoring the Network
Training Module Introduction to the TB9100/P25 CG/P25 TAG Customer Service Software (CSS) Describes Release 3.95 for Trunked TB9100 and P25 TAG Release.
Chapter 15: Network Monitoring and Tuning
Requirements Definition
Presentation transcript:

Nagios – Our Open Source Network Management Solution Presenter: Ling Zhang LBLnet Services Group Information Technologies and Services Division LBNL

Contributors Nagios software design and development: Ethan Galstad (www.nagios.org) System integration, configuration, testing: Ling Zhang, Greg Bell, Harper Mann, Cedric Hui, Clark Wood, Mike Bennett 18 September 2018 ITSD/LBNL

Goals for this talk To explain: To discuss LBLnet’s point of view of Network Management System network monitoring problems we encountered the design of our Nagios network monitoring system To discuss the benefits of the nagios system our future development goals 18 September 2018 ITSD/LBNL

Our point of view of a NMS Proactive network management Alarm Panel Connectivity Performance Fault isolation Trend Analysis Capacity planning The Notification Precise Fast 18 September 2018 ITSD/LBNL

Background Information Network Monitoring tools we have tested and/or used before: Sun Net Manager Spectrum Whatsup Gold Netmon SNMPc Ipmonitor HP Openview OpenNMS InCharge Home grown scripts MRTG/RRDtool etc. 18 September 2018 ITSD/LBNL

Background Information Our fair share of problems with NMS: Notification storm 65 notifications were received during a router up/down event. The router has 20 active interface and 32 downstream monitored devices False alarms Integration with existing systems (MRTG, Trouble ticket system) Tech support our longest outstanding tickets: 2 years and counting Budget 18 September 2018 ITSD/LBNL

In Search of a Better NMS Accurate and efficient fault detection Good performance Extensible Can be integrated with our existing system Low maintenance Fits our budget 18 September 2018 ITSD/LBNL

Features of Nagios Open source system runs on most Unix system Highly extensible Reliable dependency monitoring Excellent service monitoring capabilities Ability to schedule maintenance periods Flexible notification 18 September 2018 ITSD/LBNL

Our Nagios Topology LBLnet NMS diagram 18 September 2018 ITSD/LBNL

Nagios Extensibility Plugins Event handlers External commands 18 September 2018 ITSD/LBNL

Nagios Extensibility - Plugins Compiled executables or scripts (Perl, shell, etc.) Run by nagios process Checks device or service status Example: define host { host_name switch1 address 1.2.3.4 check_command ping_switch } define service { host_name switch1 Service_description CPU Util check_command get_cpu_util 18 September 2018 ITSD/LBNL

Services Monitored by Nagios Nagios uses plugins to check service status DHCP DNS FTP HTTP HTTPS IMAP NTP Radius SMTP SQL TFTP WINS etc. 18 September 2018 ITSD/LBNL

Nagios Extensibility – Event Handelers Compiled executables or scripts Run by nagios process Triggered by host or service status change Example: define service{ host_name somehost service_description HTTP max_check_attempts 4 check_command check_http event_handler restart-httpd ...other service variables... } 18 September 2018 ITSD/LBNL

Nagios Extensibility – External Commands A predefined set of commands issued externally to control the behavior of nagios Controls notification, monitor scheduling, program start/stop Issued by external applications (CGI, snmptrapd, etc.) Reads in by nagios core process during run time Example User disabled monitoring of switch1 from web interface CGI wrote command “disable monitor switch1” to command file Nagios process read this command and stopped scheduling monitoring for switch1 18 September 2018 ITSD/LBNL

Monitoring Network Devices Ping Measures system responsiveness via average RTT SNMP get CPU Temperature Interface/port status System up time Power supply status Throughput Packet discard rate etc. SNMP trap 18 September 2018 ITSD/LBNL

Nagios Trap handling Requires Net-SNMP or other trap receiver daemon Trap receiver notifies nagios about traps received via External Commands Nagios calls event handlers and/or notifies user 18 September 2018 ITSD/LBNL

Dependency Configuration define host { use switch-tmpl host_name switch1 address 1.2.3.10 parents router1 } host_name switch2 address 1.2.3.20 parents switch1 host_name switch3 address 1.2.3.30 host_name switch4 address 1.2.3.40 parents switch2 Diagram 18 September 2018 ITSD/LBNL

Nagios Notification Similar to event handlers Triggered by host/service status change Calls third party notification tools (sendmail, qpage, etc.) Supports email, page, instant messaging etc. 18 September 2018 ITSD/LBNL

Nagios Notification format Email Subject: switch3 (1.2.3.30) DOWN Host: switch3 Address: 1.2.3.30 Date/Time: Thu Jul 15 14:03:37 PDT 2004 Additional Info: (No Information Returned From Host Check) Page DOWN switch3(1.2.3.40) 18 September 2018 ITSD/LBNL

Maintenance Scheduling Schedule a maintenance window via Nagios web interface Uses external commands Fixed window Float window Dependency aware 18 September 2018 ITSD/LBNL

Monitoring Subnet with Redundant Network Connections Solution: Monitor interface up/down status via Ping Monitor HSRP status via HSRP mib Challenge: Monitoring interface status Monitoring standby status at the same time 18 September 2018 ITSD/LBNL

Performance of Nagios False alarms Notification delay False positive False negative Unnecessary Notification delay Before: 303 sec After: 221 sec 18 September 2018 ITSD/LBNL

Money and Time Saved Software package cost InCharge ($$$) IPmonitor ( $1500) Nagios ($0) Software maintenance contract cost InCharge (>$15,000) IPmonitor ($500) Time saved from less unnecessary alarms (Compared to IPmontior) 20 man.hrs/month 18 September 2018 ITSD/LBNL

Future development of Nagios Performance Monitoring Network element out of resources Interface buffer drops Duplex mismatch Has to be done by inference Assume heterogeneous network equipment No use of host SNMP Derive from combination of interface error types and rates Integrating with other NMS elements Syslog MRTG/RRDtool Trouble ticket System Database Topology discovery 18 September 2018 ITSD/LBNL

Conclusion Nagios fits our Network Management needs because: Accurate and efficient fault detection Extensibility Can be easily integrated with our existing system Low maintenance Fits our budget Delete sample document icons and replace with working document icons as follows: From Insert Menu, select Object... Click “Create from File” Locate File name in “File” box Make sure “Display as Icon” is checked Click OK Select icon From Slide Show Menu, Select “Action Settings” Click “Object Action” and select “Edit” 18 September 2018 ITSD/LBNL

Thanks! We are happy to share Questions / comments send to lblnet@lbl.gov Delete sample document icons and replace with working document icons as follows: From Insert Menu, select Object... Click “Create from File” Locate File name in “File” box Make sure “Display as Icon” is checked Click OK Select icon From Slide Show Menu, Select “Action Settings” Click “Object Action” and select “Edit” 18 September 2018 ITSD/LBNL