Lemon Tutorial Sensor How-To Miroslav Siket, Dennis Waldron CERN-IT/FIO-FD.

Slides:



Advertisements
Similar presentations
GridPP7 – June 30 – July 2, 2003 – Fabric monitoring– n° 1 Fabric monitoring for LCG-1 in the CERN Computer Center Jan van Eldik CERN-IT/FIO/SM 7 th GridPP.
Advertisements

26/05/2004HEPIX, Edinburgh, May Lemon Web Monitoring Miroslav Šiket CERN IT/FIO
More about Ruby Maciej Mensfeld Presented by: Maciej Mensfeld More about Ruby dev.mensfeld.pl github.com/mensfeld.
MAP REDUCE PROGRAMMING Dr G Sudha Sadasivam. Map - reduce sort/merge based distributed processing Best for batch- oriented processing Sort/merge is primitive.
1 CHEP 2000, Roberto Barbera Roberto Barbera (*) Grid monitoring with NAGIOS WP3-INFN Meeting, Naples, (*) Work in collaboration with.
Lemon Tutorial Lemon Overview Miroslav Siket, Dennis Waldron CERN-IT/FIO-FD.
GGF Toronto Spitfire A Relational DB Service for the Grid Peter Z. Kunszt European DataGrid Data Management CERN Database Group.
Creating WordPress Websites. Creating a site on your computer Local server Local WordPress installation Setting Up Dreamweaver.
NGOP J.Fromm K.Genser T.Levshina M.Mengel V.Podstavkov.
Week 2 IBS 685. Static Page Architecture The user requests the page by typing a URL in a browser The Browser requests the page from the Web Server The.
Hands-On Microsoft Windows Server 2003 Networking Chapter 7 Windows Internet Naming Service.
ASP.NET Programming with C# and SQL Server First Edition Chapter 8 Manipulating SQL Server Databases with ASP.NET.
Chapter 7 Managing Data Sources. ASP.NET 2.0, Third Edition2.
Slide 1 of 9 Presenting 24x7 Scheduler The art of computer automation Press PageDown key or click to advance.
CS 4700 / CS 5700 Network Fundamentals Lecture 17.5: Project 5 Hints (Getting a job at Akamai) Revised 3/31/2014.
CERN IT Department CH-1211 Genève 23 Switzerland t Integrating Lemon Monitoring and Alarming System with the new CERN Agile Infrastructure.
CLEO’s User Centric Data Access System Christopher D. Jones Cornell University.
1 Network Statistic and Monitoring System Wayne State University Division of Computing and Information Technology Information Technology.
M. Taimoor Khan * Java Server Pages (JSP) is a server-side programming technology that enables the creation of dynamic,
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
1 Guide to Novell NetWare 6.0 Network Administration Chapter 11.
Exlibris: Developers Meet Developers Feb. 1-2, 2010 Voyager To NUFinancials ( peoplesoft ) System ( for Invoice Payment ) Michael North Northwestern University.
Student Learning Environment on the World Wide Web l CGI-programming in Perl for the connection of databases over the Internet. l Web authoring using Frontpage.
Rsv-control Marco Mambelli – Site Coordination meeting October 1, 2009.
CINEMA’s UbiComp Subsystem Stefan Berger and Henning Schulzrinne Department of Computer Science Columbia University
SAN DIEGO SUPERCOMPUTER CENTER Working with Inca Reporters Jim Hayes Inca Workshop September 4-5, 2008.
9 Chapter Nine Compiled Web Server Programs. 9 Chapter Objectives Learn about Common Gateway Interface (CGI) Create CGI programs that generate dynamic.
Event Management & ITIL V3
Week seven CIT 354 Internet II. 2 Objectives Database_Driven User Authentication Using Cookies Session Basics Summary Homework and Project 2.
INFN-GRID Testbed Monitoring System Roberto Barbera Paolo Lo Re Giuseppe Sava Gennaro Tortone.
© Copyright 2009 Sysgem AG, 8002 Zurich, Switzerland Sysgem File Synchronizer (SFiS) Manage configuration files on multiple target servers from definitions.
FailSafe SGI’s High Availability Solution Mayank Vasa MTS, Linux FailSafe Gatekeeper
Security monitoring boxes Andrew McNab University of Manchester.
Lemon Monitoring Miroslav Siket, German Cancio, David Front, Maciej Stepniewski CERN-IT/FIO-FS LCG Operations Workshop Bologna, May 2005.
02 August OraMonPlans 08/ August Topics Enhancements –OraMon DB redundancy layer –Compare and fix OraMon configurations –Expiry of historical.
CASTOR evolution Presentation to HEPiX 2003, Vancouver 20/10/2003 Jean-Damien Durand, CERN-IT.
Lemon Monitoring Presented by Bill Tomlin CERN-IT/FIO/FD WLCG-OSG-EGEE Operations Workshop CERN, June 2006.
David Lawrence 7/8/091Intro. to PHP -- David Lawrence.
© Copyright 2009 Sysgem AG, 8002 Zurich, Switzerland Sysgem File Synchronizer (SFiS) Manage configuration files on multiple target servers from definitions.
Configuring and Troubleshooting Identity and Access Solutions with Windows Server® 2008 Active Directory®
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Lemon for Quattor I.Fedorko CERN CF/IT 16 March 2011.
Chapter 5 Introduction To Form Builder. Lesson A Objectives  Display Forms Builder forms in a Web browser  Use a data block form to view, insert, update,
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Lemon Tutorial Sensor Exception Miroslav Siket, Dennis Waldron CERN-IT/FIO-FD.
Jan Hatje, DESY CSS – Control System Studio EPICS collaboration meeting CSS – Control System Studio Update EPICS collaboration meeting 2008 Shanghai.
Chap 2 – Getting Started COMP YL Professor Mattos.
1 Java Server Pages A Java Server Page is a file consisting of HTML or XML markup into which special tags and code blocks are inserted When the page is.
Distributed Logging Facility Castor External Operation Workshop, CERN, November 14th 2006 Dennis Waldron CERN / IT.
SPI NIGHTLIES Alex Hodgkins. SPI nightlies  Build and test various software projects each night  Provide a nightlies summary page that displays all.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF CF Monitoring: Lemon, LAS, SLS I.Fedorko(IT/CF) IT-Monitoring.
Client installation DIRAC Project. DIRAC Client Software  Many operations can be performed through the Web interface  Even more to come  However, certain.
CASTOR Operations Face to Face 2006 Miguel Coelho dos Santos
CERN - IT Department CH-1211 Genève 23 Switzerland CASTOR F2F Monitoring at CERN Miguel Coelho dos Santos.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF Lemon monitoring and Lemon Alarm System (sensors, exception, alarm)
DDM Central Catalogs and Central Database Pedro Salgado.
Java Object-Relational Layer Sharon Diskin GUS 3.0 Workshop June 18-21, 2002.
Computing Facilities CERN IT Department CH-1211 Geneva 23 Switzerland t CF CC Monitoring I.Fedorko on behalf of CF/ASI 18/02/2011 Overview.
Outline Announcements: –HW II due today! –HW III on web CVS.
Final Presentation Smart-Home Smart-Switch using Arduino
Client installation Beijing, 13-15/11/2013. DIRAC Client Software Beijing, /11/2013 DIRAC Tutorial2  Many operations can be performed through the.
3 Copyright © 2006, Oracle. All rights reserved. Installation and Administration Basics.
CERN IT Department CH-1211 Genève 23 Switzerland t Bamboo users meeting IT-CS-CT.
SMOOTHWALL FIREWALL By Nitheish Kumarr. INTRODUCTION  Smooth wall Express is a Linux based firewall produced by the Smooth wall Open Source Project Team.
17 Copyright © 2006, Oracle. All rights reserved. Information Publisher.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES Author etc Alarm framework requirements Andrea Sciabà Tony Wildish.
CERN IT Department CH-1211 Genève 23 Switzerland t Load testing & benchmarks on Oracle RAC Romain Basset – IT PSS DP.
Lemon Tutorial Quattor and Non-Quattor Configuration of the lemon-agent Miroslav Siket, Dennis Waldron CERN-IT/FIO-FD.
System Monitoring with Lemon
Monitoring and Fault Tolerance
Miroslav Siket, Dennis Waldron
Presentation transcript:

Lemon Tutorial Sensor How-To Miroslav Siket, Dennis Waldron CERN-IT/FIO-FD

09/10/2006Lemon Tutorial2 Outline Terminology Examples of existing sensors Considerations Live Examples –Hello World –Service based monitoring Do’s and Don’ts

09/10/2006Lemon Tutorial3 Terminology Sensor: –A process or script which is connected to the lemon-agent via a bi-directional pipe and collects information on behalf of the agent. Sensors implement, Metric Classes: –The equivalent to a class in OOP (Object Orientated Programming) Metric Instance: –Is an instance (an object) of a metric class which has its own configuration data. Metric ID: –A unique identifier associated with a particular metric instance of a particular metric class.

09/10/2006Lemon Tutorial4 Existing sensors At CERN: –Approx 40 active sensors defined, providing 264 metrics and 227 exceptions. –Default installation of the Lemon agent comes with three sensors: MSA (builtin) – self monitoring of the agent. Linux – performance, file system and process monitoring. File – file tests e.g. size, mtime, ctime. –Together they provide 135 metrics (51% of all CERN metrics) –Other officially distributed sensors include: exception – correlation sensor for generating alarms. remote – provides ping and http web server checks. oracle – oracle database statistics monitoring. parselog – log file parsing sensor. –All available from the lemon software repository –Other contributing sensors are available from CVS:

09/10/2006Lemon Tutorial5 Considerations Question: What is your goal? How do you intend to use the monitoring information you collect? Is it for: –Pure data collection? OK –Graphs displayed on the lemon status pages? Just because you’ve collected data doesn’t give you graphs immediately! This is not automatic! –Information to be alarmed? Make sure the structure of the data you collect can be alarmed! Data that cannot be alarmed: –Timestamps as strings - NO –Timestamps as numbers - NO –Parsing of complex strings - NO

09/10/2006Lemon Tutorial6 Considerations (II) - Use Case Grid Certificate Expiry Use Case Outline: you wish to be notified or raise an alarm if the Grid Certificate on a machine will expiry in the next two weeks. You need 1 metric and 1 exception –The metric will record the expiry time of the certificate. –The exception will check the metric and decide if it expires in the next two weeks. The metric needs to be structured in such a way that the correlation unit of the exception sensor can understand it. Can I record the data as a: –String e.g. “Sun Oct 8 16:05: ” NO (Cannot be converted to a number) –UnixTime e.g. “ ” NO (Correlation unit doesn’t understand time, yet!!) Solution: –Record the number of seconds until the certificate expires. –E.g seconds (3 wks) can be mathematical alarmed :- If metric < (2 wks) then raise alarm

09/10/2006Lemon Tutorial7 Considerations (III) Misconception: –In Lemon that a metric has to be related to one and only one distinct piece of information (1 to 1 mapping) Not true: –A metric can be associated with multiple values and have multi rows with each row identified by a unique key.

09/10/2006Lemon Tutorial8 Considerations (IV) – Use Case Recording partition information Outline: you would like to know the total size, space used in megabytes, space used as a % and the mount options of all mounted partitions on a machine. –Under the idea of a 1 to 1 mapping, that’s 4 metrics per partition. An average machine may have 7 partitions (4x7 = 28 metrics in total). –Why not: Convert the data into a multi-valued metric? 7 metrics each reporting 4 values. So, –Metric 1 total_space –Metric 2 space_used_mb –Metric 3 space_used_perc –Metric 4 mount_options Becomes: –Metric A total_space space_used_mb space_used_perc mount_options –Go one step further: Convert the data into a multi-valued, multi-rowed metric 1 metric reporting the values for all mount points. So, –Metric A total_space space_used_mb space_used_perc mount_options Becomes: –Metric B mountname1 total_space space_used_mb space_used_perc mount_options -Metric B mountname2 total_space space_used_mb space_used_perc mount_options -…. -Benefits: -Monitoring of new mount points is dynamic, no need for reconfigurations, no need to going through a registration process to get new metric ids.

09/10/2006Lemon Tutorial9 Example 1 – Hello World Objective: To create a Perl sensor which records the value “Hello World” into Lemon. Simple sensor to demonstrate: –The generic build framework for sensors. –How to registering your Perl module with the API. –How to register metric classes that your modules provides. –How to store the text “Hello World” for the machine under which the sensor runs into Lemon. –Running and debugging your sensor on the command line. Functions used: –registerVersion() –registerMetric() –storeSample01() Documented at:

09/10/2006Lemon Tutorial10 Example 2 – Service Monitoring Objective: To check if a webpage is available on a remote web server and record the HTTP response code under a service name. Demonstrates: –The basics of on behalf reporting –The ability to parse configuration arguments –The ability to log messages Functions used: –registerMetric() –getParam() –log() –storeSample03()

09/10/2006Lemon Tutorial11 Do’s and Don’ts Don’t: –Call die() or exit() from inside your sensor. –Open or write to files in locations writeable by non-root users such as /tmp/ –Read from filehandles (e.g sockets) that may block. This will make your sensor unresponsive to requests from the agent. –Never rely on, or have dependencies on files on remote file systems such as AFS (Andrew File System). Your sensor should aim to have as few dependencies as possible Do’s: –Document your sensor. Refer to the sensor tutorial to see how this can be done automatically for you. –If you have the ability to use a timeout around calls to databases and services like LSF, use it!! –Make your metric classes configurable, avoid hard coded paths to non standard files. –Try to make your sensors as generic as possible so that others can benefit from your work.