EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks Stuart Kenny and Stephen Childs Trinity.

Slides:



Advertisements
Similar presentations
Geoff Quigley, Stephen Childs and Brian Coghlan Trinity College Dublin
Advertisements

EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Operations Dashboard Workplan Cyril.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks From ROCs to NGIs The pole1 and pole 2 people.
Global Customer Partnership Council Forum | 2008 | November 18 1IBM - GCPC MeetingIBM - GCPC Meeting IBM Lotus® Sametime® Meeting Server Deployment and.
INFSO-RI Enabling Grids for E-sciencE Status of LCG-2 porting Stephen Childs, Brian Coghlan and Eamonn Kenny Grid-Ireland/EGEE October.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Simply monitor a grid site with Nagios J.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks What GGUS can do for you JRA1 All hands.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Spyros Kopsidas Center for Research and.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks C. Loomis (CNRS/LAL) M.-E. Bégin (SixSq.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks perfSONAR deployment over Spanish LHC Tier.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The network monitoring in grid context Operations.
1 The new Fabric Management Tools in Production at CERN Thorsten Kleinwort for CERN IT/FIO HEPiX Autumn 2003 Triumf Vancouver Monday, October 20, 2003.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Extensions to the ETICS Build System Client.
05/29/2002Flavia Donno, INFN-Pisa1 Packaging and distribution issues Flavia Donno, INFN-Pisa EDG/WP8 EDT/WP4 joint meeting, 29 May 2002.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks C. Loomis (CNRS/LAL) M.-E. Bégin (SixSq.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Stephen Childs Trinity College Dublin &
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GStat 2.0 Joanna Huang (ASGC) Laurence Field.
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks WMSMonitor: a tool to monitor gLite WMS/LB.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Nagios for Grid Services E. Imamagic, SRCE.
INFSO-RI Enabling Grids for E-sciencE Experience with monitoring of Prague T2 site Tomáš Kouba NEC 2007, Varna, Bulgaria
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team James Casey EGEE’08.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-level monitoring - an overview James.
INFSO-RI Enabling Grids for E-sciencE Strategy for gLite multi-platform support Author:Eamonn Kenny Meeting:SA3 All Hands Meeting.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Service Availability Monitoring – Status.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite Build Programme and Multi-Platform.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Site Monitoring with Nagios E. Imamagic,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks, Novelties and Features around the GridWay.
Lemon Monitoring Miroslav Siket, German Cancio, David Front, Maciej Stepniewski CERN-IT/FIO-FS LCG Operations Workshop Bologna, May 2005.
Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Usage of virtualization in gLite certification Andreas Unterkircher.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Stephen Childs Trinity College Dublin &
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Design of an Expert System for Enhancing.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Wojciech Lapka SAM Team CERN EGEE’09 Conference,
WLCG infrastructure monitoring proposal Pablo Saiz IT/SDC/MI 16 th August 2013.
INFSO-RI Enabling Grids for E-sciencE GridICE: Grid and Fabric Monitoring Integrated for gLite-based Sites Sergio Fantinel INFN.
QWG Errata Management Framework Ian Collier 10 th Quattor Workshop Rutherford Appleton Laboratory October 2010.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Using GStat 2.0 for Information Validation.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Progress on first user scenarios Stephen.
EGEE-II INFSO-RI Enabling Grids for E-sciencE GStat Work Plans for EGEE-III Joanna Huang, ASGC/OPS EGEE SA1 F2F Meetings, Abingdon.
INFSO-RI Enabling Grids for E-sciencE Grid-wide Intrusion Detection Stuart Kenny*, Brian Coghlan Dept. of Computer Science Trinity.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks John Gordon SA1 Face to Face CERN, June.
INFSO-RI Enabling Grids for E-sciencE /10/20054th EGEE Conference - Pisa1 gLite Configuration and Deployment Models JRA1 Integration.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks APEL CPU Accounting in the EGEE/WLCG infrastructure.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Communication tools between Grid Virtual.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,
Disk Server Deployment at RAL Castor F2F RAL - Feb 2009 Martin Bly.
Grid testing using virtual machines Stephen Childs*, Brian Coghlan, David O'Callaghan, Geoff Quigley, John Walsh Department of Computer Science Trinity.
INFSO-RI Enabling Grids for E-sciencE Installing & configuring Joachim Flammer Integration Team, CERN EMBRACE Tutorial, Clermont-Ferrand.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Patch Preparation SA3 All Hands Meeting.
TCD Site Report Stuart Kenny*, Stephen Childs, Brian Coghlan, Geoff Quigley.
Grid-Ireland test facilities Stephen Childs Dept. of Computer Science Trinity College Dublin.
EGEE-II TCD 22 nd -25 th May 2007 Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Experiences with a distributed.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Configuration Data or “What should be.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Stephen Childs Trinity College Dublin &
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
StratusLab is co-funded by the European Community’s Seventh Framework Programme (Capacities) Grant Agreement INFSO-RI Demonstration StratusLab First.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Mining Job Monitoring Data Automatic Error.
INFSO-RI Enabling Grids for E-sciencE GOCDB Requirements John Gordon, STFC.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CYFRONET site report Marcin Radecki CYFRONET.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarksEGEE-III INFSO-RI MPI on the grid:
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Towards an Information System Product Team.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GOCDB4 Gilles Mathieu, RAL-STFC, UK An introduction.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Status of the SAM/Nagios/GSTAT Components.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MyEGEE David Horat (
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Nagios Grid Monitor E. Imamagic, SRCE OAT.
NGI and Site Nagios Monitoring
Quattor Usage at Nikhef
EDT-WP4 monitoring group status report
Presentation transcript:

EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Stuart Kenny and Stephen Childs Trinity College Dublin & Grid-Ireland 7 th quattor workshop March Monitoring templates in QWG

Enabling Grids for E-sciencE 7th quattor workshop, London, March Monitoring tools status LEMON –Components: ncm-fmonagent, ncm-oramonserver –Templates: standard/monitoring/lemon –Configuration:  client, server, metrics, etc. via component  Web front-end via filecopy Nagios –Components: ncm-nagios, ncm-ncg –Templates: standard/monitoring/nagios, standard/monitoring/nagios3 (!) –Configuration: client?, server via component Ganglia –Components: N/A –Templates: standard/monitoring/ganglia –Configuration: client via filecopy, server & web front-end manual MonAMI –Components: –Templates: standard/monitoring/monami –Configuration: client via filecopy

Enabling Grids for E-sciencE 7th quattor workshop, London, March Ideal model Hierarchical site model should be defined once –Host: Machine and associated personality (could be multiple)  e.g. wn001.example.org is a cluster node  e.g. server.example.org is an NFS server and a web server –Cluster: List of hosts (e.g. “cluster nodes”, “NFS servers”  Combination of automatic generation (from “personality”) and explicit config  e.g. CLUSTERS[“SE”] = find_se_nodes()  e.g. CLUSTERS[“bad_nodes”] = list(“wn05.example.org”,”mon.example.org”) –Super-cluster: List of clusters (e.g. Grid machines, support services)  e.g. SUPERCLUSTER[“SUPPORT”]=list(“NFS”,”WEB”) All monitoring tools’ config should be generated from site model –Which sensors are on machine X? –Which machines’ data should I aggregate in cluster Y?

Enabling Grids for E-sciencE 7th quattor workshop, London, March How far are we from the ideal? Hierarchical site model should be defined once – Host: Hosts assigned node type by reg exp match on hostname (DB_MACHINE)  e.g., CE, WN, SE_DISK –Cluster: Lists of hosts referenced by cluster based on node type  e.g., NODES_WN, NODES_GATEWAY, NODES_CE –Super-cluster: Lists of clusters referenced by super-clusters  e.g., GRID_GATEWAY = nlist(“GRID_GATEWAY”,list(“GATEWAY”,”CE”,”SE_DISK”,”MON”)); All monitoring tools’ config should be generated from site model –Lemon variables re-used to create Nagios hostgroups: e.g., “gridservers/alias” = “TCD Grid Servers”; “gridservers/members” = { lis = LEMON_CLUSTERS[‘GRID_SERVERS’];…  Services then assigned to hostgroups –Could be used for Ganglia data_source, but TBD

Enabling Grids for E-sciencE 7th quattor workshop, London, March LEMON Current status in Grid-Ireland –Client  ncm-fmonagent Client still edg-fmon-agent, need to alter client configuration templates –Server  ncm-oramonserver Only used to generate database metadata Lemon database creation done by hand using lemon-admin  Some oracle XE environment setup included in templates Issues –Server profiles only for sl4 i386  Installing on SL5 x86_64 –Documentation not up to date with latest release  In particular for lemon-web, still lrf –Oracle XE very unstable, had to install Enterprise Edition –Nearly all of server configuration done by filecopy  Some variables, mainly for Oracle connection: username, password etc.  Mostly had to remove sections of templates e.g., oramon service no longer used (lemon-server)

Enabling Grids for E-sciencE 7th quattor workshop, London, March Nagios Current status in Grid-Ireland –Using ncm-nagios to configure server  Had to make some local changes after upgrade to nagios v3 Removing some lines from configuration file  Hosts created from hardware db  Services defined as separate templates in standard/monitoring/nagios/services Added to NAGIOS_SERVICE_TEMPLATES variable  Variables for other config files, e.g., servicegroups, hostgroups Issues –Wanted to deploy WLCG nagios service checks  Initially creating service definitions in ncg_services.tpl Lists defined services to create, VOs, host lists etc., oe.g., SAM_TESTS, SAM_VOS added to NAGIOS_EXPLICIT_SERVICES  Difficult to maintain, lots of services created Nagios3 templates (new?) oServices defined in template ncgservices.tpl (1742 lines!)

Enabling Grids for E-sciencE Nagios NCG component –WLCG already have nagios configuration generator (NCG)  Controlled by configuration file –Create configuration file using Quattor  Component calls ncg.pl to create service definitions –Easy to maintain –Multisite configuration possible –Always up to date service definitions Schema –Work in progress –Need full description of NCG configuration file –Example: “/software/components/ncg/configGen/nagios/PROBES_TYPE” = “all”; “/software/components/ncg/configGen/nagios/NRPE_UI” = “gridui.cs.tcd.ie”; –Created output files included in NAGIOS_EXTERNAL_FILES  /etc/nagios/wlcg.d/commands.cfg, /etc/nagios/wlcg.d/csTCDie/services.cfg, /etc/nagios/wlcg.d/cpDIASie/services.cfg….

Enabling Grids for E-sciencE 7th quattor workshop, London, March ganglia Client (gmond) –Well-defined config file format –Should be generated based on “site model” and machine type (i.e. “Which cluster am I in?”) Server (gmetad) –Well-defined config file format –Should be generated from QWG “site model” (i.e. which machines in which clusters) Web front-end –Config in PHP file

Enabling Grids for E-sciencE 7th quattor workshop, London, March MonAMI Tool is of minority interest? Main use is nice Torque/Maui/DPM graphs in ganglia Probably OK to stick with filecopy for now Configuration also needed on ganglia web front-end # Monitor torque and maui [torque] cache = 60 [maui] cache = 60 # write to ganglia [sample] read = maui,torque write = ganglia interval = 1m [ganglia] Client config

Enabling Grids for E-sciencE Grid-Ireland Monitoring