Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI.

Slides:



Advertisements
Similar presentations
LCG WLCG Operations John Gordon, CCLRC GridPP18 Glasgow 21 March 2007.
Advertisements

Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Torsten Antoni – LCG Operations Workshop, CERN 02-04/11/04 Global Grid User Support - GGUS -
Last update 01/06/ :23 LCG 1Maria Dimou- cern-it-gd Maria Dimou IT/GD Site Registration policy & procedures
Mardi 30 mars 2010 Lavoisier : a way to integrate heteregeneous monitoring systems. Cyril LOrphelin IN2P3/CNRS Computing Centre, Lyon, France.
Forschungszentrum Karlsruhe in der Helmholtz-Gemeinschaft Wofgang Thöne, Institute For Scientific Computing – EGEE-Meeting August 2004 Welcome to the User.
John Gordon and LCG and Grid Operations John Gordon CCLRC e-Science Centre, UK LCG Grid Operations.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Grid Infrastructure and Operations Maite.
Experience with Site Functional Tests Piotr Nyczyk CERN IT/GD WLCG Service Workshop Mumbai, February 2006.
08/11/908 WP2 e-NMR Grid deployment and operations Technical Review in Brussels, 8 th of December 2008 Marco Verlato.
Dave Kant Grid Monitoring and Accounting Dave Kant CCLRC e-Science Centre, UK HEPiX at Brookhaven 18 th – 22 nd Oct 2004.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Simply monitor a grid site with Nagios J.
SEE-GRID-SCI SEE-GRID-SCI Operations Procedures and Tools Antun Balaz Institute of Physics Belgrade, Serbia The SEE-GRID-SCI.
1 1 Service Composition for LHC Computing Grid Monitoring Beob Kyun Kim e-Science Division, KISTI
INFSO-RI Enabling Grids for E-sciencE EGEE 1 st EU Review – 9 th to 11 th February 2005 CERN.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GStat 2.0 Joanna Huang (ASGC) Laurence Field.
EGEE is a project funded by the European Union under contract IST User support in EGEE Alistair Mills Torsten Antoni EGEE-3 Conference 20 April.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Grid Monitoring Tools Alexandre Duarte CERN.
CERN Using the SAM framework for the CMS specific tests Andrea Sciabà System Analysis WG Meeting 15 November, 2007.
Certification and test activity IT ROC/CIC Deployment Team LCG WorkShop on Operations, CERN 2-4 Nov
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Wojciech Lapka SAM Team CERN EGEE’09 Conference,
Enabling Grids for E-sciencE INFSO-RI Tools for CIC Operations, Bologna, 24th May Monitoring workflow in EGEE GOC DB is used to get the list.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
Grid Deployment Enabling Grids for E-sciencE BDII 2171 LDAP 2172 LDAP 2173 LDAP 2170 Port Fwd Update DB & Modify DB 2170 Port.
SAM Tests SAM Devel. & Support Team CERN IT/GD WLCG/EGEE/OSG Operations Workshop 25 Jan. 2007, CERN.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Ricardo Rocha CERN (IT/GS) EGEE’08, September 2008, Istanbul, TURKEY Experiment.
8 th CIC on Duty meeting Krakow /2006 Enabling Grids for E-sciencE Feedback from SEE first COD shift Emanoil Atanassov Todor Gurov.
Grid Monitoring and Operations SAM Development Team CERN IT/GD Tier2 Admin Workshop 03 Dec. 2006, Mumbai.
Site Validation Session Report Co-Chairs: Piotr Nyczyk, CERN IT/GD Leigh Grundhoefer, IU / OSG Notes from Judy Novak WLCG-OSG-EGEE Workshop CERN, June.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
Certification and test activity ROC/CIC Deployment Team EGEE-SA1 Conference, CNAF – Bologna 05 Oct
ATP Future Directions Availability of historical information for grid resources: It is necessary to store the history of grid resources as these resources.
Service Availability Monitor tests for ATLAS Current Status Tests in development To Do Alessandro Di Girolamo CERN IT/PSS-ED.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Communication tools between Grid Virtual.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CIC portal Requirements from users WLCG service.
LCG WLCG Accounting: Update, Issues, and Plans John Gordon RAL Management Board, 19 December 2006.
FTS monitoring work WLCG service reliability workshop November 2007 Alexander Uzhinskiy Andrey Nechaevskiy.
GridView - A Monitoring & Visualization tool for LCG Rajesh Kalmady, Phool Chand, Kislay Bhatt, D. D. Sonvane, Kumar Vaibhav B.A.R.C. BARC-CERN/LCG Meeting.
Accounting in LCG/EGEE Can We Gauge Grid Usage via RBs? Dave Kant CCLRC, e-Science Centre.
SAM Database and relation with GridView Piotr Nyczyk SAM Review CERN, 2007.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,
Mardi 8 mars 2016 Status of new features in CIC Portal Latest Release of 22/08/07 Osman Aidel, Hélène Cordier, Cyril L’Orphelin, Gilles Mathieu IN2P3/CNRS.
Operations model Maite Barroso, CERN On behalf of EGEE operations WLCG Service Workshop 11/02/2006.
INFN GRID Production Infrastructure Status and operation organization Cristina Vistoli Cnaf GDB Bologna, 11/10/2005.
INFSO-RI Enabling Grids for E-sciencE Operations Parallel Session Summary Markus Schulz CERN IT/GD Joint OSG and EGEE Operations.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Operations: Evolution of the Role of.
SAM Status Update Piotr Nyczyk LCG Management Board CERN, 5 June 2007.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Operational Procedures (Contacts, procedures,
II EGEE conference Den Haag November, ROC-CIC status in Italy
SEE-GRID-SCI Grid Operations Procedures Antun Balaz Institute of Physics Belgrade Serbia The SEE-GRID-SCI initiative.
1/3/2006 Grid operations: structure and organization Cristina Vistoli INFN CNAF – Bologna - Italy.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
TIFR, Mumbai, India, Feb 13-17, GridView - A Grid Monitoring and Visualization Tool Rajesh Kalmady, Digamber Sonvane, Kislay Bhatt, Phool Chand,
INFSO-RI Enabling Grids for E-sciencE GOCDB Requirements John Gordon, STFC.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CYFRONET site report Marcin Radecki CYFRONET.
Site Manageability Issues for LCG Ian Bird IT Department, CERN HEPiX JLab, 12 th October 2006.
1 Grid Operations Jinny Chien ASGC June 09, Academia Sinica Slides adapted from the EGEE training material repository:
INFSO-RI Enabling Grids for E-sciencE GOCDB2 Matt Thorpe / Philippa Strange RAL, UK.
Service Availability Monitoring
Daniele Bonacorsi Andrea Sciabà
Job monitoring and accounting data visualization
EGEE VO Management.
Patricia Méndez Lorenzo ALICE Offline Week CERN, 13th July 2007
Lavoisier : a way to integrate heteregeneous monitoring systems.
Kashif Mohammad Deputy Technical Co-ordinator (South Grid) Oxford
EGEE Operation Tools and Procedures
Site availability Dec. 19 th 2006
Presentation transcript:

Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ Outline monitoring and operations tools –SFT –SFT Admin Pages –Gstat –GOCDB –CIC Dashboard –FCR tools in development –SAM –FCR (new version)

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ SFT (CERN) Sites Functional Tests site (CE) usability from the users point of view constant re-certification, spotting and debugging problems testing different aspects of CE: –job submission, replica management, LCG version, rgma, CA rpms, etc. official SFT submission from CERN –submitted for dteam VO –in every 3 hours –to Certified, Production, and Monitored sites

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ The SFT Portal

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ SFT Admin Pages (Poznan) on-demand SFT submission easy to use target site selection submission possible to non-certified sites used by: –ROCs: certification of a site –ROCs, site admins, GOoDs: speed up debugging

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ SFT Admin portal

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ gstat (Sinica) Information System (BDII) monitoring response time, consistency,completeness aggregated and detailed views plots (history) –CPU availability, storage space, running jobs, etc. refreshed in every 5 mins (non-intrusive)

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ gstat Portal

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ GOCDB (RAL) central database to store static site information all LCG/EGEE sites have to register –contact, security contact, certification status, site type scheduled maintainance used by –monitoring tools SFT + gstat (via RGMA), SAM (future) –script that generates top-level BDII config file –operations management tools On Duty Dashboard

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ GOCDB Portal

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ On Duty Dashboard (IN2P3) summary of necessary monitoring information + tools for ticket processing GOoD ticket linked to corresponding GGUS ticket information from GOCDB SFT + gstat results ticket creation and management tool tools for ing concerned sites and ROCs

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ On Duty Dashboard

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ GGUS (FZK) Global GRID User Support ticketing system for the GRID based on Remedy tickets created by –individual users –automatically (GOoD Operations) provides links to documentation, monitoring infos

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ GGUS Portal

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ Connection between tools CIC dashboard gstat Monitoring tools GGUS Problem reporting and tracking fix Modifications on the tickets Sites Admins sft Grid operator test results

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ FCR (CERN) Freedom of Choice for Resources critical test and resource selection for VOs by manipulating top-level BDII information selection on CEs and SEs goal is to be able to –select which aspects of site functionality are important for the VO –blacklist unreliable sites –always use stable, "important" sites –less reliable sites based on SFT results

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ FCR Portal

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ Connection between tools FCR VO BDII configuration filter Sites Site Admins VO user jobs sft VO manager test results VO RBGOCDB site listsite info

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ SAM Service Availability Monitoring monitoring framework for GRID services "evolution of SFT " services involved: –CE, SE, BDII, RB, etc. development of the framework at CERN sensor development distributed –CERN, RAL, Sinica web services + Oracle DB

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ SAM Portal - main

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ SAM - sensor page

CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/ FCR new version integrated with SAM new features –for every service VO can select which test are critical –definition of the core services –site status information pages for users web services, Oracle