Presentation is loading. Please wait.

Presentation is loading. Please wait.

Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI.

Similar presentations


Presentation on theme: "Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI."— Presentation transcript:

1 Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI

2 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 2 Outline monitoring and operations tools –SFT –SFT Admin Pages –Gstat –GOCDB –CIC Dashboard –FCR tools in development –SAM –FCR (new version)

3 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 3 SFT (CERN) Sites Functional Tests https://lcg-sft.cern.ch:9443/sft/lastreport.cgi site (CE) usability from the users point of view constant re-certification, spotting and debugging problems testing different aspects of CE: –job submission, replica management, LCG version, rgma, CA rpms, etc. official SFT submission from CERN –submitted for dteam VO –in every 3 hours –to Certified, Production, and Monitored sites

4 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 4 The SFT Portal

5 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 5 SFT Admin Pages (Poznan) https://monitoring.egee.man.poznan.pl/admin2 on-demand SFT submission easy to use target site selection submission possible to non-certified sites used by: –ROCs: certification of a site –ROCs, site admins, GOoDs: speed up debugging

6 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 6 SFT Admin portal

7 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 7 gstat (Sinica) http://goc.grid.sinica.edu.tw/gstat/ Information System (BDII) monitoring response time, consistency,completeness aggregated and detailed views plots (history) –CPU availability, storage space, running jobs, etc. refreshed in every 5 mins (non-intrusive)

8 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 8 gstat Portal

9 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 9 GOCDB (RAL) https://goc.grid-support.ac.uk/gridsite/gocdb2/index.php central database to store static site information all LCG/EGEE sites have to register –contact, security contact, certification status, site type scheduled maintainance used by –monitoring tools SFT + gstat (via RGMA), SAM (future) –script that generates top-level BDII config file –operations management tools On Duty Dashboard

10 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 10 GOCDB Portal

11 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 11 On Duty Dashboard (IN2P3) summary of necessary monitoring information + tools for ticket processing GOoD ticket linked to corresponding GGUS ticket information from GOCDB SFT + gstat results ticket creation and management tool tools for e-mailing concerned sites and ROCs

12 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 12 On Duty Dashboard

13 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 13 GGUS (FZK) Global GRID User Support http://ggus.org ticketing system for the GRID based on Remedy tickets created by –individual users –automatically (GOoD Operations) provides links to documentation, monitoring infos

14 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 14 GGUS Portal

15 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 15 Connection between tools CIC dashboard gstat Monitoring tools GGUS Problem reporting and tracking fix Modifications on the tickets Sites Admins email sft Grid operator test results

16 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 16 FCR (CERN) Freedom of Choice for Resources https://goc.grid-support.ac.uk/gridsite/bdii/site-apps/FCR-cgi/fcr.cgi critical test and resource selection for VOs by manipulating top-level BDII information selection on CEs and SEs goal is to be able to –select which aspects of site functionality are important for the VO –blacklist unreliable sites –always use stable, "important" sites –less reliable sites based on SFT results

17 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 17 FCR Portal

18 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 18 Connection between tools FCR VO BDII configuration filter Sites Site Admins VO user jobs sft VO manager test results VO RBGOCDB site listsite info

19 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 19 SAM Service Availability Monitoring https://lcg-sam.cern.ch:8443/sam/sam.cgi monitoring framework for GRID services "evolution of SFT " services involved: –CE, SE, BDII, RB, etc. development of the framework at CERN sensor development distributed –CERN, RAL, Sinica web services + Oracle DB

20 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 20 SAM Portal - main

21 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 21 SAM - sensor page

22 CE EGEE-2/SEE-GRID-2 Summer School, Budapest, 06/07/2006 22 FCR new version integrated with SAM new features –for every service VO can select which test are critical –definition of the core services –site status information pages for users web services, Oracle


Download ppt "Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI."

Similar presentations


Ads by Google