Using Gstat 2 to Check your Published Information Stephen Burke RAL.

Slides:



Advertisements
Similar presentations
LCG WLCG Operations John Gordon, CCLRC GridPP18 Glasgow 21 March 2007.
Advertisements

EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Operations Dashboard Workplan Cyril.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
LHCC Comprehensive Review – September WLCG Commissioning Schedule Still an ambitious programme ahead Still an ambitious programme ahead Timely testing.
Africa & Arabia ROC tutorial The GSTAT2 Grid Monitoring tool Mario Reale GARR - Italy ASREN-JUNET Grid School - 24 November 2011 Africa & Arabia ROC Tutorial.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Simply monitor a grid site with Nagios J.
Monitoring the Grid at local, national, and Global levels Pete Gronbech GridPP Project Manager ACAT - Brunel Sept 2011.
Monitoring in EGEE EGEE/SEEGRID Summer School 2006, Budapest Judit Novak, CERN Piotr Nyczyk, CERN Valentin Vidic, CERN/RBI.
1 1 Service Composition for LHC Computing Grid Monitoring Beob Kyun Kim e-Science Division, KISTI
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The network monitoring in grid context Operations.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Migration to the GLUE 2.0 information schema in the LCG/EGEE/EGI.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GStat 2.0 Joanna Huang (ASGC) Laurence Field.
Maarten Litmaath (CERN), GDB meeting, CERN, 2006/02/08 VOMS deployment Extent of VOMS usage in LCG-2 –Node types gLite 3.0 Issues Conclusions.
WP3 Information and Monitoring Steve Fisher / RAL 23/9/2003.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Grid Monitoring Tools Alexandre Duarte CERN.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Service Availability Monitoring – Status.
Towards a Global Service Registry for the World-Wide LHC Computing Grid Maria ALANDES, Laurence FIELD, Alessandro DI GIROLAMO CERN IT Department CHEP 2013.
Report on Installed Resource Capacity Flavia Donno CERN/IT-GS WLCG GDB, CERN 10 December 2008.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Wojciech Lapka SAM Team CERN EGEE’09 Conference,
Grid Security Vulnerability Group Linda Cornwall, GDB, CERN 7 th September 2005
Information System Status and Evolution Maria Alandes Pradillo, CERN CERN IT Department, Grid Technology Group GDB 13 th June 2012.
Grid Deployment Enabling Grids for E-sciencE BDII 2171 LDAP 2172 LDAP 2173 LDAP 2170 Port Fwd Update DB & Modify DB 2170 Port.
Site Validation Session Report Co-Chairs: Piotr Nyczyk, CERN IT/GD Leigh Grundhoefer, IU / OSG Notes from Judy Novak WLCG-OSG-EGEE Workshop CERN, June.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Using GStat 2.0 for Information Validation.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Progress on first user scenarios Stephen.
EGEE-II INFSO-RI Enabling Grids for E-sciencE GStat Work Plans for EGEE-III Joanna Huang, ASGC/OPS EGEE SA1 F2F Meetings, Abingdon.
EMI INFSO-RI Argus Policies in Action Valery Tschopp (SWITCH) on behalf of the Argus PT.
Storage dashboard Status report A.Baranovski 12/10/07.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Jan 2010 OSG Update Grid Deployment Board, Feb 10 th 2010 Now having daily attendance at the WLCG daily operations meeting. Helping in ensuring tickets.
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Monitoring Tools E. Imamagic, SRCE CE.
LCG WLCG Accounting: Update, Issues, and Plans John Gordon RAL Management Board, 19 December 2006.
GridView - A Monitoring & Visualization tool for LCG Rajesh Kalmady, Phool Chand, Kislay Bhatt, D. D. Sonvane, Kumar Vaibhav B.A.R.C. BARC-CERN/LCG Meeting.
SAM Database and relation with GridView Piotr Nyczyk SAM Review CERN, 2007.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Information System Tutorial Laurence Field.
The GridPP DIRAC project DIRAC for non-LHC communities.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Configuration Data or “What should be.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Operations: Evolution of the Role of.
SAM Status Update Piotr Nyczyk LCG Management Board CERN, 5 June 2007.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures Grant Agreement n
Probes Requirement Review OTAG-08 03/05/ Requirements that can be directly passed to EMI ● Changes to the MPI test (NGI_IT)
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI GLUE 2: Deployment and Validation Stephen Burke egi.eu EGI OMB March 26 th.
LCG Accounting Update John Gordon, CCLRC-RAL 10/1/2007.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CYFRONET site report Marcin Radecki CYFRONET.
WLCG Information System Status Maria Alandes Pradillo, CERN CERN IT Department, Support for Distributed Computing Group GDB 9 th September 2015.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Towards an Information System Product Team.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Update on Service Availability Monitoring (SAM) Marian Babik, David Collados,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Regional tools use cases overview Peter Solagna – EGI.eu On behalf of the.
Maria Alandes Pradillo, CERN Training on GLUE 2 information validation EGI Technical Forum September 2013.
Implementation of GLUE 2.0 support in the EMI Data Area Elisabetta Ronchieri on behalf of JRA1’s GLUE 2.0 Working Group INFN-CNAF 13 April 2011, EGI User.
WLCG Operations Coordination Andrea Sciabà IT/SDC GDB 11 th September 2013.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Information system workshop Stephen Burke egi.eu EGI TF Madrid September.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Status of the SAM/Nagios/GSTAT Components.
NGI and Site Nagios Monitoring
Evolution of SAM in an enhanced model for monitoring the WLCG grid
SRM2 Migration Strategy
Quality Control in the dCache team.
Stephen Burke egi.eu EGI TF Prague September 20th 2012
EGEE Operation Tools and Procedures
Information System (BDII)
Site availability Dec. 19 th 2006
Information Services Claudio Cherubino INFN Catania Bologna
Presentation transcript:

Using Gstat 2 to Check your Published Information Stephen Burke RAL

HEPSYSMAN - June 11 th 2010 gstat 2 2 Overview Why gstat 2? What does it display? –Demo … What to look for –and how to fix/report problems GLUE 2 –With help from: Joanna Huang (ASGC) Laurence Field (CERN-IT) David Horat (CERN-IT)

HEPSYSMAN - June 11 th 2010 gstat 2 3 Motivations For Version 2 The old gstat pages are now too cluttered –The EGEE Grid has grown to 320+ sites –1 CE at CERN to 20+ CEs at CERN It is a single, centralized instance –EGI would like de-centralized, regional-based operations tools The information checks are not easily reusable –Difficult for use by sys admins and for software certification Tight coupling with SAM and GOCDB –Requires high-availability operations and notifications High-maintenance backend –Due to the gradual evolution of the code base

HEPSYSMAN - June 11 th 2010 gstat 2 4 Design Goals Consolidate the existing code base –To give a low-maintenance solution Isolate the testing component –To ensure that the tests are reusable Remove the dependency on the GOC database –To enable de-centralized deployment Bootstrapping should be achieved by querying a BDII Redesign the displays to address specific use cases –Generally improve the presentation Ensure that components are modular –And that GStat is extensible

HEPSYSMAN - June 11 th 2010 gstat 2 5 Design Choices Nagios –Manages the execution of tests and test results –Probes can be re-used by other OAT applications –Used for: Information Content Validation Tests BDII Service Monitor Django –Web application framework to simplify page generation –Object-relational mapper simplifies database integration –Significant experience already exists within the OAT –Used for: Snapshot and topology import scripts Web page rendering and management

HEPSYSMAN - June 11 th 2010 gstat 2 6 GStat 2.0 Framework

HEPSYSMAN - June 11 th 2010 gstat 2 7 Content Validation How to obtain the information content? –By querying a BDII How to ensure the information integrity and quality? –Do different checks based on the different entries Does the information agree with the schema? Are the entities we expect to see published? Are there things published that we don’t expect? Is the information logically self consistent? –Is the number of free CPUs less than or equal to the TotalCPUs? Is there agreement with external information sources? –Is the host registered in DNS? –Does it match the GOC DB? Is there conformance with extra project constraints? –Valid gLite version? –LCG installed capacity documentinstalled capacity

HEPSYSMAN - June 11 th 2010 gstat 2 8 gstat Filters Drop-down lists to filter by Grid, Country, VO, EGEE_ROC, WLCG_TIER –Currently one name space so watch for clashes, e.g. VO == Grid –No registry for most of these names, just convention –No EGI-related info defined yet Information source is mostly your published GlueSite info (VOs excepted) – so make sure it’s right …right ldapsearch -x -h lcg-bdii.cern.ch -p b o=grid gluesitename=UKI-SOUTHGRID-RALPP GlueSiteLongitude: GlueSiteLatitude: GlueSiteWeb: GlueSiteLocation: Oxfordshire, UK GlueSiteOtherInfo: EGEE_ROC=UK/I GlueSiteOtherInfo: EGEE_SERVICE=prod GlueSiteOtherInfo: GRID=EGEE GlueSiteOtherInfo: GRID=WLCG GlueSiteOtherInfo: GRID=SOUTHGRID GlueSiteOtherInfo: GRID=GRIDPP GlueSiteOtherInfo: WLCG_NAME=UK-SouthGrid GlueSiteOtherInfo: WLCG_PARENT=UK-T1-RAL GlueSiteOtherInfo: WLCG_TIER=2

HEPSYSMAN - June 11 th 2010 gstat 2 9 Using gstat 2 or –Also links from the gstat 1 page, and information on the gstat web sitegstat 1web site Geo View – plots sites on a map –Not that interesting, but check your co-ordinates –You can also jump to the site view – click on a site for a popup LDAP View – LDAP browser –See directly what your site is publishing –Need to pick a top-level BDII – but in theory they should all be the same! –URL includes the BDII (and base DN) – so can be bookmarked, and can query a site BDII directly Service View – shows top and site BDII monitoring status VO View – tree view of jobs and storage by VO

HEPSYSMAN - June 11 th 2010 gstat 2 10 Site View Site View – starts with a whole-Grid summary, drill down to your site –Can filter the table on various criteria –Popup help over the field labels Select your site to get a detailed view –URL can be bookmarked Overall summary of your site information –Does it look right? –BDII name is checked for DNS aliases –VO tabs show resources per VO Storage not yet separated by space token, but I have suggested that as an enhancement

HEPSYSMAN - June 11 th 2010 gstat 2 11 Tree View Most links on the site view take you to a tree view of your site services –Can get graphs of measured quantities Look for graph icon –“GLUE” link displays raw published info, “LDAP” link goes to LDAP browser BDII content validation – click BDII name for details, then click to expand each test section –“WARNING” means “probably wrong”, “CRITICAL” means “definitely wrong” – may or may not be serious –Most DPM sites get a lot of warnings for non-compliant “legacy SAs” - will be off by default in the next DPM release And can be turned off by hand now –Some other things may be due to middleware bugs, but should mostly be fixed AssignedJobSlots = 0 –Please look at what is being flagged and either: 1.Fix it 2.Report a bug in the middleware 3.Report a bug in the gstat test

HEPSYSMAN - June 11 th 2010 gstat 2 12 Reporting Problems GGUS ticket or –Developers are responsive –Relatively few problems reported so far – either gstat is perfect (unlikely!) or people aren’t looking much yet Daniela and Duncan reported problems Can also ask for enhancements – further development will be driven by demand You can look at the source code for the information system checks to see what it’s doingsource code –And indeed run it yourself

HEPSYSMAN - June 11 th 2010 gstat 2 13 Summary GStat 2.0 is in production – –It can replace the functionality of the original version –gstat 1 will be switched off on June 17 th Please –Provide feedback, both problems and suggestions –Look at the views important to you! Take ownership of the information you see! –Submit a GGUS ticket if there is something not right!

HEPSYSMAN - June 11 th 2010 gstat 2 14 And now for something slightly different …

HEPSYSMAN - June 11 th 2010 gstat 2 15 GLUE 2 Just a quick overview –For more info see my talks at CHEP09 and EGEE09CHEP09EGEE09 Abstract schema was approved in March 2009schema –Complete redesign, not backward compatible, hence must deploy in parallel –All services published in the same framework, much more flexible LDAP rendering defined and implemented in the BDII in glite 3.2 update 5 (September 09) –Query on the same port (2170) but base DN is o=glue Generic service info provider has been written –First to production is CREAM, update 12 (May 2010) –More to follow

HEPSYSMAN - June 11 th 2010 gstat 2 16 GLUE 2 – Next Steps Site BDII needs to aggregate from resource BDIIs and add the site (AdminDomain) info –Already certified, but unrelated deployment changes were controversial and have been rolled back – release is imminent Top BDII needs to aggregate info from site BDIIs –To follow soon –Can then see the whole Grid in GLUE 2 –gstat will be extended to monitor it Storage info providers need to come from SRM developers –No timescale yet CREAM info provider needs to be extended –Probably it will appear incrementally – end of the year? Client tools –Service discovery on the way, lcg-utils and WMS not started