Www.egi.eu EGI-InSPIRE RI-261323 EGI-InSPIRE www.egi.eu EGI-InSPIRE RI-261323 First Ops Tools Long Term Sustainability F2F David Collados 1First Ops Tools.

Slides:



Advertisements
Similar presentations
WLCG Operations and Tools TEG Monitoring – Experiment Perspective Simone Campana and Pepe Flix Operations TEG Workshop, 23 January 2012.
Advertisements

CERN IT Department CH-1211 Genève 23 Switzerland t Messaging System for the Grid as a core component of the monitoring infrastructure for.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EG recent developments T. Ferrari/EGI.eu ADC Weekly Meeting 15/05/
02/07/09 1 WLCG NAGIOS Kashif Mohammad Deputy Technical Co-ordinator (South Grid) University of Oxford.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GStat 2.0 Joanna Huang (ASGC) Laurence Field.
James Casey, CERN, IT-GT-TOM 1 st ROC LA Workshop, 6 th October 2010 Grid Infrastructure Monitoring.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-level monitoring - an overview James.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Service Availability Monitoring – Status.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1: Grid Operations Maite Barroso (CERN)
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Wojciech Lapka SAM Team CERN EGEE’09 Conference,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI (Present and) Future of the EGI Services for WLCG Peter Solagna – EGI.eu.
WLCG infrastructure monitoring proposal Pablo Saiz IT/SDC/MI 16 th August 2013.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGI Operations Tiziana Ferrari EGEE User.
Julia Andreeva on behalf of the MND section MND review.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI GGUS First Ops Tools Long Term Sustainability F2F T. Antoni, E. Buttitta,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI How to integrate portals with the EGI monitoring system Dusan Vudragovic.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Accounting Old and New Requirements John Gordon Revised 22/3/12.
PIC port d’informació científica EGEE – EGI Transition for WLCG in Spain M. Delfino, G. Merino, PIC Spanish Tier-1 WLCG CB 13-Nov-2009.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Monitoring of the LHC Computing Activities Key Results from the Services.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Monitoring Tools E. Imamagic, SRCE CE.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI SA1 and JRA1 Operations and Operational Tools D. Cesini, JRA1 Activity Manager.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks User Support for Distributed Computing Infrastructures.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Grid Oversight in Service Level Agreement environment Małgorzata Krakowian,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI SAM New Requirements from the SA1 Survey.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Requirements Status EGI.eu UCB
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI SA1 and JRA1 Operations and Operational Tools D. Cesini, JRA1 activity manager.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks What all NGIs need to do: Helpdesk / User.
EMI INFSO-RI Testbed for project continuous Integration Danilo Dongiovanni (INFN-CNAF) -SA2.6 Task Leader Jozef Cernak(UPJŠ, Kosice, Slovakia)
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Funding Global Tasks.
EGI Process Assessment and Improvement Plan – EGI core services – Tiziana Ferrari FedSM project 1EGI Process Assessment and Improvement Plan (Core Services)
EGI-InSPIRE Project Overview1 EGI-InSPIRE Overview Activities and operations boards Tiziana Ferrari, EGI.eu Operations Unit Tiziana.Ferrari at egi.eu 1.
Co-ordination & Harmonisation of Advanced e-Infrastructures for Research and Education Data Sharing Research Infrastructures Grant Agreement n
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Resource allocation Małgorzata Krakowian 1.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Regionalisation summary Prague 1.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI /05/2011 SA1 & JRA1 - EGI-InSPIRE Review
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI 2 nd level support training Marian Babik, David Collados, Wojciech Lapka,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Status of ARGUS support Peter Solagna – EGI.eu.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Accounting Requirements Stuart Pullinger STFC 09/04/2013 EGI CF – Accounting.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations automation team presentazione.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Update on Service Availability Monitoring (SAM) Marian Babik, David Collados,
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Services for Distributed e-Infrastructure Access Tiziana Ferrari on behalf.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Regional tools use cases overview Peter Solagna – EGI.eu On behalf of the.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI VO Services Activities VO Services Activities NA3 F2F Meeting (3/03/2011)
EGI-InSPIRE EGI-InSPIRE RI The European Grid Infrastructure Steven Newhouse Director, EGI.eu Project Director, EGI-InSPIRE 29/06/2016CoreGrid.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI A pan-European Research Infrastructure supporting the digital European Research.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI Engagement meeting Gergely Sipos EGI.eu 1.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI SCI-FI Security Challenge Infrastructure for Federated Incident-response.
TSA1.4 Infrastructure for Grid Management Tiziana Ferrari, EGI.eu EGI-InSPIRE – SA1 Kickoff Meeting1.
EGI-InSPIRE RI EGI-InSPIRE RI EGI-InSPIRE Software provisioning and HTC Solution Peter Solagna Senior Operations Manager.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI GGUS – the EGI Helpdesk Status and Plans T. Antoni Karlsruhe Institute of.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Status of the SAM/Nagios/GSTAT Components.
EGI-InSPIRE EGI-InSPIRE RI Developing the concept of a service marketplace for EGI Diego Scardaci EGI.eu/INFN.
JRA1 Middleware re-engineering
Daniele Bonacorsi Andrea Sciabà
Regional Operations Centres Core infrastructure Centres
NGI and Site Nagios Monitoring
POW MND section.
Evolution of SAM in an enhanced model for monitoring the WLCG grid
Advancements in Availability and Reliability computation Introduction and current status of the Comp Reports mini project C. Kanellopoulos GRNET.
Maite Barroso, SA1 activity leader CERN 27th January 2009
Infrastructure Area EMI All Hands Summary.
Monitoring of the infrastructure from the VO perspective
Operational Tools & Middleware Versions Monitoring
<Name of the tool>
Kashif Mohammad Deputy Technical Co-ordinator (South Grid) Oxford
JRA1.4 New Types of Accounting
Presentation transcript:

EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI First Ops Tools Long Term Sustainability F2F David Collados 1First Ops Tools long Term Sustainability F2F

EGI-InSPIRE RI Contents I.Tool and PT Description II.DoW Checkpoint III.RT Checkpoint IV.Effort Evaluation V.Future Involvement VI.Future Evolution First Ops Tools long Term Sustainability F2F 2

EGI-InSPIRE RI Tool and PT Description 1/2 I.SAM overview –Monitoring and reporting system for large- scale production grids –PT composition: Development: CERN, srce (NCG) Operations: –CERN (MyEGI, DBs, Msg brokers) –srce (Msg broker) –AUTH (Msg broker) 3 First Ops Tools long Term Sustainability F2F

EGI-InSPIRE RI Tool and PT Description 2/2 I.SAM overview –Allocated effort in PMs for the 4 years: TSA1.4E: Infrastructure for Grid Management 59, 11.2 TJRA1.2 Maintenance and development of the deployed operational tools 12, 12 TJRA1.3 Supporting National Deployment models 6, 3 –Allocated effort in PMs for PY3: TSA1.4E: Infrastructure for Grid Management 14.8, 2.8 TJRA1.2 Maintenance and development of the deployed operational tools 3, 3 4 First Ops Tools long Term Sustainability F2F CERNSRCE

EGI-InSPIRE RI Tool Components –ATP: SQL, Python, Oracle, MySQL –POEM: SQL, Python, Oracle, MySQL –NCG: Perl –MRS: SQL, Python, Oracle, MySQL –ACE: SQL, Python, Oracle –MyEGI: Python, Django –Libs & clients for messaging & probes: Python, Perl 5 First Ops Tools long Term Sustainability F2F Statistics:SQLPythonShellPerlHTMLJavascript LOC: 200 K42 K76 K20 K24 K20 K18 K

EGI-InSPIRE RI DoW Checkpoint 1/2 III.DoW requirements met WP4 TSA1.4 migration of monitoring infrastructure to a regionalized model deploy and run central instances for monitoring (MyEGI and SAM central DBs) WP7 TJRA1.2.6: MyEGI portal (moving from ROCs to NGIs, maintenance of portal integrating new resources and middleware components) NCG to maintain the Nagios configurator to deal with new probes 6 First Ops Tools long Term Sustainability F2F

EGI-InSPIRE RI DoW Checkpoint 2/2 III.DoW reqs under development None IV.DoW reqs that can be met by EOP None V.DoW reqs that won’t be met by EOP None 7 First Ops Tools long Term Sustainability F2F

EGI-InSPIRE RI RT Checkpoint III.RT Checkpoint –Main RT requirements met RT-2792: Multi VO SAM Nagios instanceRT-2792 RT-2793: SAM Run Custom ProbesRT-2793 –Under development RT-79: System to monitor operation tool availabilityRT-79 –Can be met by the end of the project –Need discussion 8 First Ops Tools long Term Sustainability F2F

EGI-InSPIRE RI RT Checkpoint III.RT Checkpoint –Probably won’t be met by the end of the project RT-988: Handling virtual sites for samRT-988 RT-502: SAM/GOCDB/OPSPORTAL: Handling virtual sitesRT-502 RT-2791: SAM to monitor services and sites not in gocdbRT First Ops Tools long Term Sustainability F2F

EGI-InSPIRE RI Effort IV.Effort Evaluation and Splitting –Development vs Maintenance effort Dev (0.75) + Maint (2.85) = 3.6 FTE –Effort to run the service 2.4 FTE –Effort needed to address main rt or dow requirements that cannot be met due to lack of effort Handling Virtual Sites: 28 PMs Monitor Services and Sites not in GOCDB: 24 PMs Support Glue 2.0 and Multiple Serv. Endpoints in GOCDB: 12 PMs 10 First Ops Tools long Term Sustainability F2F

EGI-InSPIRE RI Effort IV.Effort Evaluation and Splitting –Deviations between the provided effort and the InSPIRE funded effort InSPIRE funded effort (totals over 4 PYs): –77 PMs: SA1 (59), JRA1 (18) Provided effort: ~6 FTE per year –~288 PMs total over 4 PYs 11 First Ops Tools long Term Sustainability F2F

EGI-InSPIRE RI Future Involvement IV.Involvement after EGI-InSPIRE –Is current PT interested in continuing developing/maintaining the tool after EGI-InSPIRE? Yes, the complete tool –If you have an estimation of the effort considered minimal to continue the development/maintenance please report it 6 FTE for development/maintenance/support/coordination without new requirements 12 First Ops Tools long Term Sustainability F2F

EGI-InSPIRE RI Future Evolution IV.Evolution after EGI-InSPIRE –How would you like to evolve the tool? Probe execution: –Target other granularities than service endpoints (space tokens, workflows) –VO probes to focus more on VO meta-services/activities rather than services, site usability, etc. Results aggregation: –Accept test results produced by external monitoring systems through messaging Results computation: –correlating final status, availability, and reliability figures with data from other systems or probes, such as job submission, storage elements, data transfer services, etc. 13 First Ops Tools long Term Sustainability F2F

EGI-InSPIRE RI Future Evolution IV.Evolution after EGI-InSPIRE –How would you like to evolve the tool? Results computation: –Full regionalisation, including availability computations Results visualization: –Common pluggable visualization interfaces Site Monitoring: –Common multi-VO SAM for sites to locally understand site performance 14 First Ops Tools long Term Sustainability F2F

EGI-InSPIRE RI Future Evolution IV.Evolution after EGI-InSPIRE –Describe any major changes that you consider advisable for the tool in order to improve usability, availability, sustainability Technology changes: No Components removal/addition: None –Think about evolution and uptake from other projects – EC will not fund what is already developed 15 First Ops Tools long Term Sustainability F2F