Operations Coordination Team Maria Girone, CERN IT-ES GDB, 11 July 2012.

Slides:



Advertisements
Similar presentations
Operations Coordination Team Maria Girone, CERN IT-ES GDB 10 th October 2012.
Advertisements

Operations Coordination Team Maria Girone, CERN IT-ES Kick-off meeting 24 th September 2012.
WLCG Operations and Tools TEG Monitoring – Experiment Perspective Simone Campana and Pepe Flix Operations TEG Workshop, 23 January 2012.
Integrating Network and Transfer Metrics to Optimize Transfer Efficiency and Experiment Workflows Shawn McKee, Marian Babik for the WLCG Network and Transfer.
Assessment of Core Services provided to USLHC by OSG.
Solution Overview for NIPDEC- CDAP July 15, 2005.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES WLCG operations: communication channels Andrea Sciabà WLCG operations.
OSG Area Coordinators Meeting Operations Rob Quick 2/22/2012.
LCG Milestones for Deployment, Fabric, & Grid Technology Ian Bird LCG Deployment Area Manager PEB 3-Dec-2002.
Term 2, 2011 Week 3. CONTENTS The physical design of a network Network diagrams People who develop and support networks Developing a network Supporting.
Presentation to: Name: Date: ICAO Asia-Pacific AMHS Activities & Status ICAO Asia-Pacific AMHS Activities & Status ATS Message Handling System (AMHS )
OSG Area Coordinators Meeting Operations Rob Quick 2/22/2012.
Integration and Sites Rob Gardner Area Coordinators Meeting 12/4/08.
LCG and HEPiX Ian Bird LCG Project - CERN HEPiX - FNAL 25-Oct-2002.
Georgia Institute of Technology CS 4320 Fall 2003.
Workshop summary Ian Bird, CERN WLCG Workshop; DESY, 13 th July 2011 Accelerating Science and Innovation Accelerating Science and Innovation.
Responsibilities of ROC and CIC in EGEE infrastructure A.Kryukov, SINP MSU, CIC Manager Yu.Lazin, IHEP, ROC Manager
Report from the WLCG Operations and Tools TEG Maria Girone / CERN & Jeff Templon / NIKHEF TEG Workshop, 7 th February 2012.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
WLCG operations A. Sciabà, M. Alandes, J. Flix, A. Forti WLCG collaboration workshop July , Barcelona.
CERN-IT Oracle Database Physics Services Maria Girone, IT-DB 13 December 2004.
INFSO-RI Enabling Grids for E-sciencE EGEE SA1 in EGEE-II – Overview Ian Bird IT Department CERN, Switzerland EGEE.
Consultant Advance Research Team. Outline UNDERSTANDING M&E DATA NEEDS PEOPLE, PARTNERSHIP AND PLANNING 1.Organizational structures with HIV M&E functions.
Handling ALARMs for Critical Services Maria Girone, IT-ES Maite Barroso IT-PES, Maria Dimou, IT-ES WLCG MB, 19 February 2013.
Report from the WLCG Operations and Tools TEG Maria Girone / CERN & Jeff Templon / NIKHEF WLCG Workshop, 19 th May 2012.
Ian Bird GDB CERN, 9 th September Sept 2015
Jan 2010 OSG Update Grid Deployment Board, Feb 10 th 2010 Now having daily attendance at the WLCG daily operations meeting. Helping in ensuring tickets.
EMI INFSO-RI SA1 Session Report Francesco Giacomini (INFN) EMI Kick-off Meeting CERN, May 2010.
WLCG Technical Evolution Group: Operations and Tools Maria Girone & Jeff Templon Kick-off meeting, 24 th October 2011.
Security Policy Update WLCG GDB CERN, 14 May 2008 David Kelsey STFC/RAL
PIC port d’informació científica EGEE – EGI Transition for WLCG in Spain M. Delfino, G. Merino, PIC Spanish Tier-1 WLCG CB 13-Nov-2009.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team Kickoff Meeting.
1 Proposal for a technical discussion group  Because...  We do not have a forum where all of the technical people discuss the critical.
Operations model Maite Barroso, CERN On behalf of EGEE operations WLCG Service Workshop 11/02/2006.
WLCG Technical Evolution Group: Operations and Tools Maria Girone & Jeff Templon GDB 12 th October 2011, CERN.
Installation and Maintenance of Health IT Systems Unit 8a Troubleshooting; Maintenance and Upgrades; and Interaction with Vendors, Developers, and Users.
WLCG Operations Coordination report Maria Alandes, Andrea Sciabà IT-SDC On behalf of the WLCG Operations Coordination team GDB 9 th April 2014.
ATLAS Distributed Computing ATLAS session WLCG pre-CHEP Workshop New York May 19-20, 2012 Alexei Klimentov Stephane Jezequel Ikuo Ueda For ATLAS Distributed.
Evolution of WLCG infrastructure Ian Bird, CERN Overview Board CERN, 30 th September 2011 Accelerating Science and Innovation Accelerating Science and.
Monitoring the Readiness and Utilization of the Distributed CMS Computing Facilities XVIII International Conference on Computing in High Energy and Nuclear.
WLCG Operations Coordination and Commissioning Maria Girone, CERN IT On behalf of the Operations Coordination Team 11 th March OSG All Hands Meeting,
Ian Bird, CERN 1 st February Dec 2015
LHCOPN operational model Guillaume Cessieux (CNRS/FR-CCIN2P3, EGEE SA2) On behalf of the LHCOPN Ops WG GDB CERN – November 12 th, 2008.
WLCG Accounting Task Force Update Julia Andreeva CERN GDB, 8 th of June,
Grid Deployment Technical Working Groups: Middleware selection AAA,security Resource scheduling Operations User Support GDB Grid Deployment Resource planning,
WLCG Information System Status Maria Alandes Pradillo, CERN CERN IT Department, Support for Distributed Computing Group GDB 9 th September 2015.
WLCG Operations Coordination report Maria Dimou Andrea Sciabà IT/SDC On behalf of the WLCG Operations Coordination team GDB 12 th November 2014.
WLCG Operations Coordination Andrea Sciabà IT/SDC GDB 11 th September 2013.
Accounting Review Summary and action list from the (pre)GDB Julia Andreeva CERN-IT WLCG MB 19th April
WLCG Accounting Task Force Introduction Julia Andreeva CERN 9 th of June,
PerfSONAR operations meeting 3 rd October Agenda Propose changes to the current operations of perfSONAR Discuss current and future deployment model.
Ian Bird, CERN WLCG Project Leader Amsterdam, 24 th January 2012.
WLCG IPv6 deployment strategy
Roles and Responsibilities
Communication, Communication, Communication
COMP532 IT INFRASTRUCTURE
BA Continuum India Pvt Ltd
Ian Bird GDB Meeting CERN 9 September 2003
Leveraging the Power of Collaboration
Taming the protocol zoo
Proposal for obtaining installed capacity
IEEE Std 1074: Standard for Software Lifecycle
The CCIN2P3 and its role in EGEE/LCG
School of EE and Computer Science
Input on Sustainability
Leigh Grundhoefer Indiana University
Ian Bird LCG Project - CERN HEPiX - FNAL 25-Oct-2002
WLCG Collaboration Workshop: Outlook for 2009 – 2010
Finance & Planning Committee of the San Francisco Health Commission
OU BATTLECARD: WebLogic Server 12c
Presentation transcript:

Operations Coordination Team Maria Girone, CERN IT-ES GDB, 11 July 2012

Background WLCG Operations has helped in the successful delivery of the WLCG service – Essential to process and analyze data at unprecedented speed Operations costs are still high – and many changes are underway / in the pipeline Ops & Tools TEG provided a very good forum for discussions and came up with a number of concrete recommendations – Today we discuss the proposed mandate for the Operations Coordination Team – Addresses needs identified in the WLCG Service Coordination recommendations (R1) + Commissioning (R2) of OPS & Tools TEG Maria Girone, CERN2

Long-Term Recommendations: Operations R1: WLCG Service Coordination: improve the computing service(s) provided by the sites – Establish a coordination team with contributions from experiments, sites, and projects. Persistent effort. Monitors and directs service commissioning effort – Address specific Tier-2 communication needs Dedicated service coordination meetings – Evolve to “Computing as a Service at Tier-2s” less experiment-specific services and interactions organize with EGI, NDGF and OSG common site administrator training R2: WLCG Service Commissioning: establish core teams of experts (from sites and experiments) to validate, commission and troubleshoot services – Dedicated work groups (task forces) created dynamically on specific topics 3TEG Workshop, February 2012

Team Goals WLCG will evolve rapid over the next three years, by the end of LS1 we should be able to operate in a Computing as a service model but only if we are willing to evolve – Within the guidance of the WLCG-MB, the service and capacity providers and the experiments, the team will understand what services are actually needed; monitor health; negotiate the configuration, upgrade and roll-back; commission new services help in transition when services are decommissioned Will help in facing the significant reduction of personnel resources from sites and experiments 4Maria Girone, CERN

Computing as a Service Would like to arrive at a point where – A small number of well-defined common services would be needed per site; – Installing, configuring and upgrading these would be “trivial” – All services would comply to standards, e.g. for error messages, monitoring; – Services would be resilient to glitches and highly available; – In case of load (or unexpected “user behavior”) they would react gracefully; – In case of problems, diagnosis and remedy should be straight- forward and rapid. A point where sites provide a defined service and experiments use it – Increased expectations on the stability and quality of the service, but lower expectations on the need for customization and interaction 5Maria Girone, CERN

Team Roles Key body: core members + targeted experts when required – Need representation / knowledge of sites / regions, experiments and services – Like Networking, this will be a persistent WG with both long term and short term goals Relates to existing structures such as daily OPS Re-tasks WLCG T1SCM as principle communication / coordination meeting (see next slides) 6Maria Girone, CERN

Team Communication Integrates long-term goals with short-term task forces to address specific deployment / de-commissioning issues – Ensures and strengthens communication to sites (Tier1 and Tier2) – Recommends to the MB specific solutions to specific problems (based on operations experience and on its 'expert team' investigations) Interacts with other WGs – Via representation of team members e.g. data federations, networking, information system, security, … 7Maria Girone, CERN

Meetings Daily Operations – Some members from the OPS team will be also Service Coordinator On Duty – SCOD (meeting chair, report to the MB) – SCODs from sites are very welcome – Will run daily also during LS1 Fortnightly Operations Coordination – Monitors and Coordinates on-going operations – Replaces the T1SCM Quarterly Operations Planning – Reviews needs from experiments and sites – Prepares plans and proposes them to the MB – Creates and dissolves internal ops task forces 8Maria Girone, CERN

Ideas for Task Forces CVMFS deployment completion – see today’s GDB: OPS TEG members involved Perfsonar deployment gLExec deployment completion … 9Maria Girone, CERN

Fortnightly Coordination Meeting Target: 1.5 hour, short minutes and action list OCT Task Force Reports (Relevant) External WG News – IS, network, monitoring, security, … Operational Issues Review – Unresolved issues from the daily – Data and Storage Management – CE, batch – Other Baseline Services Operational Plans from Experiments (LHC Machine planning for WLCG ops perspective) 10Maria Girone, CERN

Moving to an Operations Focus In the T1SCM now storage is handled from a versioning and technology perspective rather than an operations and service functionality perspective – In the new operations meeting tracks well defined metrics on the agreed baseline services 11Maria Girone, CERN

Conclusions Operations team will ensure stable operations and coordinate the technical transition guidance and direction of the WLCG-MB We would like to start as soon as possible and finalize the membership of the team – This is a call for help from the sites Regular meetings as main communication channel to sites, experiments, service providers – With minutes and action list Internal expert-driven task forces for technical work plan definition and deployment – With established and documented procedures, twikis, … 12Maria Girone, CERN