EGI: SA1 Operations John Gordon EGEE09 Barcelona September 2009.

Slides:



Advertisements
Similar presentations
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI EGI - Identity Management Steven Newhouse Director, EGI.eu Federated Identity.
Advertisements

EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE-III Program of Work Erwin Laure EGEE-II / EGEE-III Transition Meeting CERN,
EGI: A European Distributed Computing Infrastructure Steven Newhouse Interim EGI.eu Director.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks From ROCs to NGIs The pole1 and pole 2 people.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Future support of EGI services Tiziana Ferrari/EGI.eu Future support of EGI.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks PoW for the second year Transition to EGI.
EGI-InSPIRE Steven Newhouse Interim EGI.eu Director EGI-InSPIRE Project Director.
EMI INFSO-RI SA2 - Quality Assurance Alberto Aimar (CERN) SA2 Leader EMI First EC Review 22 June 2011, Brussels.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Steven Newhouse EGEE’s plans for transition.
The EGI Blueprint: Grid Operations and Security Migration to the next grid operations era Tiziana Ferrari (Istituto Nazionale di Fisica Nucleare)
Responsibilities of ROC and CIC in EGEE infrastructure A.Kryukov, SINP MSU, CIC Manager Yu.Lazin, IHEP, ROC Manager
UK NGI Operations John Gordon 15 th May NGS continuation NGI Security Monitoring VOMS Helpdesk I am reacting to some issues highlighted by Jeremy.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Steven Newhouse Technical Director CERN.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team James Casey EGEE’08.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-level monitoring - an overview James.
RI EGI-InSPIRE RI EGI Future activities Peter Solagna – EGI.eu.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Bob Jones EGEE project director CERN.
INFSO-RI Enabling Grids for E-sciencE External Projects Integration Summary – Trigger for Open Discussion Fotis Karayannis, Joanne.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1: Grid Operations Maite Barroso (CERN)
INFSO-RI Enabling Grids for E-sciencE EGEE SA1 in EGEE-II – Overview Ian Bird IT Department CERN, Switzerland EGEE.
Your university or experiment logo here The European Landscape John Gordon GridPP24 RHUL 15 th April 2010.
EGEE-III-INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-III All Activity Meeting Brussels,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Steven Newhouse Technical Director CERN.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGI Operations Tiziana Ferrari EGEE User.
EGI-InSPIRE Steven Newhouse Interim EGI.eu Director EGI-InSPIRE Project Director Technical Director EGEE-III 1GDB - December 2009.
EMI INFSO-RI SA1 Session Report Francesco Giacomini (INFN) EMI Kick-off Meeting CERN, May 2010.
INFSO-RI Enabling Grids for E-sciencE An overview of EGEE operations & support procedures Jules Wolfrat SARA.
Ian Bird LCG Project Leader On the transition to EGI – Requirements from WLCG WLCG Workshop 24 th April 2008.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Robin McConnell NA3 Activity Manager 28.
Security Policy: From EGEE to EGI David Kelsey (STFC-RAL) 21 Sep 2009 EGEE’09, Barcelona david.kelsey at stfc.ac.uk.
WLCG Laura Perini1 EGI Operation Scenarios Introduction to panel discussion.
PIC port d’informació científica EGEE – EGI Transition for WLCG in Spain M. Delfino, G. Merino, PIC Spanish Tier-1 WLCG CB 13-Nov-2009.
AEGIS Academic and Educational Grid Initiative of Serbia Antun Balaz (NGI_AEGIS Technical Manager) Dusan Vudragovic (NGI_AEGIS Deputy.
EGEE is a project funded by the European Union under contract IST Roles & Responsibilities Ian Bird SA1 Manager Cork Meeting, April 2004.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks User Support for Distributed Computing Infrastructures.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team Kickoff Meeting.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Grid Oversight in Service Level Agreement environment Małgorzata Krakowian,
Components Selection Validation Integration Deployment What it could mean inside EGI
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Steven Newhouse Technical Director CERN.
CERN - IT Department CH-1211 Genève 23 Switzerland t IT-GD-OPS attendance to EGEE’09 IT/GD Group Meeting, 09 October 2009.
Ian Bird LCG Project Leader Status of EGEE  EGI transition WLCG LHCC Referees’ meeting 21 st September 2009.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Plans for PY2 Steven Newhouse Project Director, EGI.eu 30/05/2011 Future.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI UMD Roadmap Steven Newhouse 14/09/2010.
Resource Provisioning EGI_DS WP3 consolidation workshop, CERN Fotis Karayannis, GRNET.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks What all NGIs need to do: Helpdesk / User.
EMI INFSO-RI Testbed for project continuous Integration Danilo Dongiovanni (INFN-CNAF) -SA2.6 Task Leader Jozef Cernak(UPJŠ, Kosice, Slovakia)
NGI_TR Emrah Akkoyun TR-Grid Operational Center EGI-InSPIRE – SA1 Kickoff Meeting1.
SAFE SSCs for A&A, Fusion and ES Coordinator: Claudio Vuerli, INAF, Italy.
EGI Process Assessment and Improvement Plan – EGI core services – Tiziana Ferrari FedSM project 1EGI Process Assessment and Improvement Plan (Core Services)
EGI-InSPIRE Project Overview1 EGI-InSPIRE Overview Activities and operations boards Tiziana Ferrari, EGI.eu Operations Unit Tiziana.Ferrari at egi.eu 1.
Setting up NGI operations Ron Trompert EGI-InSPIRE – ROD teams workshop1.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Storage Accounting John Gordon, STFC OMB August 2013.
Grid Deployment Technical Working Groups: Middleware selection AAA,security Resource scheduling Operations User Support GDB Grid Deployment Resource planning,
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations automation team presentazione.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Regional tools use cases overview Peter Solagna – EGI.eu On behalf of the.
EGI-InSPIRE EGI-InSPIRE RI The European Grid Infrastructure Steven Newhouse Director, EGI.eu Project Director, EGI-InSPIRE 29/06/2016CoreGrid.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI A pan-European Research Infrastructure supporting the digital European Research.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks IT ROC: Vision for EGEE III Tiziana Ferrari.
EGI InSPIRE Report to the EGI Council Steven Newhouse On behalf of the Editorial Board.
Bob Jones EGEE Technical Director
Regional Operations Centres Core infrastructure Centres
Ian Bird GDB Meeting CERN 9 September 2003
POW MND section.
Networking support (SA2) tasks for EGI
NA3: User Community Support Team
Maite Barroso, SA1 activity leader CERN 27th January 2009
Connecting the European Grid Infrastructure to Research Communities
Solutions for federated services management EGI
Presentation transcript:

EGI: SA1 Operations John Gordon EGEE09 Barcelona September 2009

EGI-InSPIRE Objectives Transition to a federated European e- infrastructure built from national resources Deliver production quality services to all its user communities Provide a scalable user support model for international research communities within Europe and their worldwide collaborators Interoperation and interoperability with other e-infrastructures Steven’s

Who does what? Internal National Infrastructure & Activities –Infrastructure that is not visible internationally –Activities with no external dependency NGI International Tasks –The ‘service’ interface to your national grid –Community defined that EGI will enforce EGI Global Tasks –Run by EGI.eu through community partners –Services needed for the whole EGI community

Steven’s Operations Providing a Reliable Grid Infrastructure Availability of Skilled Support Teams Service Deployment Validation Providing a secure Infrastructure Accounting for resource usage Helpdesk Infrastructure Infrastructure for Grid Management Regionalisation of the operational tools

SA1 Operations Activity TSA1.1 Activity Management TSA1.2 Providing a Reliable Grid Infrastructure TSA1.3 Availability of Support Teams TSA1.4 Service Deployment Validation TSA1.5 Secure Infrastructure TSA1.6 Accounting TSA1.7 Helpdesk Infrastructure TSA1.8 Infrastructure for Grid Management TSA1.9 Transition

Background EGI-DS Blueprint has 17 Global Operations Tasks and 9 International Tasks. Too many simply to decompose the operations activity into those tasks. Grouped them together into broader meta- tasks associated with second-level objectives for EGI. Did not achieve a layered set of tasks, but some are lower level and others are built on top of them.

SA1 Task Dependencies TSA1.4 Service Deployment Validation TSA1.2 Reliable Infrastructure TSA1.3 Support Teams TSA1.5 Secure Infrastructure TSA1.6 Accounting TSA1.7 Helpdesk TSA1.8 Grid Management Infrastructure

TSA1.5 Secure Infrastructure Address the various security-related risks and maintain the availability of EGI services. All aspects of operational security aimed at achieving « A secure infrastructure » Security Operations (O-E- 16, O-N-9a), IGTF and EUGridPMA support and participation (O-E-15), Security Vulnerability Handling, Security Coordination, GOCDB Security contact information (O-E-1, O-N-1) Security Monitoring from TSA1.8 Security Policy (O-E-15), ->NA2

TSA1.7 Helpdesk Infrastructure This task will provide a network of helpdesks in NGIs all interacting with a central global instance through agreed interfaces, standard procedures for handling tickets, passing them between helpdesks, escalating them. It will also interact with other activities and projects to ensure that requirements are gathered from as wide a constituency as possible, so key to the operation of the infrastructure is the helpdesk. Helpdesk (O-E-6, O-N-6), Service Requirements Capture (O-E-8)

TSA1.4 Service Deployment Validation Ensure that new software releases can be deployed safely and reliably without any degradation of service to the production grid infrastructure. Complementary to SA2 The task includes operational tools, global and site services, and testing of interoperation with other grids infrastructures. There will be management of phased middleware roll-out and deployment including testbeds for middleware testing by end-users. Similarly, a test environment will be developed for new versions of the operational tools. Coordination of middleware roll- out and deployment (O-E-9, O-N- 9b) –including testbeds for certification middleware testing by end-users. SA1.8 Infrastructure for Grid Management Operational tools (O-E-17) Interoperability (O-E-11, O-N-9d))

TSA1.8 Infrastructure for Grid Management Deployment of the infrastructure for Grid management consisting of a set of services and tools needed by the NGI/EIRO Operations Centres for the running of the Grid software services for Grid monitoring (including SLA and security monitoring), and ongoing Grid management. At the core of this infrastructure is a set of monitoring tools to be deployed in all NGIs to monitor their sites. Above this will sit higher level monitoring of global services and automated measurement of various service and site reliability metrics. Monitoring infrastructure, Nagios, SAM, and other tools (FTM, GridMap, Gridview, WMS monitor) (O-E-3, O-N-3) end-to-end network performance monitoring infrastructure and its support, part of (O-E-12) GOCDB knowledge of topology and configuration, downtime schedule (O-E-1, O-N-1) CIC Portal and NGI dashboard. (O-E-4, O-N-4) Security monitoring infrastructure (currently a development task JRA1) SLA portal (currently a development task JRA1) TSA1.4 Service Deployment Validation (development of tools is in JRA1)

TSA1.6 Accounting Provide a reliable record of the usage of the infrastructure for users, VOs, NGI and EGI management. Access to data will be restricted according to agreed policies and NGI/EIRO privacy laws. This task will provide: securely and reliably run accounting repositories at NGI and EGI level; a portal to provide on- demand visualisation and/or data download. It requires grid topology information and a secure infrastructure (provided by other tasks) Accounting Repositories (O-E- 2, O-N-2), Accounting Portal (O-E-2) GOCDB topology information (O-E-1, O-N-1) TSA1.4 A Secure Infrastructure

TSA1.3 Support Teams Bring together the various teams of people handling support issues for users, sites and the network. It will not merge them into a common team as the skills required differ but it will make sure the infrastructure is in place and the teams are trained and resourced and all the required documentation is in place. TSA1.8 Infrastructure for Grid Management Helpdesk Triage Teams (O- E-7), National User Support Teams (O-N-7), COD Support Teams, central and national (cCOD, rCOD), (O-E-5, O-N-5) TSA1.6 Helpdesk Infrastructure Resource Allocation (O-E- 10, O-N-9c) Network Support (O-E-12)

TSA1.2 Providing a Reliable Grid Infrastructure This task is to ensure that sites and operational and middleware services are functional, reliable, responsive. It will achieve this through subtasks on: production grid services, interoperability, best practices and service level agreements. It also has dependencies on other subtasks which manage the human support teams, security, helpdesks, and the monitoring and management infrastructure. Grid Oversight, (O-E-5, O-N-5) TSA1.3 Support Teams TSA1.5 A Secure Infrastructure TSA1.7 Helpdesk Infrastructure TSA1.8 Infrastructure for Grid Management Production Core Grid Services (O- E-14, O-N-8) Interoperability (O-E-11, O-N-9d)) Best Practices (O-E-13, O-N-9d) Supervision of SLAs etc (as part of (O-E-3, O-N-3))

TSA1.9 Transition The transition from EGEE to EGI will be more complicated for some regions than others. The complex regions with expertise centred in a few countries will need to do a lot of knowledge exchange for the first year. This task will support some NGIs to continue with a regional role as they migrate expertise to nearby NGIs as well as perform their own NGI roles. Subtasks include the designing and deployment of operational services inside new NGIs, the preparation of ready to deploy solutions (e.g. a helpdesk integrated with central instance), knowledge sharing between NGIs, the definition of clear guidelines and procedures; the definition of a roadmap and the monitoring of the related progress for new NGIs joining operations, the gathering and addressing of requirements that come from the new NGIs about operational processes. SA1.9.1: support to NGIs during the transition process, the designing and deployment of operational services inside new NGIs, and the preparation of ready to deploy solutions (e.g. a helpdesk integrated with central instance), knowledge sharing between NGIs, the definition of clear guidelines and procedures; SA1.9.2: the definition of a roadmap and the monitoring of the related progress for new NGIs joining operations, the gathering and addressing of requirements that come from the new NGIs about operational processes

TSA1.1Activity Management COO, Activity Leader and Task Leaders EGI Operations Board involving all NGIs A wider forum involving NGI management, NGI sites, VOs, SSCs, middleware providers, as applicable Working groups, task forces Consultation and Advisory bodies

Issues Have we covered everything in the Blueprint? –A few appear in multiple metatasks possibly leading to management problems Existing Global Tasks which were missed from the Blueprint? (e.g. Vulnerability Group) New Global Tasks which have emerged since the Blueprint was written (e.g. MPI) New Middleware/Tools not yet part of production EGEE stack/toolset (e.g. Intrusion detection) Decide who performs Global Tasks (Thursday) then work on transitions and work plans Distributing effort for International Tasks to NGIs Recognising additional contributions from NGI Flexibility to cope with new tasks during the project Absorbing new NGIs, EIROs Milestones/deliverables –Too many BP tasks to manage separate milestones/deliverables for each