Nordic ROC Organization

Slides:



Advertisements
Similar presentations
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks From ROCs to NGIs The pole1 and pole 2 people.
Advertisements

Enabling Grids for E-sciencE COD 19 meeting, Bologna Nordic ROD experiences Michaela Lechner COD-19, Bologna.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Romanian SA1 report Alexandru Stanciu ICI.
CERN, BalticGrid Project Rolandas Naujikas [rolandas nawyeekas] CERN IT/GD Vilnius University Lithuania.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Steven Newhouse EGEE’s plans for transition.
The EGI Blueprint: Grid Operations and Security Migration to the next grid operations era Tiziana Ferrari (Istituto Nazionale di Fisica Nucleare)
Responsibilities of ROC and CIC in EGEE infrastructure A.Kryukov, SINP MSU, CIC Manager Yu.Lazin, IHEP, ROC Manager
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks ROD model assessment ROC UKI John Walsh.
EGEE is a project funded by the European Union under contract IST User support in EGEE Alistair Mills Torsten Antoni EGEE-3 Conference 20 April.
INFSO-RI Enabling Grids for E-sciencE User Support in EGEE Torsten Antoni, FZK
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-level monitoring - an overview James.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES GGUS Overview ROC_LA CERN
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1: Grid Operations Maite Barroso (CERN)
INFSO-RI Enabling Grids for E-sciencE EGEE SA1 in EGEE-II – Overview Ian Bird IT Department CERN, Switzerland EGEE.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The EGEE User Support Infrastructure Torsten.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGI Operations Tiziana Ferrari EGEE User.
EGI-InSPIRE Steven Newhouse Interim EGI.eu Director EGI-InSPIRE Project Director Technical Director EGEE-III 1GDB - December 2009.
INFSO-RI Enabling Grids for E-sciencE An overview of EGEE operations & support procedures Jules Wolfrat SARA.
Ian Bird LCG Project Leader On the transition to EGI – Requirements from WLCG WLCG Workshop 24 th April 2008.
Operations Working Group Summary Ian Bird CERN IT-GD 4 November 2004.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Robin McConnell NA3 Activity Manager 28.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Communication tools between Grid Virtual.
PIC port d’informació científica EGEE – EGI Transition for WLCG in Spain M. Delfino, G. Merino, PIC Spanish Tier-1 WLCG CB 13-Nov-2009.
AEGIS Academic and Educational Grid Initiative of Serbia Antun Balaz (NGI_AEGIS Technical Manager) Dusan Vudragovic (NGI_AEGIS Deputy.
1Maria Dimou- cern-it-gd LCG GDB May 2008 USAG and direct GGUS ticket routing to Sites Grid Deployment.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The EGEE User Support Infrastructure Alistair.
EGEE-III INFSO-RI Enabling Grids for E-sciencE COD20. June 2009 Helsinki R-COD in UKI Claire Devereux, Jeremy Coles & Co. COD-20,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks User Support for Distributed Computing Infrastructures.
Kati Lassila-Perini EGEE User Support Workshop Outline: – CMS collaboration – User Support clients – User Support task definition – passive support:
INFSO-RI Enabling Grids for E-sciencE User and Virtual Organisation Support in EGEE Flavia Donno, CERN Torsten Antoni, FZK Alistair.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team Kickoff Meeting.
Operations model Maite Barroso, CERN On behalf of EGEE operations WLCG Service Workshop 11/02/2006.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks What all NGIs need to do: Helpdesk / User.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Operational Procedures (Contacts, procedures,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks COD-16 (Transition to EGEE-III) Report to.
Scuola Grid - Martina Franca, Thursday 08 November Il Sistema di Supporto INFNGrid & GGUS ( Global Grid User.
INFSO-RI Enabling Grids for E-sciencE The role of the Virtual Organization Ticket Processing Manager Guido Negri INFN – CNAF Italy.
Enabling Grids for E-sciencE EGEE-II INFSO-RI ROC managers meeting at EGEE 2007 conference, Budapest, October 1, 2007 Admin Matters Vera Hanser.
May pre GDB WLCG services post EGEE Josva Kleist Michael Grønager Software Coord NDGF Director CERN, May 12 th 2009.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI COD activity in EGI-InSPIRE Marcin Radecki CYFRONET, Poland & COD Team 9/29/2016.
CERN WLCG Grid Storage Systems Deployment Flavia Donno, CERN 6 November 2007 Organization of Storage Support through GGUS Flavia Donno CERN/IT-GD CERN.
Documentation, Best Practices and Procedures: Roadmap
Il Sistema di Supporto INFNGrid & GGUS (Global Grid User Support )
THE GISELA PROJECT Herbert Hoeger WP2 Manager - ULA (Venezuela)
Regional Operations Centres Core infrastructure Centres
Il sistema di supporto di INFNGRID e GGUS
SA1 Status Report EGEE Grid Operations & Management
SA1 Execution Plan Status and Issues
User Support Workflow in EGEE
PL-Grid – an example of NGI support structure Marcin Radecki
Ian Bird GDB Meeting CERN 9 September 2003
Brief overview on GridICE and Ticketing System
ATLAS support in LCG.
Sistemi di monitoraggio e allarmistica
LCG/EGEE Incident Response Planning
Helene Cordier, CNRS-IN2P3 Villeurbanne, France
Service Level Agreement/Description between CE ROC and Sites
NGI Operations readiness report
Report on SLA progress Ioannis Liabotis <ilaboti at grnet.gr>
Infrastructure Support
The CCIN2P3 and its role in EGEE/LCG
Romain Wartel EGEE08 Conference, Istanbul, 23rd September 2008
Maite Barroso, SA1 activity leader CERN 27th January 2009
ROD model assessment ROC FR
NE-ROC Nordics Operations
BalticGrid Operations
LCG Operations Workshop, e-IRG Workshop
Presentation transcript:

Nordic ROC Organization Gert Svensson, PDC, KTH, Nordic ROC Manager Nordic ROC and Baltic Grid meeting - Helsinki- 15 June 2009

ROC Duties Provide Help Desk facilities (first-level support). Provide second-level support by helping in the resolution of advanced and specialized operational problems that cannot be solved by site administrators. If necessary, the ROC will propagate and follow-up problems with higher-level operational or development teams. Ticket follow-up (ensure that sites work on tickets opened against them). Respond to tickets from sites in a timely manner. Manage and support the deployment of gLite middleware on sites. Registering new sites. Follow-up on accounting. Nordic ROC & BG – Helsinki, 15 June 2009

Functions and tools Functional Areas Operational tools Ticketing System

Regionalized model What is our target model? In EGI as much as possible will be regionalized based on NGI:s National Grid Initiatives ROC:s are responsible for day to day operations, with a minimal organization overseeing them More efficient Several NGI:s can have one ROC Nordic ROC & BG – Helsinki, 15 June 2009

Current operational model Nordic ROC & BG – Helsinki, 15 June 2009

Transition r-COD COD Duties to be performed all the time Duties to be performed periodically Look at the whole infrastructure r-COD Duties to be performed all the time Only look at sites in own region Nordic ROC & BG – Helsinki, 15 June 2009

Another view Nordic ROC & BG – Helsinki, 15 June 2009

Site responsibilty Adhere to the Operations Procedures Manual Maintain accurate information in GOCDB Adhere to the Grid Site Operations Policy Adhere to the Security and Availability Policy document Adhere to Service Level Description (SLD) Deploy supported versions of gLite (or compatible) middleware Respond to tickets in a timely manner Nordic ROC & BG – Helsinki, 15 June 2009

First line user support - TPM Provides 1st line support for users together with VO experts Assigns tickets to appropriate support units Monitor longstanding open unchanged tickets Is at the time being a central task More tickets will be sent directly to ROC in the future Only cases without natural region will be handled centrally Follow-up will stay central Nordic ROC & BG – Helsinki, 15 June 2009

First line support function First-line support in GGUS is called Ticket Process Management (TPM). The TPM duty is to assign tickets to the right Support Unit (SU). Assignment must be done in less than one working hour. TPMs only see 'normal' submitted tickets, i.e. those not assigned automatically (to the ROCs or a few VOs today). TPMs should recuperate 'forgotten' tickets. TPMs are notified for action on 2nd and 3rd level of ticket escalation. TPMs should open savannah entries for middleware problems submitted to GGUS. Function and models' details in http://edms.cern.ch/document/1000210 Antoni | Bosio | Dimou - SA1 F2F Meeting | CERN | 09/06/09

User support workflow User Support Ticket Processing Managers (TPM) analyse the problems reported and assign them to the correct second-level support units. VOs have support infrastructures to help their users with VO-specific problems. These infrastructures are under their own control. Usually, they are using other tools to support this effort. The Regional Operations Centres are responsible for dealing with problems arising in their associated resource centres GGUS benefits from experts spread all over the world for solving issues related to grid security, to networks, and to the interfaces with other grids.

User Support Workflow contd. For VO users and VO specific problems Mail to <VO>-user-support@ggus.org - Solves - Classifies - Monitors Automatic Ticket Creation TPM Grid+VO experts VO-specific Central Application (GGUS) VO Support Units Middleware Support Units Deployment Operations Support ROC Network Other Grids Nordic ROC & BG – Helsinki, 15 June 2009

Multi level monitoring framework

Terminology SNIC - Swedish National Infrastructure for Computing Organizing high-performance computing in Sweden Joint Research Unit in the EGEE III project NDGF – Nordic Data Grid Facility an organization for Grids set up by the Nordic Countries runs the Nordic Tier-1 distributed over 9 sites develops ARC middleware most staff distributed in the Nordic countries Nordic ROC & BG – Helsinki, 15 June 2009

NE ROC organisation Two federations: Nordic, Benelux One distributed ROC Three sites in Sweden, one in Finland (SNIC + NGDF) and three in Netherlands GGUS handling and 1:st line support – Regional Operator on Duty (ROD): Nordic handles Nordic Sites + Baltic Grid Collaboration between SNIC ROC and NDGF Netherlands handles the Benelux sites Rotated among the sites in weekly shifts TPM duty rotated between all ROC:s Rotated between sites in the NE ROC Nordic ROC & BG – Helsinki, 15 June 2009

Challenges Distributed ROC Distributed Tier-1 Knowledge of ARC and gLite by different groups Nordic ROC & BG – Helsinki, 15 June 2009

Regular meetings Meetings Nordic ROC meeting Thursday 10.00 Phone EGEE & WLCG Joint Operations meeting Monday 16.00 Phone EGEE SA1 meeting each second Tuesday 10.00 Phone NGDF meeting Friday 10.00 Jabber Nordic ROC & BG – Helsinki, 15 June 2009

Things to discuss How do we improve communication? Contact site directly, operations directors etc? Provide more help SLA – EGEE requires Service Level Agreements How do we handle that? 75 % availability over each month What should we do when site doesn’t respond? Nordic ROC & BG – Helsinki, 15 June 2009