EGEE-III INFSO-RI-222667 Enabling Grids for E-sciencE www.eu-egee.org EGEE and gLite are registered trademarks Operations Automation Team James Casey EGEE’08.

Slides:



Advertisements
Similar presentations
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks ROD model assessment ROC SEE By E. Atanassov,
Advertisements

EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Operations Dashboard Workplan Cyril.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite Release Process Maria Alandes Pradillo.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks From ROCs to NGIs The pole1 and pole 2 people.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Romanian SA1 report Alexandru Stanciu ICI.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks PoW for the second year Transition to EGI.
EGI: SA1 Operations John Gordon EGEE09 Barcelona September 2009.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Simply monitor a grid site with Nagios J.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Steven Newhouse EGEE’s plans for transition.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The network monitoring in grid context Operations.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks PPS All sites Meeting: Introduction & Agenda.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks ROD model assessment ROC UKI John Walsh.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Middleware Deployment and Support in EGEE.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GStat 2.0 Joanna Huang (ASGC) Laurence Field.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Monitoring and enforcement of Service Level Agreements John Shade EGEE-II / EGEE.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Multi-level monitoring - an overview James.
EGEE-III INFSO-RI Enabling Grids for E-sciencE COD21 22 Sept 2009 Forum & COD-22 since COD21 until EGI Hélène Cordier COD-22, CNRS-IN2P3,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Service Availability Monitoring – Status.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Site Monitoring with Nagios E. Imamagic,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE-EGI Grid Operations Transition Maite.
CERN IT Department CH-1211 Geneva 23 Switzerland t GDB CERN, 4 th March 2008 James Casey A Strategy for WLCG Monitoring.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1: Grid Operations Maite Barroso (CERN)
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Wojciech Lapka SAM Team CERN EGEE’09 Conference,
EGEE-III INFSO-RI Enabling Grids for E-sciencE Antonio Retico CERN, Geneva 19 Jan 2009 PPS in EGEEIII: Some Points.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Pre-production in EGEEIII Operation principles Antonio Retico EGEE-II / EGEE II SA1.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Ricardo Rocha CERN (IT/GS) EGEE’08, September 2008, Istanbul, TURKEY Experiment.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation in EGEE-III What does.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MSG - A messaging system for efficient and.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks DSA1.4 – Objectives and Status Ioannis Liabotis.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Dashboard Cyril L’Orphelin - CNRS/IN2P3.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Using GStat 2.0 for Information Validation.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Standard network trouble tickets exchange.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks APEL CPU Accounting in the EGEE/WLCG infrastructure.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Communication tools between Grid Virtual.
PIC port d’informació científica EGEE – EGI Transition for WLCG in Spain M. Delfino, G. Merino, PIC Spanish Tier-1 WLCG CB 13-Nov-2009.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Monitoring Tools E. Imamagic, SRCE CE.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Deliverable DSA1.4 Jules Wolfrat ARM-9 –
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Middleware Update Maria Alandes Pradillo.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks COD-17
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Regional Nagios Emir Imamagic /SRCE EGEE’09,
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team Kickoff Meeting.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Ian Bird All Activity Meeting, Sofia
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Patch Preparation SA3 All Hands Meeting.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA1 & SA2-ENOC Interactions status and plans.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Pole 2 : Restructuration of the OPS Manual.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Configuration Data or “What should be.
CERN - IT Department CH-1211 Genève 23 Switzerland t IT-GD-OPS attendance to EGEE’09 IT/GD Group Meeting, 09 October 2009.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Operations: Evolution of the Role of.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks NA3 Resources Robin McConnell.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks What all NGIs need to do: Helpdesk / User.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks ROC model assessment AP ROC ShuTing Liao.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The Dashboard for Operations Cyril L’Orphelin.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CYFRONET site report Marcin Radecki CYFRONET.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks COD-16 (Transition to EGEE-III) Report to.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks COD-17
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations automation team presentazione.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Towards an Information System Product Team.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks GOCDB4 Gilles Mathieu, RAL-STFC, UK An introduction.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks IT ROC: Vision for EGEE III Tiziana Ferrari.
Enabling Grids for E-sciencE EGEE-II INFSO-RI ROC managers meeting at EGEE 2007 conference, Budapest, October 1, 2007 Admin Matters Vera Hanser.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operational Tools M2 Update James Casey.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Status of the SAM/Nagios/GSTAT Components.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks MyEGEE David Horat (
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Nagios Grid Monitor E. Imamagic, SRCE OAT.
Transition to EGI PSC-06 Istanbul Ioannis Liabotis Greece GRNET
Introduction to OAT presentations
Evolution of SAM in an enhanced model for monitoring the WLCG grid
Grid Service Monitoring Working Group
Maite Barroso, SA1 activity leader CERN 27th January 2009
Presentation transcript:

EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Operations Automation Team James Casey EGEE’08 SA1 management meeting 23 rd September 2008 Istanbul, Turkey

Enabling Grids for E-sciencE EGEE-III INFSO-RI Achievements MSA1.1 – Operations Automation Strategy – –Highlights  Adopt messaging as integration middleware  Use Nagios for site and regional monitoring  Move responsibility for reliability of sites to the sites With support from their ROC  Communication strategy to help sharing of operational tools –Internal milestones defined in this document  Messaging Infrastructure deployment  Multi-level monitoring development, packaging and deployment  Move of ROCs to the new operational model  Tutorial at EGEE’08 (Tomorrow) To change: View -> Header and Footer 2

Enabling Grids for E-sciencE EGEE-III INFSO-RI Achievements Multi-level monitoring –Nagios for site monitoring  Yaim packaging complete (effort from CERN + DAE/India)  Tutorials at EGEE’08 (tomorrow) –Regional monitoring  First version of multi-site monitoring released And used by several regions ( IT, SW, CE, …)  No connection yet to site monitoring On plan for end of year Communication –OAT is good forum for teams to talk and share ideas  Good work on technical + information architecture for MSA1.1 –Leveraging the SA1 quarterly meeting for OAT F2Fs To change: View -> Header and Footer 3

Enabling Grids for E-sciencE EGEE-III INFSO-RI Issues Per-tool workplans not yet produced –This was an ‘future plan’ from transition meeting –But tools are working on the plans  Operations dashboard, GOCDB both presented first roadmaps at EGEE’08 sessions Effort –WBS contributions are only from ‘known effort’  E.g. teams already working – SAM, Operations dashboard, GOCDB, Nagios, SAMAP, … –New projects ‘appearing’ in the scope of OAT  More in futures… –Where will this effort come from?  Try to raise interest in main SA1 session To change: View -> Header and Footer 4

Enabling Grids for E-sciencE EGEE-III INFSO-RI Issues - Messaging Messaging Infrastructure –First solution deployed  Failover Activemq broker pair at CERN –Running into some bugs in production  Bugs are in area of management + stability Core functions are ok and work as needed  Tracking the next release of Activemq  Working around bugs in production Failover gives us the ability to workaround –Track alternative implementations  Always was in the plan since we used interop protocols for communication and shield clients with our own API –For now we keep going as planned To change: View -> Header and Footer 5

Enabling Grids for E-sciencE EGEE-III INFSO-RI Issues - Communication Trying ‘Lightning talks’ at EGEE’08 –Not very strong take-up (5 proposals) –Thanks to IT-ROC – most contributions Documentation –Still a weak area –Need to do much more work here Not everyone has heard/understood the strategy –We need to evangelize a bit more –Talk in main EGEE-SA1 session  Theme will be “What does the OAT mean for you ?” To change: View -> Header and Footer 6

Enabling Grids for E-sciencE EGEE-III INFSO-RI Plans Operations Dashboard –Regional dashboard is under design (FR) –Prototype by end of year –Q: Alarms database foreseen at ROC, and link to regional ticketing system – not sure who will do this work GOCDB –First roadmap presented yesterday at GOCDB advisory meeting  Includes programmatic interface and updated schema SAM –Moving probes to use messaging system as first step  In validation cluster –Gaining Nagios experience –Working on ‘ how to do alarming’ with operations database To change: View -> Header and Footer 7

Enabling Grids for E-sciencE EGEE-III INFSO-RI Plans Gridview –Regional summarization to be discussed in F2F with gridview in Oct’08  Part of ‘SLA portal’ ? Nagios –Push to have connected site and regional monitoring by end of year SLA portal –New item requested by ROCs –Under discussion in team –still no clear idea on feature set –Also no effort tasked yet To change: View -> Header and Footer 8

Enabling Grids for E-sciencE EGEE-III INFSO-RI Plans Metrics Collection –New item requested by OCC –How to gather and report on MSA1.3 metrics –Effort still to be identified EGEE-SA1 tools –Have repository for tools at Manchester (yum) –Using it for messaging + nagios  Packaging tools via yaim –Will try with a few management tools to add them to the repository and get sites using them –Possible tools:  Wiatg, WMSMon, FTS tools, dpm-admin tools… To change: View -> Header and Footer 9

Enabling Grids for E-sciencE EGEE-III INFSO-RI Summary Writing strategy document took a lot of our effort –Need to make sure everyone reads and understands it  Down to the site level Teams working well on their tools/projects –And communicating together Biggest outstanding issues are –External communication (awareness/publicity/documentation) –Finding effort To change: View -> Header and Footer 10