Status of gLite-3.0 deployment and uptake Ian Bird CERN IT LCG-LHCC Referees Meeting 29 th January 2007.

Slides:



Advertisements
Similar presentations
Applications Area Issues RWL Jones GridPP13 – 5 th June 2005.
Advertisements

GLite Status Stephen Burke RAL GridPP 13 - Durham.
LCG Tiziana Ferrari - SC3: INFN installation status report 1 Service Challenge Phase 3: Status report Tiziana Ferrari on behalf of the INFN SC team INFN.
1 User Analysis Workgroup Update  All four experiments gave input by mid December  ALICE by document and links  Very independent.
Accounting Update Dave Kant Grid Deployment Board Nov 2007.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
Summary of issues and questions raised. FTS workshop for experiment integrators Summary of use  Generally positive response on current state!  Now the.
Stefano Belforte INFN Trieste 1 CMS SC4 etc. July 5, 2006 CMS Service Challenge 4 and beyond.
LCG Milestones for Deployment, Fabric, & Grid Technology Ian Bird LCG Deployment Area Manager PEB 3-Dec-2002.
LHCC Comprehensive Review – September WLCG Commissioning Schedule Still an ambitious programme ahead Still an ambitious programme ahead Timely testing.
CERN - IT Department CH-1211 Genève 23 Switzerland t LCG Deployment GridPP 18, Glasgow, 21 st March 2007 Tony Cass Leader, Fabric Infrastructure.
Computing Infrastructure Status. LHCb Computing Status LHCb LHCC mini-review, February The LHCb Computing Model: a reminder m Simulation is using.
WLCG Service Report ~~~ WLCG Management Board, 24 th November
Ian Bird LCG Project Leader LHCC Referee Meeting Project Status & Overview 22 nd September 2008.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks JRA1 summary Claudio Grandi EGEE-II JRA1.
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
UK middleware deployment GridPP27 - CERN 15 th September 2011 GridPP27 - CERN 15 th September 2011 Status & plans Jeremy Coles.
Maarten Litmaath (CERN), GDB meeting, CERN, 2006/02/08 VOMS deployment Extent of VOMS usage in LCG-2 –Node types gLite 3.0 Issues Conclusions.
CERN Using the SAM framework for the CMS specific tests Andrea Sciabà System Analysis WG Meeting 15 November, 2007.
GLite – An Outsider’s View Stephen Burke RAL. January 31 st 2005gLite overview Introduction A personal view of the current situation –Asked to be provocative!
LCG Service Challenges: Planning for Tier2 Sites Update for HEPiX meeting Jamie Shiers IT-GD, CERN.
LCG Service Challenges: Planning for Tier2 Sites Update for HEPiX meeting Jamie Shiers IT-GD, CERN.
LCG EGEE is a project funded by the European Union under contract IST LCG PEB, 7 th June 2004 Prototype Middleware Status Update Frédéric Hemmer.
Stefano Belforte INFN Trieste 1 Middleware February 14, 2007 Resource Broker, gLite etc. CMS vs. middleware.
GDB March User-Level, VOMS Groups and Roles Dave Kant CCLRC, e-Science Centre.
MW Readiness WG Update Andrea Manzi Maria Dimou Lionel Cons 10/12/2014.
1 User Analysis Workgroup Discussion  Understand and document analysis models  Best in a way that allows to compare them easily.
WLCG Grid Deployment Board, CERN 11 June 2008 Storage Update Flavia Donno CERN/IT.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Priorities update Andrea Sciabà IT/GS Ulrich Schwickerath IT/FIO.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE Site Architecture Resource Center Deployment Considerations MIMOS EGEE Tutorial.
Site Validation Session Report Co-Chairs: Piotr Nyczyk, CERN IT/GD Leigh Grundhoefer, IU / OSG Notes from Judy Novak WLCG-OSG-EGEE Workshop CERN, June.
BNL Service Challenge 3 Status Report Xin Zhao, Zhenping Liu, Wensheng Deng, Razvan Popescu, Dantong Yu and Bruce Gibbard USATLAS Computing Facility Brookhaven.
Enabling Grids for E-sciencE gLite for ATLAS Production Simone Campana, CERN/INFN ATLAS production meeting May 2, 2005.
The CMS Top 5 Issues/Concerns wrt. WLCG services WLCG-MB April 3, 2007 Matthias Kasemann CERN/DESY.
LCG Report from GDB John Gordon, STFC-RAL MB meeting February24 th, 2009.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
Last update 29/01/ :01 LCG 1Maria Dimou- cern-it-gd Maria Dimou IT/GD CERN VOMS server deployment LCG Grid Deployment Board
Plans for Service Challenge 3 Ian Bird LHCC Referees Meeting 27 th June 2005.
Data Transfer Service Challenge Infrastructure Ian Bird GDB 12 th January 2005.
Baseline Services Group Report LHCC Referees Meeting 27 th June 2005 Ian Bird IT/GD, CERN.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks APEL CPU Accounting in the EGEE/WLCG infrastructure.
LCG User Level Accounting John Gordon CCLRC-RAL LCG Grid Deployment Board October 2006.
Criteria for Deploying gLite WMS and CE Ian Bird CERN IT LCG MB 6 th March 2007.
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
Enabling Grids for E-sciencE CMS/ARDA activity within the CMS distributed system Julia Andreeva, CERN On behalf of ARDA group CHEP06.
Distributed Analysis Tutorial Dietrich Liko. Overview  Three grid flavors in ATLAS EGEE OSG Nordugrid  Distributed Analysis Activities GANGA/LCG PANDA/OSG.
INFSO-RI Enabling Grids for E-sciencE gLite Test and Certification Effort Nick Thackray CERN.
LCG Issues from GDB John Gordon, STFC WLCG MB meeting September 28 th 2010.
8 August 2006MB Report on Status and Progress of SC4 activities 1 MB (Snapshot) Report on Status and Progress of SC4 activities A weekly report is gathered.
LHCC Referees Meeting – 28 June LCG-2 Data Management Planning Ian Bird LHCC Referees Meeting 28 th June 2004.
CERN - IT Department CH-1211 Genève 23 Switzerland t Grid Reliability Pablo Saiz On behalf of the Dashboard team: J. Andreeva, C. Cirstoiu,
EGEE-II INFSO-RI Enabling Grids for E-sciencE middleware status and plans Claudio Grandi (INFN and CERN) John White.
WLCG Status Report Ian Bird Austrian Tier 2 Workshop 22 nd June, 2010.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Operations: Evolution of the Role of.
SAM Status Update Piotr Nyczyk LCG Management Board CERN, 5 June 2007.
Ian Bird LCG Project Leader Status of EGEE  EGI transition WLCG LHCC Referees’ meeting 21 st September 2009.
ARDA Massimo Lamanna / CERN Massimo Lamanna 2 TOC ARDA Workshop Post-workshop activities Milestones (already shown in December)
ALICE Physics Data Challenge ’05 and LCG Service Challenge 3 Latchezar Betev / ALICE Geneva, 6 April 2005 LCG Storage Management Workshop.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Job Management Claudio Grandi.
Acronyms GAS - Grid Acronym Soup, LCG - LHC Computing Project EGEE - Enabling Grids for E-sciencE.
Outcome should be a documented strategy Not everything needs to go back to square one! – Some things work! – Some work has already been (is being) done.
LCG Introduction John Gordon, STFC-RAL GDB June 11 th, 2008.
LCG Service Challenge: Planning and Milestones
Olof Bärring LCG-LHCC Review, 22nd September 2008
Short update on the latest gLite status
TCG Discussion on CE Strategy & SL4 Move
Data Management cluster summary
LHC Data Analysis using a worldwide computing grid
The LHCb Computing Data Challenge DC06
Presentation transcript:

Status of gLite-3.0 deployment and uptake Ian Bird CERN IT LCG-LHCC Referees Meeting 29 th January 2007

October 7, LHCC Referees Meeting; January 29 th 2007 Deployment History  gLite-3.0 was delivered in May 2006  Rapidly deployed to Tier 1 sites  After 1 st update (3.0.1) full deployment across EGEE  Two full update releases  3.0.1, delivered in June, August  Change to incremental updates (move away from big-bang releases)  12 updates to – rapidly deployed by all sites  Anticipate major releases only for major changes:  e.g will be SLC4  But even then will avoid major functional or behavioural changes

October 7, LHCC Referees Meeting; January 29 th 2007 Comments  No real distinction between what was LCG-2.7 and gLite-3.0  Mainly evolved versions of existing services  Some “new” services – they existed before but are now in use (e.g. VOMS)  The introduction of gLite-3.0 was not disruptive to the production service  Although it did cost effort from the sites!  Most gLite-3.0 services are deployed only in EGEE; exceptions are:  FTS deployed also in US Tier 1s and NDGF  WMS/LB used by CMS to submit work across EGEE and OSG  VOMS

October 7, LHCC Referees Meeting; January 29 th 2007 Status of major components  VOMS:  VOMS service in full production; old ldap-based VO services stopped  VOMS roles and groups:  FTS already supports roles and groups  DPM supports roles, groups and ACLs  dCache 1.7 supports roles, groups at disk-pool level; ACLs mid-year  Castor – no real estimate yet  Job priorities: VOViews and batch system support being tested now; supported by gLite WMS  R-GMA:  Used as back-end of APEL (accounting)  Used as monitoring transport mechanism  … and hence used by dashboards

October 7, LHCC Referees Meeting; January 29 th 2007 Status – 2  FTS:  Used by all experiments; deployed at Tier 0 and all Tier 1s  Rapid cycle of fixes for issues found in Service Challenges and ongoing use  Most major issues (e.g. fat clients) have been addressed  Version 2.0 support for SRM2.2  LFC:  In production, used by ATLAS, LHCb  Deployed as both central and local file catalogues  Major issues addressed:  python API problems  Bulk queries (can now achieve 300 Hz)  GFAL/lcg-utils: main SRM clients  Used by all experiments  Updated to support SRM v2.2

October 7, LHCC Referees Meeting; January 29 th 2007 Status – 3  WMS/LB:  ATLAS and CMS rely on gLite WMS functionality – particularly the job collections  CMS can still use LCG-RB for MC production; not an option for ATLAS  LHCb and ALICE use LCG-RB or gLite WMS as basic submission tool  Major testing effort with CMS (and ATLAS participation) in Q306 to get WMS to state to be used in CSA06  Testing showed that rates of ~26k jobs/day are feasible from a single node in quiet conditions:  Now see that extended testing shows memory consumption limits to ~10k jobs/day  In CSA06 CMS achieved workloads of 5-8k jobs/day on each of 2 WMS nodes, but limited by bottlenecks in CMS components (later fixed)  In use by ATLAS MC production since July at rates up to 4k jobs/day on single WMS node  Major issues now are:  Reliability of service is not adequate for service managers or production managers  Starting second phase of testing now, with full ATLAS & CMS participation  ATLAS more sensitive to reliability issues in their production  Aim is  reliable (equiv to LCG RB stability) operation  with 50K jobs/day Q207 and K jobs/day on <10 nodes by end of 2007

October 7, LHCC Referees Meeting; January 29 th 2007 Status – 4  CE:  Not widely deployed, but testing has shown it to be now reasonably reliable  There were a number of problems initially  BUT:  Condor bug limited #jobs in a batch system to 100 (!)  Now have a fix for this, serious scale testing is starting  Phasing out the LCG CE is not so urgent:  Experiments want stability  gLite CE brings limited additional functionality (pass of job resource requirements to batch system)  Need to avoid porting LCG-CE to SLC4 if possible

October 7, LHCC Referees Meeting; January 29 th 2007 Summary  gLite-3.0 is the production middleware on EGEE  Some services used elsewhere also  All services are in production use  Testing efforts for WMS/LB and CE still needed to get to desired performance and reliability levels