UK middleware deployment GridPP27 - CERN 15 th September 2011 GridPP27 - CERN 15 th September 2011 Status & plans Jeremy Coles.

Slides:



Advertisements
Similar presentations
The Middleware Readiness Working Group LHCb Computing Workshop LHCb Computing Workshop Maria Dimou IT/SDC 2014/05/22.
Advertisements

New VOMS servers campaign GDB, 8 th Oct 2014 Maarten Litmaath IT/SDC.
London Tier 2 Status Report GridPP 13, Durham, 4 th July 2005 Owen Maroney, David Colling.
London Tier 2 Status Report GridPP 12, Brunel, 1 st February 2005 Owen Maroney.
LCG Milestones for Deployment, Fabric, & Grid Technology Ian Bird LCG Deployment Area Manager PEB 3-Dec-2002.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Unified Middleware Distribution (UMD): SW provisioning to EGI Mario David.
LHCC Comprehensive Review – September WLCG Commissioning Schedule Still an ambitious programme ahead Still an ambitious programme ahead Timely testing.
EGEE is a project funded by the European Union under contract IST Testing processes Leanne Guy Testing activity manager JRA1 All hands meeting,
Core operations Jeremy Coles GridPP28 17 th April 2012 Jeremy Coles GridPP28 17 th April 2012 a b.
Marian Babik, Luca Magnoni SAM Test Framework. Outline  SAM Test Framework  Update on Job Submission Timeouts  Impact of Condor and direct CREAM tests.
EMI INFSO-RI EMI Structure, Plans, Deliverables Alberto Di Meglio (CERN) Project Director ATLAS Software & Computing Week Geneva, 21 July 2011.
Ian Bird LCG Project Leader LHCC Referee Meeting Project Status & Overview 22 nd September 2008.
EMI 1 Release The EMI 1 (Kebnekaise) release features for the first time a complete and consolidated set of middleware components from ARC, dCache, gLite.
Your university or experiment logo here GridPP Storage Future Jens Jensen GridPP workshop RHUL, April 2010.
Maarten Litmaath (CERN), GDB meeting, CERN, 2006/02/08 VOMS deployment Extent of VOMS usage in LCG-2 –Node types gLite 3.0 Issues Conclusions.
MW Readiness Verification Status Andrea Manzi IT/SDC 21/01/ /01/15 2.
GLite – An Outsider’s View Stephen Burke RAL. January 31 st 2005gLite overview Introduction A personal view of the current situation –Asked to be provocative!
JRA Execution Plan 13 January JRA1 Execution Plan Frédéric Hemmer EGEE Middleware Manager EGEE is proposed as a project funded by the European.
London Tier 2 Status Report GridPP 11, Liverpool, 15 September 2004 Ben Waugh on behalf of Owen Maroney.
Owen SyngeTitle of TalkSlide 1 Storage Management Owen Synge – Developer, Packager, and first line support to System Administrators. Talks Scope –GridPP.
MW Readiness WG Update Andrea Manzi Maria Dimou Lionel Cons 10/12/2014.
Grid Security Vulnerability Group Linda Cornwall, GDB, CERN 7 th September 2005
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
Information System Status and Evolution Maria Alandes Pradillo, CERN CERN IT Department, Grid Technology Group GDB 13 th June 2012.
Experiment Operations: ALICE Report WLCG GDB Meeting, CERN 14th October 2009 Patricia Méndez Lorenzo, IT/GS-EIS.
WLCG Middleware Support II Markus Schulz CERN-IT-GT May 2011.
UKI-SouthGrid Overview and Oxford Status Report Pete Gronbech SouthGrid Technical Coordinator HEPSYSMAN – RAL 10 th June 2010.
EMI INFSO-RI SA1 Session Report Francesco Giacomini (INFN) EMI Kick-off Meeting CERN, May 2010.
WLCG Software Lifecycle First ideas for a post EMI approach 0.
LCG Report from GDB John Gordon, STFC-RAL MB meeting February24 th, 2009.
The GridPP DIRAC project DIRAC for non-LHC communities.
EMI INFSO-RI European Middleware Initiative (EMI) Alberto Di Meglio (CERN)
CERN IT Department CH-1211 Geneva 23 Switzerland t WLCG Operation Coordination Luca Canali (for IT-DB) Oracle Upgrades.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks SA3 partner collaboration tasks & process.
Report from GSSD Storage Workshop Flavia Donno CERN WLCG GDB 4 July 2007.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Middleware Update Maria Alandes Pradillo.
1 Update at RAL and in the Quattor community Ian Collier - RAL Tier1 HEPiX FAll 2010, Cornell.
CERN IT Department CH-1211 Genève 23 Switzerland t SL(C) 5 Migration at CERN CHEP 2009, Prague Ulrich SCHWICKERATH Ricardo SILVA CERN, IT-FIO-FS.
RI EGI-InSPIRE RI UMD 2 Decommissioning Status Cristina Aiftimiei EGI.eu.
EMI INFSO-RI SA1 – Maintenance and Support Francesco Giacomini (INFN) EMI First EC Review Brussels, 22 June 2011.
INFSO-RI Enabling Grids for E-sciencE gLite Test and Certification Effort Nick Thackray CERN.
Grid Deployment Board 5 December 2007 GSSD Status Report Flavia Donno CERN/IT-GD.
The Grid Storage System Deployment Working Group 6 th February 2007 Flavia Donno IT/GD, CERN.
The GridPP DIRAC project DIRAC for non-LHC communities.
WLCG Operations Coordination report Maria Alandes, Andrea Sciabà IT-SDC On behalf of the WLCG Operations Coordination team GDB 9 th April 2014.
MW Readiness WG Update Andrea Manzi Maria Dimou Lionel Cons Maarten Litmaath On behalf of the WG participants GDB 09/09/2015.
Status of gLite-3.0 deployment and uptake Ian Bird CERN IT LCG-LHCC Referees Meeting 29 th January 2007.
Ian Bird LCG Project Leader Status of EGEE  EGI transition WLCG LHCC Referees’ meeting 21 st September 2009.
J Jensen/J Gordon RAL Storage Storage at RAL Service Challenge Meeting 27 Jan 2005.
The Great Migration: From Pacman to RPMs Alain Roy OSG Software Coordinator.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI GLUE 2: Deployment and Validation Stephen Burke egi.eu EGI OMB March 26 th.
LCG Accounting Update John Gordon, CCLRC-RAL 10/1/2007.
WLCG Information System Status Maria Alandes Pradillo, CERN CERN IT Department, Support for Distributed Computing Group GDB 9 th September 2015.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Maria Alandes Pradillo, CERN Training on GLUE 2 information validation EGI Technical Forum September 2013.
WLCG Operations Coordination Andrea Sciabà IT/SDC GDB 11 th September 2013.
EGI-InSPIRE RI EGI-InSPIRE RI EGI-InSPIRE Software provisioning and HTC Solution Peter Solagna Senior Operations Manager.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Middleware updates Peter Solagna – EGI.eu OMB – 16/07/ /3/
WLCG IPv6 deployment strategy
EGEE Middleware Activities Overview
gLite->EMI2/UMD2 transition
CREAM Status and Plans Massimo Sgaravatto – INFN Padova
EGI UMD Storage Software Repository (Mostly former EMI Software)
DPM releases and platforms status
TCG Discussion on CE Strategy & SL4 Move
EMI: dal Produttore al Consumatore
Francesco Giacomini – INFN JRA1 All-Hands Nikhef, February 2008
UMD 2 / EMI 2 Decommissioning Status
UMD 2 Decommissioning Status
UMD 2 Decommissioning Status
Presentation transcript:

UK middleware deployment GridPP27 - CERN 15 th September 2011 GridPP27 - CERN 15 th September 2011 Status & plans Jeremy Coles

GridPP27 – Middleware deployment Jeremy Coles – GridPP27 – 15/09/2011 Overview Baselines and recommended versions The UK situation Moving away from gLite Issues and concerns Discussion? 2 UMD

GridPP27 – Middleware deployment Jeremy Coles – GridPP27 – 15/09/2011 Baselines (are we even up-to-date with gLite?) There is a WLCG wiki page listing some useful information and it has been updated recently. In what follows the figure in (brakets) refers to the minimum recommended version listed here: is a WLCG wiki page listing some useful information and it has been updated recently. In what follows the figure in (brakets) refers to the minimum recommended version listed here: Note that the recommended versions do not necessarily reflect the latest versions of packages available in the gLite/UMD/EMI/... repositories are versions fixing significant bugs or introducing important features. Versions newer than those indicated are assumed to be at least as good, unless otherwise indicated. Also note that support for versions changes in October: Information in the following slides was drawn from site inputs here: Some sites usefully provided more details than others and may be highlighted more often but all those in the list are likely to be in a similar position! 3

GridPP27 – Middleware deployment Jeremy Coles – GridPP27 – 15/09/2011 gLite 3.1 LCG-CE ( ) Brunel; QMUL*; ECDF*; Lancs(3.1.40); Liv(3.1.46);Cam(3.1 condor); T1; UCL WMS ( ) – still best option IC; Glas(3.1.31); T1(3.1); Ox (for gridppnagios) BDII-site( ) Brunel; UCL UI ( ) Liv(3.1.45) VOMS() Glas(2.0.15) 4

GridPP27 – Middleware deployment Jeremy Coles – GridPP27 – 15/09/2011 gLite 3.2 WNs ( ) Brunel; IC; QMUL ( tarball); RHUL (3.2.11); UCL (3.2); Lancs (3.2.9); Liv (3.2.9); Man (3.2.10); ECDF (3.2.10); Glas ( ); Bham; Bris; Cam; Ox; RALPP; T1 (3.2.7) BDII-site ( ) Brunel; IC; QMUL (probs->openldap2.4); RHUL (3.2.11); Lancs ( ); Liv (3.2.11); Man ( ); ECDF (3.2.11); Glas (3.2.9); Bham; Bris; Cam; Ox; RALPP (+VM); T1 ( ) BDII-top ( ) Man( ); T1( ) 5

GridPP27 – Middleware deployment Jeremy Coles – GridPP27 – 15/09/2011 gLite 3.2 cont CREAM CE( (1.6.5) ) IC(SGE issues*); QMUL (SGE issues); RHUL (3.2.10); UCL; Lancs(3.2.10); Liv(3.2.11); Man( & ); ECDF(SGE issues); Glas( ); Bham; Bris; Cam; Ox; RALPP; T1( & 3.2.6) UI( ) Brunel; IC; RHUL; Lancs; Liv(3.2.10); Man(3.2.8); Glas(3.2.8); Bham; Bris; Cam; Ox; RALPP; T1(3.2.10) ARGUS( (1.2)) RHUL( ); Liv( ); Man( ); Bham; Ox; RALPP; T1 + Glasgow(SCAS) Glexec_wn( ) Brunel; Liv( ); Man; Glas; Bham; Ox; RALPP; T1( ) 6 *SGE issues – see (eg. Deadlocks; CreamDB (and InnoDB) setup… timeouts (so change purge_interval in blah.config ); tomcat processes survive gLite restart)

GridPP27 – Middleware deployment Jeremy Coles – GridPP27 – 15/09/2011 gLite 3.2 cont.2 VOMS (1.9.19?) Glas FTS (2.2.4) T1( ) LFC( ) T1( ) VOBOX( ) T1(3.2.11); Bham 7 Note: Frontier(3.2.4?) not covered Frontier/Squid Launchpad 2.7.STABLE9-3.7 ? Not covered. Not present: ARGUS/glexec: IC; QMUL; UCL; Lancs; ECDF -> relocatable install not yet available. Bris;Cam UI: QMUL; ECDF(T3) No entries for Durham or EFDA-JET

GridPP27 – Middleware deployment Jeremy Coles – GridPP27 – 15/09/2011 Storage SE_dpm_disk/mysql SL4 ( ) ECDF; UCL SE_dpm_disk/mysql SL5 ( ) Brunel; Bham; Shef; RHUL(1.8.1); Man(1.8.1); Lancs( ); Liv( ); Glas(1.8.0); Cam(1.7.4); Ox(1.7.?) dCache(1.9.5) IC( ); RALPP(1.9.5) Storm(1.5.6) Bris(1.3); QMUL(1.7) CASTOR( ) T1( ) 8

GridPP27 – Middleware deployment Jeremy Coles – GridPP27 – 15/09/2011 The last words of Mr Fix It – why move!? 9 Bug fixes Security updates Those maintaining the middleware have other communities to satisfy too meaning new functionality has to find a way in… The underlying operating systems (hardware) evolve and the middleware has to be updated. “Life cycle” RHEL4 to February 29, 2012 (SL ) RHEL5 to March 31, 2014 (SL ) RHEL6 from November 10, 2010 ref: Other SL3 EGEE EDG

GridPP27 – Middleware deployment Jeremy Coles – GridPP27 – 15/09/2011 (Expected) end of support For gLite 3.1: The LCG-CE (which means TORQUE_utils,SGE_utilsandLSF_utils and glite-CLUSTER) and WMS are fully supported until 31 st October Security update for the FTS stops on 7 th October. For gLite 3.2: Bug fixes and minor functionality updates of a certain priority continue until 31 st October 2011.Data management services will be supported until April Security updates for most components (not ARGUS) carry on until 30 th April 2012 (the time EMI-2 is released). Track the patches via For LCG priorities and news check:

GridPP27 – Middleware deployment Jeremy Coles – GridPP27 – 15/09/2011 UMD/EMI The Unified Middleware Distribution (UMD) is the integrated set of software components that EGI makes available from technology providers within the EGI Community. These components are packaged to provide an integrated offering for deployment on the EGI production infrastructure EMI-1 (Kebnekaise) released on 12 May 2011 EGI early adopter (staged rollout) sites took up EMI-1 e e UMD-1 released July 2011

GridPP27 – Middleware deployment Jeremy Coles – GridPP27 – 15/09/2011 So “what next” advice When moving towards EMI- 1 based middleware the recommendation is to use the UMD repository. gLite 3.2 to UMD 1.x: services should be migrated either at a convenient time, when the service is moved to new hardware, or when sites or users will benefit significantly. gLite 3.1 to UMD 1.x: move from gLite-3.1 directly to UMD/EMI-1. If a required service hasn't passed the transition, it is advisable to wait for the service to pass the UMD validation. New services: should use UMD if staged rollout is complete and the UMD repository has been updated. services with little persistent state should move to UMD as soon as they move to new hardware or whenever a re-installation of the node is scheduled. Verify that your local fabric management and monitoring are aware of the path changes that come with the improved structure of EMI-1. Storage and catalogue services that are in operation should not move to UMD in the near future. If taking releases directly from providers check with them for advice. gLite-3.2 clients for the workload management and data management are compatible with the UMD versions of services. Some problems have been spotted on the EMI-1 WN and UI related to SAM tests and lcg_util clients. Need more experiment usage/feedback. Given the sensitivity to correct library and binary paths and the complexity of the configuration, the changes to those locations in EMI-1 might have an impact. Until it has been verified that there aren't problems, sites should stay with the gLite-3.2 WN and UI. 12

GridPP27 – Middleware deployment Jeremy Coles – GridPP27 – 15/09/2011 UK EMI/UMD 1.0 deployment progress so far ARGUS: Brunel; ECDF(soon) CE: Brunel; Lancs(soon); Shef(soon); Glasgow(in test); Ox(in test); RALPP(UMD1.1) WMS: IC (test) DPM: ECDF(soon); Glas(test) Storm: QMUL(1.7.0/1.7.1) Cluster publisher: RALPP(UMD1.1) ARC CE: Glas(test) Do we have comments from these sites on their experiences? Staged rollout: 13 Global CREAM 6 th Sept

GridPP27 – Middleware deployment Jeremy Coles – GridPP27 – 15/09/2011 What next? “We’ll move when the benefits outweigh the risks” “What are the plans for the experiment nodes?” “Will there be access to installation and maintenance recipes from the developers?” We “… understand the UMD WN has problems!” The site plans to “virtualize additional grid services” 14 Example questions: -What is the main driver for sites? -When will the SL6 middleware be released? -What currently ‘stops’ us tracking the baseline? -When is the best time to transition? (Accounting is now continuous) -Can we really plan in any detail when EMI/LCG/experiment plans are not clear!?

GridPP27 – Middleware deployment Jeremy Coles – GridPP27 – 15/09/2011 Summary & ‘strategy’ 15 Stability of releases EOL gLite SL4->SL5 -> SL6 Experience with UMD - Experiments & sites Keeping site available 1)Confirm UMD validations 2)Where possible install all new hardware with UMD 3)Recheck experiment plans 4)Storage and catalogues – test but do not move existing services yet. Upgrades to 1.8 should happen. Storage sort of decoupled. 5)Stay with gLite-3.2 WN and UI until WLCG verification. Migrate resources in stages. 6)Those involved with early adoption lead the migration Current spread