WLCG Operations Coordination report Maria Alandes, Andrea Sciabà IT-SDC On behalf of the WLCG Operations Coordination team GDB 9 th April 2014.

Slides:



Advertisements
Similar presentations
Operations Coordination Team Maria Girone, CERN IT-ES GDB 10 th October 2012.
Advertisements

Operations Coordination Team Maria Girone, CERN IT-ES Kick-off meeting 24 th September 2012.
The Middleware Readiness Working Group LHCb Computing Workshop LHCb Computing Workshop Maria Dimou IT/SDC 2014/05/22.
Jan 2010 Current OSG Efforts and Status, Grid Deployment Board, Jan 12 th 2010 OSG has weekly Operations and Production Meetings including US ATLAS and.
New VOMS servers campaign GDB, 8 th Oct 2014 Maarten Litmaath IT/SDC.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES News on monitoring for CMS distributed computing operations Andrea.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
WLCG Operations Coordination report Maria Dimou / CERN With input and on behalf of the WLCG Operations Coordination team May 2015 GDB CERN indico event.
LHCC Comprehensive Review – September WLCG Commissioning Schedule Still an ambitious programme ahead Still an ambitious programme ahead Timely testing.
Status of WLCG Tier-0 Maite Barroso, CERN-IT With input from T0 service managers Grid Deployment Board 9 April Apr-2014 Maite Barroso Lopez (at)
Computing for ILC experiment Computing Research Center, KEK Hiroyuki Matsunaga.
CERN IT Department CH-1211 Genève 23 Switzerland t EIS section review of recent activities Harry Renshall Andrea Sciabà IT-GS group meeting.
LCG Introduction John Gordon, SFTC GDB December 2 nd 2009.
GGUS summary ( 4 weeks ) VOUserTeamAlarmTotal ALICE ATLAS CMS LHCb Totals 1.
WLCG Service Report ~~~ WLCG Management Board, 24 th November
PanDA Multi-User Pilot Jobs Maxim Potekhin Brookhaven National Laboratory Open Science Grid WLCG GDB Meeting CERN March 11, 2009.
WLCG Service Report ~~~ WLCG Management Board, 1 st September
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
MW Readiness Verification Status Andrea Manzi IT/SDC 21/01/ /01/15 2.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Overview of STEP09 monitoring issues Julia Andreeva, IT/GS STEP09 Postmortem.
CERN11 th February WLCG Ops Coordination [GDB Report] Josep Flix (PIC/CIEMAT) On behalf of the WLCG Operations Coordination Team GDB – CERN.
WLCG Service Report ~~~ WLCG Management Board, 9 th August
WLCG operations A. Sciabà, M. Alandes, J. Flix, A. Forti WLCG collaboration workshop July , Barcelona.
MW Readiness WG Update Andrea Manzi Maria Dimou Lionel Cons 10/12/2014.
1 LHCb on the Grid Raja Nandakumar (with contributions from Greig Cowan) ‏ GridPP21 3 rd September 2008.
Julia Andreeva, CERN IT-ES GDB Every experiment does evaluation of the site status and experiment activities at the site As a rule the state.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
Information System Status and Evolution Maria Alandes Pradillo, CERN CERN IT Department, Grid Technology Group GDB 13 th June 2012.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Priorities update Andrea Sciabà IT/GS Ulrich Schwickerath IT/FIO.
LCG Introduction John Gordon, STFC GDB June 8 th 2011.
SAM Sensors & Tests Judit Novak CERN IT/GD SAM Review I. 21. May 2007, CERN.
WLCG Service Report ~~~ WLCG Management Board, 7 th September 2010 Updated 8 th September
LCG Report from GDB John Gordon, STFC-RAL MB meeting February24 th, 2009.
LCG Support for Pilot Jobs John Gordon, STFC GDB December 2 nd 2009.
Julia Andreeva on behalf of the MND section MND review.
The GridPP DIRAC project DIRAC for non-LHC communities.
December GDB Brief summary – J Coles. Meetings January meeting moved to 15 th 2014 events created. Check March meeting outside CERN. Copenhagen workshop.
NIKHEF11 th March WLCG Operational Costs M. Dimou, J. Flix, A. Forti, A. Sciabà WLCG Operations Coordination Team GDB – NIKHEF [11 th March.
WLCG ‘Weekly’ Service Report ~~~ WLCG Management Board, 5 th August 2008.
Enabling Grids for E-sciencE INFSO-RI Enabling Grids for E-sciencE Gavin McCance GDB – 6 June 2007 FTS 2.0 deployment and testing.
WLCG Operations Coordination Andrea Sciabà IT/SDC 10 th July 2013.
WLCG critical services update Andrea Sciabà WLCG operations coordination meeting December 18, 2014.
December GDB Summary See also: Jeremy’s notes.
LCG Issues from GDB John Gordon, STFC WLCG MB meeting September 28 th 2010.
8 August 2006MB Report on Status and Progress of SC4 activities 1 MB (Snapshot) Report on Status and Progress of SC4 activities A weekly report is gathered.
News and Status WLCG Information System Task Force Maria Alandes IT/SDC.
The Grid Storage System Deployment Working Group 6 th February 2007 Flavia Donno IT/GD, CERN.
The GridPP DIRAC project DIRAC for non-LHC communities.
LCG Pilot Jobs + glexec John Gordon, STFC-RAL GDB 7 December 2007.
MW Readiness WG Update Andrea Manzi Maria Dimou Lionel Cons Maarten Litmaath On behalf of the WG participants GDB 09/09/2015.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Operations: Evolution of the Role of.
SAM Status Update Piotr Nyczyk LCG Management Board CERN, 5 June 2007.
WLCG Service Report ~~~ WLCG Management Board, 17 th February 2009.
WLCG Service Report ~~~ WLCG Management Board, 10 th November
WLCG Operations Coordination and Commissioning Maria Girone, CERN IT On behalf of the Operations Coordination Team 11 th March OSG All Hands Meeting,
WLCG Operations Coordination news and meeting restructuring Maria Alandes Pradillo Josep Flix Alessandra Forti Andrea Sciabà WLCG operations coordination.
News from the HEPiX IPv6 Working Group David Kelsey (STFC-RAL) HEPIX, BNL 13 Oct 2015.
GGUS summary (3 weeks) VOUserTeamAlarmTotal ALICE7029 ATLAS CMS LHCb Totals
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
WLCG Accounting Task Force Update Julia Andreeva CERN GDB, 8 th of June,
WLCG Operations Coordination report Maria Dimou Andrea Sciabà IT/SDC On behalf of the WLCG Operations Coordination team GDB 12 th November 2014.
Site notifications with SAM and Dashboards Marian Babik SDC/MI Team IT/SDC/MI 12 th June 2013 GDB.
Accounting Update John Gordon. Outline Multicore CPU Accounting Developments Cloud Accounting Storage Accounting Miscellaneous.
CMS Multicore jobs at RAL Andrew Lahiff, RAL WLCG Multicore TF Meeting 1 st July 2014.
Maria Alandes Pradillo, CERN Training on GLUE 2 information validation EGI Technical Forum September 2013.
WLCG Operations Coordination Andrea Sciabà IT/SDC GDB 11 th September 2013.
Operations Coordination Team Maria Girone, CERN IT-ES GDB, 11 July 2012.
WLCG IPv6 deployment strategy
WLCG Operations Coordination
Summary from last MB “The MB agreed that a detailed deployment plan and a realistic time scale are required for deploying glexec with setuid mode at WLCG.
Presentation transcript:

WLCG Operations Coordination report Maria Alandes, Andrea Sciabà IT-SDC On behalf of the WLCG Operations Coordination team GDB 9 th April 2014

Outline  Previous report on March 12 th  News  Experiments  Oracle updates  Status of task forces  Conclusions WLCG Operations Coordination – M. Alandes and A. Sciabà 2

News  Simone Campana is the new ATLAS Distributed Computing coordinator and Andrea Sciabà and Maria Alandes are the new WLCG operations officers  Regular T0 report at the Operations Coordination meeting  CVMFS 2.0 has reached end of life  2.0 clients won’t be able to access 2.1 server  Almost all sites have now the 2.1 client  Best effort support for 2.0 server (used to operate Stratum 0)  New release of GGUS on 26 th March containing some new features:  Notify multiple sites with one ticket  GGUS ALARMs for Russian T1  Savannah-GGUS-CMS, new functionality for CMS to replace the Savannah-GGUS bridge WLCG Operations Coordination – M. Alandes and A. Sciabà 3

Recent and future WLCG Operations Coordination meetings  Next Planning Meeting on April 17th  Until July:  May 8, 22 (shifted due to May 1)  June 5, 19  July 7-9 (WLCG Workshop, Barcelona)  July 24 (shifted due to workshop) WLCG Operations Coordination – M. Alandes and A. Sciabà 4

Experiment news (1/2)  NOTE: Official report on the Wigner efficiency studies expected after April 14th  ALICE  Problems at KIT with job efficiencies. Investigations ongoing  ATLAS  Rucio migration ongoing  Slow transfers (0.5MB/s) noticed between some sites like Cambridge-BNL, Nikhef-TRIUMF and few others. WLCG Operations Coordination – M. Alandes and A. Sciabà 5

Experiment news (2/2)  CMS  FTS2 decommissioning OK, sites should switch Phedex debug agents to use RAL FTS3  CVMFS switch at CERN on April 14th  Continue testing multicore in T1s  LHCb  LSF job inefficiencies under investigation at CERN WLCG Operations Coordination – M. Alandes and A. Sciabà 6

Oracle upgrades  Updates since last GDB 12 th March  CERN  LCGR updated to (grid services)  LHCBR updated to (LHCb LFC and Dirac bookkeeping)  ATLR, ADCR updated to (ATLAS conditions and computing services)  See WLCG Operations Coordination minutes for more details WLCG Operations Coordination – M. Alandes and A. Sciabà 7

gLExec  Still 16 tickets to sites open (no changes since last GDB)  Review gLexec deployment to understand whether TF could end WLCG Operations Coordination – M. Alandes and A. Sciabà 8

SHA-2  SHA-2 user certificates used by all 4 experiments  EGI Operations Portal VO cards have been updated with the details of the future VOMS servers  Campaign to recognise future VOMS servers launched on 17 th March   VOMS servers action is not SHA-2 specific  Future of the TF to be discussed at the next Planning meeting WLCG Operations Coordination – M. Alandes and A. Sciabà 9

perfSONAR  Deadline for perfSONAR has passed (April 1 st )  9 sites out of 111 missing   2 sites no reply  4 sites installing or procuring HW  1 site with no HW  1 site with no resources to deploy perfSONAR  1 site installed and configured but not on OIM  Many sites need attention due to wrong config/old release/firewall  Future of the TF to be discussed at the next Planning meeting WLCG Operations Coordination – M. Alandes and A. Sciabà 10

FTS-3  New FTS3 version deployed on pilot  Features requested in the FTS3 TF during the last 3 months covering different sites, experiments and FTS service manager requests.  Please, check the complete list of release notes for more details:  WLCG Operations Coordination – M. Alandes and A. Sciabà 11

Machine/job features  See Stefan’s talk this afternoon WLCG Operations Coordination – M. Alandes and A. Sciabà 12

Multicore deployment  Testing is ongoing  So far only ATLAS running MC jobs but CMS will start tests soon  ATLAS is implementing a way to pass parameters to batch systems more easily as asked by sites  At most sites it is not possible to pass job requirements through CREAM on to the batch system.  This is because the CREAM mechanism for passing requirements requires an additional script as a plugin.  For SGE there is a standard script but not for other batch systems.  Hence for other sites it only works if the site implements this script; not many sites have done so  Reviewing of sites/batch system experiences with multicore jobs continue  Next step: review experience at CMS and ATLAS shared sites when handling multicore jobs from both VOs WLCG Operations Coordination – M. Alandes and A. Sciabà 13

WMS decommissioning  CERN WMS instances for experiments being drained on 1 st April  No issues reported  SAM instances have their own timeline WLCG Operations Coordination – M. Alandes and A. Sciabà 14

Middleware readiness  List of confirmed volunteer sites will be prepared for MB on April 15 th  Next meeting on 15 th May at 10h30 CEST  Different readiness verification approaches across VOs  WLCG MW Officer role  Versions at the volunteers sites WLCG Operations Coordination – M. Alandes and A. Sciabà 15

Conclusions  Some TF to be closed or goals to be reviewed.  To be confirmed after next Planning meeting on April 17 th  Oracle upgrades progressing as planned  Preparation of the WLCG workshop WLCG Operations Coordination – M. Alandes and A. Sciabà 16