PDC’06 – production status and issues Latchezar Betev TF meeting – May 04, 2006.

Slides:



Advertisements
Similar presentations
Status GridKa & ALICE T2 in Germany Kilian Schwarz GSI Darmstadt.
Advertisements

CERN LCG Overview & Scaling challenges David Smith For LCG Deployment Group CERN HEPiX 2003, Vancouver.
Operating Systems An operating system is a set of programs that controls how the hardware of a computer works. An operating system provides a means of.
CREAM: Update on the ALICE experiences WLCG GDB Meeting Patricia Méndez Lorenzo (IT/GS) CERN, 11th March 2009.
ALICE Operations short summary and directions in 2012 Grid Deployment Board March 21, 2011.
ALICE Operations short summary and directions in 2012 WLCG workshop May 19-20, 2012.
Patricia Méndez Lorenzo (IT/GS) ALICE Offline Week (18th March 2009)
Summary of issues and questions raised. FTS workshop for experiment integrators Summary of use  Generally positive response on current state!  Now the.
Production test on EDG-1.4 Goal 1: simulate and reconstuct 5000 Pb-Pb central events 1 job/event Output size: about 1.8 GB/event, so 9 TB Job duration:
Status of PDC’07 Latchezar Betev TF meeting – April 5, 2007.
LCG Plans for Chrsitmas Shutdown John Gordon, STFC-RAL GDB December 10 th, 2008.
DDM-Panda Issues Kaushik De University of Texas At Arlington DDM Workshop, BNL September 29, 2006.
Status of the production and news about Nagios ALICE TF Meeting 22/07/2010.
CERN – Alice Offline – Thu, 03 Feb 2005 – Marco MEONI - 1/18 Monitoring of a distributed computing system: the AliEn Grid Alice Offline weekly meeting.
Sejong STATUS Chang Yeong CHOI CERN, ALICE LHC Computing Grid Tier-2 Workshop in Asia, 1 th December 2006.
Panda Grid Status Kilian Schwarz, GSI on behalf of PANDA GRID Group (slides to a large extend from Radoslaw Karabowicz)
Status of PDC’06 Latchezar Betev TF meeting – September 28, 2006.
WLCG GDB, CERN, 10th December 2008 Latchezar Betev (ALICE-Offline) and Patricia Méndez Lorenzo (WLCG-IT/GS) 1.
Offline report – 7TeV data taking period (Mar.30 – Apr.6) ALICE SRC April 6, 2010.
CERN Using the SAM framework for the CMS specific tests Andrea Sciabà System Analysis WG Meeting 15 November, 2007.
The ALICE Distributed Computing Federico Carminati ALICE workshop, Sibiu, Romania, 20/08/2008.
Status of PDC’07 and user analysis issues (from admin point of view) L. Betev August 28, 2007.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
CERN – Alice Offline – Thu, 20 Mar 2008 – Marco MEONI - 1 Status of Cosmic Reconstruction Offline weekly meeting.
Rutherford Appleton Lab, UK VOBox Considerations from GridPP. GridPP DTeam Meeting. Wed Sep 13 th 2005.
Site Report: Prague Jiří Chudoba Institute of Physics, Prague WLCG GridKa+T2s Workshop.
Phase 2 of the Physics Data Challenge ‘04 Latchezar Betev ALICE Offline week Geneva, September 15, 2004.
Plans for Service Challenge 3 Ian Bird LHCC Referees Meeting 27 th June 2005.
LCG CERN David Foster LCG WP4 Meeting 20 th June 2002 LCG Project Status WP4 Meeting Presentation David Foster IT/LCG 20 June 2002.
Data Transfer Service Challenge Infrastructure Ian Bird GDB 12 th January 2005.
AliEn central services Costin Grigoras. Hardware overview  27 machines  Mix of SLC4, SLC5, Ubuntu 8.04, 8.10, 9.04  100 cores  20 KVA UPSs  2 * 1Gbps.
+ AliEn site services and monitoring Miguel Martinez Pedreira.
Update of SAM Implementation ALICE TF Meeting 18/10/07.
Large scale data flow in local and GRID environment Viktor Kolosov (ITEP Moscow) Ivan Korolko (ITEP Moscow)
Production Activities and Results by ALICE Patricia Méndez Lorenzo (on behalf of the ALICE Collaboration) Service Challenge Technical Meeting CERN, 15.
A. Gheata, ALICE offline week March 09 Status of the analysis framework.
AliRoot survey: Analysis P.Hristov 11/06/2013. Are you involved in analysis activities?(85.1% Yes, 14.9% No) 2 Involved since 4.5±2.4 years Dedicated.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The LCG interface Stefano BAGNASCO INFN Torino.
Patricia Méndez Lorenzo (CERN, IT/GS-EIS) ċ. Introduction  Welcome to the first ALICE T1/T2 tutorial  Delivered for site admins and regional experts.
Christmas running post- mortem (Part III) ALICE TF Meeting 15/01/09.
03/09/2007http://pcalimonitor.cern.ch/1 Monitoring in ALICE Costin Grigoras 03/09/2007 WLCG Meeting, CHEP.
ALICE experiences with CASTOR2 Latchezar Betev ALICE.
Status of AliEn2 Services ALICE offline week Latchezar Betev Geneva, June 01, 2005.
INFSO-RI Enabling Grids for E-sciencE gLite Test and Certification Effort Nick Thackray CERN.
Service Challenge Report Federico Carminati GDB – January 11, 2006.
Data transfers and storage Kilian Schwarz GSI. GSI – current storage capacities vobox LCG RB/CE GSI batchfarm: ALICE cluster (67 nodes/480 cores for batch.
CMS: T1 Disk/Tape separation Nicolò Magini, CERN IT/SDC Oliver Gutsche, FNAL November 11 th 2013.
PDC’06 - status of deployment and production Latchezar Betev TF meeting – April 27, 2006.
ALICE Grid operations +some specific for T2s US-ALICE Grid operations review 7 March 2014 Latchezar Betev 1.
Status of gLite-3.0 deployment and uptake Ian Bird CERN IT LCG-LHCC Referees Meeting 29 th January 2007.
GRID interoperability and operation challenges under real load for the ALICE experiment F. Carminati, L. Betev, P. Saiz, F. Furano, P. Méndez Lorenzo,
Phase 2 of the Physics Data Challenge ‘04 Peter Hristov For the ALICE DC team Russia-CERN Joint Group on Computing CERN, September 20, 2004.
ALICE Physics Data Challenge ’05 and LCG Service Challenge 3 Latchezar Betev / ALICE Geneva, 6 April 2005 LCG Storage Management Workshop.
The ALICE Production Patricia Méndez Lorenzo (CERN, IT/PSS) On behalf of the ALICE Offline Project LCG-France Workshop Clermont, 14th March 2007.
Pledged and delivered resources to ALICE Grid computing in Germany Kilian Schwarz GSI Darmstadt ALICE Offline Week.
ALICE WLCG operations report Maarten Litmaath CERN IT-SDC ALICE T1-T2 Workshop Torino Feb 23, 2015 v1.2.
Monthly video-conference, 18/12/2003 P.Hristov1 Preparation for physics data challenge'04 P.Hristov Alice monthly off-line video-conference December 18,
Availability of ALICE Grid resources in Germany Kilian Schwarz GSI Darmstadt ALICE Offline Week.
The ALICE Christmas Production L. Betev, S. Lemaitre, M. Litmaath, P. Mendez, E. Roche WLCG LCG Meeting 14th January 2009.
INFNGRID Technical Board, Feb
ALICE internal and external network
Service Operations at the T0/T1 for the ALICE Experiment
Summary on PPS-pilot activity on CREAM CE
Data Challenge with the Grid in ATLAS
INFN-GRID Workshop Bari, October, 26, 2004
MC data production, reconstruction and analysis - lessons from PDC’04
Torrent-based software distribution
Simulation use cases for T2 in ALICE
AliEn central services (structure and operation)
The LHCb Computing Data Challenge DC06
Presentation transcript:

PDC’06 – production status and issues Latchezar Betev TF meeting – May 04, 2006

2 PDC’06 – production status Running status  Central services – all OK, no intervention necessary  With the exception of the ProxyServer – solution is being discussed (Andreas, Pablo, Predrag)  Site services – all OK (on running sites)  Running standard production jobs, old AliRoot  Job duration – 8 hours  Job output – CERN storage still firewalled: prevents us from storing data at CERN  Stable running since 25/04 – 9 days  Currently 15 sites (2 T1s, 13 T2s)

3 PDC’06 – production status Site profiles  Average 520 jobs, max 1180 jobs

4 PDC’06 – production status Site profiles (2)  Job statistics

5 PDC’06 – production status Site profiles (CERN)  Periodical drop in jobs accepted by LCG

6 PDC’06 – production status Site profiles (T2s)  Uneven job acceptance, no method yet to track and enforce ALICE resources share

7 PDC’06 – production status Repartition of done jobs  Approximately 40/60 % repartition T2/T1. Muenster (Opteron) is boosting the T2 share, T1s are underrepresented.

8 PDC’06 – production status Issues  Storage at CERN – still unresolved  Monitoring and submitting jobs at sites:  Sites typically advertise 0 free CPUs  Current system is auto-calculating the number of jobs to submit – penalizing ALICE  Have to go back to the AliEn system of deterministic values for number of CPUs and number of submitted job agents, irrespective of advertised resources.  Job communication (Proxy) with the central services

9 PDC’06 – production status Loss of connectivity with CS  Simultaneous occurrence in sites, correlated with ERROR_S, ERROR_IB

10 PDC’06 – production status Issues (2)  Deployment of VO-boxes:  Process is steadily ongoing, however not as fast as we would like it to be  Mix of problems – some LCG, some AliEn services related.  Deployment experts are working around the clock  Hopefully after the initial setup phase, further updates will be much faster  Hope that gLite 3.0 is not going to change the rules completely  List of sites – to be discussed after this presentation