Production Resources & Issues p20.09 MC-data Regeneration

Slides:



Advertisements
Similar presentations
O. Stézowski IPN Lyon AGATA Week September 2003 Legnaro Data Analysis – Team #3 ROOT as a framework for AGATA.
Advertisements

Large scale data flow in local and GRID environment V.Kolosov, I.Korolko, S.Makarychev ITEP Moscow.
The D0 Monte Carlo Challenge Gregory E. Graham University of Maryland (for the D0 Collaboration) February 8, 2000 CHEP 2000.
JetWeb on the Grid Ben Waugh (UCL), GridPP6, What is JetWeb? How can JetWeb use the Grid? Progress report The Future Conclusions.
High Energy Physics At OSCER A User Perspective OU Supercomputing Symposium 2003 Joel Snow, Langston U.
The SAMGrid Data Handling System Outline:  What Is SAMGrid?  Use Cases for SAMGrid in Run II Experiments  Current Operational Load  Stress Testing.
Building a distributed software environment for CDF within the ESLEA framework V. Bartsch, M. Lancaster University College London.
3rd June 2004 CDF Grid SAM:Metadata and Middleware Components Mòrag Burgon-Lyon University of Glasgow.
Jean-Yves Nief CC-IN2P3, Lyon HEPiX-HEPNT, Fermilab October 22nd – 25th, 2002.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
- Iain Bertram R-GMA and DØ Iain Bertram RAL 13 May 2004 Thanks to Jeff Templon at Nikhef.
11/30/2007 Overview of operations at CC-IN2P3 Exploitation team Reported by Philippe Olivero.
SAMGrid as a Stakeholder of FermiGrid Valeria Bartsch Computing Division Fermilab.
ATLAS and GridPP GridPP Collaboration Meeting, Edinburgh, 5 th November 2001 RWL Jones, Lancaster University.
GridPP18 Glasgow Mar 07 DØ – SAMGrid Where’ve we come from, and where are we going? Evolution of a ‘long’ established plan Gavin Davies Imperial College.
DØ Computing Model & Monte Carlo & Data Reprocessing Gavin Davies Imperial College London DOSAR Workshop, Sao Paulo, September 2005.
Status of the LHCb MC production system Andrei Tsaregorodtsev, CPPM, Marseille DataGRID France workshop, Marseille, 24 September 2002.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
Dzero MC production on LCG How to live in two worlds (SAM and LCG)
June 10, D0 Use of OSG D0 relies on OSG for a significant throughput of Monte Carlo simulation jobs, will use it if there is another reprocessing.
1 LCG-France sites contribution to the LHC activities in 2007 A.Tsaregorodtsev, CPPM, Marseille 14 January 2008, LCG-France Direction.
EGEE-III INFSO-RI Enabling Grids for E-sciencE Overview of STEP09 monitoring issues Julia Andreeva, IT/GS STEP09 Postmortem.
GridPP11 Liverpool Sept04 SAMGrid GridPP11 Liverpool Sept 2004 Gavin Davies Imperial College London.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
High Energy FermiLab Two physics detectors (5 stories tall each) to understand smallest scale of matter Each experiment has ~500 people doing.
May Donatella Lucchesi 1 CDF Status of Computing Donatella Lucchesi INFN and University of Padova.
UTA MC Production Farm & Grid Computing Activities Jae Yu UT Arlington DØRACE Workshop Feb. 12, 2002 UTA DØMC Farm MCFARM Job control and packaging software.
Run II Review Closeout 15 Sept., 2004 FNAL. Thanks! …all the hard work from the reviewees –And all the speakers …hospitality of our hosts Good progress.
Pavel Nevski DDM Workshop BNL, September 27, 2006 JOB DEFINITION as a part of Production.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI VO auger experience with large scale simulations on the grid Jiří Chudoba.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
CMS: T1 Disk/Tape separation Nicolò Magini, CERN IT/SDC Oliver Gutsche, FNAL November 11 th 2013.
Jiri Chudoba for the Pierre Auger Collaboration Institute of Physics of the CAS and CESNET.
INFSO-RI Enabling Grids for E-sciencE DGAS, current status & plans Andrea Guarise EGEE JRA1 All Hands Meeting Plzen July 11th, 2006.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
July 26, 2007Parag Mhashilkar, Fermilab1 DZero On OSG: Site And Application Validation Parag Mhashilkar, Fermi National Accelerator Laboratory.
Acronyms GAS - Grid Acronym Soup, LCG - LHC Computing Project EGEE - Enabling Grids for E-sciencE.
Joe Foster 1 Two questions about datasets: –How do you find datasets with the processes, cuts, conditions you need for your analysis? –How do.
LCG Accounting Update John Gordon, CCLRC-RAL 10/1/2007.
EGI-InSPIRE RI EGI-InSPIRE EGI-InSPIRE RI Pierre Auger Observatory Jiří Chudoba Institute of Physics and CESNET, Prague.
Enabling Grids for E-sciencE LRMN ThIS on the Grid Sorina CAMARASU.
DØ Computing Model and Operational Status Gavin Davies Imperial College London Run II Computing Review, September 2005.
DØ Grid Computing Gavin Davies, Frédéric Villeneuve-Séguier Imperial College London On behalf of the DØ Collaboration and the SAMGrid team The 2007 Europhysics.
5/12/06T.Kurca - D0 Meeting FNAL1 p20 Reprocessing Introduction Computing Resources Architecture Operational Model Technical Issues Operational Issues.
Vendredi 27 avril 2007 Management of ATLAS CC-IN2P3 Specificities, issues and advice.
U.S. ATLAS Grid Production Experience
Monte Carlo Production and Reprocessing at DZero
GWE Core Grid Wizard Enterprise (
Bulk production of Monte Carlo
Data Challenge with the Grid in ATLAS
EGEE VO Management.
ALICE Physics Data Challenge 3
DØ Computing & Analysis Model
LHCb Computing Model and Data Handling Angelo Carbone 5° workshop italiano sulla fisica p-p ad LHC 31st January 2008.
CRAB Server CRAB (CMS Remote Analysis Builder)
CC IN2P3 - T1 for CMS: CSA07: production and transfer
Job workflow Pre production operations:
US CMS Testbed.
Status of MC production on the grid
Chapter 2: Database System Concepts and Architecture
N. De Filippis - LLR-Ecole Polytechnique
Institut de Physique Nucléaire de Lyon
Pierre Girard ATLAS Visit
 YongPyong-High Jan We appreciate that you give an opportunity to have this talk. Our Belle II computing group would like to report on.
TeraScale Supernova Initiative
DØ MC and Data Processing on the Grid
Gridifying the LHCb Monte Carlo production system
Status and plans for bookkeeping system and production tools
IPv6 update Duncan Rand Imperial College London
The LHCb Computing Data Challenge DC06
Presentation transcript:

Production Resources & Issues p20.09 MC-data Regeneration MC-Production Status Production Resources & Issues Production Status p20.09 MC-data Regeneration Other Issues Conclusions Tibor Kurča Institut de Physique Nucléaire de Lyon DØ France June 23, 2008 Grenoble June 23, 2008 T.Kurca - D0 France

MC processing resources Grid flavours: 1. Native SAMGrid (SAM+grid tools) 2. Open Science Grid (OSG) 3. LHC computing grid (LCG) - forwarding nodes needed to translate between SAMgrid and OSG/LCG grids Nongrid site CC-IN2P3 - Goal: maximal usage of local & remote resources - Each system has its (dis)advantages June 23, 2008 T.Kurca - D0 France

Prod systems: Pros(Cons) Grid flavours: + access to additional resources - low efficiency (system too complicated) - opportunistic usage  single requests running weeks….. Nongrid site CC-IN2P3 + high efficiency: dependence only on the local resources & separation of production and SAM-related tasks (they are independent) + local storage of the results + local bookkeeping system + flexibility for non-standard requests - manpower intensive - HPSS dependance - LCG dependance June 23, 2008 T.Kurca - D0 France

MC-Production Status Last 12 months 600 M events 50M/month 1.7M/day Sept05-June08 : 1.14 B evts Nice, but …. - goals are much higher: 70M/month, 2.5M/day - too optimistic !!! June 23, 2008 T.Kurca - D0 France

Weekly MC-Production 24M weekly record Exception ! 15M@CC (+d0gstar reprocessing) 17.5M Feb08 short term goal:17.5M/w …. We are far away…. Why ? - Grid efficiency not improving - LCG not operational - CC-worsenning June 23, 2008 T.Kurca - D0 France

MC-Production Rates Optimistic assumptions per week: OSG 4M, SG 4M, CC 5M, LCG 1M max 56M events /month June 23, 2008 T.Kurca - D0 France

ZeroBias Problem p20.09 MC with wrong ZeroBias data overlay - processed with too tight 2.5 Zero-supression  p17 and p20 JES are not the same reconstructed jet energies are lower by 1.5-2.0 % independent of pT Long term solution : regenerate p20.09.02 MC-data with correct ZeroBias running in parallel with new requests… currently done ~186M / 277M events  67% Some good news: all data produced at CCIN2P3 were redone very quickly - all d0gstar files are stored in HPSS (145 TB, 55M events) - about factor 3-4 faster then full simulation chain  done in about 55 days Some bad news : decision to store all p20.09 d0gstar files at FNAL also until new ZeroBias available : …. But it turned out that Grid production is not able to start from d0gstar file June 23, 2008 T.Kurca - D0 France

Other Issues Streamlining the MC-Request System - current procedure is rather time consuming (labour-intensive) & error prone! Goal: reduce manpower intensity of the submission process  keep up with increasing volume of request  build in automatic crosschecks  web page based request system Status: work in progress in collaboration with REX group Access to MC-data - rather painful –> old & new tools (re)appearing June 23, 2008 T.Kurca - D0 France

Acces to MC-Data more tools around - each has some (dis)advantages 2 basic categories of searches - detailed information about the given request ID (Monte Carlo Catalog, request.py script of R. Herber) - list of requests based on the physics constraints like production and/or decay (request.py of R. Herber, script of S. Muanza, CAF-Trees list of F.Couderc) for more details see: http://www-d0.fnal.gov/computing/mcprod/mcc.html  MC-Data   How to find your request IDs Monte Carlo Catalog is being updated for v7 (C.Biscarat, S.Muanza) June 23, 2008 T.Kurca - D0 France

Conclusions Production rate  short term goal 2.5 M events/day (Feb 2008) … But due to persisting/new problems  even 2M/day too optimistic !!! p20.09 MC-data regeneration done at 67% Problems with efficiency of the grid production Production on LCG still down Production @CCIN2P3 slowing down June 23, 2008 T.Kurca - D0 France

Announcements Regeneration of p20.09.02 … from the list by request IDs  tell us if you need higher priority for certain requests MC-production on LCG (private) is possible: thanks to L. Duflot - without SAMGrid interface prerequisites: - production tarballs and ZeroBias data known to LCG (stored at SE) - you belong to the VO enabled on the site Work on streamlining the request system (slow progress) June 23, 2008 T.Kurca - D0 France

MC-Production Status 2 June 23, 2008 T.Kurca - D0 France