DZero Monte Carlo Production Ideas for CMS Greg Graham Fermilab CD/CMS 1/16/01 CMS Production Meeting.

Slides:



Advertisements
Similar presentations
Configuration management
Advertisements

RunJob in CMS Greg Graham Discussion Slides. RunJob in CMS RunJob is an Application Configuration and Job Creation Tool –RunJob uses metadata to abstract.
1 OBJECTIVES To generate a web-based system enables to assemble model configurations. to submit these configurations on different.
CMS Applications Towards Requirements for Data Processing and Analysis on the Open Science Grid Greg Graham FNAL CD/CMS for OSG Deployment 16-Dec-2004.
Runjob: A HEP Workflow Planner for Production Processing Greg Graham CD/CMS Fermilab CD Project Status Meeting 13-Nov-2003.
Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
GRID Workload Management System Massimo Sgaravatto INFN Padova.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL June 23, 2003 GAE workshop Caltech.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
CLEO’s User Centric Data Access System Christopher D. Jones Cornell University.
SCRAM Software Configuration, Release And Management Background SCRAM has been developed to enable large, geographically dispersed and autonomous groups.
SOFTWARE.
The D0 Monte Carlo Challenge Gregory E. Graham University of Maryland (for the D0 Collaboration) February 8, 2000 CHEP 2000.
Framework for Automated Builds Natalia Ratnikova CHEP’03.
What is Sure BDCs? BDC stands for Batch Data Communication and is also known as Batch Input. It is a technique for mass input of data into SAP by simulating.
MCRunjob: An HEP Workflow Planner for Grid Production Processing Greg Graham CD/CMS Fermilab Iain Bertram and Dave Evans Lancaster university CHEP 2003.
The SAMGrid Data Handling System Outline:  What Is SAMGrid?  Use Cases for SAMGrid in Run II Experiments  Current Operational Load  Stress Testing.
 To explain the importance of software configuration management (CM)  To describe key CM activities namely CM planning, change management, version management.
Remote Production and Regional Analysis Centers Iain Bertram 24 May 2002 Draft 1 Lancaster University.
3 Sept 2001F HARRIS CHEP, Beijing 1 Moving the LHCb Monte Carlo production system to the GRID D.Galli,U.Marconi,V.Vagnoni INFN Bologna N Brook Bristol.
David Adams ATLAS ATLAS Distributed Analysis David Adams BNL March 18, 2004 ATLAS Software Workshop Grid session.
Grid Job and Information Management (JIM) for D0 and CDF Gabriele Garzoglio for the JIM Team.
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
November 7, 2001Dutch Datagrid SARA 1 DØ Monte Carlo Challenge A HEP Application.
Configuration Management (CM)
Geant4 Acceptance Suite for Key Observables CHEP06, T.I.F.R. Mumbai, February 2006 J. Apostolakis, I. MacLaren, J. Apostolakis, I. MacLaren, P. Mendez.
3rd June 2004 CDF Grid SAM:Metadata and Middleware Components Mòrag Burgon-Lyon University of Glasgow.
Distribution After Release Tool Natalia Ratnikova.
1 st December 2003 JIM for CDF 1 JIM and SAMGrid for CDF Mòrag Burgon-Lyon University of Glasgow.
INFSO-RI Module 01 ETICS Overview Etics Online Tutorial Marian ŻUREK Baltic Grid II Summer School Vilnius, 2-3 July 2009.
ATLAS and GridPP GridPP Collaboration Meeting, Edinburgh, 5 th November 2001 RWL Jones, Lancaster University.
Data Grid projects in HENP R. Pordes, Fermilab Many HENP projects are working on the infrastructure for global distributed simulated data production, data.
Status of the LHCb MC production system Andrei Tsaregorodtsev, CPPM, Marseille DataGRID France workshop, Marseille, 24 September 2002.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
The ALICE short-term use case DataGrid WP6 Meeting Milano, 11 Dec 2000Piergiorgio Cerello 1 Physics Performance Report (PPR) production starting in Feb2001.
Dzero MC production on LCG How to live in two worlds (SAM and LCG)
LHCb Software Week November 2003 Gennady Kuznetsov Production Manager Tools (New Architecture)
Heather Kelly PPA Scientific Computing Apps LAT was launched as part of the Fermi Gamma-ray Space Telescope on June 11 th 2008.
Alexander Richards, UCL 1 Atlfast and RTT (plus DCube) Christmas Meeting 18/12/2007.
National Center for Supercomputing ApplicationsNational Computational Science Grid Packaging Technology Technical Talk University of Wisconsin Condor/GPT.
Metadata Mòrag Burgon-Lyon University of Glasgow.
GridPP11 Liverpool Sept04 SAMGrid GridPP11 Liverpool Sept 2004 Gavin Davies Imperial College London.
4 December 2003 Natalia Ratnikova Natalia Ratnikova 1 Software Packaging with DAR Natalia Ratnikova, Anzar Afaq, Greg Graham Fermilab Tony Wildish, Veronique.
Application Software System Software.
9/12/99R. Moore1 Level 2 Trigger Software Interface R. Moore, Michigan State University.
1 Update on tt-bar signal and background simulation Stan Bentvelsen.
Analysis Tools at D0 PPDG Analysis Grid Computing Project, CS 11 Caltech Meeting Lee Lueking Femilab Computing Division December 19, 2002.
Krzysztof Genser/Fermilab For the Fermilab Geant4 Performance Team.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
GlueX Computing GlueX Collaboration Meeting – JLab Edward Brash – University of Regina December 11 th -13th, 2003.
RefDB: The Reference Database for CMS Monte Carlo Production Véronique Lefébure CERN & HIP CHEP San Diego, California 25 th of March 2003.
D0 File Replication PPDG SLAC File replication workshop 9/20/00 Vicky White.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
David Adams ATLAS ATLAS Distributed Analysis and proposal for ATLAS-LHCb system David Adams BNL March 22, 2004 ATLAS-LHCb-GANGA Meeting.
Alien and GSI Marian Ivanov. Outlook GSI experience Alien experience Proposals for further improvement.
20 October 2005 LCG Generator Services monthly meeting, CERN Validation of GENSER & News on GENSER Alexander Toropin LCG Generator Services monthly meeting.
Ganga/Dirac Data Management meeting October 2003 Gennady Kuznetsov Production Manager Tools and Ganga (New Architecture)
Simulation Production System Science Advisory Committee Meeting UW-Madison March 1 st -2 nd 2007 Juan Carlos Díaz Vélez.
ATLAS Physics Analysis Framework James R. Catmore Lancaster University.
Claudio Grandi INFN Bologna Virtual Pools for Interactive Analysis and Software Development through an Integrated Cloud Environment Claudio Grandi (INFN.
DZero Monte Carlo Status, Performance, and Future Plans Greg Graham U. Maryland - Dzero 10/16/2000 ACAT 2000.
CDF SAM Deployment Status Doug Benjamin Duke University (for the CDF Data Handling Group)
Shahkar/MCRunjob: An HEP Workflow Planner for Grid Production Processing Greg Graham CD/CMS Fermilab GruPhyN 15 October 2003.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL May 19, 2003 BNL Technology Meeting.
Simulation Production System
Belle II Physics Analysis Center at TIFR
Existing Perl/Oracle Pipeline
Remote SAM Initiative (RSI) – Proposed Work Plan
Presentation transcript:

DZero Monte Carlo Production Ideas for CMS Greg Graham Fermilab CD/CMS 1/16/01 CMS Production Meeting

1/16/01Greg Graham - CMS Production2 Monte Carlo Production Systems Monte Carlo Verification I - Code –Patches, Versions, Executables RED : Runtime Environment Distribution Monte Carlo Verification II - Executables –Check representative output from all platforms Monte Carlo Specification –Physics Process (H), Request specification (H) Control and Tracking : The Grid –Condor pools, Globus –Production

1/16/01Greg Graham - CMS Production3 Monte Carlo Verification Before anything is distributed, make sure : –Correct versions of external code (eg - Pythia) are being used. –All needed patches have been applied –The output looks good at the build site

1/16/01Greg Graham - CMS Production4 Runtime Environment Distribution Dzero : the “mini-tarfiles” –Offline production executables from official builds –Packaged with needed.so and databases –Installed on top of each other in regular way so as not to destroy previous installations It is not optimal to distribute the build environments and compile everywhere –missing patches, different static libs, (?) compilers, etc. –  MC-site ? : systematic error from site to site MC variation

1/16/01Greg Graham - CMS Production5 Monte Carlo Verification II After executables have been distributed, make sure : –standard samples are run at different sites –output is compared

1/16/01Greg Graham - CMS Production6 Physics Process Specification Schema part of the SAM system –Keeps track of cardfiles Cardfiles required to be stored in CVS –Keeps track of cardfile elements production, decay, PDF, etc. –Attributes (eg Pt limit, Eta limit) and cardfile elements can be added dynamically –Attributes plus cardfiles define what we mean by Physics Process

1/16/01Greg Graham - CMS Production7 Physics Process Specification Ttbar CTEQ5 W->jets Production Process PDF Decay Process Pythia, ver 6.134Cvs module : Cards cardfile : mycards.pythia version : v0.1 Pt Limits Eta Limits Pt > 5.0 GeV |Eta| < 3.0

1/16/01Greg Graham - CMS Production8 Request Specification Schema part of the SAM system –Keeps track of processing requests List can grow dynamically; keeps track of whole processing chain –Keeps track of groups of input/output files SAM “projects” are define on the input/ output –Requests can be split into site-specific “work-requests” –Appearance of files with the correct parentage in SAM constitutes job tracking

1/16/01Greg Graham - CMS Production9 Automatic Request Processing Schema

1/16/01Greg Graham - CMS Production10 Monte Carlo Job Specification mc_runjob : A D0 package written in Python –Driven by macro scripts Macro scripts serve to specify processing steps in a regular way –Runs most D0 Offline Executables All in principle … –Chains executables together Run generation through ntuple in one step Handles naming of input and output files Handles seed generation, run numbers, etc.

1/16/01Greg Graham - CMS Production11 Mc_runjob –“Configurators” Run and configure each D0 Offline executable –The “Linker” Links Configurators Collects shell scripts that actually do the work –Macro scripts The Linker and Configurators obey a regular set of macro commands in macro scripts Can easily be extended !

1/16/01Greg Graham - CMS Production12 Inheritance Hierarchy –Inheritance is used to incorporate D0 specific configuration (ie - RCP, runtime environment, naming, SAM metadata) –Other special purpose configurators can handle the batch system, manage a list of files, etc. Can easily be extended !

1/16/01Greg Graham - CMS Production13 Graphical Support –While primarily a macro script driven tool, the mc_runjob inheritance hierarchy also provides core tools with GUI methods that build a GUI from information contained in the configurators themselves. No special porting task for GUI !

1/16/01Greg Graham - CMS Production14 Automatic Request Processing System

1/16/01Greg Graham - CMS Production15 Remote Processing Centers Monte Carlo simulation, digitization, and eventually some MC reconstruction will be handled offsite. Currently operating RPCs –UT Arlington; IN2P3, Lyon; NIKHEF, Amsterdam; Prague Future planned RPCs –Lancaster; Rio de Janeiro; TIFR Generated events will be stored with SAM We expect better than 100K events/day

1/16/01Greg Graham - CMS Production16 References G.E. Graham “The Dzero Monte Carlo Challenge” Proceedings, CHEP 2000 G.E. Graham “Dzero Monte Carlo” Proceedings, ACAT 2000 (to appear soon)