INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org CRAB: a tool for CMS distributed analysis in grid environment Federica Fanzago INFN PADOVA.

Slides:



Advertisements
Similar presentations
1 14 Feb 2007 CMS Italia – Napoli A. Fanfani Univ. Bologna A. Fanfani University of Bologna MC Production System & DM catalogue.
Advertisements

1 CRAB Tutorial 19/02/2009 CERN F.Fanzago CRAB tutorial 19/02/2009 Marco Calloni CERN – Milano Bicocca Federica Fanzago INFN Padova.
Introduction to CMS computing CMS for summer students 7/7/09 Oliver Gutsche, Fermilab.
Réunion DataGrid France, Lyon, fév CMS test of EDG Testbed Production MC CMS Objectifs Résultats Conclusions et perspectives C. Charlot / LLR-École.
CRAB Tutorial Federica Fanzago – Cern/Cnaf 13/02/2007 CRAB Tutorial (Cms Remote Analysis Builder)
Workload Management meeting 07/10/2004 Federica Fanzago INFN Padova Grape for analysis M.Corvo, F.Fanzago, N.Smirnov INFN Padova.
Grid and CDB Janusz Martyniak, Imperial College London MICE CM37 Analysis, Software and Reconstruction.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
A tool to enable CMS Distributed Analysis
Ian M. Fisk Fermilab February 23, Global Schedule External Items ➨ gLite 3.0 is released for pre-production in mid-April ➨ gLite 3.0 is rolled onto.
LHC Experiment Dashboard Main areas covered by the Experiment Dashboard: Data processing monitoring (job monitoring) Data transfer monitoring Site/service.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
WORKFLOW IN MOBILE ENVIRONMENT. WHAT IS WORKFLOW ?  WORKFLOW IS A COLLECTION OF TASKS ORGANIZED TO ACCOMPLISH SOME BUSINESS PROCESS.  EXAMPLE: Patient.
The ATLAS Production System. The Architecture ATLAS Production Database Eowyn Lexor Lexor-CondorG Oracle SQL queries Dulcinea NorduGrid Panda OSGLCG The.
Physicists's experience of the EGEE/LCG infrastructure usage for CMS jobs submission Natalia Ilina (ITEP Moscow) NEC’2007.
CERN - IT Department CH-1211 Genève 23 Switzerland t Monitoring the ATLAS Distributed Data Management System Ricardo Rocha (CERN) on behalf.
Zhiling Chen (IPP-ETHZ) Doktorandenseminar June, 4 th, 2009.
High Energy Physics At OSCER A User Perspective OU Supercomputing Symposium 2003 Joel Snow, Langston U.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
José M. Hernández CIEMAT Grid Computing in the Experiment at LHC Jornada de usuarios de Infraestructuras Grid January 2012, CIEMAT, Madrid.
LcgCAF:CDF submission portal to LCG Federica Fanzago for CDF-Italian Computing Group Gabriele Compostella, Francesco Delli Paoli, Donatella Lucchesi, Daniel.
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
BaBar Grid Computing Eleonora Luppi INFN and University of Ferrara - Italy.
F.Fanzago – INFN Padova ; S.Lacaprara – LNL; D.Spiga – Universita’ Perugia M.Corvo - CERN; N.DeFilippis - Universita' Bari; A.Fanfani – Universita’ Bologna;
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
CERN IT Department CH-1211 Genève 23 Switzerland t Internet Services Job Monitoring for the LHC experiments Irina Sidorova (CERN, JINR) on.
INFSO-RI Enabling Grids for E-sciencE Project Gridification: the UNOSAT experience Patricia Méndez Lorenzo CERN (IT-PSS/ED) CERN,
DOSAR Workshop, Sao Paulo, Brazil, September 16-17, 2005 LCG Tier 2 and DOSAR Pat Skubic OU.
ATLAS and GridPP GridPP Collaboration Meeting, Edinburgh, 5 th November 2001 RWL Jones, Lancaster University.
1 Sergio Maffioletti Grid Computing Competence Center GC3 University of Zurich Swiss Grid School 2012 Develop High Throughput.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
Tier-2  Data Analysis  MC simulation  Import data from Tier-1 and export MC data CMS GRID COMPUTING AT THE SPANISH TIER-1 AND TIER-2 SITES P. Garcia-Abia.
Enabling Grids for E-sciencE EGEE-III INFSO-RI Using DIANE for astrophysics applications Ladislav Hluchy, Viet Tran Institute of Informatics Slovak.
Giuseppe Codispoti INFN - Bologna Egee User ForumMarch 2th BOSS: the CMS interface for job summission, monitoring and bookkeeping W. Bacchi, P.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks WMSMonitor: a tool to monitor gLite WMS/LB.
Enabling Grids for E-sciencE System Analysis Working Group and Experiment Dashboard Julia Andreeva CERN Grid Operations Workshop – June, Stockholm.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks The usage of the gLite Workload Management.
1 Andrea Sciabà CERN Critical Services and Monitoring - CMS Andrea Sciabà WLCG Service Reliability Workshop 26 – 30 November, 2007.
INFSO-RI Enabling Grids for E-sciencE EGEE is a project funded by the European Union under contract INFSO-RI Grid Accounting.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks CRAB: the CMS tool to allow data analysis.
MND review. Main directions of work  Development and support of the Experiment Dashboard Applications - Data management monitoring - Job processing monitoring.
Daniele Spiga PerugiaCMS Italia 14 Feb ’07 Napoli1 CRAB status and next evolution Daniele Spiga University & INFN Perugia On behalf of CRAB Team.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical using WMProxy advanced job submission.
Distributed Physics Analysis Past, Present, and Future Kaushik De University of Texas at Arlington (ATLAS & D0 Collaborations) ICHEP’06, Moscow July 29,
Enabling Grids for E-sciencE CMS/ARDA activity within the CMS distributed system Julia Andreeva, CERN On behalf of ARDA group CHEP06.
INFSO-RI Enabling Grids for E-sciencE DGAS, current status & plans Andrea Guarise EGEE JRA1 All Hands Meeting Plzen July 11th, 2006.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
INFSO-RI Enabling Grids for E-sciencE File Transfer Software and Service SC3 Gavin McCance – JRA1 Data Management Cluster Service.
Experiment Support CERN IT Department CH-1211 Geneva 23 Switzerland t DBES The Common Solutions Strategy of the Experiment Support group.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
DGAS Distributed Grid Accounting System INFN Workshop /05/1009, Palau Giuseppe Patania Andrea Guarise 6/18/20161.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
BaBar & Grid Eleonora Luppi for the BaBarGrid Group TB GRID Bologna 15 febbraio 2005.
Claudio Grandi INFN Bologna Workshop congiunto CCR e INFNGrid 13 maggio 2009 Le strategie per l’analisi nell’esperimento CMS Claudio Grandi (INFN Bologna)
ATLAS Distributed Analysis S. González de la Hoz 1, D. Liko 2, L. March 1 1 IFIC – Valencia 2 CERN.
Eleonora Luppi INFN and University of Ferrara - Italy
Overview of the Belle II computing
BOSS: the CMS interface for job summission, monitoring and bookkeeping
BOSS: the CMS interface for job summission, monitoring and bookkeeping
CRAB and local batch submission
BOSS: the CMS interface for job summission, monitoring and bookkeeping
N. De Filippis - LLR-Ecole Polytechnique
Grid Computing in CMS: Remote Analysis & MC Production
Job Application Monitoring (JAM)
Presentation transcript:

INFSO-RI Enabling Grids for E-sciencE CRAB: a tool for CMS distributed analysis in grid environment Federica Fanzago INFN PADOVA

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI Introduction CMS “Compact Muon Solenoid” is one of the four particle physics experiment that will collect data at LHC “Large Hadron Collider” starting in 2007 at CERN CMS will produce a large amount of data (events) that should be made available for analysis to world-wide distributed physicists CMS will produce –~2 PB events/year (assumes startup luminosity 2x10 33 cm -2 s -1 ) All events will be stored into files –O(10^6) files/year Files will be grouped in Fileblocks –O(10^3) Fileblocks/year Fileblocks will be grouped in Datasets –O(10^3) Datasets (total after 10 years of CMS) – TB “bunch crossing” every 25 nsecs. 100 “triggers” per second Each triggered event ~1 MB in size

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI How to manage and where to store this huge quantity of data? How to assure data access to physicists of CMS collaboration? How to have enough computing power for processing and data analysis? How to ensure resources and data availability? How to define local and global policy about data access and resources?  CMS will use a distributed architecture based on grid infrastructure Tools for accessing distributed data and resources are provided by WLCG (World LHC Computing Grid) with two main different flavours –LCG/gLite in Europe, OSG in the US Issues and help

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI Online system Tier 0 Tier 1 Tier 2 Tier 3 Offline farm CERN Computer center. Tier2 Center InstituteB InstituteA... workstation Italy Regional Center Fermilab Regional Center France Regional Center recorded data CMS computing model The CMS offline computing system is arranged in four Tiers and is geographically distributed Remote data accessible via grid

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI Analysis: what happens in a local environment... User writes his own analysis code and configuration parameter card –Starting from CMS specific analysis software –Builds executable and libraries He apply the code to a given amount of events, whose location is known, splitting the load over many jobs –But generally he is allowed to access only local data He writes wrapper scripts and uses a local batch system to exploit all the computing power –Comfortable until data you’re looking for are sitting just by your side Then he submits all by hand and checks the status and overall progress Finally collects all output files and store them somewhere

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI and in a distributed grid environment The distributed analysis is a more complex computing task because it assume to know: which data are available where data are stored and how to access them which resources are available and are able to comply with analysis requirements grid and CMS infrastructure details But users don't want deal with these kind of problem Users want to analyze data in “a simple way” as in local environment

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI Distribution analysis chain... To allow analysis in distributed environment, the CMS collaboration is developing some tools interfaced with grid services, that include Installation of CMS software via grid on remote resources Data transfer service: to move and manage a large flow of data among tiers Data validation system: to ensure data consistency Data location system: to keep track of data available in each site and to allow data discovery, composed by –Central database (RefDB) that knows what kind of data (dataset) have been produced in each Tier –Local database (PubDB) in each Tier, with info about where data are stored and their access protocol CRAB: Cms Remote Analysis Builder...

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI and CRAB role CRAB is a user-friendly tool whose aim is to simplify the work of users with no knowledge of grid infrastructure to create, submit and manage job analysis into grid environments. –written in python and installed on UI (grid user access point) Users have to develop their analysis code in a interactive environment and decide which data to analyse. They have to provide to CRAB: –Dataset name, number of events –Analysis code and parameter card –Output files and handling policy CRAB handles data discovery, resources availability, job creation and submission, status monitoring and output retrieval

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI How CRAB works Job creation: crab –create N (or all) –data discovery: sites storing data are found querying RefDB and local PubDBs –packaging of user code: creation of a tgz archive with user code (bin, lib and data) –wrapper script (sh) for the real user executable –JDL file, script which drives the real job towards the “grid” –splitting: according to user request (number of events per job and in total) Job submission: crab –submit N (or all) -c –jobs are submitted to the Resource Broker using BOSS, the submitter and tracking tool interfaced with CRAB –jobs are sent to those sites which host data

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI How CRAB works (2) Job monitoring: crab –status (n_of_job) –the status of all submitted jobs is checked using Boss Job output management: crab –getoutput (n_of_job) –following user request CRAB can  copy them back to the UI... ... or copy to a Storage Element Job resubmission: crab –resubmit n_of_job –if job suffers grid failure (aborted or cancelled status)

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI CRAB workflow: today

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI CRAB experience Used by tens of users to access remote MC data for Physics TDR analysis ~7000 Datasets available for O(10^8) total events, full MC production CMS users, via CRAB, use two dedicated Resources Brokers (at CERN and at CNAF) knowing all CMS sites CRAB proves that CMS users are able to use available grid services and that the full analysis chain works in a distributed environment!

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI CRAB usage Top 20 dataset/owner requested from users Top 20 CE where CRAB-Jobs run CRAB is currently used to analyse data for the CMS Physics TDR (being written now…) The total number of jobs submitted to the grid using CRAB during the second half of the last year is more than 300’000 by users.

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI CRAB future As CMS analysis framework and grid middleware evolve: –CRAB has to adapt to cope with these changes and always guarantee its usability and thus remote data access to users  New data discovery components (DBS, DLS) that will substitute RefDB and PubDB  New Event Data Model (as analysis framework)  gLite, new middleware for grid computing Open issues to be resolved (number of users and submitted jobs is increasing…) –Jobs policies and priorities at VO level: for example  for next tree weeks Higgs group users have priorities over other groups  tracker alignment jobs performed by user xxx must start immediately –Bulk submission: handle 1000 jobs as a single task, just one submission/status/...

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI CRAB future (2) CRAB will be split in two different components to minimize the user effort to manage analysis jobs and obtain their results. Some user actions will be delegated to “not user dependent” services, that take care to follow job evolution on the grid, get results and return them to user The Me/MyFriend idea: –Me: the user desktop (laptop or shell), where working environment is and where user can work interactively. For user operation as:  job creation  job submission –MyFriend: a set of robust services running 24x7 to guarantee the execution of:  job tracking  resubmission  output retrieval

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI Conclusion CRAB was born in April ’05 A big effort has been done to understand user needs and how to use in the best way services provided by grid Lot of work have been made to make it robust, flexible and reliable Users appreciate the tool and are asking for further improvements –CRAB has been used by many CMS collaborators to analyze remote data for CMS Physics TDR, otherwise not accessible –CRAB is used to continuously test CMS Tiers to prove the whole infrastructure robustness The use of CRAB proves the complete computing chain for distributed analysis works for a generic CMS user !

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI back-up Back-up slide

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI Statistics with CRAB(1) From to The weekly rate of the CRAB-jobs flow is: week # of jobs week LCG OSG (%) jobs

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI Statistics with CRAB(2) Efficiency: % of jobs which arrive to WN (remote CE) and run INFN CE All CE

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI CRAB flow: the future

Federica Fanzago INFN-PADOVA EGEE User Forum 01 March Enabling Grids for E-sciencE INFSO-RI CMS