Experience in ALICE – Analysis Framework and Train

Slides:



Advertisements
Similar presentations
ALICE Offline Tutorial Markus Oldenburg – CERN May 15, 2007 – University of Sao Paulo.
Advertisements

H.G.Essel: Go4 - J. Adamczewski, M. Al-Turany, D. Bertini, H.G.Essel, S.Linev CHEP 2004 Go4 v2.8 Analysis Design.
Trains status&tests M. Gheata. Train types run centrally FILTERING – Default trains for p-p and Pb-Pb, data and MC (4) Special configuration need to be.
PROOF: the Parallel ROOT Facility Scheduling and Load-balancing ACAT 2007 Jan Iwaszkiewicz ¹ ² Gerardo Ganis ¹ Fons Rademakers ¹ ¹ CERN PH/SFT ² University.
The ALICE Analysis Framework A.Gheata for ALICE Offline Collaboration 11/3/2008 ACAT'081A.Gheata – ALICE Analysis Framework.
Shuei MEG review meeting, 2 July MEG Software Status MEG Software Group Framework Large Prototype software updates Database ROME Monte Carlo.
Experience with analysis of TPC data Marian Ivanov.
1 Part III: PROOF Jan Fiete Grosse-Oetringhaus – CERN Andrei Gheata - CERN V3.2 –
1 Marek BiskupACAT2005PROO F Parallel Interactive and Batch HEP-Data Analysis with PROOF Maarten Ballintijn*, Marek Biskup**, Rene Brun**, Philippe Canal***,
ALICE Simulation Framework Ivana Hrivnacova 1 and Andreas Morsch 2 1 NPI ASCR, Rez, Czech Republic 2 CERN, Geneva, Switzerland For the ALICE Collaboration.
Andreas Morsch, CERN EP/AIP CHEP 2003 Simulation in ALICE Andreas Morsch For the ALICE Offline Project 2003 Conference for Computing in High Energy and.
The ALICE Simulation Strategy Andreas Morsch For the ALICE Offline Group Joint STAR/ALICE Offline Meeting Brookhaven National Laboratory, Upton, NY April.
Status of JCORRAN analysis train R.Diaz D.Kim 1. Analysis Module “AliJCORRANTask” 2 Name : AliJCORRANTask 1. Inherited from AliAnalysisTaskSE Inputs :
CHEP, March 2009, Prague P. Mato /CERN.  Distributed Data Analysis is very wide subject and I don’t like catalogue-like talks  Narrowing the scope.
Vertex finding and B-Tagging for the ATLAS Inner Detector A.H. Wildauer Universität Innsbruck CERN ATLAS Computing Group on behalf of the ATLAS collaboration.
PWG3 Analysis: status, experience, requests Andrea Dainese on behalf of PWG3 ALICE Offline Week, CERN, Andrea Dainese 1.
Andrei Gheata, Mihaela Gheata, Andreas Morsch ALICE offline week, 5-9 July 2010.
Analysis trains – Status & experience from operation Mihaela Gheata.
H.G.Essel: Go4 - J. Adamczewski, M. Al-Turany, D. Bertini, H.G.Essel, S.Linev CHEP 2003 GSI Online Offline Object Oriented Go4.
ALICE analysis framework References for Analysis Tools used to the ALICE simulated data.
HIGUCHI Takeo Department of Physics, Faulty of Science, University of Tokyo Representing dBASF Development Team BELLE/CHEP20001 Distributed BELLE Analysis.
PROOF and ALICE Analysis Facilities Arsen Hayrapetyan Yerevan Physics Institute, CERN.
H.G.Essel: Go4 - J. Adamczewski, M. Al-Turany, D. Bertini, H.G.Essel, S.Linev ROOT 2002 GSI Online Offline Object Oriented Go4.
Paul Scherrer Institut 5232 Villigen PSI ROME / / Matthias Schneebeli ROME Collaboration Meeting in Pisa Presented by Matthias Schneebeli.
Computing for Alice at GSI (Proposal) (Marian Ivanov)
A. Gheata, ALICE offline week March 09 Status of the analysis framework.
AliRoot survey: Analysis P.Hristov 11/06/2013. Are you involved in analysis activities?(85.1% Yes, 14.9% No) 2 Involved since 4.5±2.4 years Dedicated.
Gustavo Conesa ALICE offline week Gamma and Jet correlations analysis framework Short description, Status, HOW TO use and TO DO list 1/9.
1 Offline Week, October 28 th 2009 PWG3-Muon: Analysis Status From ESD to AOD:  inclusion of MC branch in the AOD  standard AOD creation for PDC09 files.
The MEG Offline Project General Architecture Offline Organization Responsibilities Milestones PSI 2/7/2004Corrado Gatto INFN.
M. Gheata ALICE offline week, October Current train wagons GroupAOD producersWork on ESD input Work on AOD input PWG PWG31 (vertexing)2 (+
03/06/08Gustavo Conesa PWG4 1/14 Particle identification and Hadron/Jet correlations analysis framework Description, Status, HOW TO use and.
Latest Improvements in the PROOF system Bleeding Edge Physics with Bleeding Edge Computing Fons Rademakers, Gerri Ganis, Jan Iwaszkiewicz CERN.
Analysis experience at GSIAF Marian Ivanov. HEP data analysis ● Typical HEP data analysis (physic analysis, calibration, alignment) and any statistical.
Analysis train M.Gheata ALICE offline week, 17 March '09.
M. Gheata ALICE offline week, 24 June  A new analysis train macro was designed for production  /ANALYSIS/macros/AnalysisTrainNew.C /ANALYSIS/macros/AnalysisTrainNew.C.
Alien and GSI Marian Ivanov. Outlook GSI experience Alien experience Proposals for further improvement.
Calibration algorithm and detector monitoring - TPC Marian Ivanov.
Meeting with University of Malta| CERN, May 18, 2015 | Predrag Buncic ALICE Computing in Run 2+ P. Buncic 1.
ALICE Offline Tutorial PART 3: PROOF Alice Core Offline 5 th June, 2008.
PWG4 analysis - Offline week 1/9 PWG4 Status Gustavo Conesa Balbastre INFN-Frascati.
Some topics for discussion 31/03/2016 P. Hristov 1.
CCR e INFN-GRID Workshop, Palau, Andrea Dainese 1 L’analisi per l’esperimento ALICE Andrea Dainese INFN Padova Una persona attiva come utente.
MAUS Status A. Dobbs CM43 29 th October Contents MAUS Overview Infrastructure Geometry and CDB Detector Updates CKOV EMR KL TOF Tracker Global Tracking.
ANALYSIS TRAIN ON THE GRID Mihaela Gheata. AOD production train ◦ AOD production will be organized in a ‘train’ of tasks ◦ To maximize efficiency of full.
HYDRA Framework. Setup of software environment Setup of software environment Using the documentation Using the documentation How to compile a program.
The ALICE Analysis -- News from the battlefield Federico Carminati for the ALICE Computing Project CHEP 2010 – Taiwan.
Go4 v2.2 Status & Overview CHEP 2003
Jan Fiete Grosse-Oetringhaus
Data Formats and Impact on Federated Access
Experience of PROOF cluster Installation and operation
Part II Analysis framework
ALICE experience with ROOT I/O
Analysis trains – Status & experience from operation
Analysis tools in ALICE
Developments of the PWG3 muon analysis code
GWE Core Grid Wizard Enterprise (
Status of the Analysis Task Force
Status of the CERN Analysis Facility
ALICE analysis preservation
News on the CDB Framework
Analysis Trains - Reloaded
Analysis framework - status
ALICE Computing Model in Run3
ALICE Computing Upgrade Predrag Buncic
Performance optimizations for distributed analysis in ALICE
Framework for the acceptance and efficiency corrections
Simulation in a Distributed Computing Environment
ATLAS DC2 & Continuous production
Chapter 13: I/O Systems.
Presentation transcript:

Experience in ALICE – Analysis Framework and Train Andreas Morsch CERN

Analysis in ALICE Three main analysis modes Prompt data processing (calibration, alignment, reconstruction, analysis) @CERN with PROOF Analysis with local PROOF clusters Batch Analysis on the GRID infrastructure

Plans for Scheduled Analysis Most efficient way for many analysis tasks to read and process the full data set. In particular if resources are sparse. Optimise CPU/IO ratio But also Helps to develop a common well tested framework for analysis. Develops common knowledge base and terminology. Helps documenting the analysis procedure and makes results reproducible.

Plans for scheduled analysis: Analysis train producing AODs Acceptance and Efficiency Correction Services Monte Carlo Truth ESD/AOD TASK 1 TASK 2 TASK … TASK N AOD

What the Analysis Framework does in ALICE Transparent access to all resources with the same code Usage: Local, AliEn grid, CAF/PROOF Transparent access to different inputs ESD, AOD, Kinematics tree (MC truth) Allow for „scheduled“ analysis Common and well tested environment to run several tasks

Solutions Transparent access to computing resources Hide computing scheme dependent code in one Manager Class Transparent access to data Make intensive use of interfaces VEventHandler VEvent VTrack

AliAnalysis… Framework Data-oriented model composed of independent tasks Task execution triggered by data readiness Tasks are owned and managed by AliAnalysisManager Parallel execution and event loop done via TSelector functionality Mandatory for usage with PROOF AliAnalysisTask INPUT 0 INPUT 1 OUTPUT 0 CONT 0 CONT 1 CONT 2 N.B.: The analysis framework itself has a very general design, not bound to ALICE software A. Gheata

AliAnalysisManager and PROOF: The Manager in Disguise TSelector Delegates Creates AliAnalysisSelector AliAnalysisManager

AliAnalysisManager – PROOF mode CLIENT PROOF AliAnalysisSelector TSelector AM->StartAnalysis(“proof”) MyAnalysis.C Master O1 Analysis Manager task1 task2 task3 taskN Input chain Outputs Input list AM Output list Worker Worker SlaveBegin() AM task1 task2 task3 taskN Inputs Outputs Worker Process() Worker AM Worker SlaveTerminate() O2 O1 On Terminate() O2 On O

TObjArray *fContainers Tasks and event loop AliAnalysisManager TObjArray *fContainers TObjArray *fTasks AliAnalysisSelector Chain->Process() EVENT LOOP Top cont ESD chain Top level tasks and containers (“Train”) task1 task2 output1 output2 POST EVENT LOOP Task Fit task4 result result

(AliAODInputHandler) Tasks and Common I/O AliAnalysisManager AliAODHandler (Output) AliAODEvent AliMCEventHandler AliVEventHandler AliMCEvent AliAnalysisTask AliMCParticle AliAODtrack AliESDEvent (AliAODEvent) AliESDtrack AliESDInputHandler (AliAODInputHandler) Tasks AliVParticle AliVEvent Data I/O via slots

AliAnalysisTask ConnectInputData() CreateOutPutObjects() Define which data is connected to which slot CreateOutPutObjects() Create Histograms Init(),LocalInit() Optional, e.g. read parameters Exec() The event loop Terminate() Called at the end, can draw e.g. a histogram

Common ESD Access Handling AliAnalysisManager AliVirtualEventHandler AliESDInputHandler AliESDEvent AliESDInputHandler* inpHandler = new AliESDInputHandler(); inpHandler->SetInactiveBranches(“ Calo FMD “); AliAnalysisManager *mgr = new AliAnalysisManager(‘Analysis Train’, ‘Test’); mgr->SetInputEventHandler(inpHandler);

Common ESD Access Handling void AliAnalysisTaskXYZ::ConnectInputData(Option_t* option) { // Connect the input data AliESDInputHandler* esdH = (AliAODHandler*) ((AliAnalysisManager::GetAnalysisManager()) ->GetInputEventHandler()); fESD = esdH->GetEvent(); }

Common AOD Access Handling AliAnalysisManager AliVirtualEventHandler AliAODHandler AliAODEvent AliAODHandler* aodHandler = new AliAODHandler(); aodHandler->SetOutputFileName("aod.root"); AliAnalysisManager *mgr = new AliAnalysisManager(‘Analysis Train’, ‘Test’); mgr->SetOutputEventHandler(aodHandler); AliAnalysisDataContainer *coutput1 = mgr->CreateContainer(‘AODTree’, TTree::Class(), AliAnalysisManager::kOutputContainer, "default");

User Analysis Code: Output Data void AliAnalysisTaskXYZ::CreateOutputObjects() { // Create the output container // // Default AOD AliAODHandler* handler = (AliAODHandler*) ((AliAnalysisManager::GetAnalysisManager()) ->GetOutputEventHandler()); fAOD = handler->GetAOD(); }

AliVirtualEventHandler AliAnalysisManager AliVirtualEventHandler AliVEvent AliMCEventHandler AliMCEvent AliMCEventHandler* mcHandler = new AliMCEventHandler(); AliAnalysisManager *mgr = new AliAnalysisManager(‘Analysis Train’, ‘Test’); mgr->SetMCtruthEventHandler(mcHandler);

User Analysis Code: MC truth void AliAnalysisTaskXYZ::Exec(Option_t* option ) { // During Analysis AliMCEvent* mc = mgr->GetMCEventHandler()->MCEvent(); Int_t ntrack = mc->GetNumberOfTracks(); for (Int_t i = 0; i < ntrack; i++) AliVParticle* particle = mc->GetTrack(i); Double_t pt = particle->Pt(); }

(Recommended) Integration of User Analysis Code AliAnalysisTask Steers Delegates AliAnalysisUserTask User AnalysisCode Implements Interface Deals with AliAODEvent Documents selection and analysis parameters Factory

Example: Gamma Analysis Task AliAnaGamma AliAnaGammaDirect AliAnaGammaCorrelation AliGammaReader AliGammaDataReader AliGammaMCReader AliGammaMCDataReader AliAnaGammaParton AliAnaGammaHadron AliAnaGammaJetLeadCone AliAnaGammaJetFinder AliAnalysisGammaTask

CAF Related Issues We produce large output trees which are currently memory resident on the worker side. Need urgently the TProofFile/TFileMeger mechanism to handle file resident trees. Future challenges Event mixing with nested event loops Repeating loops (calibration, s. M. Ivanov)

Integration of User Tasks Relatively smooth so far Needs user support to scrutinize (in particular for CAF/PROOF): Memory requirements (leaks) Correct data member initialization On client and workers

Analysis train producing AOD Tested LOCAL, on GRID and PROOF modes on p-p and Pb-Pb events Monte Carlo Truth ESD/AOD ESD FILTERING JET ANALYSIS GAMMA TASK Others to come … AOD

Summary ALICE Offline has developed an analysis framework that hides computing scheme dependences from the user. The same user code runs on Local PC CAF/PROOF Grid Framework manages a list of independent tasks: Execution triggered by data readiness Sequential execution of the top level task (train) driven by input chain Common I/O is managed by event handlers Run-time configuration of Tasks and Handlers

Thanks to … A. Gheata M. Gheata J.-F. Grosse-Oetringhaus Ch, Klein-Boesing M. Oldenburg F. Carminati Y. Schutz G. Conesa and many others