ALICE Offline Tutorial Markus Oldenburg – CERN May 15, 2007 – University of Sao Paulo.

Slides:



Advertisements
Similar presentations
PowerPoint Tutorial 1 Creating a Presentation
Advertisements

Word Tutorial 2 Editing and Formatting a Document
Excel Tutorial 3 Working with Formulas and Functions
Objectives Create an action query to create a table
TeraGrid's GRAM Auditing & Accounting, & its Integration with the LEAD Science Gateway Stuart Martin Computation Institute, University of Chicago & Argonne.
Measurement of the Hard photodisintegration of a proton-pair in the 3 He nucleus. Running now in JLab, hall A E03-101:GamPP n p p Ishay Pomerantz, Tel-Aviv.
May 17 and 22, ALICE Offline Tutorial Markus Oldenburg – CERN May 17 and 22, 2007 – University of Sao Paulo.
May 23, ALICE Offline Tutorial Markus Oldenburg – CERN May 23, 2007 – University of São Paulo.
June 1, ALICE Offline Tutorial Markus Oldenburg – CERN June 1, 2007 – University of São Paulo.
3 october Brown easyBorrow (beta) Brown University Library October 2007.
Word Tutorial 1 Creating a Document
V3.0 12/04/ For Coral Systems With PRI Services Presented by Education Technology Services.
Italian white certificates scheme
Objectives Explore a structured range of data Freeze rows and columns
March 8, Dynamic Fault Tree analysis using Input/Output Interactive Markov Chains Hichem Boudali, Pepijn Crouzen, and Mariëlle Stoelinga. Formal.
Word Tutorial 5 Working with Templates and Outlines
Helping Online Students Do Better Academically with Useful Learning & Study Strategies. Some recommended instructor activities to help students succeed.
Proof/Proof-on-Demand at Atlas T3 Shuwei Ye (BNL) Aug 20, 2014.
PROOF and AnT in PHOBOS Kristjan Gulbrandsen March 25, 2004 Collaboration Meeting.
Statistics of CAF usage, Interaction with the GRID Marco MEONI CERN - Offline Week –
PROOF: the Parallel ROOT Facility Scheduling and Load-balancing ACAT 2007 Jan Iwaszkiewicz ¹ ² Gerardo Ganis ¹ Fons Rademakers ¹ ¹ CERN PH/SFT ² University.
PROOF - Parallel ROOT Facility Kilian Schwarz Robert Manteufel Carsten Preuß GSI Bring the KB to the PB not the PB to the KB.
The ALICE Analysis Framework A.Gheata for ALICE Offline Collaboration 11/3/2008 ACAT'081A.Gheata – ALICE Analysis Framework.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Interactive Data Analysis with PROOF Bleeding Edge Physics with Bleeding Edge Computing Fons Rademakers CERN.
1 Part III: PROOF Jan Fiete Grosse-Oetringhaus – CERN Andrei Gheata - CERN V3.2 –
1 Marek BiskupACAT2005PROO F Parallel Interactive and Batch HEP-Data Analysis with PROOF Maarten Ballintijn*, Marek Biskup**, Rene Brun**, Philippe Canal***,
PROOF Cluster Management in ALICE Jan Fiete Grosse-Oetringhaus, CERN PH/ALICE CAF / PROOF Workshop,
ROOT for Data Analysis1 Intel discussion meeting CERN 5 Oct 2003 Ren é Brun CERN Distributed Data Analysis.
ROOT and Federated Data Stores What Features We Would Like Fons Rademakers CERN CC-IN2P3, Nov, 2011, Lyon, France.
ROOT-CORE Team 1 PROOF xrootd Fons Rademakers Maarten Ballantjin Marek Biskup Derek Feichtinger (ARDA) Gerri Ganis Guenter Kickinger Andreas Peters (ARDA)
Introduction to the PROOF system Ren é Brun CERN Do-Son school on Advanced Computing and GRID Technologies for Research Institute of.
PROOF and ALICE Analysis Facilities Arsen Hayrapetyan Yerevan Physics Institute, CERN.
March, PROOF - Parallel ROOT Facility Maarten Ballintijn Bring the KB to the PB not the PB to the KB.
Super Scaling PROOF to very large clusters Maarten Ballintijn, Kris Gulbrandsen, Gunther Roland / MIT Rene Brun, Fons Rademakers / CERN Philippe Canal.
Postgraduate Computing Lectures PAW 1 PAW: Physicist Analysis Workstation What is PAW? –A tool to display and manipulate data. Learning PAW –See ref. in.
1 Status of PROOF G. Ganis / CERN Application Area meeting, 24 May 2006.
Alberto Colla - CERN ALICE off-line week 1 Alberto Colla ALICE off-line week Cern, May 31, 2005 Table of contents: ● Summary of requirements ● Description.
Computing for Alice at GSI (Proposal) (Marian Ivanov)
A. Gheata, ALICE offline week March 09 Status of the analysis framework.
Gustavo Conesa ALICE offline week Gamma and Jet correlations analysis framework Short description, Status, HOW TO use and TO DO list 1/9.
ALICE Offline Week October 4 th 2006 Silvia Arcelli & Chiara Zampolli TOF Online Calibration - Strategy - TOF Detector Algorithm - TOF Preprocessor.
Latest Improvements in the PROOF system Bleeding Edge Physics with Bleeding Edge Computing Fons Rademakers, Gerri Ganis, Jan Iwaszkiewicz CERN.
Latest Improvements in the PROOF system Bleeding Edge Physics with Bleeding Edge Computing Fons Rademakers, Gerri Ganis, Jan Iwaszkiewicz CERN.
Analysis experience at GSIAF Marian Ivanov. HEP data analysis ● Typical HEP data analysis (physic analysis, calibration, alignment) and any statistical.
Go4 Workshop J.Adamczewski-Musch, S.Linev Go4 advanced features.
AliRoot Classes for access to Calibration and Alignment objects Magali Gruwé CERN PH/AIP ALICE Offline Meeting February 17 th 2005 To be presented to detector.
ROOT and PROOF Tutorial Arsen HayrapetyanMartin Vala Yerevan Physics Institute, Yerevan, Armenia; European Organization for Nuclear Research (CERN)
ALICE Offline Tutorial PART 3: PROOF Alice Core Offline 5 th June, 2008.
Some topics for discussion 31/03/2016 P. Hristov 1.
Joe Foster 1 This talk extends the one I gave in 2006 called Visualizing Data with ROOT. –
PROOF on multi-core machines G. GANIS CERN / PH-SFT for the ROOT team Workshop on Parallelization and MultiCore technologies for LHC, CERN, April 2008.
AAF tips and tricks Arsen Hayrapetyan Yerevan Physics Institute, Armenia.
ANALYSIS TRAIN ON THE GRID Mihaela Gheata. AOD production train ◦ AOD production will be organized in a ‘train’ of tasks ◦ To maximize efficiency of full.
HYDRA Framework. Setup of software environment Setup of software environment Using the documentation Using the documentation How to compile a program.
Lyon Analysis Facility - status & evolution - Renaud Vernet.
The ALICE Analysis -- News from the battlefield Federico Carminati for the ALICE Computing Project CHEP 2010 – Taiwan.
Compute and Storage For the Farm at Jlab
PROOF integration in FAIRROOT
Experience of PROOF cluster Installation and operation
Report PROOF session ALICE Offline FAIR Grid Workshop #1
Status of the Analysis Task Force
Status of the CERN Analysis Facility
PROOF – Parallel ROOT Facility
Experience in ALICE – Analysis Framework and Train
Analysis framework - status
G. Ganis, 2nd LCG-France Colloquium
Kristjan Gulbrandsen March 25, 2004 Collaboration Meeting
Support for ”interactive batch”
PROOF - Parallel ROOT Facility
Presentation transcript:

ALICE Offline Tutorial Markus Oldenburg – CERN May 15, 2007 – University of Sao Paulo

ALICE Offline Tutorial F.Carminati, P.Christakoglou, J.F.Grosse-Oetringhaus, P.Hristov, A.Peters, P.Saiz April 13, 2007 – v1.3 based on:

Part III: PROOF available online at: -> Seminarshttp://cern.ch/Oldenburg

May 15, PROOF Parallel ROOT Facility Interactive parallel analysis on a local cluster PROOF itself is not related to Grid Can be used in the Grid Can access Grid files The usage of PROOF is transparent The same code can be run locally and in a PROOF system (certain rules have to be followed) PROOF is part of ROOT

May 15, root Remote PROOF Cluster Data proof Client - Local PC $ root ana.C stdout/result node1 node2 node3 node4 $ root root [0] tree->Process(ana.C) $ root root [0] tree->Process(ana.C) root [1] gROOT->Proof(remote) $ root root [0] tree->Process(ana.C) root [1] TProof::Open(remote) root [2] chain->Process(ana.C) ana.C proof PROOF Schema Data master slave

May 15, Terminology Client Your machine running a ROOT session that is connected to a PROOF master Master PROOF machine coordinating work between Slaves Slave PROOF machine that processes data Query A job submitted from the client to the PROOF system. A query consists of a selector and a chain Selector A class containing the analysis code (more details later) Chain A list of files (trees) to process (more details later)

May 15, TTree A tree is a container for data storage with disk overspill It consists of several branches These can be in one or several files Branches are stored contiguously (split mode) When reading a tree, certain branches can be switched off speed up of analysis when not all data is needed Tree Branch

May 15, TTree #include "TTree.h" #include "TFile.h" #include "TRandom.h" class point { public: void Set() {x=gRandom->Rndm();y=gRandom->Rndm();z=gRandom->Rndm();} private: Float_t x, y, z; ClassDef(point, 1) }; Int_t t() { point *pp = new point(); TTree *tree = new TTree("Test","Test Tree",99); TFile *file = new TFile("test.root","recreate"); tree->Branch("point",&pp); for(Int_t i=0; i<100; ++i) { pp->Set(); tree->Fill();} tree->Write(); file->Close(); // file=new TFile("test.root","read"); tree->Print(); // return 0; }

May 15, TTree (2) ****************************************************************************** *Tree :Test : Test Tree * *Entries : 100 : Total = 4090 bytes File Size = 0 * * : : Tree compression factor = 1.00 * ****************************************************************************** *Branch :point * *Entries : 100 : BranchElement (see below) * * * *Br 0 :x : * *Entries : 100 : Total Size= 1006 bytes One basket in memory * *Baskets : 0 : Basket Size= bytes Compression= 1.00 * * * *Br 1 :y : * *Entries : 100 : Total Size= 1006 bytes One basket in memory * *Baskets : 0 : Basket Size= bytes Compression= 1.00 * * * *Br 2 :z : * *Entries : 100 : Total Size= 1006 bytes One basket in memory * *Baskets : 0 : Basket Size= bytes Compression= 1.00 * * * point xyzxyz xxxxxxxxxxyyyyyyyyyyzzzzzzzzzz Branches File

May 15, How to use PROOF Files to be analyzed are put into a chain ( TChain) Analysis written as a selector ( TSelector, AliSelector, AliSelectorRL) Input/Output is sent using dedicated lists If additional libraries are needed, these have to be distributed as a package Analysis (TSelector) Input Files (TChain) Output (TList) Input (TList)

May 15, TChain A chain is a list of trees (in several files) Normal TTree functions can be used Draw(...), Scan(...) these iterate over all elements of the chain Selectors can be used with chains Process(const char* selectorFileName) After using SetProof() these calls are run in PROOF Chain Tree1 (File1) Tree2 (File2) Tree3 (File3) Tree4 (File3) Tree5 (File4)

May 15, once on your client once on each Slave TSelector for each tree for each event Classes derived from TSelector can run locally and in PROOF Begin() SlaveBegin() Init(TTree* tree) Process(Long64_t entry) SlaveTerminate() Terminate()

May 15, Input / Output The TSelector class has two members of type TList: fInput, fOutput These are used to get input data or put output data Input list Before running a query the input list is populated proof->AddInput(myObj) In the selector (Begin, SlaveBegin) the object is retrieved: fInput->FindObject(myObject)

May 15, Input / Output (2) Output list After processing, the output has to be added to the output list on each Slave (in SlaveTerminate) fOutput->Add(fResult) PROOF merges the results from each query automatically (see next slide) On your client (in Terminate) you retrieve the object and save it, display it,... fOutput->FindObject(myResult)

May 15, Input / Output (3) Merging Objects are identified by name Standard merging implementation for histograms available Other classes need to implement Merge(TCollection*) When no merging function is available all the individual objects are returned Result from Slave 1 Result from Slave 2 Final result Merge()

May 15, Chain Tree1 (File1) Tree2 (File2) Tree3 (File3) Tree4 (File3) Tree5 (File4) Workflow Summary Analysis (TSelector) Input (TList) proof

May 15, Workflow Summary Analysis (TSelector) Input (TList) proof Output (TList) Output (TList) Output (TList) Merged Output

May 15, Packages PAR files: PROOF ARchive. Like Java jar Gzipped tar file PROOF-INF directory BUILD.sh, building the package, executed per Slave SETUP.C, set environment, load libraries, executed per Slave API to manage and activate packages UploadPackage(package.par) EnablePackage(package)

May 15, Accessing ESD Use local ROOT To access AliESDs.root, the ESD.par package has to be uploaded into the PROOF environment Selector derives from AliSelector (in STEER) Access to data by member: fESD TSelector AliSelector

May 15, Accessing the RunLoader Use local AliRoot Access to Kinematics, Clusters, etc. requires access to the RunLoader Therefore (nearly) full AliRoot needs to be loaded A AliRoot version is already deployed on the CAF test system and can be enabled by a 3 line macro (part of the tutorial files, see later) ESD package is not allowed to be loaded Selector derives from AliSelectorRL (in STEER) GetStack(), GetRunLoader(), GetHeader() TSelector AliSelector AliSelectorRL

May 15, CERN Analysis Facility The CERN Analysis Facility (CAF) will run PROOF for ALICE Prompt analysis of pp data Pilot analysis of PbPb data Calibration & Alignment Available to the whole collaboration but the number of users will be limited for efficiency reasons Design goals 500 CPUs 100 TB of selected data locally available

May 15, Evaluation of PROOF Test setup since May machines, 2 CPUs each, 200 GB disk Tests performed Usability tests Simple speedup plot Evaluation of different query types Evaluation of the system when running a combination of query types Goal: Realistic simulation of users using the system

May 15, Query Type Cocktail A realistic stress test consists of different users that submit different types of queries 4 different query types 20% very short queries 40% short queries 20% medium queries 20% long queries User mix 33 nodes available for the test Maximum average speedup for 10 users = 6.6 (33 nodes = 66 CPUs)

May 15, Relative Speedup