1 Status of PROOF G. Ganis / CERN Application Area meeting, 24 May 2006.

Slides:



Advertisements
Similar presentations
Database System Concepts and Architecture
Advertisements

CHEP 2012 – New York City 1.  LHC Delivers bunch crossing at 40MHz  LHCb reduces the rate with a two level trigger system: ◦ First Level (L0) – Hardware.
1 PROOF & GRID Update Fons Rademakers. 2 Parallel ROOT Facility The PROOF system allows: parallel execution of scripts parallel analysis of trees in a.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
June 21, PROOF - Parallel ROOT Facility Maarten Ballintijn, Rene Brun, Fons Rademakers, Gunter Roland Bring the KB to the PB.
New CERN CAF facility: parameters, usage statistics, user support Marco MEONI Jan Fiete GROSSE-OETRINGHAUS CERN - Offline Week –
1 Status of the ALICE CERN Analysis Facility Marco MEONI – CERN/ALICE Jan Fiete GROSSE-OETRINGHAUS - CERN /ALICE CHEP Prague.
Chapter 3 Operating Systems Introduction to CS 1 st Semester, 2015 Sanghyun Park.
DIANE Overview Germán Carrera, Alfredo Solano (CNB/CSIC) EMBRACE COURSE Monday 19th of February to Friday 23th. CNB-CSIC Madrid.
PROOF: the Parallel ROOT Facility Scheduling and Load-balancing ACAT 2007 Jan Iwaszkiewicz ¹ ² Gerardo Ganis ¹ Fons Rademakers ¹ ¹ CERN PH/SFT ² University.
PROOF - Parallel ROOT Facility Kilian Schwarz Robert Manteufel Carsten Preuß GSI Bring the KB to the PB not the PB to the KB.
The ALICE Analysis Framework A.Gheata for ALICE Offline Collaboration 11/3/2008 ACAT'081A.Gheata – ALICE Analysis Framework.
The Next Generation Root File Server Andrew Hanushevsky Stanford Linear Accelerator Center 27-September-2004
PROOF Status and Perspectives G. GANIS CERN / LCG VII ROOT Users workshop, CERN, March 2007.
Test Of Distributed Data Quality Monitoring Of CMS Tracker Dataset H->ZZ->2e2mu with PileUp - 10,000 events ( ~ 50,000 hits for events) The monitoring.
Grid Job and Information Management (JIM) for D0 and CDF Gabriele Garzoglio for the JIM Team.
D0 SAM – status and needs Plagarized from: D0 Experiment SAM Project Fermilab Computing Division.
Interactive Data Analysis with PROOF Bleeding Edge Physics with Bleeding Edge Computing Fons Rademakers CERN.
3rd June 2004 CDF Grid SAM:Metadata and Middleware Components Mòrag Burgon-Lyon University of Glasgow.
DIANE Project CHEP 03 DIANE Distributed Analysis Environment for semi- interactive simulation and analysis in Physics Jakub T. Moscicki,
1 DIRAC – LHCb MC production system A.Tsaregorodtsev, CPPM, Marseille For the LHCb Data Management team CHEP, La Jolla 25 March 2003.
Can we use the XROOTD infrastructure in the PROOF context ? The need and functionality of a PROOF Master coordinator has been discussed during the meeting.
Evolution of Parallel Programming in HEP F. Rademakers – CERN International Workshop on Large Scale Computing VECC, Kolkata.
D C a c h e Michael Ernst Patrick Fuhrmann Tigran Mkrtchyan d C a c h e M. Ernst, P. Fuhrmann, T. Mkrtchyan Chep 2003 Chep2003 UCSD, California.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
1 Marek BiskupACAT2005PROO F Parallel Interactive and Batch HEP-Data Analysis with PROOF Maarten Ballintijn*, Marek Biskup**, Rene Brun**, Philippe Canal***,
PROOF Cluster Management in ALICE Jan Fiete Grosse-Oetringhaus, CERN PH/ALICE CAF / PROOF Workshop,
Giuseppe Codispoti INFN - Bologna Egee User ForumMarch 2th BOSS: the CMS interface for job summission, monitoring and bookkeeping W. Bacchi, P.
And Tier 3 monitoring Tier 3 Ivan Kadochnikov LIT JINR
ROOT for Data Analysis1 Intel discussion meeting CERN 5 Oct 2003 Ren é Brun CERN Distributed Data Analysis.
Operating Systems David Goldschmidt, Ph.D. Computer Science The College of Saint Rose CIS 432.
ROOT and Federated Data Stores What Features We Would Like Fons Rademakers CERN CC-IN2P3, Nov, 2011, Lyon, France.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
1 PROOF The Parallel ROOT Facility Gerardo Ganis / CERN CHEP06, Computing in High Energy Physics 13 – 17 Feb 2006, Mumbai, India Bring the KB to the PB.
ROOT-CORE Team 1 PROOF xrootd Fons Rademakers Maarten Ballantjin Marek Biskup Derek Feichtinger (ARDA) Gerri Ganis Guenter Kickinger Andreas Peters (ARDA)
PROOF in Atlas Tier 3 model Sergey Panitkin 1 BNL.
ARDA Prototypes Andrew Maier CERN. ARDA WorkshopAndrew Maier, CERN2 Overview ARDA in a nutshell –Experiments –Middleware Experiment prototypes (basic.
Introduction to the PROOF system Ren é Brun CERN Do-Son school on Advanced Computing and GRID Technologies for Research Institute of.
CASTOR evolution Presentation to HEPiX 2003, Vancouver 20/10/2003 Jean-Damien Durand, CERN-IT.
Summary Distributed Data Analysis Track F. Rademakers, S. Dasu, V. Innocente CHEP06 TIFR, Mumbai.
H.G.Essel: Go4 - J. Adamczewski, M. Al-Turany, D. Bertini, H.G.Essel, S.Linev CHEP 2003 GSI Online Offline Object Oriented Go4.
PROOF and ALICE Analysis Facilities Arsen Hayrapetyan Yerevan Physics Institute, CERN.
A prototype for an extended PROOF What is PROOF ? ROOT analysis model … … on a multi-tier architecture Status New development Prototype based on XRD Demo.
Online Monitoring System at KLOE Alessandra Doria INFN - Napoli for the KLOE collaboration CHEP 2000 Padova, 7-11 February 2000 NAPOLI.
March, PROOF - Parallel ROOT Facility Maarten Ballintijn Bring the KB to the PB not the PB to the KB.
Super Scaling PROOF to very large clusters Maarten Ballintijn, Kris Gulbrandsen, Gunther Roland / MIT Rene Brun, Fons Rademakers / CERN Philippe Canal.
Data Transfer Service Challenge Infrastructure Ian Bird GDB 12 th January 2005.
ANALYSIS TOOLS FOR THE LHC EXPERIMENTS Dietrich Liko / CERN IT.
March 13, 2006PROOF Tutorial1 Distributed Data Analysis with PROOF Fons Rademakers Bring the KB to the PB not the PB to the KB.
September, 2002CSC PROOF - Parallel ROOT Facility Fons Rademakers Bring the KB to the PB not the PB to the KB.
Latest Improvements in the PROOF system Bleeding Edge Physics with Bleeding Edge Computing Fons Rademakers, Gerri Ganis, Jan Iwaszkiewicz CERN.
Latest Improvements in the PROOF system Bleeding Edge Physics with Bleeding Edge Computing Fons Rademakers, Gerri Ganis, Jan Iwaszkiewicz CERN.
03/09/2007http://pcalimonitor.cern.ch/1 Monitoring in ALICE Costin Grigoras 03/09/2007 WLCG Meeting, CHEP.
Sept. 2000CERN School of Computing1 PROOF and ROOT Grid Features Fons Rademakers.
The ALICE data quality monitoring Barthélémy von Haller CERN PH/AID For the ALICE Collaboration.
ROOT and PROOF Tutorial Arsen HayrapetyanMartin Vala Yerevan Physics Institute, Yerevan, Armenia; European Organization for Nuclear Research (CERN)
Claudio Grandi INFN Bologna Virtual Pools for Interactive Analysis and Software Development through an Integrated Cloud Environment Claudio Grandi (INFN.
PROOF on multi-core machines G. GANIS CERN / PH-SFT for the ROOT team Workshop on Parallelization and MultiCore technologies for LHC, CERN, April 2008.
ANALYSIS TRAIN ON THE GRID Mihaela Gheata. AOD production train ◦ AOD production will be organized in a ‘train’ of tasks ◦ To maximize efficiency of full.
Lyon Analysis Facility - status & evolution - Renaud Vernet.
Report PROOF session ALICE Offline FAIR Grid Workshop #1
PROOF – Parallel ROOT Facility
PROOF in Atlas Tier 3 model
Ákos Frohner EGEE'08 September 2008
Partner: LMU (Atlas), GSI (Alice)
G. Ganis, 2nd LCG-France Colloquium
Computing Infrastructure for DAQ, DM and SC
DUCKS – Distributed User-mode Chirp-Knowledgeable Server
Support for ”interactive batch”
PROOF - Parallel ROOT Facility
Presentation transcript:

1 Status of PROOF G. Ganis / CERN Application Area meeting, 24 May 2006

2 Outline G. Ganis, AA, 24 May 2006  Reminder about PROOF  Recent developments and status  Near future  Testing at CAF  Quick Demo

3 Outline G. Ganis, AA, 24 May 2006  Reminder about PROOF  Recent developments and status  Near future  Testing at CAF  Quick Demo

4 The ROOT data model: Trees & Selectors G. Ganis, AA, 24 May 2006 Begin() Create histos, … Define output list Process() preselection analysis Terminate() Final analysis (fitting, …) output list Selector loop over events OK event branch leaf branch leaf 12 n last n read needed parts only Chain branch leaf

5 The PROOF approach G. Ganis, AA, 24 May 2006 catalog Storage PROOF farm scheduler query  farm perceived as extension of local PC  same macro, syntax as in local session  more dynamic use of resources  real time feedback  automated splitting and merging MASTER PROOF query: data file list, myAna.C files final outputs (merged) feedbacks (merged)

6 PROOF – Multi-tier Architecture G. Ganis, AA, 24 May 2006 good connection ? VERY importantless important Optimize for data locality or efficient data server access adapts to cluster of clusters or wide area virtual clusters Geographically separated domains; heterogenous machine types proofd

7 PROOF – Scalability G. Ganis, AA, 24 May 2006  CAF, 4 dual Xeon machines  CMS selector, 120 MB data (290 files), distributed locally  Strictly concurrent user sessions (100% CPU used):  No inefficiency introduced by PROOF internals 1 user 2 users 4 users 8 users

8 Outline G. Ganis, AA, 24 May 2006  Reminder about PROOF  Recent developments and status  Near future  Testing at CAF  Quick Demo

9 Recent developments G. Ganis, AA, 24 May 2006  Goals:  support for interactive-batch mode  stateless connection  multi-sessions  user-friendliness  management tools  GUI

10 Sample of analysis activity G. Ganis, CHEP06, 13 Feb 2006 AQ1: 1s query produces a local histogram AQ2: a 10mn query submitted to PROOF1 AQ3->AQ7: short queries AQ8: a 10h query submitted to PROOF2 BQ1: browse results of AQ2 BQ2: browse temporary results of AQ8 BQ3->BQ6: submit 4 10mn queries to PROOF1 CQ1: Browse results of AQ8, BQ3->BQ6 Monday at 10h15 ROOT session on my laptop Monday at 16h25 ROOT session on my laptop Wednesday at 8h40 Browse from any web browser

11 New connection layer based on XROOTD G. Ganis, AA, 24 May 2006  Interactive batch requires a coordinator on the server side  Candidate: XROOTD  light weight top component (networking, protocol handler)  new protocol implemented as a plug-in to launch and control PROOF server sessions  Non-destructive disconnections handled naturally  stateless connection  Xrd/olbd control network can be exploited to circulate information  Can use same daemon for data serving and PROOF serving

12 PROOF management tools G. Ganis, AA, 24 May 2006  Data sets  distribution of data files on local worker pools  by direct upload  by staging out from a mass storage (e.g. CASTOR)  Query results  classification and handling tools  retrieve, archive  Packages  optimized upload of additional libraries needed by the analysis

13 GUI controller G. Ganis, AA, 24 May 2006 Allows full on-click control on everything  define a new session  submit a query, execute a command  query editor  execute macro to define or pick up a TChain  browse directories with selectors  online monitoring of feedback histograms  browse folders with results of query  retrieve, delete, archive functionality  start viewer for fast TChain browsing

14 Outline G. Ganis, AA, 24 May 2006  Reminder about PROOF  Recent developments and status  Near future  Testing at CAF  Quick Demo

15 Near Future Plans G. Ganis, AA, 24 May 2006  Data access (see next)  Multi-user scheduling (see next)  Packetizer optimizations  re-assignment of being-processed packets to fast idle workers  Dynamic cluster configuration  come-and-go functionality for worker nodes  olbd network to get info about the load on the cluster  Improve handling of error conditions  identify cases hanging the system, improve error logging, …  exploit olbd control network for better overview of the cluster  Testing and consolidation  Monitoring of cluster behaviour  MonAlisa: allows definition of ad hoc parameters, e.g. I/O / node / query

16 PROOF: data access issues G. Ganis, AA, 24 May 2006  Low latency in data access is essential  File opening overhead  minimized using asynchronous open techniques  Data retrieval  caching, asynchronous pre-fetching of data segments to be analyzed  Asynchronous features supported by XROOTD / XrdClient  Plan of work to fully exploit this in TFile / TSelector using the knowledge available in TTree (L. Franco)

17 PROOF: multi-user scheduling issues G. Ganis, AA, 24 May 2006  New scheduler component being developed to control the use of available resources in multi-user environments (J. Iwaszkiewicz)  Decisions taken on per query base following a metric based on:  load of the cluster  resources need by the query  user history and priorities  …  Requires support for dynamic re-configuration of worker assigments (prototype being tested)  Generic interface to external schedulers planned  Condor, LSF, …

18 Outline G. Ganis, AA, 24 May 2006  Reminder about PROOF  Recent developments and status  Near future  Testing at CAF  Quick Demo

19 Testing at CAF G. Ganis, AA, 24 May 2006  CAF  40 dual Xeon 2.8 GHz machines, 4 GB RAM, GB/s Ethernet  215 GB disk pool  SLC4  lxcafdev: 5 machines  testing developments  lxcaf: 35 machine tested by ALICE (J.F. Grosse-Oetringhaus)  stress testing functionality  performance tests

20 Quick demo G. Ganis, AA, 24 May 2006  CMS test analysis selector  290 files (115 MB) distributed on lxcafdev  Local run accessing files from lxcafdev  PROOF run with 8 workers

21 A real PROOF session - query definition and running G. Ganis, AA, 24 May 2006 name Execute to create chain Select chain Choose selector Feedback histograms Processing information

22 Typical end-user job-length distribution G. Ganis, CHEP06, 13 Feb 2006 Interactive analysis using local resources, e.g. - end-analysis calculations - visualization Analysis jobs with well defined algorithms (e.g. production of personal trees) Medium term jobs, e.g. analysis design and development using also non-local resources Goal: bring these to the same level of perception

23 A real PROOF session - connection G. Ganis, CHEP06, 13 Feb 2006 Predefined session Define new session Session startup progress bar Session startup status ready

24 A real PROOF session - package manager G. Ganis, CHEP06, 13 Feb 2006 Package tab PAR (Proof ARchive)  ROOT-INF directory, BUILD.sh, SETUP.C  Control setup of each worker

25 A real PROOF session: query browsing and finalization G. Ganis, CHEP06, 13 Feb 2006 Details about the query Folder with output objects raw histogram finalization

26 A real PROOF session: disconnection / reconnection G. Ganis, CHEP06, 13 Feb 2006  Running sessions kept alive by server side coordinator  Reconnection is much faster: no process to fork disconnect reconnect The query is now terminated

27 A real PROOF session: chain viewer G. Ganis, CHEP06, 13 Feb 2006 Chain viewer Right-click

28 XrdProofd basics G. Ganis, CHEP06, 13 Feb 2006  Prototype based on XROOTD XROOTD links XrdXrootdProtocol files MT stuff user 1 XPD links XrdProofdProtocol proofserv  XrdProofdProtocol: client gateway to proofserv  static area for all client information and its activities static area MT stuff Worker servers user 1 user 2 … …

29 … client xc slave n XrdProofd XS slave 1 XrdProofd XS master XrdProofd XS xc XRD links TXSocket xc proofserv xc fork() proofslave xc fork() proofslave xc fork() XrdProofd communication layer G. Ganis, CHEP06, 13 Feb 2006

30 Workflow: Pull Architecture G. Ganis, CHEP06, 13 Feb 2006 dynamic load balancing naturally achieved

31 AliEn: command-line session G. Ganis, CHEP06, 13 Feb 2006  TGrid : abstract interface for all services // Connect TGrid *alien = TGrid::Connect(“alien://”); // Query TString path= “/alice/cern.ch/user/p/peters/analysis/miniesd/”; TGridResult *res = alien->Query(path, ”*.root“); // Create chain from list of files TChain chain(“Events", “session“, res->GetFileInfoList()); // Open a PROOF session TProof *proof = TProof::Open(“proofmaster”); // Process your query chain.Process(“selector.C”);

32 GRID G. Ganis, CHEP06, 13 Feb 2006 PROOFMASTERSERVER USER SESSION Guaranteed site access through PROOF Sub-Masters calling out to Master (agent technology) Grid/Root Authentication Grid Access Control Service TGrid UI/Queue UI Proofd Startup GRID Service Interfaces Grid File/Metadata Catalogue Client retrieves list of logical files (LFN + MSN) Slave servers access data via xrootd from local disk pools PROOF SLAVE SERVERS PROOFSUB-MASTERSERVER PROOFSUB-MASTERSERVER PROOFSUB-MASTERSERVERROOT Demo’ed by ALICE at SC03, SC04, …