PROOF and AnT in PHOBOS Kristjan Gulbrandsen March 25, 2004 Collaboration Meeting.

Slides:



Advertisements
Similar presentations
Remote Visualisation System (RVS) By: Anil Chandra.
Advertisements

Proof/Proof-on-Demand at Atlas T3 Shuwei Ye (BNL) Aug 20, 2014.
June, 20013rd ROOT Workshop1 PROOF and ROOT Grid Features Fons Rademakers.
1 PROOF & GRID Update Fons Rademakers. 2 Parallel ROOT Facility The PROOF system allows: parallel execution of scripts parallel analysis of trees in a.
Distributed components
Reference: Message Passing Fundamentals.
Amoeba Distributed Operating System James Schultz CPSC 550 Spring 2007.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
June 21, PROOF - Parallel ROOT Facility Maarten Ballintijn, Rene Brun, Fons Rademakers, Gunter Roland Bring the KB to the PB.
Systems Architecture, Fourth Edition1 Internet and Distributed Application Services Chapter 13.
.NET Mobile Application Development Introduction to Mobile and Distributed Applications.
70-291: MCSE Guide to Managing a Microsoft Windows Server 2003 Network Chapter 14: Troubleshooting Windows Server 2003 Networks.
Protocols and the TCP/IP Suite Chapter 4. Multilayer communication. A series of layers, each built upon the one below it. The purpose of each layer is.
PROOF: the Parallel ROOT Facility Scheduling and Load-balancing ACAT 2007 Jan Iwaszkiewicz ¹ ² Gerardo Ganis ¹ Fons Rademakers ¹ ¹ CERN PH/SFT ² University.
PROOF - Parallel ROOT Facility Kilian Schwarz Robert Manteufel Carsten Preuß GSI Bring the KB to the PB not the PB to the KB.
Track 1: Cluster and Grid Computing NBCR Summer Institute Session 2.2: Cluster and Grid Computing: Case studies Condor introduction August 9, 2006 Nadya.
Remote OMNeT++ v2.0 Introduction What is Remote OMNeT++? Remote environment for OMNeT++ Remote simulation execution Remote data storage.
The ALICE Analysis Framework A.Gheata for ALICE Offline Collaboration 11/3/2008 ACAT'081A.Gheata – ALICE Analysis Framework.
Microsoft Active Directory(AD) A presentation by Robert, Jasmine, Val and Scott IMT546 December 11, 2004.
UNIT - 1Topic - 2 C OMPUTING E NVIRONMENTS. What is Computing Environment? Computing Environment explains how a collection of computers will process and.
DIANE Project CHEP 03 DIANE Distributed Analysis Environment for semi- interactive simulation and analysis in Physics Jakub T. Moscicki,
Loosely Coupled Parallelism: Clusters. Context We have studied older archictures for loosely coupled parallelism, such as mesh’s, hypercubes etc, which.
1 Some initial Design suggestions… Getting started… where to begin? Find out whether your design architecture will work… as soon as possible. If you need.
Component Technology. Challenges Facing the Software Industry Today’s applications are large & complex – time consuming to develop, difficult and costly.
1 Marek BiskupACAT2005PROO F Parallel Interactive and Batch HEP-Data Analysis with PROOF Maarten Ballintijn*, Marek Biskup**, Rene Brun**, Philippe Canal***,
LOGO PROOF system for parallel MPD event processing Gertsenberger K. V. Joint Institute for Nuclear Research, Dubna.
ROOT for Data Analysis1 Intel discussion meeting CERN 5 Oct 2003 Ren é Brun CERN Distributed Data Analysis.
9 Systems Analysis and Design in a Changing World, Fourth Edition.
ROOT and Federated Data Stores What Features We Would Like Fons Rademakers CERN CC-IN2P3, Nov, 2011, Lyon, France.
INFORMATION SYSTEM-SOFTWARE Topic: OPERATING SYSTEM CONCEPTS.
LOGO Development of the distributed computing system for the MPD at the NICA collider, analytical estimations Mathematical Modeling and Computational Physics.
Experiment Management System CSE 423 Aaron Kloc Jordan Harstad Robert Sorensen Robert Trevino Nicolas Tjioe Status Report Presentation Industry Mentor:
1 Network Information System (NIS). 2 Module – Network Information System (NIS) ♦ Overview This module focuses on configuring and managing Network Information.
David Adams ATLAS DIAL: Distributed Interactive Analysis of Large datasets David Adams BNL August 5, 2002 BNL OMEGA talk.
NOVA A Networked Object-Based EnVironment for Analysis “Framework Components for Distributed Computing” Pavel Nevski, Sasha Vanyashin, Torre Wenaus US.
HIGUCHI Takeo Department of Physics, Faulty of Science, University of Tokyo Representing dBASF Development Team BELLE/CHEP20001 Distributed BELLE Analysis.
PROOF and ALICE Analysis Facilities Arsen Hayrapetyan Yerevan Physics Institute, CERN.
CHAPTER 7 CLUSTERING SERVERS. CLUSTERING TYPES There are 2 types of clustering ; Server clusters Network Load Balancing (NLB) The difference between the.
A Remote Collaboration Environment for Protein Crystallography HEPiX-HEPNT Conference, 8 Oct 1999 Nicholas Sauter, Stanford Synchrotron Radiation Laboratory.
March, PROOF - Parallel ROOT Facility Maarten Ballintijn Bring the KB to the PB not the PB to the KB.
Super Scaling PROOF to very large clusters Maarten Ballintijn, Kris Gulbrandsen, Gunther Roland / MIT Rene Brun, Fons Rademakers / CERN Philippe Canal.
1 Status of PROOF G. Ganis / CERN Application Area meeting, 24 May 2006.
File Transfer And Access (FTP, TFTP, NFS). Remote File Access, Transfer and Storage Networks For different goals variety of approaches to remote file.
CSC 480 Software Engineering Lecture 17 Nov 4, 2002.
October Test Beam DAQ. Framework sketch Only DAQs subprograms works during spills Each subprogram produces an output each spill Each dependant subprogram.
Copyright 2007, Information Builders. Slide 1 iWay Web Services and WebFOCUS Consumption Michael Florkowski Information Builders.
September, 2002CSC PROOF - Parallel ROOT Facility Fons Rademakers Bring the KB to the PB not the PB to the KB.
Sept. 2000CERN School of Computing1 PROOF and ROOT Grid Features Fons Rademakers.
ROOT and PROOF Tutorial Arsen HayrapetyanMartin Vala Yerevan Physics Institute, Yerevan, Armenia; European Organization for Nuclear Research (CERN)
Sockets A popular API for client-server interaction.
Architecture of a platform for innovation and research Erik Deumens – University of Florida SC15 – Austin – Nov 17, 2015.
Fermilab Scientific Computing Division Fermi National Accelerator Laboratory, Batavia, Illinois, USA. Off-the-Shelf Hardware and Software DAQ Performance.
Architecture Review 10/11/2004
Experience of PROOF cluster Installation and operation
Status of the Analysis Task Force
PROOF – Parallel ROOT Facility
CSC 480 Software Engineering
April HEPCG Workshop 2006 GSI
#01 Client/Server Computing
Client-Server Interaction
Comparison of LAN, MAN, WAN
ALICE-Grid Activities in Bologna
Mobile Agents.
Chapter 2: The Linux System Part 1
Kristjan Gulbrandsen March 25, 2004 Collaboration Meeting
Support for ”interactive batch”
PROOF - Parallel ROOT Facility
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
#01 Client/Server Computing
Presentation transcript:

PROOF and AnT in PHOBOS Kristjan Gulbrandsen March 25, 2004 Collaboration Meeting

What is PROOF? A system integrated into ROOT which allows for interactive analysis of large data sets using parallel processing and I/O Transparent – difference between running a local session and over multiple computers is minimal Adaptable – can react network conditions, system performance and multiple architectures Scalable – no manifest limitations on size

PROOF Architecture Client connects to a master server local to cluster Master server talks to slaves on nodes where (ideally) data is located Slaves run in parallel Master server collects results minimizing slow interaction with client Internet Master Slave User

TSelector Interface class TSelector{ Begin() SlaveBegin() Process() SlaveTerminate() Terminate() } Client: Begin() Terminate() (n) Slaves: SlaveBegin() Process() … Process() … Process() … SlaveTerminate() If a tree exists, tree->MakeSelector() creates a skeleton class deriving from TSelector A copy of each object exists in each slave } code normally in for loops Create histograms

Using PROOF Call gROOT->Proof(“proof:// ”) to begin a proof session A set of file names must be added to a TDSet similar to adding files to a TChain Call TDSet->Process( ) where contain TSelector code Additional supporting files/libraries can be used by creating PAR files

PROOF Execution root Remote PROOF Cluster proof TNetFile TFile Local PC $ root ana.C stdout/obj node1 node2 node3 node4 $ root root [0].x ana.C $ root root [0].x ana.C root [1] gROOT->Proof(“remote”) $ root root [0] tree->Process(“ana.C”) root [1] gROOT->Proof(“remote”) root [2] dset->Process(“ana.C”) ana.C proof proof = slave server proof proof = master server #proof.conf slave node1 slave node2 slave node3 slave node4 *.root TFile

PROOF in PHOBOS PROOF is installed on the Pharm cluster Newest ROOT version (4.00/03) is needed and exists in /usr/local/root Proofserver is compiled with libnew (for now to allow PhatII classes to be used without modification PhatII structure is ideal for transferring individual libraries among slave nodes

AnT Trees A tree format has been created to hold summary information for analyses Trees are designed to have basic summary information used for analyses and allow pieces of data to be ignored (not read) decreasing I/O TRefs allowing partial information to be read in while maintaining the ability to cross reference information (i.e. tracks referring to their hits)

AnT Structure EventInfo: Run Seq Ev_No Date Time Polarity Prim_vtx-> Tracks[]: PID Charge MeandE SigmadE Prob Chi^2 Xprod[3] Mom[3] HitArray[]-> TriggerInfo: IsCol L0 L1 EOct ERing TrgT_Extra[] TrgE_Extra[] Paddle: TruncMeanP TruncMeanN SumP SumN TDiff Vertex[]: Status ID Prob Pos[3] Sigma[3] Hits[]: Layer SensorLabel dE Pos[3] Pad[2] ZDC: SumP SumN TZDCP TZDCN TOF Info? PCAL Info? HitArrays are being developed

Current AnT Trees Prototype AnT trees currently exist on Pharm (10 runs, 56 Seqs) and can be used Analysis personnel needed to use the trees and provide information about necessary additions making them useful for many analyses

Analysis using AnT/PROOF AnT/PROOF has been used to generate p t distributions from current data Using AnT/PROOF speeds up analysis from an hour to a minute Disabling hit read in speeds up processing by more than factor of 10

Summary PROOF is ready for use on Pharm. Simple example macros exist explaining how to use PROOF AnT trees have been created for quick analysis of large data sets in conjunction with PROOF Users are needed to test/try both PROOF and AnT to provide information on data format and stress PROOF architecture