David Adams ATLAS ATLAS Distributed Analysis David Adams BNL September 30, 2004 CHEP2004 Track 5: Distributed Computing Systems and Experiences.

Slides:



Advertisements
Similar presentations
Database System Concepts and Architecture
Advertisements

David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL June 23, 2003 GAE workshop Caltech.
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
QCDgrid Technology James Perry, George Beckett, Lorna Smith EPCC, The University Of Edinburgh.
Linux Operations and Administration
The ATLAS Production System. The Architecture ATLAS Production Database Eowyn Lexor Lexor-CondorG Oracle SQL queries Dulcinea NorduGrid Panda OSGLCG The.
Don Quijote Data Management for the ATLAS Automatic Production System Miguel Branco – CERN ATC
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL July 15, 2003 LCG Analysis RTAG CERN.
David Adams ATLAS ATLAS Distributed Analysis David Adams BNL March 18, 2004 ATLAS Software Workshop Grid session.
K. Harrison CERN, 20th April 2004 AJDL interface and LCG submission - Overview of AJDL - Using AJDL from Python - LCG submission.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Marianne BargiottiBK Workshop – CERN - 6/12/ Bookkeeping Meta Data catalogue: present status Marianne Bargiotti CERN.
David Adams ATLAS AJDL: Analysis Job Description Language David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
ATLAS DIAL: Distributed Interactive Analysis of Large Datasets David Adams – BNL September 16, 2005 DOSAR meeting.
David Adams ATLAS DIAL status David Adams BNL July 16, 2003 ATLAS GRID meeting CERN.
David Adams ATLAS ATLAS Distributed Analysis Plans David Adams BNL December 2, 2003 ATLAS software workshop CERN.
1 Schema Registries Steven Hughes, Lou Reich, Dan Crichton NASA 21 October 2015.
ILDG Middleware Status Chip Watson ILDG-6 Workshop May 12, 2005.
Event Data History David Adams BNL Atlas Software Week December 2001.
Datasets on the GRID David Adams PPDG All Hands Meeting Catalogs and Datasets session June 11, 2003 BNL.
Ganga A quick tutorial Asterios Katsifodimos Trainer, University of Cyprus Nicosia, Feb 16, 2009.
David Adams ATLAS ADA, ARDA and PPDG David Adams BNL June 28, 2004 PPDG Collaboration Meeting Williams Bay, Wisconsin.
INFSO-RI Enabling Grids for E-sciencE ATLAS Distributed Analysis A. Zalite / PNPI.
David Adams ATLAS Architecture for ATLAS Distributed Analysis David Adams BNL March 25, 2004 ATLAS Distributed Analysis Meeting.
David Adams ATLAS DIAL status David Adams BNL November 21, 2002 ATLAS software meeting GRID session.
DDM Monitoring David Cameron Pedro Salgado Ricardo Rocha.
Metadata Mòrag Burgon-Lyon University of Glasgow.
David Adams ATLAS DIAL/ADA JDL and catalogs David Adams BNL December 4, 2003 ATLAS software workshop Production session CERN.
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL June 7, 2004 BNL Technology Meeting.
David Adams ATLAS Virtual Data in ATLAS David Adams BNL May 5, 2002 US ATLAS core/grid software meeting.
D. Adams, D. Liko, K...Harrison, C. L. Tan ATLAS ATLAS Distributed Analysis: Current roadmap David Adams – DIAL/PPDG/BNL Dietrich Liko – ARDA/EGEE/CERN.
David Adams ATLAS DIAL: Distributed Interactive Analysis of Large datasets David Adams BNL August 5, 2002 BNL OMEGA talk.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 – The Ganga Evolution Andrew Maier.
Performance of The NorduGrid ARC And The Dulcinea Executor in ATLAS Data Challenge 2 Oxana Smirnova (Lund University/CERN) for the NorduGrid collaboration.
INFSO-RI Enabling Grids for E-sciencE ARDA Experiment Dashboard Ricardo Rocha (ARDA – CERN) on behalf of the Dashboard Team.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL November 17, 2003 SC2003 Phoenix.
K. Harrison CERN, 3rd March 2004 GANGA CONTRIBUTIONS TO ADA RELEASE IN MAY - Outline of Ganga project - Python support for AJDL - LCG analysis service.
David Adams ATLAS ATLAS distributed data management David Adams BNL February 22, 2005 Database working group ATLAS software workshop.
K. Harrison CERN, 22nd September 2004 GANGA: ADA USER INTERFACE - Ganga release status - Job-Options Editor - Python support for AJDL - Job Builder - Python.
David Adams ATLAS ATLAS Distributed Analysis: Overview David Adams BNL December 8, 2004 Distributed Analysis working group ATLAS software workshop.
David Adams ATLAS ATLAS-ARDA strategy and priorities David Adams BNL October 21, 2004 ARDA Workshop.
ATLAS-specific functionality in Ganga - Requirements for distributed analysis - ATLAS considerations - DIAL submission from Ganga - Graphical interfaces.
ADA Job Builder A Graphical Approach to Job Building ATLAS Software and Computing Workshop May 2005 Chun Lik Tan
David Adams ATLAS Datasets for the Grid and for ATLAS David Adams BNL September 24, 2003 ATLAS Software Workshop Database Session CERN.
Pavel Nevski DDM Workshop BNL, September 27, 2006 JOB DEFINITION as a part of Production.
STAR Scheduling status Gabriele Carcassi 9 September 2002.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
1 A Scalable Distributed Data Management System for ATLAS David Cameron CERN CHEP 2006 Mumbai, India.
ATLAS Distributed Analysis Dietrich Liko IT/GD. Overview  Some problems trying to analyze Rome data on the grid Basics Metadata Data  Activities AMI.
K. Harrison CERN, 21st February 2005 GANGA: ADA USER INTERFACE - Ganga release Python client for ADA - ADA job builder - Ganga release Conclusions.
David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.
Finding Data in ATLAS. May 22, 2009Jack Cranshaw (ANL)2 Starting Point Questions What is the latest reprocessing of cosmics? Are there are any AOD produced.
David Adams ATLAS ATLAS Distributed Analysis and proposal for ATLAS-LHCb system David Adams BNL March 22, 2004 ATLAS-LHCb-GANGA Meeting.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 Technical Overview Jakub T. Moscicki, CERN.
ATLAS Distributed Analysis DISTRIBUTED ANALYSIS JOBS WITH THE ATLAS PRODUCTION SYSTEM S. González D. Liko
David Adams ATLAS AJDL: Abstract Job Description Language David Adams BNL June 29, 2004 PPDG Collaboration Meeting Williams Bay.
David Adams ATLAS ADA: ATLAS Distributed Analysis David Adams BNL December 15, 2003 PPDG Collaboration Meeting LBL.
ATLAS DIAL: Distributed Interactive Analysis of Large Datasets David Adams Brookhaven National Laboratory February 13, 2006 CHEP06 Distributed Data Analysis.
Joe Foster 1 Two questions about datasets: –How do you find datasets with the processes, cuts, conditions you need for your analysis? –How do.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL May 19, 2003 BNL Technology Meeting.
ATLAS Distributed Analysis S. González de la Hoz 1, D. Liko 2, L. March 1 1 IFIC – Valencia 2 CERN.
U.S. ATLAS Grid Production Experience
LCG middleware and LHC experiments ARDA project
The Ganga User Interface for Physics Analysis on Distributed Resources
Presentation transcript:

David Adams ATLAS ATLAS Distributed Analysis David Adams BNL September 30, 2004 CHEP2004 Track 5: Distributed Computing Systems and Experiences

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Contents Goals Key concepts Datasets Transformations Jobs AJDL Service architecture Analysis services DIAL ATPROD ARDA Catalog services Data management services Clients Status ARDA Conclusions Contributors More information

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Goals Provide to globally distributed users: Access to globally distributed data that is –Comprehensible –Enables selection of relevant data –Enables sensible placement of data Means to perform globally distributed processing on this data –High-level view that hides details of underlying middleware –But enables monitoring and debugging –Automatic, complete and accurate provenance All the above must be easy to use Well-integrated with analysis environments –Root, python, etc. Graphical views where appropriate –Browse and examine data, –Monitor jobs, …

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Key concepts Dataset Describes a collection of data –E.g. a collection of reconstructed events, –A collection of histograms, … Transformation Defines an operation to be performed on the data Dataset  Dataset Application + task (user configuration of application) Job Instance of a transformation Typical user request processed as a collection of sub-jobs –Same transformation acting on sub-datasets –Plus dataset splitting of input and merging of output

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Key concepts (cont)

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Datasets Dataset includes Identifier Location of data, e.g. list of logical files –Absent for virtual datasets Content (i.e. description of the content) –E.g. list of event ID’s and the type of data for each event –Or a list of histogram names List of constituent datasets –Usually their ID’s –When dataset is composite, access to location and content may require use of the constituent datasets Dataset selection catalog holds metadata Dataset replica catalog holds replica mapping 1 Virtual  N concrete dataset mapping

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Datasets (cont) For ATLAS data, we identify Types of data –Used to define dataset categories –Category will be part of the content specification Types of datasets –Currently C++ classes with XML data representation –Third column indicates if this class exists –Likely will move to XML schema as the primary definition See table 

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Datasets (cont) NameType?Description EVIDSEventDataset×List of event ID’s EVGENAtlasPoolEventDataset×From event generator HITSAtlasPoolEventDataset×Hits, e.g. from GEANT DIGITSAtlasPoolEventDataset×Digitization of hits RAWAtlasByteStreamEventDatasetRaw data ESDAtlasPoolEventDataset×Event summary data AODAtlasPoolEventDataset×Analysis oriented data TAGAtlasPoolTagEventDatasetEvent metadata NTUPRootNtupleDatasetNtuples HISTORootHistogramDataset×Histograms CBNTCbntDataset×DC1 combined ntuples TEXTTextDatasetText data, e.g. log files

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Transformations Transformation Describes an operation to act on a dataset to produce a new dataset Has two components –Application = code shared by multiple transformations >Usually scripts to locate and run code in software packages –Task = user-supplied configuration (parameters or code) Task List of files –Presently embedded in task –Later could also be logical files Named parameters –Add this soon Typically created by user submitting the job

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Transformations (cont) Application Two entry points (presently scripts) –Build_task to fetch task files, compile, etc –Run creates output dataset from input dataset and built task Typically created by application developer Software package management Need an interface to enable build_task and run scripts to locate software on any machine E.g. “locate mypkg 1.2.3” returns /usr/contrib/mypkg/1.2.3/rh73_gcc73 Also support querying and installation Implement as thin layer on existing package management systems –Pacman, RPM, local build, … Use service to handle installation and removal of packages

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Transformations (cont) For ATLAS we identify the above transformations Characterized by input and output dataset categories Most common ones listed—others are possible

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Jobs A job is an instance of a transformation acting on a dataset Output result is another dataset Partial result may be available before job is complete Typical user-submitted job is split into sub-jobs By splitting input dataset and applying the same transformation to each sub-dataset Strategies for splitting and merging results must be provided Provenance Dataset provenance is specified by recording the input dataset and transformation More complete information is available from the job: –Site, CPU, submission, start and stop times, … –Log files maintained for some period, perhaps as datasets

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, AJDL AJDL = Abstract Job Definition Language Components are representations of Dataset Transformation = Application + Task Job JobPreferences File Identifiers for all the above Presently defined as C++ classes With methods to write to and read from XML –Different for each subclass of Dataset –Same for subclasses of Job XML specified in DTD files

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, AJDL (cont) Look at moving to XML schema Automatically derive classes from XML definitions –Automatic support for other languages (python, java, …) In collaboration with GANGA and others At the same time Try to find one representation for all datasets Introduce separate type for event ID lists –Often too large to carry around in a dataset Also interested in specifying interfaces for AJDL services Those that operate on AJDL components Services listed later Interested in working with others on these specifications

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Service architecture ADA itself is distributed Allows data access and job management to be distributed –Important for scaling to a large number of users Collection of web services –Analysis service for job processing –Job monitoring –Catalog services >Metadata >Repository >Replica (not only for files) Users interact through clients –Root client from DIAL –Python client from GANGA

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Service architecture

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, DIAL analysis service Two instances running at BNL Long running jobs using condor job submission Interactive response using fast LSF queue Working to improve interactive response Submit jobs to perform result merging –Presently done on service host Use parallel jobs for merging Long term, look at the use of job agents –Possibly as part of ARDA Add service to act as switch Delegate jobs based on –Job requirements –Desired response time –Resource availability

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, ATPROD analysis service Enable submission to the existing ATLAS production system At least for user-level production Strategy Split input dataset Make an entry in the production catalog for each sub-job Monitor catalog and gather and merge results as jobs finish Same for the other analysis services Not yet implemented

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, ARDA analysis service Enable submission to the gLite WMS Let EGEE do the work of matchmaking, brokering, job tracking, monitoring, error reporting, … There is a service to submit to the existing prototype system Expect first release of GLite next month Quickly deploy an analysis service based on this Make regular updates taking advantage of more gLite features

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Catalog services Goals of ADA cataloging: Provide a repository for AJDL objects indexed by ID –Insert at site A and extract with ID at site B Enable users to assign metadata to objects and retrieve with queries Record dataset provenance Provide job monitoring Identify three types of catalogs Repository –Map ID to XML string Metadata catalog –Map ID to named attributes Replica catalog –Map ID to a list of ID’s

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Catalog services (cont) Required global catalog instances Repositories for Dataset, Application, Task, Job Metadata catalog for Dataset –Same as that used for production? Replica catalog for Dataset More later First choice is to host these in AMI (soon) Next add local job catalog to record analysis service state So service can be restarted without losing jobs Later look at issues such as Distributed cataloging Private catalogs

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Data management services DQ (Don Quijote) was developed as part of production Provides access to file replica catalogs from all three grids Enables file movement including between grids ADA will adopt this for replica management and movement ATLAS has plan to add a file transfer service Adopt this as well when available SRM provides file management at the site level ATLAS expects sites to deploy this service DQ and ADA will use this as it is deployed GLite has a suite of data management services Including SRM Rest of service model is complex—hide it behind DQ –Already have DQ interface to AlieEn file catalog

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Clients DIAL provides a ROOT client ACLiC used to build dictionaries for DIAL classes –All DIAL classes available on the ROOT command line –Enables catalog browsing, job submission, monitoring, etc. GANGA provides a python client PyLCGDict used to build python wrappers for DIAL classes –All DIAL classes available on the python command line Later build python-only client –Restricted functionality but –Greater portability GUI GANGA is developing a GUI –Data browsing –Configure, submit and monitor jobs

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Status Present system includes Root and Python command line clients DIAL analysis services running –Interactive service at BNL –Batch service at BNL Datasets –Classes for combined ntuples, ATLAS-POOL event collections –All DC1 CBNT data –Few DC2 samples Transformations –DC1 CBNT  histograms –DIGI: atlasdigi –RECO: atlas-reco-8.x.0. x= 3, 4, 5

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, ARDA ATLAS-ARDA prototype ARDA is a CERN project to deliver prototype distributed analysis systems for the LHC experiments –Based on gLite (EGEE middleware) The ATLAS ARDA prototype makes use of the components shown in the figure Expect functional system this year

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Conclusions Status ADA is coming together but there is still much to do Still in demo mode; for serious use we must add –Dataset description of DC2 data –Repositories for applications, tasks, datasets and jobs in AMI –Dataset selection catalog in AMI –Dataset replica catalogs in AMI –Transformations for the full DC2 production/analysis chain –Means to move output data to a storage element Expect all this year Future developments (beyond those above) Update AJDL moving to XML schema and adding WSDL GUI (expect this soon) ATPROD service to access more compute resources ARDA service to try out EGEE middleware Improvements to DIAL service to improve interactive response

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, Contributors DIAL D. Adams, W. Deng, V. Sambamurthy, N. Chetan, C. Kannan GANGA K. Harrison, C. Tan, A. Soroko ARDA D. Liko, F. Orellana AMI S. Albrand, J. Fulachier ATLAS C. Haeberli, J. Bahilo, F. Fassi, G. Rybkine, M. Branco Many useful discussions All the above and PPDG, GAG, gLite,…

David Adams ATLAS CHEP2004 Atlas Distributed AnalysisSept 30, More information For more information on ADA, see the home page Includes status of subprojects, relevant talks and documents, and links to associated projects To try it out, run root demo 3 in the latest DIAL release See the ADA paper in the CHEP2004 proceedings