Presentation is loading. Please wait.

Presentation is loading. Please wait.

David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.

Similar presentations


Presentation on theme: "David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN."— Presentation transcript:

1 David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN

2 David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, 20032 Contents DAC mandate Scope Strategy Scenario for first release Plan for the first release Deliverables for first release Conclusions

3 David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, 20033 DAC Mandate Distributed Analysis Coordinator Is responsible for coordinating the development of software tools for distributed analysis and their integration into the ATLAS software environment Start with the analysis of existing tools such as GANGA, DIAL, AtCom… Provide users with transparent access to metadata of different sorts as well as to event data in all stages of processing Participate actively in the definition of LCG projects such as ARDA Is a member of relevant LCG committees and working groups

4 David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, 20034 Scope Analysis (not necessarily distributed) Supports the manipulation and extraction of summary data (e.g. histograms) from any type of event data –AOD, ESD, … Supports user-level production of event data –e.g. MC generation, simulation and reconstruction Distributed analysis Extends the extraction and production support to include distributed processing and distributed data Natural extension of non-distributed analysis Easily invoked from any ATLAS analysis environment –including Python, ROOT, command line –easily ported to any future environment (e.g. JAS)

5 David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, 20035 Strategy Implement ADA as a collection of grid services As described in ARDA document Use ARDA components where possible Add missing and ATLAS-specific pieces Provide clients for ATLAS analysis environments Python, ROOT, command line Interface similar to that of DIAL –See figures Regular releases Perhaps for each SW week and ATLAS X.0 Expand functionality with each release

6 David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, 20036 User Analysis Job 1 Job 2 ApplicationTask Dataset 1 Scheduler 1. Create or locate 2. select3. Create or select 4. select 5. submit(app,tsk,ds) 6. split Dataset Dataset 2 7. create e.g. ROOT e.g. athena Result 9. fill 10. gather Result 9. fill ResultCode ADA/DIAL user interface

7 David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, 20037 High level JDL as a bridge Initial ADA

8 David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, 20038 Strategy (cont) Look to common projects for most of the pieces ARDA, GANGA, DIAL, … Share as much as possible with ATLAS production –Also distributed –Similar interfaces and code for bulk and user-level production ADA must identify these pieces and tie them together Deployment ADA services must be deployed –Interactive service at one or two sites with data Provide testing and monitoring of these services Work with facilities to deploy and maintain –Also to develop facility-specific features –Looking for 1 or 2 initial sites for interactive service

9 David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, 20039 Scenario for first release Here is a scenario for user interaction with the first release of ADA Authenticate –Proxy from authentication service Choose application –E.g. PAW to process DC1 ntuples or –Athena to process DC2 AOD or –Athena reconstruction Define task –Analysis: provide code to define and fill histograms –Production: athena job options, maybe code –Perhaps select starting point from task catalog Select input dataset –From dataset (metadata) catalog service

10 David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, 200310 Scenario for first release (cont) Create job configuration –Response time, role, optional splitter,… Locate processing service Submit job –Application, task, dataset, configuration While job is running –Query service for status and partial results –Examine partial results (e.g. histograms) –Kill job if results are bad When job is finished –Examine complete result –Modify task or select new dataset and repeat

11 David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, 200311 Plan for first release Schedule Implement and deploy in advance of March 2004 software workshop Might slip to May meeting Building blocks Code and developers in GANGA and DIAL LCG project following from ARDA –Just starting; so don’t wait but –Stay closely coupled to that project Open to contributions (especially effort) from others

12 David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, 200312 Deliverables for first release Comments Goal is to support the scenario outlined earlier Build on current GANGA and DIAL implementations and plans Emergence of ARDA project may change plans Add more tasks if more ideas and effort are found

13 David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, 200313 Deliverables for first release (cont) Authentication service GSI based Support both EDG and US certificates High-level JDL Start from current DIAL interface Incorporate ideas from PPDG, ARDA, … –If available in time This defines the interface (WSDL) for the following analysis and production services Clients for analysis environments Command line, GANGA and ROOT

14 David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, 200314 Deliverables for first release (cont) Interactive analysis service Goal is “interactive” response time Initial implementation at one or two sites Build on existing DIAL scheduler service –Add authentication –Deploy as web or grid service Application/task/dataset –PAW with fortran task to fill histos from combined ntuples –Add ROOT with C++ task to fill from ROOT ntuples? –Add athena with C++ task to fill from AOD?

15 David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, 200315 Deliverables for first release (cont) Batch analysis service Batch-like response Processing distributed over grid Start from ATLAS production supervisor/executor –See figure Support athena tasks –Fill histograms from AOD –Reconstruction

16 David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, 200316 Possible connections to ATLAS production Possible ADA Scheduler services Production system Different flavors for different grids

17 David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, 200317 Deliverables for first release (cont) Catalog services Catalog tasks, datasets, results and jobs Dataset catalog functionality: –Means for users to select an input dataset –Means for production to register output dataset –Means for system (e.g. DIAL scheduler) to turn dataset specification into accessible physical files Host in AMI –Add grid service interface Try to share with production File catalog and replication services –Use DMS?

18 David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, 200318 Conclusions Distributed analysis is a new project for ATLAS Philosophy Tightly integrate with non-distributed analysis Be neutral  use client-server mechanism to support different analysis environments and different processing systems Be flexible  capabilities (and hence demands) will change as technology evolves Be responsive to evolving user requirements Build on existing ideas and projects including GANGA, DIAL, ATLAS production, ARDA, GriPhyN/IVDGL, PPDG, …

19 David Adams ATLAS ADA Plans ATLAS SW – Summary sessionDecember 5, 200319 Conclusions (cont) Plan of action Define interface (high-level JDL) Quickly implement clients –command line, GANGA, ROOT Quickly implement services –authentication and authorization –interactive and batch analysis/production –catalogs Expose to users, learn lessons and re-implement Repeat More information Web site coming soon Mail to dladams@bnl.gov


Download ppt "David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN."

Similar presentations


Ads by Google