Grid User Interface:Ganga Farida Fassi Master de Physique Informatique Rabat, Maroc 24-17 th, May, 2011.

Slides:



Advertisements
Similar presentations
ATLAS/LHCb GANGA DEVELOPMENT Introduction Requirements Architecture and design Interfacing to the Grid Ganga prototyping A. Soroko (Oxford), K. Harrison.
Advertisements

GANGA Overview Germán Carrera, Alfredo Solano (CNB/CSIC) EMBRACE COURSE Monday 19th of February to Friday 23th. CNB-CSIC Madrid.
Computing Lectures Introduction to Ganga 1 Ganga: Introduction Object Orientated Interactive Job Submission System –Written in python –Based on the concept.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Introduction to GANGA Hurng-Chun Lee 27 Feb.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL March 25, 2003 CHEP 2003 Data Analysis Environment and Visualization.
Israel Cluster Structure. Outline The local cluster Local analysis on the cluster –Program location –Storage –Interactive analysis & batch analysis –PBS.
DIRAC API DIRAC Project. Overview  DIRAC API  Why APIs are important?  Why advanced users prefer APIs?  How it is done?  What is local mode what.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL July 15, 2003 LCG Analysis RTAG CERN.
David Adams ATLAS ATLAS Distributed Analysis David Adams BNL March 18, 2004 ATLAS Software Workshop Grid session.
K. Harrison CERN, 20th April 2004 AJDL interface and LCG submission - Overview of AJDL - Using AJDL from Python - LCG submission.
03/27/2003CHEP20031 Remote Operation of a Monte Carlo Production Farm Using Globus Dirk Hufnagel, Teela Pulliam, Thomas Allmendinger, Klaus Honscheid (Ohio.
Distributed Analysis using Ganga I.Ideas behind Ganga II.Getting started III.Running ATLAS applications Distributed Analysis Tutorial ATLAS Computing &
Belle MC Production on Grid 2 nd Open Meeting of the SuperKEKB Collaboration Soft/Comp session 17 March, 2009 Hideyuki Nakazawa National Central University.
F. Fassi, S. Cabrera, R. Vives, S. González de la Hoz, Á. Fernández, J. Sánchez, L. March, J. Salt, A. Lamas IFIC-CSIC-UV, Valencia, Spain Third EELA conference,
DIRAC Review (13 th December 2005)Stuart K. Paterson1 DIRAC Review Exposing DIRAC Functionality.
Stuart Wakefield Imperial College London Evolution of BOSS, a tool for job submission and tracking W. Bacchi, G. Codispoti, C. Grandi, INFN Bologna D.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
Ganga A quick tutorial Asterios Katsifodimos Trainer, University of Cyprus Nicosia, Feb 16, 2009.
David Adams ATLAS ADA, ARDA and PPDG David Adams BNL June 28, 2004 PPDG Collaboration Meeting Williams Bay, Wisconsin.
Giuseppe Codispoti INFN - Bologna Egee User ForumMarch 2th BOSS: the CMS interface for job summission, monitoring and bookkeeping W. Bacchi, P.
Introduction to Ganga Karl Harrison (University of Cambridge) ATLAS Distributed Analysis Tutorial Milano, 5-6 February 2007
ARDA Prototypes Andrew Maier CERN. ARDA WorkshopAndrew Maier, CERN2 Overview ARDA in a nutshell –Experiments –Middleware Experiment prototypes (basic.
T3 analysis Facility V. Bucard, F.Furano, A.Maier, R.Santana, R. Santinelli T3 Analysis Facility The LHCb Computing Model divides collaboration affiliated.
Ganga Status Update Will Reece. Will Reece - Imperial College LondonPage 2 Outline User Statistics User Experiences New Features in Upcoming Features.
A PanDA Backend for the Ganga Analysis Interface J. Elmsheuser 1, D. Liko 2, T. Maeno 3, P. Nilsson 4, D.C. Vanderster 5, T. Wenaus 3, R. Walker 1 1: Ludwig-Maximilians-Universität.
D. Adams, D. Liko, K...Harrison, C. L. Tan ATLAS ATLAS Distributed Analysis: Current roadmap David Adams – DIAL/PPDG/BNL Dietrich Liko – ARDA/EGEE/CERN.
Ganga 4 Basics - Tutorial Jakub T. Moscicki ARDA/LHCb Ganga Tutorial, November 2005.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 – The Ganga Evolution Andrew Maier.
Ganga 4 Basics - Tutorial Jakub T. Moscicki ARDA/LHCb Ganga Tutorial, September 2006.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Ganga Tutorial From: Jakub T. Moscicki (CERN)
Distributed Computing and Ganga Karl Harrison (University of Cambridge) 3rd LHCb-UK Software Course National e-Science Centre, Edinburgh, 8-10 January.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Ganga User Interface EGEE Review Jakub Moscicki.
1 DIRAC Job submission A.Tsaregorodtsev, CPPM, Marseille LHCb-ATLAS GANGA Workshop, 21 April 2004.
Distributed Data Analysis with GANGA (Tutorial) Alexander Zaytsev Budker Institute of Nuclear Physics (BudkerINP), Novosibirsk On the basis of GANGA EGEE.
Using Ganga for physics analysis Karl Harrison (University of Cambridge) ATLAS Distributed Analysis Tutorial Milano, 5-6 February 2007
2 June 20061/17 Getting started with Ganga K.Harrison University of Cambridge Tutorial on Distributed Analysis with Ganga CERN, 2.
David Adams ATLAS ATLAS-ARDA strategy and priorities David Adams BNL October 21, 2004 ARDA Workshop.
ATLAS-specific functionality in Ganga - Requirements for distributed analysis - ATLAS considerations - DIAL submission from Ganga - Graphical interfaces.
INFSO-RI Enabling Grids for E-sciencE Using of GANGA interface for Athena applications A. Zalite / PNPI.
Distributed Analysis Tutorial Dietrich Liko. Overview  Three grid flavors in ATLAS EGEE OSG Nordugrid  Distributed Analysis Activities GANGA/LCG PANDA/OSG.
David Adams ATLAS ATLAS Distributed Analysis (ADA) David Adams BNL December 5, 2003 ATLAS software workshop CERN.
David Adams ATLAS ATLAS Distributed Analysis and proposal for ATLAS-LHCb system David Adams BNL March 22, 2004 ATLAS-LHCb-GANGA Meeting.
INFSO-RI Enabling Grids for E-sciencE Ganga 4 Technical Overview Jakub T. Moscicki, CERN.
Ganga/Dirac Data Management meeting October 2003 Gennady Kuznetsov Production Manager Tools and Ganga (New Architecture)
ATLAS Physics Analysis Framework James R. Catmore Lancaster University.
Geant4 GRID production Sangwan Kim, Vu Trong Hieu, AD At KISTI.
Seven things you should know about Ganga K. Harrison (University of Cambridge) Distributed Analysis Tutorial ATLAS Software & Computing Workshop, CERN,
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
David Adams ATLAS DIAL Distributed Interactive Analysis of Large datasets David Adams BNL May 19, 2003 BNL Technology Meeting.
SEE-GRID-SCI WRF-ARW model: Grid usage The SEE-GRID-SCI initiative is co-funded by the European Commission under the FP7 Research Infrastructures.
Progress on Design and Implement of Job Management System Suo Bing, Yan Tian, Zhao Xianghu
ATLAS TAG Services Jack Cranshaw with support from Thomas Doherty, Julius Hrivnac, Marcin Nowak.
ATLAS Distributed Analysis S. González de la Hoz 1, D. Liko 2, L. March 1 1 IFIC – Valencia 2 CERN.
Distrubuited Analysis using GANGA
L’analisi in LHCb Angelo Carbone INFN Bologna
Dan van der Ster for the Ganga Team
Work report Xianghu Zhao Nov 11, 2014.
BOSS: the CMS interface for job summission, monitoring and bookkeeping
BOSS: the CMS interface for job summission, monitoring and bookkeeping
Introduction to Grid Technology
LHCb Computing Model and Data Handling Angelo Carbone 5° workshop italiano sulla fisica p-p ad LHC 31st January 2008.
BOSS: the CMS interface for job summission, monitoring and bookkeeping
Job workflow Pre production operations:
Jakub T. Moscicki (KUBA) CERN
The Ganga User Interface for Physics Analysis on Distributed Resources
ADA aodhisto transformation
EGEE Middleware: gLite Information Systems (IS)
ADA analysis transformations
Presentation transcript:

Grid User Interface:Ganga Farida Fassi Master de Physique Informatique Rabat, Maroc th, May, 2011

Outline  Ganga Overview  Ganga Architecture  How to use Ganga  More on Ganga usage  Brief review on Panda

Ganga Overview The naive idea of submitting jobs to Grid assume the following steps: ▫ Prepare the “Job Description Language” file for job configuration ▫ Find suitable (e.g. Athena) software application ▫ Locate the datasets on different storage elements ▫ Job splitting, monitoring and book-keeping Ganga combines the components to provide a front- end client for interacting with Grid infrastructures Ganga combines the components to provide a front- end client for interacting with Grid infrastructures

Ganga architecture

Ganga allows simple switching between testing on a local batch system and large-scale data processing on Grid distributed resources ▫ Jobs look the same whether they run locally or on the Grid ▫ Configure once, run anywhere Ganga Overview

Architecture Job Object is where the Ganga journey starts: A job in Ganga is constructed from a set of building blocks, not all required for every job

Architecture Customized application, plug-in based design, eases job creation Incremental analysis development switching between different technologies: First test on local machine Intermediate sample analyzed on batch Full sample run using GRID backends

Few words on analysis, data Model… User has to define his/her analysis project specifying:   The physique process that he/her aim to study  Data format that continue the required information for the analysis.  Lean some tools p b W+W+ t W-W- q l-l- p b-jet jet b-jet

Example application: ATHENA Athena is the ATLAS framework used to control the execution workflow Support Athena applications: Simulation, Reconstruction, and Analysis

Some Analysis Work-flows  Classic analysis using AOD (ROOT file or Database format)  Athena user code sequentially processes large Monte Carlo  or Data stream sample on the Grid  Produces ROOT tuple output which is further processed  locally or on the Grid Small MC Sample Production:  Small MC Sample Production:  Use Production System Transformation (Geant) to produce a  small MC sample for special/official usage  ROOT:  Generic ROOT application e.g. Toy MC

How to use Ganga 11

Ganga processes, in the order they are specified, any configuration files pointed to by the environment variable ▫ GANGA_CONFIG_PATH  and then processes “.gangarc” configure file This makes possible the use of group configuration files  But allows settings to be overridden by user config Configurations

Ganga creates a directory gangadir in your home directory and uses this for storing job-related files and information ▫ created at the first launch [DefaultJobRepository] local_root = /alternative/gangadir [ Ganga Workspace

Example: ATLAS Analysis Job ATLAS Applications: Athena and AthenaMC Data input: ▫ DQ2Dataset: all DQ2 dataset handling in client, LFC/SE interaction on worker node, used by all backends ▫ ATLASDataset: LFC file access ▫ ATLASLocalDataset: local file system, Local/Batch backend Data output: ▫ DQ2OutputDataset: stores files on Grid SE, registration in DQ2 ▫ AtlasOutputDataset: multipurpose for Grid and Local output

[configuration] TextShell = IPython... [LCG] Vir tualOrganisation=atlas... [athena] LCGOutputLocation = srm://lsrm.ific.uv.es/lustre/ific.uv.es/grid/atlas/dq2/users/ LocalOutputLocation = srm://lsrm.ific.uv.es/lustre/ific.uv.es/grid/atlas/dq2/users/ ATLAS_SOFTWARE = /opt/exp_software/atlas/prod/releases/rel_12-0_2 …. …. Syntax Hardcoded configurations setenv GANGA_CONFIG_PATH GangaAtlas/Atlas.ini set path = (/afs/ific.uv.es/project/atlas/software/ganga/install/4.4.2/bin/ $path ) ~/.gangarc ganga -g user config > site config > release config Sequence Python ConfigParser standard How to set configurations release config site config user config Configurations

“ Hello World” example”: CLIP From a Ganga CLIP session, a job that writes “Hello World” can be created, and submitted to LCG, as follows app = Executeable() app.exe = “/bin/echo” app.env = {} app.args = [“Hello World”] # Property values set above are in fact the defaults # for Executable application j = Job(application = app, backend = LCG()) j.submit() # Check on job progress jobs # When job has completed, check the output j.peek(“stdout”)

Athena example: CLIP This assumes you are in the ATLAS VO, your cmt area set up and have checked out, built your package into a work area : j = Job() j.name='Test-AthenaJob-IFIC' j.application = Athena() j.application.exclude_from_user_area=["*.o","*.root.*","*.exe"] j.application.prepare(athena_compile=False) j.application.option_file='$HOME/AthenaTerstArea/12.0.6/PhysicsAnalysis/AnalysisCommon/UserAnalysis/UserAnalysis /run/AnalysisSkeleton_topOptions.py' j.application.atlas_release='12.0.6' j.inputdata.type='DQ2_LOCAL' j.application.max_events='10‘ j.inputdata=DQ2Dataset() j.inputdata.dataset="trig1_misal1_mc PythiaZmumu_pt100_fixed.recon.AOD.v _tid005906" j.splitter = AthenaSplitterJob(numsubjobs=2) j.merger = AthenaOutputMerger() j.outputdata=DQ2OutputDataset() j.outputdata.outputdata=['AnalysisSkeleton.aan.root'] j.backend=LCG() j.backend.CE='ce01.ific.uv.es:2119/jobmanager-pbs-short' j.submit() Aplication InputData Splitter & Merger OutputData Submission

list_plugins( “type”) # List plugins of specified type: # “applications”, “backends”, etc j1 = Job(backend =LSF()) # Create a new job for LSF a1 = Executable() # Create Executable application j1.application = a1 # Set value for job’s application j1.backend = LCG() # Change job’s backend to LCG export(j1, “myJob.py”) # Write job to specified file load( “myJob.py” ) # Load job(s) from specified file j2 = j1.copy() # Create j2 as a copy of job j1 jobs # List jobs jobs[i].subjobs # List subjobs for split job i Ganga CLIP commands (1) Useful commands

Ganga CLIP commands (2) When a job j has been defined, the following methods can be used j.submit() # Submit the job j.kill() # Kill the job (if running) j.remove() # Kill the job and delete associated files j.peek() # List files in job’s output directory Once a job has been submitted,  it can no longer be modified,  it cannot be resubmitted, but  the job can be copied and the copy can be modified/submitted

20 Ganga architecture CLIPGUI Scripts J = Job(backend=LSF()) j.submit() Ganga.Core Athena Gaudi Job repository File Workspace IIN/OUT Sandbox CondorG gLite LSF Monitoring Plugin modules

21  Support for managed production and user analysis  Coherent, homogeneous processing system layered over diverse resources  Pilot submission through CondorG, local batch or gLite WMS PanDA  Use of pilot jobs for acquisition of resources. Workload jobs assigned to successfully activated pilots based on Panda-managed brokerage criteria  integrated data management and monitoring system

Monitoring tools Jobs SAM Collect, store and expose to users information coming from different sources

23 You have a choice: 1 ).Select to see all jobs submitted in the selected time window,By default you get last 24 hours time Window 2).Select all jobs which had been terminated in last 24 hours or are pending or running at the current moment. Then select ‘all jobs regardless submission time’ option Monitoring tools

Useful links  Ganga angaAtlasTutorial Ganga angaAtlasTutorial  Panda Panda  Dashboard:  SAM: