Presentation is loading. Please wait.

Presentation is loading. Please wait.

INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org UNOSAT and Geant4: Experiences of their merge in the LCG Environment Patricia Méndez Lorenzo.

Similar presentations


Presentation on theme: "INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org UNOSAT and Geant4: Experiences of their merge in the LCG Environment Patricia Méndez Lorenzo."— Presentation transcript:

1 INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org UNOSAT and Geant4: Experiences of their merge in the LCG Environment Patricia Méndez Lorenzo CERN (IT-GD) / CNAF 1st IEEE International Conference on e-Science and Grid Computing Dec, 5-8 Melbourne (Australia)

2 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo Outlook Scope of the talk Geant4 Experiences UNOSAT Experiences Summary This talk assumes you are familiar with the LCG/EGEE infrastructure and architecture

3 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo Scope of this talk  LCG/EGEE is providing a Grid infrastructure to high energy physics experiments at CERN  HOWEVER...  An important number of other communities are getting in contact with us to use the Grid  LCG/EGEE foreseen the support to this new groups  The question is: How to begin and what to do to get involved  The scope of this talk is to show you how the gridifications of new projects are performed

4 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo First Example: Geant4  What is Geant4 (GEometry ANd Tracking)?  Generic Toolkit for Monte Carlo simulation of particle interactions with the matter (i.e. detectors)  Application domains:  High-Energy Physics: ATLAS, CMS and LHCb (LHC), BaBar (SLAC), etc  Space Radiation: ESA  Medical Physics: Proton and brachy therapies, etc  Object-Oriented (C++) project, modular and extensible. Significant improved with respect its predecessor, Geant3, not only from the software structure, but mainly for the physics coverage  Electromagnetic physics of Geant4 and even more Hadronic physics are complex fields. It is fundamental to test their models covering the widest possible range of particles, materials and energies

5 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo Geant4 Requirements  Geant4 works with those physics models the experiments will have to face to  Electromagnetic and Hadronic physics are fundamental features to be properly simulated by Geant4. However they are extremely CPU demanding  Number of events and energy depending:  1 event of 1GeV ~ 0.03 sec (2.4 GHz)  1 event of 300 Gev ~ 9-10 sec Geant4 wants to use the LCG environment to validate the software they provide to their users twice per year  Two large productions per year  Goal during the software validation: Comparison some shower observables between the two different Geant4 versions and check statistical significant changes  Small productions (some few thousands of jobs) during the whole year

6 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo Geant4 Parameters and Strategy  In order to test the software they provide, Geant4 has to run over a large range of parameters:  8 different particles  23 Beam energies  5 physics list (physics models)  7 simplified detectors  Make all the combinations; these are the independent jobs to be run in each validation of the Geant4 software  Geant4 Procedure  Already two productions have been validated in LCG  Comparison of two different software versions  Generation of samples for each version  First time separately  Second time both versions together in each job  Analysis procedure outside Grid  Third production going on at this moment

7 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo Geant4 Production in LCG  Stages: 1. Software installation:  Installation of the Geant4 packages (with all the required external additional packages: PI, AIDA, etc)  Software provided via a tar file  Installation through jobs using specific LCG tools  Fundamental request for the sites: Shared area between WNs and perfectly definition of the software installation region 2. Events production:  Jobs sent by bunches of about 1000 jobs (remember all the possible parameter combinations) defined by each physics list  5000 events in each job were produced 3. Analysis:  Statistical tests to perform the comparison between the two G4 versions

8 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo Geant4 Production in LCG  General Characteristics:  VO:  1 st Production: dteam (6 certificates, one as dteamsgm)  2 nd Production: alice (2certificates, one as alicesgm)  3 rd Production: First time as Geant4!  The group is following all the necessary streps to become VO  Sites and middleware operating system:  1 st Production: RedHat7.3  2 nd and 3 rd Productions: Scientific Linux  Resources:  1 st Production: Own RB+BDII+UI: lxb2006 at CERN  2 nd and 3 rd Productions: lxplus resources and 2 BDII  All output:  1 st Production: About 30 GB stored at CERN (lxn1183)  2 nd Production: Comparable quantity stored at CERN (lxn1180)  3 rd Production: Retrieving the outputs to a delivered afs area at CERN

9 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo Framework developed for Geant4  Generation of a general framework consisting of 3 major tools:  Tool for general and automatic job submission  Tool for events generation in all those sites where the software has been installed  Tool for data analysis (not needed during the 2 nd Production)  First Part: Tool for job submission  Copy and registry of the Geant4 package  A file containing the TURL is created and is passed to the WN  Follow up of candidates able to admit Geant4 jobs  Selection of long queues only  Automatic built of the.jdl files for each long queue  Built taking as base those proposed by the user joining the name of the queue where to submit the job  Software Installation tools are used to perform the installation  Submission of these files to each queue

10 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo Framework developed for Geant4  Software Installation tool (Tool submitted in the first step to all sites to install the software)  First step:  The tar file is copied from the SE at CERN to the WN  It is untar and copied to the VO_DTEAM_SW_DIR area  Second Step: Software Installation tool  Some Geant4 tests are performed to validate the installation  If succeeded a tag is published in the Information System  Results:  The software installation was tried in 63 sites  1 st Production: 28 sites  2 nd Production: 35 sites  3 rd Production: Transition phase: configuring the G4 sites  Main Problems:  Sites were having submission problems  Sites did not have defined the VO_ _SW_DIR area or did not have shared area among WNs

11 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo Framework developed for Geant4  Second Step: Tool for the ProductionStrategy:  Only long queues will be used to run the production  All outputs (hbook files) will be stored at CERNMethodology:  Geant4 provides their own code to perform the events production  Python Script for each type of particle, energy, physics list and calorimeter is created by the framework from one template provided by Geant4  Generation of one jdl per job containing the code provided by Geant4 (the same for all jobs) + that script generated by the framework and changing for each job  Submission of all jdl files to all sites containing the Geant4 installation

12 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo Framework developed for Geant4  Results (First and Second Production):  A hbook file containing 5000 event is created in the case the production succeeded  The name of the file is created by the framework containing the type of particles, the energy, the physics list and the calorimeter within the name (important to perform later the comparison)  The hbook file is copied and registered to a disk at CERN  During the 2nd production a tar file containing different files should have been created in the case the job succeeded. This file was retrieved to the afs area delivered for this aim and copied and registered to the grid  Around 4508 jobs (two physics list for both Geant4 versions) were run in lest than 2 weeks in 28 sites with a efficiency of about 87%

13 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo Update of the Framework  This framework covered the Geant4 requirements for its first production  It’s not useful for larger production  Difficult to deal with the output and visualize the results  A new complete tool has been developed for large production  Flexible enough to be used for any VO and any user application  Most of the improvements mostly relative to the outputs handle Documentation: “LCG2 User Guide” http://grid-deployment.web.cern.ch/grid-deployment/cgi- bin/index.cgi?var=eis/docs Download: http://goc.grid.sinica.edu.tw/gocwiki/User_tools

14 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo Update of the Framework  The new framework consists mainly of two tools:  Tool to perform the automatic job submission  Tool to retrieve and handle the corresponding output 1. Automatic job submission Overview:  Given an user’s jdl this tool performs the following actions:  It lists all sites able to run the jdl provided by the user  It creates automatically a jdl file based on that provided by the user  It submits the just created jdl containing the user application(s)  Moreover it creates a subdirectory (defined by the user) containing a list of the sites where the jobs have been submitted, the corresponding jdls and the jobs IDs

15 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo Update of the Framework  Additional Features:  The user can define the queues where the jobs are submitted. These queues are checked to see whether it fixes the job requirements  Requested LFN files can be included. The corresponding TURLs are searched and included in a file passed in the InputSandbox to the WN 2. Retrieve and handle of the outputs  The 2 nd tool checks the status of the jobs from the job IDs included in the directory given by the user  It provides the following output: The job run in ramses.dcic.ups.es:2119/jobmanager-torque-dteam is in status: Scheduled The job run in grid01.phy.ncu.edu.tw:2119/jobmanager-torque-dteam is in status: running The job run in scaic10.scai.frauhofer.de:2119/jobmanager-torque-dteam is in status: over The user is queried to retrieve the output to the destination he has previously decided

16 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo Update of the Framework Additional Features: It is possible to visualize the outputs on the web A html report is provided showing the files decided by the user

17 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo New Geant4 Production: DIANE Results obtained for another community: ITU

18 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo Second Example: UNOSAT  Satellite imagery based web mapping service  Objectives  Easy access to quality geoinformation service  Organize the demand for geoinformation  Ensure cost-effective and timely products  Core Services  Humanitarian Mapping  Image Processing

19 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo Second Example: UNOSAT Data suppliers UNOSAT Central Unit USER WWW Ground station    

20 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo Relief Projects of UNOSAT  Case Study: Indian Ocean Tsunami Relief and Development  29 th Dec 2004: First Map distributed online to field users  14 th Jan 2005: Imagery Bank online:  100 Tsunami-related maps (pre and post)  670 raw satellite images  January: 200,000 tsunami maps downloaded in total  UNOSAT has a huge amount of data to stored  CERN has provided a good amount of space for this aim  From Summer 2005 the collaboration with GRID began  Running and storing data in LCG/EGEE can certainly assist UNOSAT in their purposes

21 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo UNOSAT and LCG/EGEE  In summer 2005 we have provided a whole structure at CERN for UNOSAT  UNOSAT Virtual Organization (VO)  3.5TB in CASTOR  Computing Elements, Resource Brokers  Collaboration with ARDA group  AFS area of 5GB  We have run some UNOSAT tests (images compression) inside the GRID environment (quite successful)  The framework developed for Geant4 has been adapted for UNOSAT needs

22 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo ARDA Support LFC ARDA APP Oracle DB CASTOR Metadata (x,y,z) LFN PFN SRM

23 Enabling Grids for E-sciencE INFSO-RI-508833 Melbourne 5th-8 th December Patricia Méndez Lorenzo Summary  Just one message  We are involving already different communities inside the GRID  Huge applications field for GRID  We have created different frameworks to gridify in a short time the new projects  Thanks to ARDA developers we have covered many needs of each community  One of the EGGE purposes (involved different communities inside the GRID) is already a reality


Download ppt "INFSO-RI-508833 Enabling Grids for E-sciencE www.eu-egee.org UNOSAT and Geant4: Experiences of their merge in the LCG Environment Patricia Méndez Lorenzo."

Similar presentations


Ads by Google