ESA DataGrid Review – 10 June 2002 – n° 1 Summary item 3  ESA and WP9 part 1 (45m)  DataGrid Frascati infrastructure (AT, 7.5m)  ESA GOME application.

Slides:



Advertisements
Similar presentations
ESA Data Integration Application Open Grid Services for Earth Observation Luigi Fusco, Pedro Gonçalves.
Advertisements

Stephen Burke - WP8 Status - 9/5/2002 Partner Logo WP8 Status Stephen Burke, PPARC/RAL.
Andrew McNab - Manchester HEP - 22 April 2002 EU DataGrid Testbed EU DataGrid Software releases Testbed 1 Job Lifecycle Authorisation at your site More.
NIKHEF Testbed 1 Plans for the coming three months.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
27-29 September 2002CrossGrid Workshop LINZ1 USE CASES (Task 3.5 Test and Integration) Santiago González de la Hoz CrossGrid Workshop at Linz,
Task 3.5 Tests and Integration ( Wp3 kick-off meeting, Poznan, 29 th -30 th January 2002 Santiago González de la.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
6-7 May 2002 CEOS GRID Workshop 1 European DataGRID for EO - ESRIN, 6-7 May 2002 CEOS Workshop on GRID.
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
EU 2nd Year Review – Jan – WP9 WP9 Earth Observation Applications Demonstration Pedro Goncalves :
DataGrid is a project funded by the European Commission under contract IST Earth Observation applications in EU DataGrid (EDG) and follow on.
1 DataGRID Application Status and plans
5 November 2001F Harris GridPP Edinburgh 1 WP8 status for validating Testbed1 and middleware F Harris(LHCb/Oxford)
WP9 – Earth Observation Applications – n° 1 Experiences with Testbed1, plans and objectives for Testbed 2 Testbed retreat th August 2002
Andrew McNab - Manchester HEP - 5 July 2001 WP6/Testbed Status Status by partner –CNRS, Czech R., INFN, NIKHEF, NorduGrid, LIP, Russia, UK Security Integration.
KNMI Applications on Testbed 1 …and other activities.
3 Sept 2001F HARRIS CHEP, Beijing 1 Moving the LHCb Monte Carlo production system to the GRID D.Galli,U.Marconi,V.Vagnoni INFN Bologna N Brook Bristol.
SLICE Simulation for LHCb and Integrated Control Environment Gennady Kuznetsov & Glenn Patrick (RAL) Cosener’s House Workshop 23 rd May 2002.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Scalable Systems Software Center Resource Management and Accounting Working Group Face-to-Face Meeting October 10-11, 2002.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
DataGrid Applications Federico Carminati WP6 WorkShop December 11, 2000.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
LHCb and DataGRID - the workplan for 2001 Eric van Herwijnen Wednesday, 28 march 2001.
WP9 – Earth Observation Applications – n° 1 WP9 report to Plenary ESA, KNMI, IPSL Presented by M. Petitdidier, IPSL DataGrid Plenary Session 5 th Project.
EU 2nd Year Review – Jan – WP9 WP9 Earth Observation Applications
7April 2000F Harris LHCb Software Workshop 1 LHCb planning on EU GRID activities (for discussion) F Harris.
Author - Title- Date - n° 1 Partner Logo WP5 Summary Paris John Gordon WP5 6th March 2002.
Quick Introduction to NorduGrid Oxana Smirnova 4 th Nordic LHC Workshop November 23, 2001, Stockholm.
First attempt for validating/testing Testbed 1 Globus and middleware services WP6 Meeting, December 2001 Flavia Donno, Marco Serra for IT and WPs.
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
WP9 – Earth Observation Applications – n° 1 Planning for EO Application Deployment DataGrid WP9 EO Applications Parallel Session 5 th Project Conference.
DataGRID WPMM, Geneve, 17th June 2002 Testbed Software Test Group work status for 1.2 release Andrea Formica on behalf of Test Group.
CLRC and the European DataGrid Middleware Information and Monitoring Services The current information service is built on the hierarchical database OpenLDAP.
Preparation for Integration Organized access to the code WP6 infrastructure (MDS-2, RC, …) Input from WPs on requirements,... Acquire experience with Globus.
LCG EGEE is a project funded by the European Union under contract IST LCG PEB, 7 th June 2004 Prototype Middleware Status Update Frédéric Hemmer.
DataGRID PTB, Geneve, 10 April 2002 Testbed Software Test Plan Status Laurent Bobelin on behalf of Test Group.
Ruth Pordes November 2004TeraGrid GIG Site Review1 TeraGrid and Open Science Grid Ruth Pordes, Fermilab representing the Open Science.
Grid User Interface for ATLAS & LHCb A more recent UK mini production used input data stored on RAL’s tape server, the requirements in JDL and the IC Resource.
DGC Paris WP2 Summary of Discussions and Plans Peter Z. Kunszt And the WP2 team.
6/23/2005 R. GARDNER OSG Baseline Services 1 OSG Baseline Services In my talk I’d like to discuss two questions:  What capabilities are we aiming for.
Jens G Jensen RAL, EDG WP5 Storage Element Overview DataGrid Project Conference Heidelberg, 26 Sep-01 Oct 2003.
INFSO-RI User Forum 1-3 March 2006 Enabling Grids for E-sciencE Worldwide ozone distribution by using Grid infrastructure ESA: L.
Status Organization Overview of Program of Work Education, Training It’s the People who make it happen & make it Work.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
EC Review – 01/03/2002 – WP9 – Earth Observation Applications – n° 1 WP9 Earth Observation Applications 1st Annual Review Report to the EU ESA, KNMI, IPSL,
2. WP9 – Earth Observation Applications ESA DataGrid Review Frascati, 10 June Welcome and introduction (15m) 2.WP9 – Earth Observation Applications.
Data Transfer Service Challenge Infrastructure Ian Bird GDB 12 th January 2005.
Open Grid Services for Earth Observation Pedro Gonçalves.
Daniele Spiga PerugiaCMS Italia 14 Feb ’07 Napoli1 CRAB status and next evolution Daniele Spiga University & INFN Perugia On behalf of CRAB Team.
DataGrid is a project funded by the European Commission under contract IST rd EU Review – 19-20/02/2004 The EU DataGrid Project Three years.
The GridPP DIRAC project DIRAC for non-LHC communities.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
II EGEE conference Den Haag November, ROC-CIC status in Italy
CMS Experience with the Common Analysis Framework I. Fisk & M. Girone Experience in CMS with the Common Analysis Framework Ian Fisk & Maria Girone 1.
14 June 2001LHCb workshop at Bologna1 LHCb and Datagrid - Status and Planning F Harris(Oxford)
DataGrid is a project funded by the European Commission under contract IST rd EU Review – 19-20/02/2004 WP9 Earth Observation applications.
Earth Observation inputs to ATF Annalisa Terracina EU-DataGrid Project Work Package 9 – EO Applications April 2003 CERN.
DataGrid France 12 Feb – WP9 – n° 1 WP9 Earth Observation Applications.
J Jensen / WP5 /RAL UCL 4/5 March 2004 GridPP / DataGrid wrap-up Mass Storage Management J Jensen
WP 9.4 Use Case (ESA-ESRIN, IPSL, KNMI)
Work Package 9 – EO Applications
Moving the LHCb Monte Carlo production system to the GRID
EO Applications Parallel Session
Testbed Software Test Plan Status
EDG Final Review Demonstration
Gridifying the LHCb Monte Carlo production system
Presentation transcript:

ESA DataGrid Review – 10 June 2002 – n° 1 Summary item 3  ESA and WP9 part 1 (45m)  DataGrid Frascati infrastructure (AT, 7.5m)  ESA GOME application (SC, 7.5m)  ESA experience in using DataGrid testbed 0 and testbed 1 (JL, 15m)

ESA DataGrid Review – 10 June 2002 – n° 2 Summary  Experiences with Testbed0  Experiences with Testbed1  GOME Application Use Case  EO Application Components  Issues

ESA DataGrid Review – 10 June 2002 – n° 3 Testbed0  Installation and testing of Globus  Globus and Condor installed on several ESRIN machines and tested (Task 00 Activity Report by Fabrizio Pacini)  First GOME processing prototype  Globus installed at ENEA  Solved security issues using GSIKLOG workaround  Demonstration and report at EDGC3 Frascati (October 2001)

firefox.esa.esrin.it datagrid.esa.esrin.it Aix44p.frascati.enea.it datagrid machine (ESRIN) is intended to be used as an interface with the AMS system which provides the archive storage for testbed0 AMS Firefox machine (ESRIN) is intended to be used as user interface aix44p.frascati.enea.it is part of the ENEA network which is composed of many machines distributed around Italy and governed by the AFS/LSF system. This forms the computing element of our experiment and is used as a gateway to ENEA HPC computing resources. A job sent to aix44p.frascati.enea.it is submitted to one of the ENEA clusters either by means of the LSF scheduler or by directing the job to a given machine LSF/AFS Testbed0 ( ESRIN – ENEA ) Now Garr network, near future will be replaced by a dedicated 1 Gbps line

The following solution has been adopted to allow Globus to acquire the AFS token. First, the Globus Security Inftrastructure (GSI) software has been installed both the client (Aix44p) and server (aixfs, AFSfileServer) at ENEA. Second, two background tasks based on the GSI software have been configured: the GSIKLog client, invoked by the Globus software each time a a job is submitted, and GSIKLogD server which authenticates the Globus user by checking the Grid certificate associated with the token. Figure : Globus acquires the token to access AFS. Step 1: having submitted a job request to the Globus Resource Allocation Manager (GRAM), the configuration file globus.gatekeeper.conf is sent to GSIklog client, step 2: a fileserver is found in the AFS configuration file, step 3: the request for a token is sent to the server, step 4: the AFSgridkmap file is created, step 5: an AFS token is returned to the client.

The user saves the files infostart.txt on firefox. It contains high level information related to the starting and stopping date elapsing the period which he or she needs to study. This file is sent from firefox to datagrid by means of the globus-rcp command: # globus-rcp /home_firefox/user_home/infostart.txt datagrid.esrin.esa.it/home_datagrid/esagrid/Grid_Data_Proj/info/in fostart.txt Then the shell script on datagrid is launched from firefox using the globus globusrun command: # globusrun - s - r datagrid.esrin.esa.it/jobmanager - f /home_firefox/user_home/get.rsl This command copy the data from AMS to datagrid.

After having retrieved the files from the AMS they are transferred to the machine aix44p using: # globus-rcp /home_datagrid/esagrid/Grid_Data_Proj/archive/ lv1 aix44p.frascati.enea.it:/afs/enea.it/frascati/info/grid/ lv1

Now the user is ready run its application on aix44p by IDL # globusrun - s - r datagrid.esrin.esa.it/jobmanager - f /home_firefox/user_home/gome.rsl /afs/enea.it/software/rsi/idl/bin/idl</afs/enea.it/frascati/info/grid/gome.es Finally, the resulting fileis transferred back to firefox by means of globus-rcp.

ESA DataGrid Review – 10 June 2002 – n° 9 Testbed1  Integration team assembled at CERN (September)  First presentation of Integration Testbed (October)  WP9 Validation Plans drawn up (October)  Loose Cannons formed  Validation plans presented at CERN (November)  Demonstration of integrated testbed (December)  Officially opened to the applications  IDL packaged, delivered and tested (January)  Report to PTB (January)

ESA DataGrid Review – 10 June 2002 – n° 10 Report to PTB (January)  Tested: Basic Job Submisson OK  High throughput simultaneous jobs NOT tested  dg-job-get-output problem  Not Tested: RC / RM / GDMP  No clear procedure; end-to-end demo by ITeam not seen  Access to SEs: we assume its not ready for us to test yet  CEs/SEs for EO VO installed at few sites: CERN, Lyon, NIKHEF  Testbed1 sites lacking in conformity  Preparing to test IDL: Application Kits currently being installed  Grid-wide installation procedure?  Documentation quality: verify that examples actually work  Prevented IPSL from testing GDMP  Integration effort underestimated  Missing integration layer; installation procedures unclear; not automated; needs help of an expert

ESA DataGrid Review – 10 June 2002 – n° 11 L1L2 KNMI GOME Use Case L2VAL RAWL1 ESA IPSL End User L2 L3 Science Application VAL + Regulated Access to Grid processing power Secure access to Grid-registered high-volume data storage L1 L files = 66 Gb 9,448,000 files = 108 Gb L1L2 ESA

Processing of raw GOME data to ozone profiles With OPERA and NNO Validate GOME ozone profiles With Ground Based measurements Raw satellite data from the GOME instrument Visualization Earth Observation DEMO for PM 24 LIDAR data DataGRID

File numbers for one year of GOME data : Number of files (to be stored and replicated): File Size Level 1: 4,72415 Mb NNO: 9,448,00010 kb Opera : 9,448,00012 kb Lidar Mb Total: 18,900, Gbyte Gome has a data set of 5 years Gome is relatively small (in both size and number of files)

Use case flow for profile processing:

ESA DataGrid Review – 10 June 2002 – n° 15 EO Application Interface Components  DataGrid provides a series of low-level grid commands  In order to support a wide range of EO applications we need generic components embedded in a Grid Application Layer  Provides functions such as:  Information Services: Obtaining information about EO products and services  Job Execution Cycle: Composing JDL scripts, submitting them, monitoring, retrieving job results  MSS interface: Moving files between MSS archives and Grid SEs  Data Replication: Replicating SE datasets to other Grid locations  Metadata Services: Registering file metadata with Grid catalogs  File retrieval directly from metadata keys: Search Grid catalogs to retrieve file names for the dataset to be processed  Provide support for higher level application interfaces e.g. repository, workflow

ESA DataGrid Review – 10 June 2002 – n° 16 Issues (1)  Very-large-scale, complex system with large numbers of participants  Dealing with new concepts and technology  Communication and coordination in de-centraised, multi-cultural, multi- institutional development group  Ongoing agressive deployment of middleware releases  Huge turnover in output: INFORMATION OVERLOAD risk  Driven by needs of HEP  With EO & Biology contributions  Discussion forum is rather crowded  Lack of EO scientists in other WPs (1-8, 10-12)  Terms of reference in meetings & discussions :  Physicist : Scientist; Experiment : Application; Loose Cannons : Validation Team; CASTOR : Archive Storage; etc.

ESA DataGrid Review – 10 June 2002 – n° 17 Issues (2)  Yet the basic requirements are mostly in common  Need first to develop the basic functional layer  Can then be extended for application-specific requirements  Testbed stability, usability, performance and scalability  Application Grid interfacing layer needs to be developed  APIs needed  Limited functionality  Limited performance  Ongoing rapid prototyping and development  Keeping step with code & documentation in a rapidly changing distributed environment  Integration with existing environment

First collaborative scientific result using GRID!