DILIGENT Project Andrea Manzi ISTI-CNR, Pisa. 09/01/2006NA4 Generic Application Meeting2 Outline Project Description Interaction with EGEE gLite DILIGENT.

Slides:



Advertisements
Similar presentations
DILIGENT Digital libraries powered by the Grid Peter Fankhauser
Advertisements

The DRIVER Infrastructure (Digital Repository Infrastructure Vision for European Research) Paolo Manghi ISTI - National Research Council, Italy.
EGEE-II INFSO-RI Enabling Grids for E-sciencE The gLite middleware distribution OSG Consortium Meeting Seattle,
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Digital Libraries of the Future – and the Role of Libraries Donatella Castelli ISTI-CNR.
DataGrid Kimmo Soikkeli Ilkka Sormunen. What is DataGrid? DataGrid is a project that aims to enable access to geographically distributed computing power.
D4Science Project (DILIGENT For Science) Donatella Castelli CNR-ISTI DRIVER Summit January 2008 Gottingen (Germany)
Consorzio COMETA - PI2S2 Project UNIONE EUROPEA SAGE – Storage Accounting for Grid Environments in gLite Fabio Scibilia Consorzio.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
Enabling Grids for E-sciencE Medical image processing web portal : Requirements analysis. An almost end user point of view … H. Benoit-Cattin,
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
CGW 2003 Institute of Computer Science AGH Proposal of Adaptation of Legacy C/C++ Software to Grid Services Bartosz Baliś, Marian Bubak, Michał Węgiel,
Computing for ILC experiment Computing Research Center, KEK Hiroyuki Matsunaga.
Nicholas LoulloudesMarch 3 rd, 2009 g-Eclipse Testing and Benchmarking Grid Infrastructures using the g-Eclipse Framework Nicholas Loulloudes On behalf.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Configuring and Maintaining EGEE Production.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
EGEE is a project funded by the European Union under contract IST Testing processes Leanne Guy Testing activity manager JRA1 All hands meeting,
Relationships July 9, Producers and Consumers SERI - Relationships Session 1.
:: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: ::::: :: GridKA School 2009 MPI on Grids 1 MPI On Grids September 3 rd, GridKA School 2009.
A DΙgital Library Infrastructure on Grid EΝabled Technology ETICS Usage in DILIGENT Pedro Andrade
QCDGrid Progress James Perry, Andrew Jackson, Stephen Booth, Lorna Smith EPCC, The University Of Edinburgh.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Grid Workload Management Massimo Sgaravatto INFN Padova.
The huge amount of resources available in the Grids, and the necessity to have the most up-to-date experimental software deployed in all the sites within.
November SC06 Tampa F.Fanzago CRAB a user-friendly tool for CMS distributed analysis Federica Fanzago INFN-PADOVA for CRAB team.
Enabling Grids for E-sciencE SA1 EGEE-II INFSO-RI The Pre-Production Service in WLCG/EGEE A. Retico, N. Thackray CERN – Geneva, Switzerland PPS.
Oct 2008 RCDL 2008, Dubna, Russian Federation D4Science Tutorial Preface George Kakaletris (NKUA)
DILIGENT A step towards a knowledge infrastructure.
GLite – An Outsider’s View Stephen Burke RAL. January 31 st 2005gLite overview Introduction A personal view of the current situation –Asked to be provocative!
CEOS WGISS-21 CNES GRID related R&D activities Anne JEAN-ANTOINE PICCOLO CEOS WGISS-21 – Budapest – 2006, 8-12 May.
Owen SyngeTitle of TalkSlide 1 Storage Management Owen Synge – Developer, Packager, and first line support to System Administrators. Talks Scope –GridPP.
Production Grid Challenges in Hungary Péter Stefán Ferenc Szalai Gábor Vitéz NIIF/HUNGARNET.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
Conference name Company name INFSOM-RI Speaker name The ETICS Job management architecture EGEE ‘08 Istanbul, September 25 th 2008 Valerio Venturi.
Grid Security Vulnerability Group Linda Cornwall, GDB, CERN 7 th September 2005
EGEE is a project funded by the European Union under contract IST Presentation of NA4 Generic Applications Roberto Barbera NA4 Generic Applications.
26/05/2005 Research Infrastructures - 'eInfrastructure: Grid initiatives‘ FP INFRASTRUCTURES-71 DIMMI Project a DI gital M ulti M edia I nfrastructure.
Recent improvements in HLRmon, an accounting portal suitable for national Grids Enrico Fattibene (speaker), Andrea Cristofori, Luciano Gaido, Paolo Veronesi.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
INFSO-RI Enabling Grids for E-sciencE /10/20054th EGEE Conference - Pisa1 gLite Configuration and Deployment Models JRA1 Integration.
Università di Perugia Enabling Grids for E-sciencE Status of and requirements for Computational Chemistry NA4 – SA1 Meeting – 6 th April.
D4Science and ETICS Building and Testing gCube and gCore Pedro Andrade CERN EGEE’08 Conference 25 September 2008 Istanbul (Turkey)
DILIGENT A testbed digital library infrastructure for supporting the activity of the researchers.
INFSO-RI SA2 ETICS2 first Review Valerio Venturi INFN Bruxelles, 3 April 2009 Infrastructure Support.
INFSO-RI Enabling Grids for E-sciencE gLite Test and Certification Effort Nick Thackray CERN.
Objectives & Current Status Donatella Castelli ISTI-CNR, Italy.
ETICS An Environment for Distributed Software Development in Aerospace Applications SpaceTransfer09 Hannover Messe, April 2009.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks EGEE Operations: Evolution of the Role of.
II EGEE conference Den Haag November, ROC-CIC status in Italy
Pedro Andrade > IT-GD > D4Science Pedro Andrade CERN European Organization for Nuclear Research GD Group Meeting 27 October 2007 CERN (Switzerland)
DGAS Distributed Grid Accounting System INFN Workshop /05/1009, Palau Giuseppe Patania Andrea Guarise 6/18/20161.
WP5 – Infrastructure Operations Test and Production Infrastructures StratusLab kick-off meeting June 2010, Orsay, France GRNET.
Pasquale Pagano, CNR – ISTI on behalf of the DILIGENT Consortium Geneva, CERN, 16th of December 2004.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
1 Tutorial Outline 30’ From Content Management Systems to VREs 50’ Creating a VRE 80 Using a VRE 20’ Conclusions.
Enabling Grids for E-sciencE Claudio Cherubino INFN DGAS (Distributed Grid Accounting System)
EGEE-III INFSO-RI Enabling Grids for E-sciencE Application Porting Support SSC and proposal Gergely Sipos
Enabling Grids for E-sciencE University of Perugia Computational Chemistry status report EGAAP Meeting – 21 rst April 2005 Athens, Greece.
JRA1 Middleware re-engineering
Bob Jones EGEE Technical Director
Accessing the VI-SEEM infrastructure
Regional Operations Centres Core infrastructure Centres
StoRM: a SRM solution for disk based storage systems
Ian Bird GDB Meeting CERN 9 September 2003
Infrastructure Support
Short update on the latest gLite status
GSAF Grid Storage Access Framework
gLite The EGEE Middleware Distribution
Presentation transcript:

DILIGENT Project Andrea Manzi ISTI-CNR, Pisa

09/01/2006NA4 Generic Application Meeting2 Outline Project Description Interaction with EGEE gLite DILIGENT Infrastructures gLite Experimentation Problem Using gLite Services DILIGENT Requirements Future plans

09/01/2006NA4 Generic Application Meeting3 Project Description Duration: 36 Months Start Date: Sept 2004 Person/Months: 1024 Total Costs: 9.5 M € (6.3 M € from EU) Objective: Create a Digital Library Infrastructure that will allow members of dynamic virtual research organizations to create on-demand transient digital libraries based on shared computing, storage, multimedia, multi-type content, and application resources

09/01/2006NA4 Generic Application Meeting4 Participants Italian National Research Coucil – ISTI (Italy, Scientific Co-ordinator) European Research Consortium for Informatics and Mathematics (France, Administrative Co- ordinator) European Organization for Nuclear Research (Switzerland) Fraunhofer-Gesellschaft zur F ö rderung der angewandten Forschung e.V. – IPSI (Germany) University of Athens (Greece) University of Basel (Switzerland) University for Health Informatics and Technology Tyrol (Austria) University of Strathclyde (United Kingdom) Engineering Ingegneria Informatica SpA (Italy) Fast Search & Transfer ASA (Norway) 4D SOFT Software Development Ltd. (Hungary) European Space Agency – ESRIN (Italy) Scuola Normale Superiore (Italy) RAI Radio Televisione Italiana (Italy)

09/01/2006NA4 Generic Application Meeting5 DLCreation service Service C Service B Service A Service D Service E DILIGENT DL infrastructure simulation Speech recognition Feature extraction 3D processing Consumers Producers Implementation of Environmental Conventions Research on Culture Heritage

09/01/2006NA4 Generic Application Meeting6 Interaction with EGEE Coordination with EGEE Technical interactions 9 technical meetings (mainly with JRA1) gLite mailing lists subscription:   1 training on “Grid Technologies for Digital Libraries” 1 tutorial on “gLite Deployment” Other interactions 4 EGEE conferences (Cork, The Hague, Athens, Pisa)

09/01/2006NA4 Generic Application Meeting7 Interaction with EGEE Feedback to EGEE On EGEE activities gLite bugs submission (JRA1) On DILIGENT project status access to EGEE prototype testbeds (JRA1) access to EGEE PPS testbed (SA1) grid related DL requirements (JRA1, NA4) future plans

09/01/2006NA4 Generic Application Meeting8 gLite DILIGENT Infrastructures DILIGENT has 2 independent infrastructures (gLite v1.4) Development infrastructure Testing infrastructure Infrastructures are geographically distributed, linking 6 sites in Athens, Budapest, Darmstadt, Pisa, Innsbruck and Rome Running gLite experimentation tests since July 2005

09/01/2006NA4 Generic Application Meeting9 Development Infrastructures

09/01/2006NA4 Generic Application Meeting10 Testing Infrastructure Job Management Services Data Management Services 4DSOFT Information Services CNR Security Services ENG

09/01/2006NA4 Generic Application Meeting11 gLite Experimentation Goal store/manage collections of objects run applications organized in DAGs store the application results for future usage Tests plan   Data Upload   Job Submission   Data transfer Data 800K XML files of the Reuters corpus (from Aug96 to Aug97) Application Feature extraction tool (JIRE Application) Implementation of prototypes to test the feasibility of the proposed solutions

09/01/2006NA4 Generic Application Meeting12 gLite Experimentation – Data Upload Two Mass Storage Systems (MSS) were tested: dCache and DPM dCache: success rate: 69,06 % avg. rate: 16,18 s/file several problems! DPM: success rate: 97,26 % avg. rate: 6,10 s/file

09/01/2006NA4 Generic Application Meeting13 gLite Experimentation – Job Submission Jobs using dCache data MSS: several problems! Jobs using DPM data MSS: success rate: 100% avg. rate: 5,77 s/file comparable performance using 10 and 100 jobs due to the small number of available worker nodes

09/01/2006NA4 Generic Application Meeting14 gLite Experimentation DILIGENT Vs PPS infras. Data upload similar results (for DPM) Job submission similar results DILIGENT dCache not considered (didn't work with 1000 files)

09/01/2006NA4 Generic Application Meeting15 Process Management gLite Experimentation The experimental DILIGENT DL exploits gLite storing and processing on demand the stored products on the GRID. This allows to produce usable end-user manifestations upon requests. Storage Management Content Management Metadata Management Index and Search Management Authentication Authorization gLite StorageBroker Information Service gLite JM gLite SE gLite WMS Storage Management User Interface Inf. Service R-GMA DVOS VOMS

09/01/2006NA4 Generic Application Meeting16 Problem Using gLite Services gLite deployment gLite architecture and configuration are complex gLite 1.0 was released in April 2005 (since then four new releases were made available) limited information available (it has been made available gradually) several bugs were found in deploying and using gLite (many are solved) Software porting to 64 bit is not complete. Some gLite services ( WMS, CE) can’t be deployed on 64 bit machines.

09/01/2006NA4 Generic Application Meeting17 Problem Using gLite Services [cont] Job submission: Slow Job execution phase Anyway gLite job management system showed to be reliable: more jobs same performance Data upload: A lot of performance issues using DCache backend gLite-put/gLite-get/gLite-rm simultaneous large amount of small files DILIGENT needs 100% successful upload rate-> DPM dead-links on Fireman when glite-put ends with errors

09/01/2006NA4 Generic Application Meeting18 DILIGENT Requirements DILIGENT aims to run executables that repeat the same operations for each input files belonging to a given collection. Each single execution takes few minutes (or less) but it must be repeated for hundreds of thousands times (even millions). These executables usually are organised in a DAG to deliver a more complex functionality

09/01/2006NA4 Generic Application Meeting19 DILIGENT Requirements [cont] In order to support this framework, it should be possible: To query for the maximum number of CPUs concurrently available in order to allow to a DILIGENT high level service to automatically prepare a DAG where each node will be entitled to process a partition of the data collection To use parametric jobs/automatic partitioning on data Submission of a same computation on a set of n input data should be more efficient than the submission of n jobs To use Condor as LRMS (Local resource management System)

09/01/2006NA4 Generic Application Meeting20 DILIGENT Requirements [cont] To support service certificate it should be possible to obtain a service certificate for a high level service To specify a job specific priority the same user/service should be able to specify priorities for his/its own jobs To specify a priority for a user or for a service it is required to prioritize the DILIGENT infrastructural services jobs with respect to the end-user services requests

09/01/2006NA4 Generic Application Meeting21 DILIGENT Requirements [cont] To ask for on-disk encryption of data It should be possible to ask for encryption of the data on disk to prevent data leaks at the storage site level To dynamically manage VO creation The creation of a new VO should be supported without deploying and configuration of services by hand To dynamically support user/service affiliation to a VO The user/service affiliation to a VO should be automathized as much as possible

09/01/2006NA4 Generic Application Meeting22 Future Plans Monitor gLite developments and continue the current work of deploying gLite in DILIGENT infrastructures Continue the ongoing gLite experimentation using DILIGENT and EGEE PPS infrastructures Continue gridifying the following services needed in the DILIGENT DL experimentation. Metadata Management Content Management Index and Search Management Process (workflow) Management

09/01/2006NA4 Generic Application Meeting23 Tips / Summary DILIGENT has successfully installed and now maintains its own gLite infrastructures. DILIGENT development infrastructure can join the EGEE infrastructure An active EGEE-DILIGENT collaboration has been established and this has been key for the achievement of our first goals DILIGENT has identified a concrete set of open issues that we need to address. The gLite and DL experimentation activities have shown that we are on the right track

09/01/2006NA4 Generic Application Meeting24 DILIGENT Web Site DILIGENT Training DLhttp://diligent-training.isti.cnr.ithttp://diligent-training.isti.cnr.it Experimental DL Andrea Thank you