Porting applications to EU-IndiaGrid: EGEE Marco Verlato EU-IndiaGrid Workshop 16-18 April 2007 Bangalore, India.

Slides:



Advertisements
Similar presentations
FP7-INFRA Enabling Grids for E-sciencE EGEE Induction Grid training for users, Institute of Physics Belgrade, Serbia Sep. 19, 2008.
Advertisements

The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Services Abderrahman El Kharrim
EU-GRID Work Program Massimo Sgaravatto – INFN Padova Cristina Vistoli – INFN Cnaf as INFN members of the EU-GRID technical team.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
1 Grid services based architectures Growing consensus that Grid services is the right concept for building the computing grids; Recent ARDA work has provoked.
Makrand Siddhabhatti Tata Institute of Fundamental Research Mumbai 17 Aug
The SAM-Grid Fabric Services Gabriele Garzoglio (for the SAM-Grid team) Computing Division Fermilab.
INFSO-RI Enabling Grids for E-sciencE FloodGrid application Ladislav Hluchy, Viet D. Tran Institute of Informatics, SAS Slovakia.
Riccardo Bruno INFN.CT Sevilla, Sep 2007 The GENIUS Grid portal.
GRACE Project IST EGAAP meeting – Den Haag, 25/11/2004 Giuseppe Sisto – Telecom Italia Lab.
Workload Management WP Status and next steps Massimo Sgaravatto INFN Padova.
INFSO-RI Enabling Grids for E-sciencE gLite Data Management Services - Overview Mike Mineter National e-Science Centre, Edinburgh.
Nicholas LoulloudesMarch 3 rd, 2009 g-Eclipse Testing and Benchmarking Grid Infrastructures using the g-Eclipse Framework Nicholas Loulloudes On behalf.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Configuring and Maintaining EGEE Production.
INFSO-RI Enabling Grids for E-sciencE The GENIUS Grid portal Tony Calanducci INFN Catania - Italy First Latin American Workshop.
INFSO-RI Enabling Grids for E-sciencE Logging and Bookkeeping and Job Provenance Services Ludek Matyska (CESNET) on behalf of the.
Grid Resource Allocation and Management (GRAM) Execution management Execution management –Deployment, scheduling and monitoring Community Scheduler Framework.
Enabling Grids for E-sciencE ENEA and the EGEE project gLite and interoperability Andrea Santoro, Carlo Sciò Enea Frascati, 22 November.
L ABORATÓRIO DE INSTRUMENTAÇÃO EM FÍSICA EXPERIMENTAL DE PARTÍCULAS Enabling Grids for E-sciencE Grid Computing: Running your Jobs around the World.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
1 st December 2003 JIM for CDF 1 JIM and SAMGrid for CDF Mòrag Burgon-Lyon University of Glasgow.
INFSO-RI Enabling Grids for E-sciencE Workload Management System Mike Mineter
LCG Middleware Testing in 2005 and Future Plans E.Slabospitskaya, IHEP, Russia CERN-Russia Joint Working Group on LHC Computing March, 6, 2006.
Group 1 : Grid Computing Laboratory of Information Technology Supervisors: Alexander Ujhinsky Nikolay Kutovskiy.
Grid Execution Management for Legacy Code Applications Grid Enabling Legacy Code Applications Tamas Kiss Centre for Parallel.
Giuseppe Codispoti INFN - Bologna Egee User ForumMarch 2th BOSS: the CMS interface for job summission, monitoring and bookkeeping W. Bacchi, P.
June 24-25, 2008 Regional Grid Training, University of Belgrade, Serbia Introduction to gLite gLite Basic Services Antun Balaž SCL, Institute of Physics.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
MTA SZTAKI Hungarian Academy of Sciences Introduction to Grid portals Gergely Sipos
EGEE-II INFSO-RI Enabling Grids for E-sciencE Application Development on Grids Gergely Sipos MTA SZTAKI Hungarian.
Getting started DIRAC Project. Outline  DIRAC information system  Documentation sources  DIRAC users and groups  Registration with DIRAC  Getting.
Interactive Workflows Branislav Šimo, Ondrej Habala, Ladislav Hluchý Institute of Informatics, Slovak Academy of Sciences.
EGEE-II INFSO-RI EGEE and gLite are registered trademarks The EGEE Production Grid Ian Bird EGEE Operations Manager HEPiX Jefferson Lab, 12 th October.
US LHC OSG Technology Roadmap May 4-5th, 2005 Welcome. Thank you to Deirdre for the arrangements.
FRANEC and BaSTI grid integration Massimo Sponza INAF - Osservatorio Astronomico di Trieste.
E-science grid facility for Europe and Latin America Porting a program to run on the Grid Marcello Iacono Manno Consorzio COMETA
1 Grid2Win: porting of gLite middleware to Windows Dario Russo INFN Catania
Glite. Architecture Applications have access both to Higher-level Grid Services and to Foundation Grid Middleware Higher-Level Grid Services are supposed.
FP6−2004−Infrastructures−6-SSA E-infrastructure shared between Europe and Latin America Alexandre Duarte CERN IT-GD-OPS UFCG LSD 1st EELA Grid School.
Testing and integrating the WLCG/EGEE middleware in the LHC computing Simone Campana, Alessandro Di Girolamo, Elisa Lanciotti, Nicolò Magini, Patricia.
FESR Consorzio COMETA - Progetto PI2S2 Porting a program to run on the Grid Marcello Iacono Manno Consorzio COMETA
Università di Perugia Enabling Grids for E-sciencE Status of and requirements for Computational Chemistry NA4 – SA1 Meeting – 6 th April.
DIRAC Project A.Tsaregorodtsev (CPPM) on behalf of the LHCb DIRAC team A Community Grid Solution The DIRAC (Distributed Infrastructure with Remote Agent.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Practical using WMProxy advanced job submission.
EGEE is a project funded by the European Union under contract IST Experiment Software Installation toolkit on LCG-2
Grid Execution Management for Legacy Code Architecture Exposing legacy applications as Grid services: the GEMLCA approach Centre.
Tutorial on Science Gateways, Roma, Catania Science Gateway Framework Motivations, architecture, features Riccardo Rotondo.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Introduction to Grid Application Development Vangelis Floros Application.
Breaking the frontiers of the Grid R. Graciani EGI TF 2012.
EGEE is a project funded by the European Union under contract IST Generic Applications Requirements Roberto Barbera NA4 Generic Applications.
EGEE-II INFSO-RI Enabling Grids for E-sciencE Overview of gLite, the EGEE middleware Mike Mineter Training Outreach Education National.
First South Africa Grid Training June 2008, Catania (Italy) GILDA t-Infrastructure Valeria Ardizzone INFN Catania.
EGEE is a project funded by the European Union under contract IST GENIUS and GILDA Guy Warner NeSC Training Team Induction to Grid Computing.
SAM architecture EGEE 07 Service Availability Monitor for the LHC experiments Simone Campana, Alessandro Di Girolamo, Nicolò Magini, Patricia Mendez Lorenzo,
Antonio Fuentes RedIRIS Barcelona, 15 Abril 2008 The GENIUS Grid portal.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
BaBar & Grid Eleonora Luppi for the BaBarGrid Group TB GRID Bologna 15 febbraio 2005.
Enabling Grids for E-sciencE University of Perugia Computational Chemistry status report EGAAP Meeting – 21 rst April 2005 Athens, Greece.
Introduction to Grid Application Development
Gri2Win: Porting gLite to run under Windows XP Platform
Grid2Win Porting of gLite middleware to Windows XP platform
Grid Computing: Running your Jobs around the World
Belle II Physics Analysis Center at TIFR
NA4 Medical Imaging Geneva, September 26, 2006 Johan Montagnat.
Roberto Barbera (a nome di Livia Torterolo)
Grid2Win: Porting of gLite middleware to Windows XP platform
Gri2Win: Porting gLite to run under Windows XP Platform
Overview of the EGEE project and the gLite middleware
How To Integrate an Application on Grid
Job Application Monitoring (JAM)
Presentation transcript:

Porting applications to EU-IndiaGrid: EGEE Marco Verlato EU-IndiaGrid Workshop April 2007 Bangalore, India

2 Outline Concepts Job categories Levels of Gridification Software remote installation Applications invocation

3 Grid Applications Development Where computer science meets the application communities! VO-specific developments built on higher-level tools and core services Makes Grid services useable by non-specialists Grids provide the compute and data storage resources Basic Grid services: AA, job submission, info, … Higher-level grid services (brokering,…) Application toolkits, ….. Application Production grids provide these core services.

4 Job concept gLite middleware follows the job submission concept for application execution and resource sharing A job is a self contained entity that packages and conveys all the required information and artifacts for the successful remote execution of an application −Executable files −Input/Output data −Parameters −Environment −Infrastructure Requirements −Workflows

5 Type of applications The work done to port the application depends on its characteristics, complexity and the level of interactions with external grid services Size of input/output data Metadata request Access to external software (database, libraries...)

6 Applications categories Simulation Bulk Processing Responsive Apps. Workflow Parallel Jobs

7 Simulation Examples –LHC Monte Carlo simulation –Fusion Characteristics –Jobs are CPU-intensive –Large number of independent jobs –Run by few (expert) users –Small input; large output Needs –Batch-system services –Minimal data management for storage of results ATLAS ITER

8 Bulk Processing Examples –HEP processing of raw data, analysis –Earth observation data processing Characteristics –Widely-distributed input data –Significant amount of input and output data Needs –Job management tools (workload management) –Meta-data services –More sophisticated data management

9 Responsive Apps. (I) Examples –Prototyping new applications –Monitoring grid operations Characteristics –Small amounts of input and output data –Not CPU-intensive –Short response time (few minutes) Needs –Configuration which allows “immediate” execution (QoS) –Services must treat jobs with minimum latency

10 Responsive Apps. (II) Grid as a backend infrastructure: –gPTM3D: interactive analysis of medical images bioinformatics via web portal –GATE: radiotherapy planning –DILIGENT: digital libraries Characteristics –Rapid response: a human waiting for the result! –Many small but CPU-intensive tasks –User is not aware of “grid”! Needs –Interfacing (data & computing) with non-grid application or portal –User and rights management between front-end and grid

11 Workflow Examples –“Bronze Standard”: medical images registration –Flood prediction Characteristics –Use of grid and non-grid services –Complex set of algorithms for the analysis –Complex dependencies between individual tasks Needs –Tools for managing the workflow itself –Standard interfaces for services (I.e. web- services)

12 Parallel Jobs Examples –Climate modeling –Earthquake analysis –Computational chemistry Characteristics –Many interdependent, communicating tasks –Many CPUs needed simultaneously –Use of MPI libraries Needs –Configuration of resources for flexible use of MPI –Pre-installation of optimized MPI libraries

13 Universal Needs Security –Ability to control access to services and to data Fine-grained access control lists Encryption & logging for more demanding disciplines Access control consistently implemented over all services VO Management –Management of users, groups, and roles –Changing the priority of jobs for different users, groups, roles –Quota management for users, groups, roles –Definition and access to special resources Application-level services Responsive queues (guaranteed, low-latency execution)

14 Levels of “Gridification” No development: wrap existing applications as jobs. No source code modification is required Minor modifications: the application exposes minimal interaction with the grid services (e.g. Data Managements) Major modifications: a wide portion of the code is rewritten to adopt to the new environment (e.g. parallelization, metadata, information) Pure grid applications: developed from scratch. Extensively exploit existing grid services to provide new capabilities customized for a specific domain (e.g. metadata, job management, credential management )

15 No development It is the simplest case : the application has not external dependencies, just needs computational power Being sure that the application runs on the Worker Node, the executable, and the input files it may need are carried in the Input Sandbox and therefore executed Although the application can be directly run, it’s recommended to invoke it from a supplied script which prepare the environment before the execution

16 Minor modifications The application has minimal interactions with grid service : it is self consistent but... – e.g. takes input files from an SE – output files are stored on a SE – Perform I/O from a metadata catalog – the application itself can be downloaded from a SE User provides an adapter script which executes needed tasks, set the environment and run the application

17 Major modifications A part of the application software is rewritten taking care of the new environment : e.g. because of parallelization, interaction at run time with grid services... – Use of API – Web Service invoked – CLI wrap Source code recompiled taking into account this

18 Pure grid application Entirely developed from scratch taking into account grid capabilities Better exploits grid facilities Needs a deep knowledge of grid environment and possibilities in order to engineer entirely the application logic over a grid environment Very rare to meet...

19 Additional software installed Independently from the integration level, applications may need external software (libraries) that, as application itself, can be provided in several ways – at run time carrying out it within InputSandbox downloading it from a Storage Element – permanent install on the CE as tar.gz installed by the VO software manager as rpm being part of the gLite release

20 The Experiment Software Manager (ESM) is the member of the experiment VO entitled to install Application Software in the different sites. The ESM is generally acknowledged through a VOMS role (SoftwareManager) The ESM can manage (install, validate, remove...) Experiment Software on a site at any time through a normal Grid job, without previous communication to the site administrators. Such job has in general no scheduling priorities and will be treated as any other job of the same VO in the same queue. lcg-asis is a good generator for such installation jobs Software installation on CE

21 Software installation on CE The site provides a dedicated space where each supported VO can install or remove software. The amount of available space must be negotiated between the VO and the site An environmental variable holds the path for the location of a such space. Its format is the following: VO_ _SW_DIR

22 Software validation Once the software is installed, the ESM can perform the validation The validation is a process verifying that the software was properly installed and works as expected It can be performed in the same job, after installation (validation on the fly) or later on, via a dedicated job.

23 Software validation If the ESM judges that the software was properly installed he publishes in the Information System (IS) the tag which identifies unequivocally the software, e.g.: VO-cms-CMSSW_1_2_2 Such a tag is added to the GlueHostApplicationSoftwareRunTimeEnvironment attribute of the IS, using the GRIS running in the CE Jobs requesting for a particular piece of software can be directed to the appropriate CE just by setting special requirements on the JDL, e.g.: Requirements=Member("VO-cms-CMSSW_1_2_2", other.GlueHostApplicationSoftwareRunTimeEnvironment);

24 Invoke Application From the UI −Command Line Interfaces / Scripts −APIs −Higher level tools From portals −Accessible from any browser −Tailored to applications

25 References (I)

26 References (II)

27 Acknowledgements Thanks to Mike Mineter (NeSC), Valeria Ardizzone and Emidio Giorgio of the INFN GILDA team