Parametric jobs – faciliation of Instrument Elements usage in Grid applications INGRID 2009 Katarzyna Bylec, Szymon Mueller, Mateusz.

Slides:



Advertisements
Similar presentations
W w w. h p c - e u r o p a. o r g HPC-Europa Portal: Uniform Access to European HPC Infrastructure Ariel Oleksiak Poznan Supercomputing.
Advertisements

Enhanced Middleware Architecture for the Remote Instrumentation e-Infrastructure OGF28 Alexey Cheptsov High Performace Computing Center.
GridSAM Overview Grid Job S ubmission A nd M onitoring Service What is GridSAM? Funded by the OMII Managed Programme (Started in Sept, 04) Client Perspective.
Höchstleistungsrechenzentrum Stuttgart SEGL Parameter Study Slide 1 Science Experimental Grid Laboratory (SEGL) Dynamical Parameter Study in Distributed.
WS-JDML: A Web Service Interface for Job Submission and Monitoring Stephen M C Gough William Lee London e-Science Centre Department of Computing, Imperial.
1 Project overview Presented at the Euforia KoM January, 2008 Marcin Płóciennik, PSNC, Poland.
EGEE 08, Istanbul, September 25th, Support for experimental science communities Istanbul, September 25th, 2008 Norbert Meyer, Marcin Płóciennik.
GLOBUS PLUG-IN FOR WINGS WOKFLOW ENGINE Elizabeth Martí ITACA Universidad Politécnica de Valencia
ProActive Task Manager Component for SEGL Parameter Sweeping Natalia Currle-Linde and Wasseim Alzouabi High Performance Computing Center Stuttgart (HLRS),
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space Cracow Grid Workshop’10 Kraków, October 11-13,
DORII Deployment of Remote Instrumentation Infrastructure Norbert Meyer Poznań Supercomputing and Netwoking Center DORII Workshop,
SUN HPC Consortium, Heidelberg 2004 Grid(Lab) Resource Management System (GRMS) and GridLab Services Krzysztof Kurowski Poznan Supercomputing and Networking.
ArcGIS Workflow Manager An Introduction
Polish Infrastructure for Supporting Computational Science in the European Research Space Policy Driven Data Management in PL-Grid Virtual Organizations.
DORII Joint Research Activities DORII Joint Research Activities Status and Progress 4 th All-Hands-Meeting (AHM) Alexey Cheptsov on.
DORII review Deployment and management of production infrastructure SA2 Ioannis Liabotis Greek Research and Technology Network - GRNET.
Flexibility and user-friendliness of grid portals: the PROGRESS approach Michal Kosiedowski
Nicholas LoulloudesMarch 3 rd, 2009 g-Eclipse Testing and Benchmarking Grid Infrastructures using the g-Eclipse Framework Nicholas Loulloudes On behalf.
J-OCM is a system for monitoring distributed Java applications conforming to OMIS specification with J-OMIS extensions. It is used to: gather information.
Transparent Grid Enablement Using Transparent Shaping and GRID superscalar I. Description and Motivation II. Background Information: Transparent Shaping.
DORII review Remote instrumentation communities and application support NA3 Roberto Pugliese Sincrotrone Trieste SCpA.
Dr. Harald KornmayerISCG 2007 – 28 th of March 2007 g-Eclipse A framework for Grid users, operators and developers Harald Kornmayer (Forschungszentrum.
© 2008 Open Grid Forum Independent Software Vendor (ISV) Remote Computing Primer Steven Newhouse.
DORII Joint Research Activities DORII Joint Research Activities Status and Progress 6 th All-Hands-Meeting (AHM) Alexey Cheptsov on.
© DATAMAT S.p.A. – Giuseppe Avellino, Stefano Beco, Barbara Cantalupo, Andrea Cavallini A Semantic Workflow Authoring Tool for Programming Grids.
Accelerating Scientific Exploration Using Workflow Automation Systems Terence Critchlow (LLNL) Ilkay Altintas (SDSC) Scott Klasky(ORNL) Mladen Vouk (NCSU)
GLOBAL GRID FORUM 10 Workflows in PROGRESS and GridLab environments Michał Kosiedowski.
Migrating Desktop The graphical framework for running grid applications Bartek Palak Poznan Supercomputing and Networking Center The.
Application portlets within the PROGRESS HPC Portal Michał Kosiedowski
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Giuseppe Codispoti INFN - Bologna Egee User ForumMarch 2th BOSS: the CMS interface for job summission, monitoring and bookkeeping W. Bacchi, P.
Migrating Desktop Marcin Płóciennik Marcin Płóciennik Kick-off Meeting, Santander, Graphical.
BalticGrid-II Project BalticGrid-II Kick-off Meeting, , Vilnius1 Joint Research Activity Enhanced Application Services on Sustainable e-Infrastructure.
George Tsouloupas University of Cyprus Task 2.3 GridBench ● 1 st Year Targets ● Background ● Prototype ● Problems and Issues ● What's Next.
SEE-GRID-SCI The SEE-GRID-SCI initiative is co-funded by the European Commission under the FP7 Research Infrastructures contract no.
Parameter Study Principles & Practices. What is Parameter Study? Parameter study is the application of a single algorithm over a set of independent inputs:
Terena conference, June 2004, Rhodes, Greece Norbert Meyer The effective integration of scientific instruments in the Grid.
6 February 2009 ©2009 Cesare Pautasso | 1 JOpera and XtremWeb-CH in the Virtual EZ-Grid Cesare Pautasso Faculty of Informatics University.
Enabling Grids for E-sciencE Astronomical data processing workflows on a service-oriented Grid architecture Valeria Manna INAF - SI The.
EGEE-III INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks Grid Web Portal for Chemists M. Sterzel,
Migrating Desktop Bartek Palak Bartek Palak Poznan Supercomputing and Networking Center The Graphical Framework.
Easy Access to Grid infrastructures Dr. Harald Kornmayer (NEC Laboratories Europe) Dr. Mathias Stuempert (KIT-SCC, Karlsruhe) EGEE User Forum 2008 Clermont-Ferrand,
August 2003 At A Glance The IRC is a platform independent, extensible, and adaptive framework that provides robust, interactive, and distributed control.
Parameter Study Principles & Practices. Outline Data Model of the PS Part I Simple PS –Generating simple PS Workflow by introducing PS Input port – using.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
Experiences Running Seismic Hazard Workflows Scott Callaghan Southern California Earthquake Center University of Southern California SC13 Workflow BoF.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
PROGRESS: GEW'2003 Using Resources of Multiple Grids with the Grid Service Provider Michał Kosiedowski.
BalticGrid-II Project EGEE UF’09 Conference, , Catania Partner’s logo Framework for Grid Applications Migrating Desktop Framework for Grid.
1 P-GRADE Portal hands-on Gergely Sipos MTA SZTAKI Hungarian Academy of Sciences.
The NGS Grid Portal David Meredith NGS + Grid Technology Group, e-Science Centre, Daresbury Laboratory, UK
EGEE 3 rd conference - Athens – 20/04/2005 CREAM JDL vs JSDL Massimo Sgaravatto INFN - Padova.
EGEE-II INFSO-RI Enabling Grids for E-sciencE EGEE and gLite are registered trademarks gLite – UNICORE interoperability Daniel Mallmann.
D.Spiga, L.Servoli, L.Faina INFN & University of Perugia CRAB WorkFlow : CRAB: CMS Remote Analysis Builder A CMS specific tool written in python and developed.
ECHO Technical Interchange Meeting 2013 Timothy Goff 1 Raytheon EED Program | ECHO Technical Interchange 2013.
Migrating Desktop Uniform Access to the Grid Marcin Płóciennik Poznan Supercomputing and Networking Center Poland EGEE’08 Conference, Istanbul, 24 Sep.
Ganga/Dirac Data Management meeting October 2003 Gennady Kuznetsov Production Manager Tools and Ganga (New Architecture)
Tutorial on Science Gateways, Roma, Catania Science Gateway Framework Motivations, architecture, features Riccardo Rotondo.
InSilicoLab – Grid Environment for Supporting Numerical Experiments in Chemistry Joanna Kocot, Daniel Harężlak, Klemens Noga, Mariusz Sterzel, Tomasz Szepieniec.
ECLIPSE RICH CLIENT PLATFORM Part 1 Introduction.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
© 2006 Open Grid Forum Remote Instrumentation (not only) for fun LEGO Mindstorm Exploration Demo Authors: Andrea Del Linz, Roberto Pugliese (Sincrotrone.
and Alexandre Duarte OurGrid/EELA Interoperability Meeting
BOSS: the CMS interface for job summission, monitoring and bookkeeping
BOSS: the CMS interface for job summission, monitoring and bookkeeping
EMI Interoperability Activities
BOSS: the CMS interface for job summission, monitoring and bookkeeping
Interactive grid framework
Overview of Workflows: Why Use Them?
Gordon Erlebacher Florida State University
Presentation transcript:

Parametric jobs – faciliation of Instrument Elements usage in Grid applications INGRID 2009 Katarzyna Bylec, Szymon Mueller, Mateusz Pabiś, Mariusz Wojtysiak, Paweł Wolniewicz Poznań Supercomputing and Netwoking Center , Alghero, Italy

Outline Background DORII and HORUS application JSDL and its extension Parameter Sweep capabilities Technical implementation of HORUS workflow JAVA library for Parameter Sweep

New oportunities = new problems Simplify access to the Grid → wider community → new users → new use cases → new needs → extending classical e-Infrastructure → new problems Problem: Instrument elements as a virtualisation of data sources - data processing Input: do the same with many input files Output: output the same file with changed content or output many different files as a result of slightly changed processing Goal – to streamline the process – 2 levels: Automatization Describing the logic of job

Real life example - DORII Situation: well established communities (earthquake, environmental science, experimental science)‏ applications not or only partially integrated in the European e- Infrastructures Applications' needs: To make the daily work more efficient To automatize the jobs' flow Enhance usage of scientific devices Solutions: Integrate applications to e-Infrastructure Convert applications into Grid workflows

Case study: HORUS_bench Description: Insitituto de Hidráulica Ambiental, Universidad de Cantabria Used to process data gathered via HORUS system images of Puntal beach, Santander, Spain to measure beach user density, calculate the wather line, etc. Requirements: User chooses set of processing algorithms which constitue binary model to run over input images The same model is run over GB of images data

HORUS_bench workflow SE Predefined binary models Archived images HORUS_bench output IE – cameras on the beach 0. make photos 0.1 send photos to application 0.2 store photos on SE 1. model input data 2. processing model 3. computing 4. store results

HORUS workflow adaptation DORII E-Infrastructure (access to IEs)‏ VCR and Workflow System Workflows: advantages Automation of the process Single Sing-On (MyProxy)‏ All application's task managed from one point Monitoring of workflow execution Hide the Grid complexity from user Problems Execution of thousands of the same jobs for different input data

JSDL specification JSDL = Job Submission Description Language JSDL Working Group in Open Grid Forum Project created: 09/25/2003, version 1.0 available Goal: to specify an abstract standard of job description language that is independent of underlying middleware to replace existing languages (JDL, RSL, etc.)‏ to make it extensible: POSIX Application HPC Profile Application SPMD Application Parameter Sweep

Basic JSDL POSIX Application definition Data transfer $./algorithm_model_bin -inputFile file.jpg -outputDir.

Parameter Sweep - values algorithm_model_bin -inputFile file000.jpg algorithm_model_bin -inputFile file001.jpg... algorithm_model_bin -inputFile file999.jpg

Parameter Sweep - Functions algorithm_model_bin -inputFile file0.jpg... algorithm_model_bin -inputFile file68.jpg algorithm_model_bin -inputFile file70.jpg... algorithm_model_bin -inputFile file665.jpg algorithm_model_bin -inputFile file667.jpg... algorithm_model_bin -inputFile file999.jpg

HORUS JSDL - 2 nd approach Second approach create one JSDL file and make HORUS take care of many input files

Parameter sweep - FileSweep #! /bin/bash #some pre-processing./algorithm_model_bin -inputFile input_name #some post-processing

Parameter Sweep - summary Parameter – specifies the target JSDL element to be parametrised DocumentNode (whole value, XPath substring)‏ FileSweep Function – specifies the values to be substitued for parameters Values LoopInteger DoubleLoop Assigment – define the order and dependencies between parameters Sweep at the same time Independent sweep Nested Sweep

Limitations of JSDL and Parameter Sweep No support for JDL collection jobs (shared sandbox) - JSDL limitation Problem with supporting MW MWs don't support JSDL, not to say about Parameter Sweep No parametrisation at the level of workflow lack of workflow language that would support parametrisation‏ User specification awareness sweep of elements corellated witch each other in JSDL (e.g. DataStaging of POSIX Application elements)‏

Technical solution gLite 1. user submits HORUS workflow 2. Workflow manager decomposites the workflow 2.1. Each task is submited on the Grid through CommonLib 3. Parametric JSDL is translated to JSDLs – one for each iteration 3.1. JSDL is translated to JDL 3.2. JSD is submited to gLite Workflow Editor Workflow Manager Common Lib: g-Eclipse

g-Eclipse JSDL and Parameter Sweep Integrated, middleware independent Grid enabled workbench tool EC and Eclipse project JSDL as a default job description language support for Parameter Sweep extension multi-page editor for JSDL...with special page for Parameter Sweep

g-Eclipse's JAVA Parameter Sweep Library Standalone library extracted from g-Eclipse plug-ins 23kB JAR file dev.eclipse.org/svnroot/technology/eu.geclipse/trunk/plugins/eu.geclipse.jsdl

g-Eclipse's JAVA Parameter Sweep Library

Work in progress Current implementation of extension (newest: 27 th draft)‏ Extend Parameter Sweep support XPath substring function FileSweep DoubleLoop Processing of valid yet questionable values‏ Extract JSDL editor as a standalone Eclipse plug-in Polish API within DORII's Common Lib

Summary Problems: IE shortens a time-to-grid for input data Problem of a lot of input that has to processed Applications' demands are to empower data processing Grid complexity has to be hiden from users Solutions: Workflows – abstract, high level job presentation and its automation Parametrisation – abstracting the job's logic to make processing more clear

Contact information DORII project g-Eclipse or 1.0 release: OGF JSDL WG questions/comments? Thank you!