Presentation is loading. Please wait.

Presentation is loading. Please wait.

Parametric jobs – faciliation of Instrument Elements usage in Grid applications INGRID 2009 Katarzyna Bylec, Szymon Mueller, Mateusz.

Similar presentations


Presentation on theme: "Parametric jobs – faciliation of Instrument Elements usage in Grid applications INGRID 2009 Katarzyna Bylec, Szymon Mueller, Mateusz."— Presentation transcript:

1 http://www.dorii.eu/ Parametric jobs – faciliation of Instrument Elements usage in Grid applications INGRID 2009 Katarzyna Bylec, Szymon Mueller, Mateusz Pabiś, Mariusz Wojtysiak, Paweł Wolniewicz Poznań Supercomputing and Netwoking Center 03.04.2009, Alghero, Italy

2 Outline Background DORII and HORUS application JSDL and its extension Parameter Sweep capabilities Technical implementation of HORUS workflow JAVA library for Parameter Sweep

3 New oportunities = new problems Simplify access to the Grid → wider community → new users → new use cases → new needs → extending classical e-Infrastructure → new problems Problem: Instrument elements as a virtualisation of data sources - data processing Input: do the same with many input files Output: output the same file with changed content or output many different files as a result of slightly changed processing Goal – to streamline the process – 2 levels: Automatization Describing the logic of job

4 Real life example - DORII Situation: well established communities (earthquake, environmental science, experimental science)‏ applications not or only partially integrated in the European e- Infrastructures Applications' needs: To make the daily work more efficient To automatize the jobs' flow Enhance usage of scientific devices Solutions: Integrate applications to e-Infrastructure Convert applications into Grid workflows

5 Case study: HORUS_bench Description: Insitituto de Hidráulica Ambiental, Universidad de Cantabria Used to process data gathered via HORUS system images of Puntal beach, Santander, Spain to measure beach user density, calculate the wather line, etc. Requirements: User chooses set of processing algorithms which constitue binary model to run over input images The same model is run over GB of images data

6 HORUS_bench workflow SE Predefined binary models Archived images HORUS_bench output IE – cameras on the beach 0. make photos 0.1 send photos to application 0.2 store photos on SE 1. model input data 2. processing model 3. computing 4. store results

7 HORUS workflow adaptation DORII E-Infrastructure (access to IEs)‏ VCR and Workflow System Workflows: advantages Automation of the process Single Sing-On (MyProxy)‏ All application's task managed from one point Monitoring of workflow execution Hide the Grid complexity from user Problems Execution of thousands of the same jobs for different input data

8 JSDL specification JSDL = Job Submission Description Language JSDL Working Group in Open Grid Forum Project created: 09/25/2003, version 1.0 available Goal: to specify an abstract standard of job description language that is independent of underlying middleware to replace existing languages (JDL, RSL, etc.)‏ to make it extensible: POSIX Application HPC Profile Application SPMD Application Parameter Sweep

9 Basic JSDL POSIX Application definition Data transfer $./algorithm_model_bin -inputFile file.jpg -outputDir.

10 Parameter Sweep - values algorithm_model_bin -inputFile file000.jpg algorithm_model_bin -inputFile file001.jpg... algorithm_model_bin -inputFile file999.jpg

11 Parameter Sweep - Functions algorithm_model_bin -inputFile file0.jpg... algorithm_model_bin -inputFile file68.jpg algorithm_model_bin -inputFile file70.jpg... algorithm_model_bin -inputFile file665.jpg algorithm_model_bin -inputFile file667.jpg... algorithm_model_bin -inputFile file999.jpg

12 HORUS JSDL - 2 nd approach Second approach create one JSDL file and make HORUS take care of many input files

13 Parameter sweep - FileSweep #! /bin/bash #some pre-processing./algorithm_model_bin -inputFile input_name #some post-processing

14 Parameter Sweep - summary Parameter – specifies the target JSDL element to be parametrised DocumentNode (whole value, XPath substring)‏ FileSweep Function – specifies the values to be substitued for parameters Values LoopInteger DoubleLoop Assigment – define the order and dependencies between parameters Sweep at the same time Independent sweep Nested Sweep

15 Limitations of JSDL and Parameter Sweep No support for JDL collection jobs (shared sandbox) - JSDL limitation Problem with supporting MW MWs don't support JSDL, not to say about Parameter Sweep No parametrisation at the level of workflow lack of workflow language that would support parametrisation‏ User specification awareness sweep of elements corellated witch each other in JSDL (e.g. DataStaging of POSIX Application elements)‏

16 Technical solution gLite 1. user submits HORUS workflow 2. Workflow manager decomposites the workflow 2.1. Each task is submited on the Grid through CommonLib 3. Parametric JSDL is translated to JSDLs – one for each iteration 3.1. JSDL is translated to JDL 3.2. JSD is submited to gLite Workflow Editor Workflow Manager Common Lib: g-Eclipse

17 g-Eclipse JSDL and Parameter Sweep Integrated, middleware independent Grid enabled workbench tool EC and Eclipse project 07.2006 - 12.2008 JSDL as a default job description language support for Parameter Sweep extension multi-page editor for JSDL...with special page for Parameter Sweep

18 g-Eclipse's JAVA Parameter Sweep Library Standalone library extracted from g-Eclipse plug-ins 23kB JAR file dev.eclipse.org/svnroot/technology/eu.geclipse/trunk/plugins/eu.geclipse.jsdl

19 g-Eclipse's JAVA Parameter Sweep Library

20 Work in progress Current implementation of extension http://schemas.ogf.org/jsdl/2007/01/jsdl-sweep (newest: 27 th draft)‏ http://schemas.ogf.org/jsdl/2007/01/jsdl-sweep Extend Parameter Sweep support XPath substring function FileSweep DoubleLoop Processing of valid yet questionable values‏ Extract JSDL editor as a standalone Eclipse plug-in Polish API within DORII's Common Lib

21 Summary Problems: IE shortens a time-to-grid for input data Problem of a lot of input that has to processed Applications' demands are to empower data processing Grid complexity has to be hiden from users Solutions: Workflows – abstract, high level job presentation and its automation Parametrisation – abstracting the job's logic to make processing more clear

22 Contact information DORII project www.dorii.eu contact@dorii.eu g-Eclipse www.geclipse.eu contact@geclipse.eu or geclipse-dev@eclipse.org contact@geclipse.eugeclipse-dev@eclipse.org 1.0 release: http://www.geclipse.eu/index.php?id=downloadshttp://www.geclipse.eu/index.php?id=downloads OGF JSDL WG http://forge.ogf.org/sf/projects/jsdl-wg questions/comments? katarzyna.bylec@man.poznan.pl katarzyna.bylec@man.poznan.pl Thank you!


Download ppt "Parametric jobs – faciliation of Instrument Elements usage in Grid applications INGRID 2009 Katarzyna Bylec, Szymon Mueller, Mateusz."

Similar presentations


Ads by Google