Presentation is loading. Please wait.

Presentation is loading. Please wait.

Www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 EGI-InSPIRE www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 Kepler Workflow Manager.

Similar presentations


Presentation on theme: "Www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 EGI-InSPIRE www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 Kepler Workflow Manager."— Presentation transcript:

1 www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 EGI-InSPIRE www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 Kepler Workflow Manager with Serpens Suite - Planck simulations use case Marcin Plociennik 1 Marcos López-Caniego 2 Isabel Campos 2 Jesús Marco 2 Enrique Martínez 2 (1) Poznan Supercomputing and Networking Center (2) Instituto de Física de Cantabria (CSIC-UC) 3/11/2016 1

2 www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 The Observational Cosmology and Instrumentation Group at the Instituto de Física de Cantabria (CSIC-UC) is involved in several aspects of the data analysis of Planck, ESA’s mission to study the Cosmic Microwave Background Radiation. During the EGEE-III project we dedicated a fair amount of time and effort to port to the GRID several applications to do CMB-related analysis. Detection of Point Sources in single frequency maps Detection of Point Sources using multifrequency information Detection of Clusters of Galaxies using the SZ effect Detection of Non-Gaussian features in CMB maps 2 Grid activities

3 www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 Experience so far However, while some of the applications run flawlessly in the GRID, others that involved complex workflows had very high rates of failed jobs. In general, these workflows involved registering 1000’s of large files (in many cases >200 MB) to the storage elements and copying them to the nodes to run the analysis. In other cases it was the RAM memory requirements of the application that triggered the large amount of failed jobs. In all cases, resubmission of failed jobs had to be done manually and the efficiency of the process was very low.

4 www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 To improve the efficiency of the workflow, we have deployed new scenario using Serpens for Kepler - a workflow manager to control the production of realistic simulations of the anisotropies of the Cosmic Microwave Background (CMB) radiation in the GRID. These simulations can be used to test the performance of various algorithms designed to detect the presence or absence of non- Gaussian features in the simulated CMB maps, before they are applied to Planck data. In order to test the algorithms we need to produce large numbers of simulations. Each one of them is made of a combination of a Gaussian and non-Gaussian component plus realistic instrumental noise that takes into account the observing strategy of the satellite. CMB workflow

5 www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 CMB workflow This workflow moves to the storage element the necessary information to produce the simulations, then each job of the parameter study will generate in each node the actual simulation that will be copied back to the storage element once the job is finished. Serpens for Kepler is dealing in automatic manner with all kinds of failures, problems with infrastructure (resubmitting automatically jobs in case of failures, re-uploading files, restarting, checkpointing workflow statuses, etc.)

6 www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 About Kepler www.kepler-project.org Scientific Workflow System Builds upon the open-source Ptolemy II framework Current release: 2.3 (past 2.0, 1.0 – May 2008), initiated 2003 Allows scientists to visually design and execute scientific workflows Actor-oriented model with directors acting as the main workflow engine Enables different models of computation Workflows are saved as XML files - can easily be shared/published Kepler is supported by the NSF-funded Kepler/CORE team, which spans several of the key institutions that originated the Kepler project: UC Davis, UC Santa Barbara, and UC San Diego useful across disciplines: Ecology, Engineering, Geology, Physics,... 3/11/2016 6

7 www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 About Kepler Kepler is a Java based tool for building and executing workflows Kepler allows you to use existing elements Kepler allows you to extend it by creating new elements Kepler is an Open Source project (BSD License) Kepler can support you with: –building and executing workflows –executing tasks locally –executing tasks within distributed environments http://kepler-project.org 3/11/2016 EGI User Forum 20127

8 www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 Serpens for Kepler Serpens: –Actors, workflows, templates for using grid middleware: gLite Unicore QCG –Accessing cloud: OpenNebula Amazon EC2 Developed under EU FP7 Euforia project Used mainly by Fusion community (EFDA ITM) Support for new workflow scenarios as a part of SA3 –Fusion, Astrophysics, Compchem use cases http://serpens.psnc.pl 3/11/2016 8

9 www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 gLite module Set of Kepler actors, composites and template workflows Based on RAS services Contains actors allowing: –File upload/download, –Job submission, status checking, –Output retrieval. Templates: Simple scenarios Complex scenarios (parameter study + loops) Managing thousands jobs within one workflow High level of workflow reliability assurance (handling all kinds of failures crashes, service/infrastrucuture problems) 3/11/2016 9

10 www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 Used mainly by fusion physics community – building blocks for complex fusion modeling workflows scenarios developed in EUFORIA project (EU Fusion fOR ITER Applications) and also in terms EGI-InSPIRE project and in EFDA ITM-TF Use cases templates: Basic (a->b->c) Parameter study Loops Heterogeneous workflows Serpens for Kepler – use cases 3/11/2016 10

11 www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 Fusion: Part of the ITM-TF platform: (50+ codes) BIT1 (code that solves the plasma transport in a divertor), Helena-Ilsa, (used to compute linearly unstable Magnetohydrodynamics modes in tokamak plasma), ASTRA-Truba (plasma evolution), VMEC- Mercier-COBRA-Visualisation, FAFNER2-ISDEP, CHEASE + MARS-F etc., VMEC-DKES (2500 jobs in minimal case) Astrophysics: production of realistic simulations of the anisotropies of the Cosmic Microwave Background (CMB) radiation CompChem: (first use cases): GFIT3C+ABC Example scenarios 3/11/2016 11

12 www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 W-f: Kod 2,3 Kod 6 kod 4 Kod 5 Kod 1 HPC Parameter study - Grid Remote w-f Example use case 3/11/2016 12

13 www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 All modules and external libraries are available under BSD license or compatible. Serpens was successfully validated by Kepler developers and was incorporated within its official repository, it is also part of the official 2.2/2.3 release http://serpens.psnc.pl/ Tutorials Examples Movies Documentation Bugtracker We are open for any new scenarios, collaborations Summary 3/11/2016 13


Download ppt "Www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 EGI-InSPIRE www.egi.eu EGI-InSPIRE RI-261323 EGI Community Forum 2012 Kepler Workflow Manager."

Similar presentations


Ads by Google