An Overview of Scientific Workflows: Domains & Applications Laboratoire Lorrain de Recherche en Informatique et ses Applications Presented by Khaled Gaaloul.

Slides:



Advertisements
Similar presentations
Designing Services for Grid-based Knowledge Discovery A. Congiusta, A. Pugliese, Domenico Talia, P. Trunfio DEIS University of Calabria ITALY
Advertisements

A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
An Automata-based Approach to Testing Properties in Event Traces H. Hallal, S. Boroday, A. Ulrich, A. Petrenko Sophia Antipolis, France, May 2003.
Towards a Generic Platform for Developing CSCL Applications Using Grid Infrastructure by Santi Caballé Open University of Catalonia Barcelona, Spain with.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
0 General information Rate of acceptance 37% Papers from 15 Countries and 5 Geographical Areas –North America 5 –South America 2 –Europe 20 –Asia 2 –Australia.
CLAG 2004 – April/041 A Workflow-based Architecture for e- Learning in the Grid Luiz A. Pereira, Fábio A. Porto, Bruno Schulze, Rubens N. Melo
L4-1-S1 UML Overview © M.E. Fayad SJSU -- CmpE Software Architectures Dr. M.E. Fayad, Professor Computer Engineering Department, Room #283I.
GridFlow: Workflow Management for Grid Computing Kavita Shinde.
INDUSTRIAL & SYSTEMS ENGINEERING
Distributed Database Management Systems
Review of “Embedded Software” by E.A. Lee Katherine Barrow Vladimir Jakobac.
eGovernance Under guidance of Dr. P.V. Kamesam IBM Research Lab New Delhi Ashish Gupta 3 rd Year B.Tech, Computer Science and Engg. IIT Delhi.
Chapter 1: Overview of Workflow Management Dr. Shiyong Lu Department of Computer Science Wayne State University.
An Agent-Oriented Approach to the Integration of Information Sources Michael Christoffel Institute for Program Structures and Data Organization, University.
Diffusion scheduling in multiagent computing system MotivationArchitectureAlgorithmsExamplesDynamics Robert Schaefer, AGH University of Science and Technology,
Emerging Research Dimensions in IT Security Dr. Salar H. Naqvi Senior Member IEEE Research Fellow, CoreGRID Network of Excellence European.
The Role of Modeling in Systems Integration and Business Process Analysis © Sparx Systems Pty Ltd 2011 Ben Constable Sparx Systems.
SS ZG653Second Semester, Topic Architectural Patterns Pipe and Filter.
WORKFLOWS IN CLOUD COMPUTING. CLOUD COMPUTING  Delivering applications or services in on-demand environment  Hundreds of thousands of users / applications.
Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.
GMD German National Research Center for Information Technology Innovation through Research Jörg M. Haake Applying Collaborative Open Hypermedia.
EUROPEAN UNION Polish Infrastructure for Supporting Computational Science in the European Research Space Cracow Grid Workshop’10 Kraków, October 11-13,
Ch4: Distributed Systems Architectures. Typically, system with several interconnected computers that do not share clock or memory. Motivation: tie together.
SOA, BPM, BPEL, jBPM.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
Framework: ISA-95 WG We are here User cases Studies
Managing Service Metadata as Context The 2005 Istanbul International Computational Science & Engineering Conference (ICCSE2005) Mehmet S. Aktas
WP-8, ZIB WP-8: Data Handling And Visualization Review Meeting Report Felix Hupfeld, Andrei Hutanu, Andre Merzky, Thorsten Schütt, Brygg Ullmer Zuse-Institute-Berlin.
CONTENTS Arrival Characters Definition Merits Chararterstics Workflows Wfms Workflow engine Workflows levels & categories.
Dynamic Reconfiguration Dynamic selection of handler functionality: currently through use of parameterizable handlers or by selecting from a set of existing.
Miguel Branco CERN/University of Southampton Enabling provenance on large-scale e-Science applications.
Chapter 1: Overview of Workflow Management Dr. Shiyong Lu Department of Computer Science Wayne State University.
Peter Bajcsy, Rob Kooper, Luigi Marini, Barbara Minsker and Jim Myers National Center for Supercomputing Applications (NCSA) University of Illinois at.
FI-CORE Data Context Media Management Chapter Release 4.1 & Sprint Review.
RELATIONAL FAULT TOLERANT INTERFACE TO HETEROGENEOUS DISTRIBUTED DATABASES Prof. Osama Abulnaja Afraa Khalifah
Week 5 Lecture Distributed Database Management Systems Samuel ConnSamuel Conn, Asst Professor Suggestions for using the Lecture Slides.
© DATAMAT S.p.A. – Giuseppe Avellino, Stefano Beco, Barbara Cantalupo, Andrea Cavallini A Semantic Workflow Authoring Tool for Programming Grids.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
Objectives Functionalities and services Architecture and software technologies Potential Applications –Link to research problems.
L6-S1 UML Overview 2003 SJSU -- CmpE Advanced Object-Oriented Analysis & Design Dr. M.E. Fayad, Professor Computer Engineering Department, Room #283I College.
NA-MIC National Alliance for Medical Image Computing UCSD: Engineering Core 2 Portal and Grid Infrastructure.
1 BRUSSELS - 14 July 2003 Full Security Support in a heterogeneous mobile GRID testbed for wireless extensions to the.
Replica Consistency in a Data Grid1 IX International Workshop on Advanced Computing and Analysis Techniques in Physics Research December 1-5, 2003 High.
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Cooperative experiments in VL-e: from scientific workflows to knowledge sharing Z.Zhao (1) V. Guevara( 1) A. Wibisono(1) A. Belloum(1) M. Bubak(1,2) B.
Enabling Grids for E-sciencE Astronomical data processing workflows on a service-oriented Grid architecture Valeria Manna INAF - SI The.
Data Integration Hanna Zhong Department of Computer Science University of Illinois, Urbana-Champaign 11/12/2009.
CASE (Computer-Aided Software Engineering) Tools Software that is used to support software process activities. Provides software process support by:- –
Architecture View Models A model is a complete, simplified description of a system from a particular perspective or viewpoint. There is no single view.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
Towards an IoT Ecosystem Flavia C. Delicato 1, Paulo F. Pires 1, Thais Batista 2, Everton Cavalcante 2, Bruno Costa 1, Thomaz Barros 1 1 Department of.
Collaborative Scientific Visualization: from your lab to Internet2 and beyond Matthew Wolf College of Computing Georgia Institute of Technology
March 2004 At A Glance The AutoFDS provides a web- based interface to acquire, generate, and distribute products, using the GMSEC Reference Architecture.
Collection and storage of provenance data Jakub Wach Master of Science Thesis Faculty of Electrical Engineering, Automatics, Computer Science and Electronics.
Wide Area Grid – Technical Requirements Paul Kopp.
Cyberinfrastructure Overview of Demos Townsville, AU 28 – 31 March 2006 CREON/GLEON.
National Aeronautics and Space Administration Jet Propulsion Laboratory March 17, 2009 Workflow Orchestration: Conducting Science Efficiently on the Grid.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
Data Grids, Digital Libraries and Persistent Archives: An Integrated Approach to Publishing, Sharing and Archiving Data. Written By: R. Moore, A. Rajasekar,
Business process management (BPM)
CIM Modeling for E&U - (Short Version)
Business process management (BPM)
Similarities between Grid-enabled Medical and Engineering Applications
The 2007 Winter Conference on Business Intelligence
University of Technology
Liang Chen Advisor: Gagan Agrawal Computer Science & Engineering
Data Path through host/ANP.
1st International Conference on Semantics, Knowledge and Grid
Software Development Process Using UML Recap
Presentation transcript:

An Overview of Scientific Workflows: Domains & Applications Laboratoire Lorrain de Recherche en Informatique et ses Applications Presented by Khaled Gaaloul Environments COOperation

Plan 1 I. Context & Problematic II. State of Art III. In Progress IV. Conclusion & Perspectives

I. Context & Problematic

2 Context: Scientific applications Need of WFMS for the orchestration and optimization of the scientific endeavors. Collecting, generating and analyzing of a large data flow Need of mechanisms supporting interactions between heterogeneous applications Context & Problématic State of ArtIn Progress Conclusion & Perspectives

3 Context: Scientific applications integration Context & Problematic State of ArtIn progress Conclusion & Perspectives Step1 Step2 ANDAND Labo.2 Labo.3 Labo.4 Definition & specification of processes Data flow managing Process orchestration Step5 Step4 Step3 Step6 XORXOR ANDAND Labo.1 Dynamic Scheduling of a Scientific Process

4 Prerequisites for scientific applications High flexibility degree High-performance for resources distribution Workflow ad hoc architecture: moving and hierarchical Data flow Management: - Automate data streaming - Enriching the semantic level - Documentation & reutilisability Context & Problematic State of ArtIn progress Conclusion & Perspectives

5 Problematic: How to optimize and orchester scientific processes execution? Problems in managing shared resources: heterogeneous environment, virtual organizations (VO), etc. Moving Applications: Non-determinism aspect Current approaches: lack of reutilisability and documentations, business process oriented Evolution format within data exchanges Context & Problematic State of ArtIn progress Conclusion & Perspectives

6 Problematic: New requirements Context & Problematic State of ArtIn progress Conclusion & Perspectives Designers Step1 Step2 ANDAND Step5 Step4 Step3 Step6 XORXOR ANDAND sub process1 sub process2 sub process3 To deal with heterogeneity To deal with data exchange

State of Art

7 Scientific workflow Definition: the application of workflow technology to scientific endeavors, recognized as a valuable approach for assisting scientists in accessing and analyzing data. Features: - Support for large data flows; - Dynamic environment; - Incomplete workflow: partial definition; - Ad hoc planning; - Reutilisabilty, documentation, etc. Context & ProblematicState of ArtIn progress Conclusion & Perspectives ScientificWorkflow GRID PBIO

8 Scientific Workflow Scientific domain: dedicated to the data flow managing More dynamic: non predefined workflow Traceability and documentation: enriching the semantic level within data exchanges Business Workflow Business domain: dedicated to the processes managing and optimization Lot of constraints: predefined workflow, satisfying end, execution constraints, etc. Lack of formalism: Syntactic level Context & ProblematicState of ArtIn progress Conclusion & Perspectives Scientific Workflow GRID PBIO

9 Scientific Workflow Vs Business Workflow Context & ProblematicState of ArtIn progress Conclusion & Perspectives Scientific Workflow GRID PBIO

10 Solution for intensive computing Virtual organization (VO) - including different users committees - sharing global resources (storing, processing) - Strong impact on organization structure, networks, security Context & ProblematicState of ArtIn progress Conclusion & Perspectives ScientificWorkflow GRID PBIO GRID (Globalization of Informatics' Resources and Data)

11 GridFlow (1) : GRID and Workflow? GRID complexity - Virtual organization - Needs of visualization, managing, and simulation WfMS as a Grid service - Transparent access to one or many GRID regrouping heterogeneous machines - Portals for users Context & ProblematicState of ArtIn progress Conclusion & Perspectives Scientific Workflow GRID PBIO

12 GridFlow (2): Architecture Context & ProblematicState of ArtIn progress Conclusion & Perspectives Scientific Workflow GRID PBIO

13 PBIO: or how to deal with format evolution? Heterogeneous environment, ad hoc solutions - Data exchanges and complex communication - Format evolution: lack of standardization of data streaming PBIO (Portable Binary Input/Output) - Approach to deal with binary data in storage and transmission - Record oriented binary communication mechanism - Data meta-representation - Optimizing data storage/transmission - Improving the communication between processes Context & ProblematicState of ArtIn progress Conclusion & Perspectives Scientific Workflow GRID PBIO

In Progress

14 Cooperative processes for scientific workflows Cooperation between applications - Applications more flexible - Working and communicating within the same virtual space of work - Doing common tasks in synchronous or asynchronous way BONITA: a flexible system for cooperative workflow - Define, specify, execute, and coordinate different flows of work - Based on the anticipating model - Ensure an interface for the modeling and the visualization of the processes - Managing flexible data Context & ProblematicState of ArtIn progress Conclusion & Perspectives

15 Motivating Example: Numerizing scenario Context & ProblematicState of ArtIn progress Conclusion & Perspectives

16 Deploying the scenario into Bonita Enhance execution flexibility Anticipation: process optimizing Context & ProblematicState of ArtIn progress Conclusion & Perspectives

17 Mapping Data-Intensive Science into BONITA Considerable data flows Goal: Optimize the data streaming & enhance the data exchange mechanism Context & ProblematicState of ArtIn progress Conclusion & Perspectives Data flow computing

17 Discussions Existing approach: Flow-Based Programming (FBP) - A new/old approach to scientific application development - Data flow Vs. Workflow: which one fit to us? - Anticipating an activity, is it possible with a partial result? PBIO implementation - Interactivity with Bonita services call - Need of middleware like Echo Event to support messages exchange - Portability of the PBIO approach for existing platforms Context & ProblematicState of ArtIn progress Conclusion & Perspectives

Conclusions: Cooperative aspect for scientific applications Combining strong concepts (GRID & workflows) Developing a new middleware for scientific process Perspectives: Application onto the GRID: Bonita as a GRID service Adding Non intrusive and user friendly aspects Collaboration with AURARYD on others scenarios (Volkswagen, BP) 18 Context & ProblematicState of ArtIn progress Conclusion & Perspectives