Workflow management within DIET Raphaël Bolze LIP ENS Lyon, CNRS INRIA Rhône-Alpes, GRAAL project

Slides:



Advertisements
Similar presentations
Design and Evaluation of an Autonomic Workflow Engine Thomas Heinis, Cesare Pautasso, Gustavo Alsonso Dept. of Computer Science Swiss Federal Institute.
Advertisements

1 Senn, Information Technology, 3 rd Edition © 2004 Pearson Prentice Hall James A. Senns Information Technology, 3 rd Edition Chapter 7 Enterprise Databases.
1 Vorlesung Informatik 2 Algorithmen und Datenstrukturen (Parallel Algorithms) Robin Pomplun.
Pricing for Utility-driven Resource Management and Allocation in Clusters Chee Shin Yeo and Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS)
Distributed Systems Architectures
1 Towards an Open Service Framework for Cloud-based Knowledge Discovery Domenico Talia ICAR-CNR & UNIVERSITY OF CALABRIA, Italy Cloud.
Processes and Operating Systems
1 Copyright © 2013 Elsevier Inc. All rights reserved. Chapter 3 CPUs.
11 Application of CSF4 in Avian Flu Grid: Meta-scheduler CSF4. Lab of Grid Computing and Network Security Jilin University, Changchun, China Hongliang.
Tom Sugden EPCC OGSA-DAI Future Directions OGSA-DAI User's Forum GridWorld 2006, Washington DC 14 September 2006.
Cultural Heritage in REGional NETworks REGNET T1.4: Development of the system specification.
1 Hyades Command Routing Message flow and data translation.
Business Transaction Management Software for Application Coordination 1 Business Processes and Coordination. Introduction to the Business.
18 Copyright © 2005, Oracle. All rights reserved. Distributing Modular Applications: Introduction to Web Services.
11 Copyright © 2005, Oracle. All rights reserved. Creating the Business Tier: Enterprise JavaBeans.
GridPP July 2003Stefan StonjekSlide 1 SAM middleware components Stefan Stonjek University of Oxford 7 th GridPP Meeting 02 nd July 2003 Oxford.
A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
|epcc| NeSC Workshop Open Issues in Grid Scheduling Ali Anjomshoaa EPCC, University of Edinburgh Tuesday, 21 October 2003 Overview of a Grid Scheduling.
Enterprise Java and Data Services Designing for Broadly Available Grid Data Access Services.
Chapter 1 Introduction Copyright © Operating Systems, by Dhananjay Dhamdhere Copyright © Introduction Abstract Views of an Operating System.
Database Systems: Design, Implementation, and Management
Overview Environment for Internet database connectivity
Auto-scaling Axis2 Web Services on Amazon EC2 By Afkham Azeez.
13 Copyright © 2005, Oracle. All rights reserved. Monitoring and Improving Performance.
The Platform as a Service Model for Networking Eric Keller, Jennifer Rexford Princeton University INM/WREN 2010.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 31 Slide 1 Service-centric Software Engineering.
All-in-one graphical tool for grid middleware management Eddy Caron, Abdelkader Amar, Frédéric Desprez, David Loureiro LIP ENS Lyon, INRIA Rhône-Alpes,
DIET Overview and some recent work A middleware for the large scale deployment of applications over the Grid Frédéric Desprez LIP ENS Lyon / INRIA GRAAL.
CS 6143 COMPUTER ARCHITECTURE II SPRING 2014 ACM Principles and Practice of Parallel Programming, PPoPP, 2006 Panel Presentations Parallel Processing is.
Database System Concepts and Architecture
31242/32549 Advanced Internet Programming Advanced Java Programming
Executional Architecture
Global Analysis and Distributed Systems Software Architecture Lecture # 5-6.
Chapter 10: The Traditional Approach to Design
Systems Analysis and Design in a Changing World, Fifth Edition
PSSA Preparation.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 13 Slide 1 Application architectures.
Introduction Peter Dolog dolog [at] cs [dot] aau [dot] dk Intelligent Web and Information Systems September 9, 2010.
Chapter 24 Replication and Mobile Databases Transparencies © Pearson Education Limited 1995, 2005.
From Model-based to Model-driven Design of User Interfaces.
TCP/IP Protocol Suite 1 Chapter 18 Upon completion you will be able to: Remote Login: Telnet Understand how TELNET works Understand the role of NVT in.
GridRPC Sources / Credits: IRISA/IFSIC IRISA/INRIA Thierry Priol et. al papers.
Online Educational Game of Snakes and Ladders -Shalini Pradhan -Manali Joshi -Uttara Paingankar -Seema Joshi.
Task Scheduling and Distribution System Saeed Mahameed, Hani Ayoub Electrical Engineering Department, Technion – Israel Institute of Technology
The new The new MONARC Simulation Framework Iosif Legrand  California Institute of Technology.
McGraw-Hill/Irwin Copyright © 2007 by The McGraw-Hill Companies, Inc. All rights reserved. Chapter 17 Client-Server Processing, Parallel Database Processing,
Architecture overview 6/03/12 F. Desprez - ISC Cloud Context : Development of a toolbox for deploying application services providers with a hierarchical.
Eddy Caron Join work with Jonathan Rouzaud-Cornabas, Frédéric Desprez, Rajesh Palanichamy and the DIET Team Ecole Normale Supérieure de Lyon AVALON Research.
 Cloud computing  Workflow  Workflow lifecycle  Workflow design  Workflow tools : xcp, eucalyptus, open nebula.
JuxMem: An Adaptive Supportive Platform for Data Sharing on the Grid Gabriel Antoniu, Luc Bougé, Mathieu Jan IRISA / INRIA & ENS Cachan, France Workshop.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
LEGO – Rennes, 3 Juillet 2007 Deploying Gfarm and JXTA-based applications using the ADAGE deployment tool Landry Breuil, Loïc Cudennec and Christian Perez.
DISTRIBUTED COMPUTING
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
1 CCA Meeting, Januray 25th 2007 Supporting the Master-Worker Paradigm in the Common Component Architecture Hinde Lilia Bouziane, Christian Pérez, Thierry.
Unit – I CLIENT / SERVER ARCHITECTURE. Unit Structure  Evolution of Client/Server Architecture  Client/Server Model  Characteristics of Client/Server.
Jean-Sébastien Gay LIP ENS Lyon, Université Claude Bernard Lyon 1 INRIA Rhône-Alpes GRAAL Research Team Join work with DIET TEAM D istributed I nteractive.
Peer-to-Peer Distributed Shared Memory? Gabriel Antoniu, Luc Bougé, Mathieu Jan IRISA / INRIA & ENS Cachan/Bretagne France Dagstuhl seminar, October 2003.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
JuxMem: An Adaptive Supportive Platform for Data Sharing on the Grid Gabriel Antoniu, Luc Bougé, Mathieu Jan IRISA / INRIA & ENS Cachan, France Grid Data.
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
1 VLDB - Data Management in Grids B. Del-Fabbro, D. Laiymani, J.M. Nicod and L. Philippe Laboratoire d’Informatique de l’Université de Franche-Comté Séoul,
- DAG Scheduling with Reliability - - GridSolve - - Fault Tolerance In Open MPI - Asim YarKhan, Zhiao Shi, Jack Dongarra VGrADS Workshop April 2007.
- Eddy Caron.
Open Source distributed document DB for an enterprise
Replication Middleware for Cloud Based Storage Service
Distributed Systems Bina Ramamurthy 11/30/2018 B.Ramamurthy.
Distributed Systems Bina Ramamurthy 4/22/2019 B.Ramamurthy.
Presentation transcript:

Workflow management within DIET Raphaël Bolze LIP ENS Lyon, CNRS INRIA Rhône-Alpes, GRAAL project

R. Bolze – 19 oct 2006 Edinburgh 2 Introduction Distributed Interactive Engineering Toolbox RPC and grid-computing : gridRPC DIET goals DIET environment & architecture Request management Research topics & features DIET and workflow management Needs Language Architectures Scheduling propose Target applications PipeAlign Docking Robinson Cosmology Current works

D istributed I nteractive E ngineering T oolbox

R. Bolze – 19 oct 2006 Edinburgh 4 RPC and Grid-Computing: GridRPC One simple idea One simple (and efficient) paradigm for grid computing: offering (or leasing) computational power and/or storage capacity through the Internet One simple solution: implementing the RPC programming model over the Grid –Using resources accessible through the network –Mixed parallelism model (data-parallel model at server level and task parallelism between servers) Features needed –Load-balancing (resource localization and performance evaluation, scheduling), –Data and replica management, –Security, –Fault-tolerance, –Interoperability with other systems, –… Design of a standard interface – within the GGF/OGF (GridRPC WG, C. Lee) – forge.gridforum.org/projects/gridrpc-wgwww.ogf.org – Existing implementations: GridSolve, Ninf, DIET, XtremWeb

R. Bolze – 19 oct 2006 Edinburgh 5 RPC and Grid Computing: Grid RPC AGENT(s) S1S2 S3 S4 A, B, C Answer (C) S2 ! Request Op(C, A, B) Client

R. Bolze – 19 oct 2006 Edinburgh DIETs Goals Our goals To develop a toolbox for the deployment of environments using the Application Service Provider (ASP) paradigm with different applications Use as much as possible public domain and standard software To obtain a high performance and scalable environment Implement and validate our more theoretical results Scheduling for heterogeneous platforms, data (re)distribution and replication, performance evaluation, algorithmic for heterogeneous and distributed platforms, … Based on CORBA, NWS, LDAP, and our own software developments CoRI for performance evaluation, FAST CoRI-easy LogService for monitoring, VizDIET for the visualization, GoDIET for the deployment Several applications in different fields (simulation, bioinformatic, cosmological application…) Release 2.1 available on the web Release 2.2 coming soon

R. Bolze – 19 oct 2006 Edinburgh 7 DIET Environment CLIENT Sequential Application Data management Application Parallel Application C C C C C C C C C A A A SSS A SSS A A A A A A A

R. Bolze – 19 oct 2006 Edinburgh 8 DIET Architecture LA MA LA ServerDeamons Master Agent Local Agent Client LA

R. Bolze – 19 oct 2006 Edinburgh 9 Requests Management agent server estimate() { predExecTime(…); } FindServer() Aggregate() { min(…); } Aggregate() { min(…); } bestServer = S3 runService(…);

R. Bolze – 19 oct 2006 Edinburgh 10 Research Topics Scheduling Distributed scheduling Plug-in schedulers Data-management Scheduling of computation requests and links with data-management Replication, data prefetching Deployment Mapping components on available (selected) resources Software platform deployment with or without dynamic connections between components Performance evaluation Application modeling Dynamic information about the platform (network, clusters) Fault Tolerance Failure Detection Application recovery …

Scheduling

R. Bolze – 19 oct 2006 Edinburgh 12 DIET Scheduling SeD level Performance estimation function Estimation metric vector (estVector_t) - dynamic collection of performance estimation values Performance measures available through DIET FAST-NWS performance metrics Time elapsed since the last execution CoRI (Collector of Resource Information) Developer defined values Standard estimation tags for accessing the fields of an estVector_t EST_FREEMEM EST_TCOMP EST_TIMESINCELASTSOLVE EST_FREECPU Aggregation Methods Defining mechanism how to sort SeD responses: associated with the service and defined at SeD level Tunable comparison/aggregation routines for scheduling Priority Scheduler Performs pairwise server estimation comparisons returning a sorted list of server responses; Can minimize or maximize based on SeD estimations and taking into consideration the order in which the request for those performance estimations was specified at SeD level.

R. Bolze – 19 oct 2006 Edinburgh 13 DIET Scheduling Collector of Resource Information (CoRI) CoRI-Easy – provides basic measurements of the environment CoRI Manager – manage the use of different collectors CoRI-Easy Collector FAST Collector CoRI Manager Other Collectors like Ganglia FAST Software

Data management

R. Bolze – 19 oct 2006 Edinburgh 15 Data/replica management Two needs Keep the data in place to reduce the overhead of communications between clients and servers Replicate data whenever possible Two approaches for DIET DTM (LIFC, Besançon) Hierarchy similar to the DIETs one Distributed data manager Redistribution between servers JuxMem (Paris, Rennes) P2P data cache Work done within the GridRPC Working Group (OGF) Relations with workflow management Client A F G Y Server 1 Server 2 X B B B

R. Bolze – 19 oct 2006 Edinburgh 16 Data management with DTM within DIET Persistence at the server level To avoid useless data transfers Intermediate results Between clients and servers Between servers transparent for the client Data Manager/Loc Manager Hierarchy mapped on the DIET one modularity Proposition to the Grid-RPC WG (OGF) Data handles Persistence flag Data management functions

R. Bolze – 19 oct 2006 Edinburgh 17 JUXMEM A peer-to-peer architecture for a data-sharing service in memory Persistence and data coherency mechanism Transparent data localization PARIS project, IRISA, France Peer Firewall Peer TCP/IP HTTP Peer ID Firewall Toolbox for the development of P2P applications Set of protocols One peer Unique ID Several communication protocols (TCP, HTTP, …)

Deployment and visualization

R. Bolze – 19 oct 2006 Edinburgh 19 Deployment Management XML: - Resources - Machines - Storage - DIET hierarchy Distributed deployment of DIET LogServiceGoDIETVizDIET DIET Administration Traces Trace subset Trace Subset

R. Bolze – 19 oct 2006 Edinburgh 20 VizDIET

Workflow management

R. Bolze – 19 oct 2006 Edinburgh 22 Workflow Management : needs ? Workflow representation : Direct Acyclic Graph (DAG) Each vertex is a tasks Each directed edge represents communication between tasks Questions : Ordering problem ? Mapping problem ?

R. Bolze – 19 oct 2006 Edinburgh 23 Workflow Management : goals Goals Build and execute workflow Use different heuristic methods to solve scheduling problems Extensibility to address mutli-workflows submission and large grid platform Manage heterogeneity and variability of environment

R. Bolze – 19 oct 2006 Edinburgh 24 Workflow Management : existing languages ? Workflows languages: No standard (XML, scripts) Exemples : Condor DAGman : script Pegasus : DAX (xml) Taverna : XScuffl (xml) 2 levels of description : Abstract : application description Concrete : execution description

R. Bolze – 19 oct 2006 Edinburgh 25 Workflow Management Workflow description in DIET Xml format DIET profile : problem (id), parameters (in, inout,out) Description of tasks and data dependency

R. Bolze – 19 oct 2006 Edinburgh 26 Workflow Management : architecture 2 Architectures : Meta scheduler in the client side Meta scheduler distributed in the client and in the MA-DAG

R. Bolze – 19 oct 2006 Edinburgh 27 Workflow Management : Meta scheduler : client Architecture 1 : Meta scheduler in the client side Client MA LA SeD

R. Bolze – 19 oct 2006 Edinburgh 28 Workflow management : Meta scheduler : client Disadvantages : No coordination between the different clients Depends on client capability Benefits : More flexible for evolution : Client can use his own algorithm. More scalable, depends on client capability.

R. Bolze – 19 oct 2006 Edinburgh 29 Workflow management Architecture 2 : Meta scheduler distributed in the client and in the MA-DAG Client MA LA SeD MA DAG

R. Bolze – 19 oct 2006 Edinburgh 30 Workflow management - Meta scheduler Base Scheduler : No ranking, respect the topological order of the DAG HEFT heuristic Flexibility : Architecture 1 : Client can have his own schedule No needs to re-build the platform Architecture 2 : Schedulers are define at the compile time. Needs to re-build the platform if some decide the change. Abstract Workflow Scheduler Virtual void execute(); Virtual void reSchedule(); User defined Scheduler Virtual void execute(); Virtual void reSchedule();

Target applications

R. Bolze – 19 oct 2006 Edinburgh 32 Docking Application Detection of protein-protein and protein-DNA interactions. Screening a database containing thousands of proteins for functional sites involved in binding to other proteins, DNA or ligand targets. docking merge params docking

R. Bolze – 19 oct 2006 Edinburgh 33 PipeAlign Application The sequence-to-function relationship can be understood through the analysis of conserved patterns and evolution of protein organization mainly based on amino acid sequence comparisons in the context of the multiple alignments. blastall ballast filtering clustalw normd rascal normd leon normd

R. Bolze – 19 oct 2006 Edinburgh 34 Robinson application This application annotate human genes according to their expression in neurological or muscular tissues, but also to the expression of their homolog other species. extract Build DB blastall

R. Bolze – 19 oct 2006 Edinburgh 35 Cosmology application rollWhiteNoise Grapfic1 Grapfic2 Ramses3D HaloMaker TreeMaker + GalaxyMaker HaloMaker Simulate the evolution of dark matter particles during time to compare it to the real observation. Centre de Recherche en Astronomie de Lyon

Current Work

R. Bolze – 19 oct 2006 Edinburgh 37 Multi-Workflow Deal with multiple workflow submission On-line scheduling, different submission time Implements fair scheduling strategies Implements specific scheduling heuristics Distribute the workflow management ? grid

R. Bolze – 19 oct 2006 Edinburgh 38 Multi-Workflow Simulations Real experiments on Grid5000

R. Bolze – 19 oct 2006 Edinburgh 39 Conclusion DIET Workflow enabled Data management : DTM, JuXMEM Performance information : CoRI, FAST Plugin schedulers Multi-Applications

Questions ?