Pegasus on the Virtual Grid: A Case Study of Workflow Planning over Captive Resources Yang-Suk Kee, Eun-Kyu Byun, Ewa Deelman, Kran Vahi, Jin-Soo Kim Oracle.

Slides:



Advertisements
Similar presentations
Gfarm v2 and CSF4 Osamu Tatebe University of Tsukuba Xiaohui Wei Jilin University SC08 PRAGMA Presentation at NCHC booth Nov 19,
Advertisements

A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
University of Minnesota Optimizing MapReduce Provisioning in the Cloud Michael Cardosa, Aameek Singh†, Himabindu Pucha†, Abhishek Chandra
A Dynamic World, what can Grids do for Multi-Core computing? Daniel Goodman, Anne Trefethen and Douglas Creager
The ADAMANT Project: Linking Scientific Workflows and Networks “Adaptive Data-Aware Multi-Domain Application Network Topologies” Ilia Baldine, Charles.
Ewa Deelman, Integrating Existing Scientific Workflow Systems: The Kepler/Pegasus Example Nandita Mangal,
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
23/04/2008VLVnT08, Toulon, FR, April 2008, M. Stavrianakou, NESTOR-NOA 1 First thoughts for KM3Net on-shore data storage and distribution Facilities VLV.
A Grid Parallel Application Framework Jeremy Villalobos PhD student Department of Computer Science University of North Carolina Charlotte.
Computer Science Department 1 Load Balancing and Grid Computing David Finkel Computer Science Department Worcester Polytechnic Institute.
Pegasus: Mapping complex applications onto the Grid Ewa Deelman Center for Grid Technologies USC Information Sciences Institute.
Workload Management Massimo Sgaravatto INFN Padova.
A Brief Overview by Aditya Dutt March 18 th ’ Aditya Inc.
CONDOR DAGMan and Pegasus Selim Kalayci Florida International University 07/28/2009 Note: Slides are compiled from various TeraGrid Documentations.
E-Science Workflow Support with Grid-Enabled Microsoft Project Gregor von Laszewski and Leor E. Dilmanian, Rochester Institute of Technology Abstract von.
The Grid is a complex, distributed and heterogeneous execution environment. Running applications requires the knowledge of many grid services: users need.
Workflow Systems for LQCD SciDAC LQCD Software meeting, Boston, Feb 2008 Fermilab, IIT, Vanderbilt.
Copyright © Clifford Neuman and Dongho Kim - UNIVERSITY OF SOUTHERN CALIFORNIA - INFORMATION SCIENCES INSTITUTE Advanced Operating Systems Lecture.
Cluster Reliability Project ISIS Vanderbilt University.
Through the development of advanced middleware, Grid computing has evolved to a mature technology in which scientists and researchers can leverage to gain.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Combining the strengths of UMIST and The Victoria University of Manchester Utility Driven Adaptive Workflow Execution Kevin Lee School of Computer Science,
1 A Framework for Data-Intensive Computing with Cloud Bursting Tekin Bicer David ChiuGagan Agrawal Department of Compute Science and Engineering The Ohio.
Pegasus-a framework for planning for execution in grids Ewa Deelman USC Information Sciences Institute.
Pegasus: Planning for Execution in Grids Ewa Deelman Information Sciences Institute University of Southern California.
Scientific Workflow Scheduling in Computational Grids Report: Wei-Cheng Lee 8th Grid Computing Conference IEEE 2007 – Planning, Reservation,
Dr. Ahmed Abdeen Hamed, Ph.D. University of Vermont, EPSCoR Research on Adaptation to Climate Change (RACC) Burlington Vermont USA MODELING THE IMPACTS.
Pegasus: Mapping Scientific Workflows onto the Grid Ewa Deelman Center for Grid Technologies USC Information Sciences Institute.
Condor Week 2005Optimizing Workflows on the Grid1 Optimizing workflow execution on the Grid Gaurang Mehta - Based on “Optimizing.
Workflow Project Status Update Luciano Piccoli - Fermilab, IIT Nov
Combining the strengths of UMIST and The Victoria University of Manchester Adaptive Workflow Processing and Execution in Pegasus Kevin Lee School of Computer.
A Hierarchical MapReduce Framework Yuan Luo and Beth Plale School of Informatics and Computing, Indiana University Data To Insight Center, Indiana University.
 Apache Airavata Architecture Overview Shameera Rathnayaka Graduate Assistant Science Gateways Group Indiana University 07/27/2015.
Pegasus: Running Large-Scale Scientific Workflows on the TeraGrid Ewa Deelman USC Information Sciences Institute
Pegasus: Mapping complex applications onto the Grid Ewa Deelman Center for Grid Technologies USC Information Sciences Institute.
July 11-15, 2005Lecture3: Grid Job Management1 Grid Compute Resources and Job Management.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
Review of Condor,SGE,LSF,PBS
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Experiment Management from a Pegasus Perspective Jens-S. Vöckler Ewa Deelman
San Diego Supercomputer Center National Partnership for Advanced Computational Infrastructure San Diego Supercomputer Center National Partnership for.
Planning Ewa Deelman USC Information Sciences Institute GriPhyN NSF Project Review January 2003 Chicago.
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
Xi He Golisano College of Computing and Information Sciences Rochester Institute of Technology Rochester, NY THERMAL-AWARE RESOURCE.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
SHIWA and Coarse-grained Workflow Interoperability Gabor Terstyanszky, University of Westminster Summer School Budapest July 2012 SHIWA is supported.
Funded by the NSF OCI program grants OCI and OCI Mats Rynge, Gideon Juve, Karan Vahi, Gaurang Mehta, Ewa Deelman Information Sciences Institute,
1 Pegasus and wings WINGS/Pegasus Provenance Challenge Ewa Deelman Yolanda Gil Jihie Kim Gaurang Mehta Varun Ratnakar USC Information Sciences Institute.
© Geodise Project, University of Southampton, Workflow Support for Advanced Grid-Enabled Computing Fenglian Xu *, M.
1 USC Information Sciences InstituteYolanda Gil AAAI-08 Tutorial July 13, 2008 Part IV Workflow Mapping and Execution in Pegasus (Thanks.
Managing LIGO Workflows on OSG with Pegasus Karan Vahi USC Information Sciences Institute
CMS Experience with the Common Analysis Framework I. Fisk & M. Girone Experience in CMS with the Common Analysis Framework Ian Fisk & Maria Girone 1.
VgES Version 0.7 Release Overview UCSD VGrADS Team Andrew A. Chien, Henri Casanova, Yang-suk Kee, Jerry Chou, Dionysis Logothetis, Richard.
Resource Allocation and Scheduling for Workflows Gurmeet Singh, Carl Kesselman, Ewa Deelman.
1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.
Building on virtualization capabilities for ExTENCI Carol Song and Preston Smith Rosen Center for Advanced Computing Purdue University ExTENCI Kickoff.
VGrADS and GridSolve Asim YarKhan Jack Dongarra, Zhiao Shi, Fengguang Song Innovative Computing Laboratory University of Tennessee VGrADS Workshop – September.
INTRODUCTION TO XSEDE. INTRODUCTION  Extreme Science and Engineering Discovery Environment (XSEDE)  “most advanced, powerful, and robust collection.
VGES Demonstrations Andrew A. Chien, Henri Casanova, Yang-suk Kee, Richard Huang, Dionysis Logothetis, and Jerry Chou CSE, SDSC, and CNS University of.
Lessons from LEAD/VGrADS Demo Yang-suk Kee, Carl Kesselman ISI/USC.
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Pegasus WMS Extends DAGMan to the grid world
SC’07 Demo Draft VGrADS Team June 2007.
Cloudy Skies: Astronomy and Utility Computing
Wide Area Workload Management Work Package DATAGRID project
rvGAHP – Push-Based Job Submission Using Reverse SSH Connections
A General Approach to Real-time Workflow Monitoring
Frieda meets Pegasus-WMS
GGF10 Workflow Workshop Summary
Presentation transcript:

Pegasus on the Virtual Grid: A Case Study of Workflow Planning over Captive Resources Yang-Suk Kee, Eun-Kyu Byun, Ewa Deelman, Kran Vahi, Jin-Soo Kim Oracle US Inc Korea Advanced Institute of Science and Technology Information Sciences Institute/University of Southern California Sungkyunkwan University

Overview  Motivation  Background – Pegasus – Virtual Grid  Pegasus-VG Proxy  Conclusion  Discussion

Motivation  Challenges in scientific application development – Data/control flow, task scheduling, data replication, fault-tolerance, etc  Challenges in resource management – Availability, performance, cost, reliability, fault- tolerance, etc  How to leverage existing cyber infrastructures for easy and efficient scientific computing?

Separations of Concerns  Application domain – Workflow management: application management can be conducted independently of target execution environments. – E.g.) Pegasus, Askalon, Triana  Resource domain – Resource provisioning: resource management can be encapsulated underneath abstractions or virtualizations – E.g.) Virtual Grid, virtual cluster, cloud

Workflow planning and execution over provisioned resources

Pegasus  A framework for workflow planning and execution  Workflow lifecycle – Design: describe the data/control flows of application via an abstract workflow – Planning: map the workflow tasks onto physical resources – Execution: schedule and run the workflow tasks on the mapped resources

Pegasus Workflow Management Pegasus mapper Condor DAGman Condor Computing environment Monitoring Information provenance Pegasus Executable workflow tasks Monitoring Information provenance Abstract workflow Condor pool

Virtual Grid  A programmable virtualized resource provisioning framework  Components – vgDL (Virtual Grid Description Language)  Specifies resource requirements – vgES (Virtual Grid Execution System)  Compiles and coordinates resources – PC (Personal Cluster)  Provides uniform job management

Timeshare A BC D Application Virtual Grid Resource Abstraction Virtual Grid Resource Abstraction VG Timeshare Lease Batch VG PBS P4 VGDL vgdl=clusterof (node) [2] { node = [Processor==“P4”] } program run AB C D ClassificationSelectionBindingEnvironment ok

Pegasus on Virtual Grid  Scope – A basic integration for workflow planning and execution over provisioned resources  Issues – Resource capacity estimation  Resource specification (vgDL) synthesis for Virtual Grid – Resource information publication  Site catalog generation for Pegasus

Resource Capacity Estimation  What Virtual Grid expects from Pegasus – vgDL description  Available information – Task execution time, data transfer time, performance metrics, minimum memory capacity, cost, deadline, etc  Unknown information – # of virtual processors  Resource capacity estimate – Minimize the # of processors that can execute a workflow within a deadline

BTS (Balanced Time Scheduling) Ref: E-science’08 E.-K. Byun, Y.-S. Kee et. al ID ET Time p1 p2 How many processors do we need to run this workflow within 7 units?

Example  Execution time of each task - Xeon processor  Data transfer time - network with 1Gbs bandwidth.  Deadline is 1 hour. Diamond = ClusterOf [2] (nd) [, 0:30:00] { nd = [Processor == “Xeon”] } preprocess findrange analyze f.input f.output

Resource Information Publication  What Pegasus expects from Virtual Grid – Site catalog  Virtual Grid – VG instance  Resource information publication – Devirtualize a VG instance and generate a site catalog for Pegasus

Timeshare A BC D Application Virtual Grid Resource Abstraction Virtual Grid Resource Abstraction VG Timeshare Lease Batch VG PBS P4 VGDL vgdl=clusterof (node) [2] { node = [Processor==“P4”] } program run AB C D ClassificationSelectionBindingEnvironment ok

Personal Cluster  A partition of resources dedicated to a user under the control of a user-level resource manager during a limited time period GT4/PBS Ref: HCW’08 Y.-S. Kee and C. Kesselman

Site Catalog Publication … /home/globus/pegasus gt4 PBS $HOME/workdir …

Workflow Planning over Provisioned Resources Creation Planning Scheduling/ Execution A BC D CC A BC D CC Executable workflow Abstract workflow BTS VG Virtual Grid VGDL Devirtualization Site catalog vgdl = ClusterOf (nd) [2] { nd = [Proc==“Xeon”] } GT4+PBS PegasusVG-Pegasus Proxy

Conclusion  Pegasus on Virtual Grid – Implements workflow planning and execution over on-demand captive resources – Enables easy and efficient application development and execution  Issues – Resource capacity estimation – Site catalog publication

Discussion  Effective performance – What is the cost that a user has to pay to have a successful execution?  Ongoing studies – Find-grain planning for resource provisioning  Performance, cost, reliability – Workflow execution for virtualization  Recovery of failed tasks

Need More Information?  Pegaus –  VGrADS – Tuesday, 11:30am, RENCI booth (2633) – Wednesday, noon, GCAS booth (285) – Wednesday, 2:00Pm, SDSC booth (568) – Wednesday, 4:00pm, RENCI booth (2633)

A Q & Q U E S T I O N S A N S W E R S