Adaptive Computing on the Grid Using AppLeS Francine Berman, Richard Wolski, Henri Casanova, Walfredo Cirne, Holly Dail, Marcio Faerman, Silvia Figueira,

Slides:



Advertisements
Similar presentations
A Workflow Engine with Multi-Level Parallelism Supports Qifeng Huang and Yan Huang School of Computer Science Cardiff University
Advertisements

Scheduling in Distributed Systems Gurmeet Singh CS 599 Lecture.
1 Coven a Framework for High Performance Problem Solving Environments Nathan A. DeBardeleben Walter B. Ligon III Sourabh Pandit Dan C. Stanzione Jr. Parallel.
From Grid to Global Computing: Deploying Parameter Sweep Applications Henri Casanova Grid Research And Innovation Laboratory (GRAIL)
The Network Weather Service A Distributed Resource Performance Forecasting Service for Metacomputing Rich Wolski, Neil T. Spring and Jim Hayes Presented.
Page 1 Building Reliable Component-based Systems Chapter 16 - Component based embedded systems Chapter 16 Component based embedded systems.
A Grid Parallel Application Framework Jeremy Villalobos PhD student Department of Computer Science University of North Carolina Charlotte.
ADAPT An Approach to Digital Archiving and Preservation Technology Principal Investigator: Joseph JaJa Lead Programmers: Mike Smorul and Mike McGann Graduate.
Workload Management Workpackage Massimo Sgaravatto INFN Padova.
Achieving Application Performance on the Information Power Grid Francine Berman U. C. San Diego and NPACI This presentation will probably involve audience.
Performance Prediction Engineering Francine Berman U. C. San Diego Rich Wolski U. C. San Diego and University of Tennessee This presentation will probably.
6th Biennial Ptolemy Miniconference Berkeley, CA May 12, 2005 Distributed Computing in Kepler Ilkay Altintas Lead, Scientific Workflow Automation Technologies.
Achieving Application Performance on the Computational Grid Francine Berman U. C. San Diego This presentation will probably involve audience discussion,
CSE 160/Berman Programming Paradigms and Algorithms W+A 3.1, 3.2, p. 178, 6.3.2, H. Casanova, A. Legrand, Z. Zaogordnov, and F. Berman, "Heuristics.
Adaptive Computing on the Grid – The AppLeS Project Francine Berman U.C. San Diego.
AppLeS, NWS and the IPG Fran Berman UCSD and NPACI Rich Wolski UCSD, U. Tenn. and NPACI This presentation will probably involve audience discussion, which.
Integrated Scientific Workflow Management for the Emulab Network Testbed Eric Eide, Leigh Stoller, Tim Stack, Juliana Freire, and Jay Lepreau and Jay Lepreau.
MCell Usage Scenario Project #7 CSE 260 UCSD Nadya Williams
Robust Tools for Archiving and Preserving Digital Data Joseph JaJa, Mike Smorul, and Mike McGann Institute for Advanced Computer Studies Department of.
Achieving Application Performance on the Computational Grid Francine Berman This presentation will probably involve audience discussion, which will create.
The Network Weather Service: A Distributed Resource Performance Forecasting Service for Metacomputing, Rich Wolski, Neil Spring, and Jim Hayes, Journal.
The AppLeS Project: Harvesting the Grid Francine Berman U. C. San Diego This presentation will probably involve audience discussion, which will create.
NPACI Alpha Project Review: Cellular Microphysiology on the Data Grid Fran Berman, UCSD Tom Bartol, Salk Institute.
High Performance Computing 1 Parallelization Strategies and Load Balancing Some material borrowed from lectures of J. Demmel, UC Berkeley.
AppLeS / Network Weather Service IPG Pilot Project FY’98 Francine Berman U. C. San Diego and NPACI Rich Wolski U.C. San Diego, NPACI and U. of Tennessee.
New Development in the AppLeS Project or User-Level Middleware for the Grid Francine Berman University of California, San Diego.
1/16/2008CSCI 315 Operating Systems Design1 Introduction Notice: The slides for this lecture have been largely based on those accompanying the textbook.
Community Manager A Dynamic Collaboration Solution on Heterogeneous Environment Hyeonsook Kim  2006 CUS. All rights reserved.
Scheduling Parallel Task
CSE 160/Berman Programming Paradigms and Algorithms W+A 3.1, 3.2, p. 178, 5.1, 5.3.3, Chapter 6, 9.2.8, , Kumar Berman, F., Wolski, R.,
EMBEDDED SYSTEMS G.V.P.COLLEGE OF ENGINEERING Affiliated to J.N.T.U. By By D.Ramya Deepthi D.Ramya Deepthi & V.Soujanya V.Soujanya.
Distributed Real-Time Systems for the Intelligent Power Grid Prof. Vincenzo Liberatore.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 12 Slide 1 Distributed Systems Architectures.
Self Adaptivity in Grid Computing Reporter : Po - Jen Lo Sathish S. Vadhiyar and Jack J. Dongarra.
Achieving Application Performance on the Grid: Experience with AppLeS Francine Berman U. C., San Diego This presentation will probably involve audience.
Panel Abstractions for Large-Scale Distributed Systems Henri Bal Vrije Universiteit Amsterdam.
Chapter 3: Operating-System Structures System Components Operating System Services System Calls System Programs System Structure Virtual Machines System.
Operating Systems CS3502 Fall 2014 Dr. Jose M. Garrido
Software Architecture
Parallel Tomography Shava Smallen CSE Dept. U.C. San Diego.
Software Architecture Framework for Ubiquitous Computing Divya ChanneGowda Athrey Joshi.
Transparency in Distributed Operating Systems Vijay Akkineni.
◦ What is an Operating System? What is an Operating System? ◦ Operating System Objectives Operating System Objectives ◦ Services Provided by the Operating.
An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.
Trace Generation to Simulate Large Scale Distributed Application Olivier Dalle, Emiio P. ManciniMar. 8th, 2012.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Tekin Bicer Gagan Agrawal 1.
Production Data Grids SRB - iRODS Storage Resource Broker Reagan W. Moore
Scientific Workflow Scheduling in Computational Grids Report: Wei-Cheng Lee 8th Grid Computing Conference IEEE 2007 – Planning, Reservation,
1 Logistical Computing and Internetworking: Middleware for the Use of Storage in Communication Micah Beck Jack Dongarra Terry Moore James Plank University.
Issues in (Financial) High Performance Computing John Darlington Director Imperial College Internet Centre Fast Financial Algorithms and Computing 4th.
Introduction to dCache Zhenping (Jane) Liu ATLAS Computing Facility, Physics Department Brookhaven National Lab 09/12 – 09/13, 2005 USATLAS Tier-1 & Tier-2.
Metadata Management of Terabyte Datasets from an IP Backbone Network: Experience and Challenges Sue B. Moon and Timothy Roscoe.
Kelly Davis and Tom Goodale Architecture of GAT Kelly Davis and Tom Goodale and
 Apache Airavata Architecture Overview Shameera Rathnayaka Graduate Assistant Science Gateways Group Indiana University 07/27/2015.
CIS250 OPERATING SYSTEMS Chapter One Introduction.
Creating SmartArt 1.Create a slide and select Insert > SmartArt. 2.Choose a SmartArt design and type your text. (Choose any format to start. You can change.
20409A 7: Installing and Configuring System Center 2012 R2 Virtual Machine Manager Module 7 Installing and Configuring System Center 2012 R2 Virtual.
Scheduling MPI Workflow Applications on Computing Grids Juemin Zhang, Waleed Meleis, and David Kaeli Electrical and Computer Engineering Department, Northeastern.
Chapter 1 Basic Concepts of Operating Systems Introduction Software A program is a sequence of instructions that enables the computer to carry.
OGCE Workflow and LEAD Overview Suresh Marru, Marlon Pierce September 2009.
Application-level Scheduling Sathish S. Vadhiyar Credits / Sources: AppLeS web pages and papers.
CS4315A. Berrached:CMS:UHD1 Introduction to Operating Systems Chapter 1.
Parallel Tomography Shava Smallen SC99. Shava Smallen SC99AppLeS/NWS-UCSD/UTK What are the Computational Challenges? l Quick turnaround time u Resource.
Euro-Par, HASTE: An Adaptive Middleware for Supporting Time-Critical Event Handling in Distributed Environments ICAC 2008 Conference June 2 nd,
1
Enabling Grids for E-sciencE Agreement-based Workload and Resource Management Tiziana Ferrari, Elisabetta Ronchieri Mar 30-31, 2006.
Achieving Application Performance on the Computational Grid Francine Berman U. C. San Diego and NPACI This presentation will probably involve audience.
Joseph JaJa, Mike Smorul, and Sangchul Song
Grid Computing.
Introduction to Operating Systems
Presentation transcript:

Adaptive Computing on the Grid Using AppLeS Francine Berman, Richard Wolski, Henri Casanova, Walfredo Cirne, Holly Dail, Marcio Faerman, Silvia Figueira, Jim Hayes, Graziano Obertelli, Jennifer Schopf, Gary Shao, Shava Smallen, Neil Spring, Alan Su, and Dmitrii Zagorodnov IEEE Transactions on Parallel and Distributed Systems, Vol. 14, No. 5, May 2003

Agenda Introduction Problems AppLeS and its components Result products Related works Discussions Conclusions

Introduction What is a Grid? –A collection of resources that can be used as an ensemble What are resources? –Computational devices, networks, online instruments, storage archives, and etc

Problems Heterogeneity –Different performance Inconsistentcy –Shared –Fail –Upgraded

AppLeS Project Application Level Scheduling Goals –Investigate adaptive scheduling for Grid computing –Apply research results to applications for validating the efficacy of the approach and extracting Grid performance for the end-user

Steps (6) Schedule Adaptation (1) Resource Discovery (2) Resource Selection (3) Schedule Generation (4) Schedule Selection (5) Application Execution

Resource Discovery Depend on the Grid –A List of user’s logins –Resource discovery services of each Grid

Resource Selection Simple SARA –Synthetic Aperture Radar Atlas –Developed by JPL and SDSC –Provide access to satellite images distributed in various repositories –End-to-end available bandwidth is predicted using NWS

Performance Modeling Jacobi 2D Main loop –Loop until convergence –For all matrix entries A i,j A i,j = ¼(A i,j + A i+1,j + A i-1,j + A i,j+1 + A i,j-1 ) –Compute local error Model –T i = Area i * Oper i * AvailCPU i + C i ; 1 <= I <= p i,ji-1,ji+1,j i,j-1 i,j+1 Area - the size of the strip, Oper - execution time to compute one entry AvailCPU - percentage of available CPU, C - Communication time

Scheduling Generation Complib –A computational biology application –Compare a library of unknown sequences against a database of “known” sequences using FASTA scoring method Parallization –Master/Worker –Work size Small unit size (Self-scheduling) - high overhead Big unit size - load imbalance

AppLeS’s Approch

Scheduling Adaptation MCell –A computational neuroscience application –Study biochemical interactions within living cells at molecular level –Multiple independent tasks –Shared input

XSufferage Based on Sufferage Sufferage value = second best - first best XSufferage concerns data replication time (zero for locally available)

Outcome APST - AppLeS Parameter Sweep Template AMWAT - AppLeS Master/Worker Application Template SA - Supercomputer AppLeS

APST Parameter Sweep Applications –Mostly independent Provide –Transparent deployment –Automatic scheduling Capabilities –Launching tasks –Moving and storing data –Discovering and monitoring resources

AMWAT Master/Worker Provide –APIs for Discovering Scheduling Predicting SS - Self-Scheduling FSC - Fixed Size Chunking GSS - Guided Self-Schduling TSS - Trapezoidal Self-Scheduling FAC2 - Factoring

SA Space-shared Moldable jobs Reduce response times

Related Works Environment –MARS and Dome - Run-time checkpointing environment Structure –MARS - SPMD –VDCE and SEA - Task graph –IOS - Real-time, fine-grained, task graph –Dome and SPP - Abstract language Dome - SPMD SPP - Task graph Performance model –Depend on program structure Objective –Minimize execution time

Related Works EnvStructPerfApproach AppLeSAny ProvidedAdaptive MARSChkPntSPMDStatisticsData Dist DomeChkPntSPMDData Dist VDCETGDerivedList Sched SPPTGDerived SEATGData FlowExpert Sys IOSTGDerivedGA GrADS

Discussions Performance of distributed applications depend on both application and platform- specific information Storage and service are usually separated Communication must be concerned in the model Multi-applications environment has not been addressed

Conclusions AppLeS –An application-level scheduling framework –Provide adaptive, flexible, and reusable components –being integrated into GrADS for building next generation Grid applications Each part has been demonstrated its improvement