Computer Science Department 1 Load Balancing and Grid Computing David Finkel Computer Science Department Worcester Polytechnic Institute.

Slides:



Advertisements
Similar presentations
BARNALI CHAKRABARTY. What is an Operating System ?
Advertisements

Distributed Systems Major Design Issues Presented by: Christopher Hector CS8320 – Advanced Operating Systems Spring 2007 – Section 2.6 Presentation Dr.
The Anatomy of the Grid: An Integrated View of Grid Architecture Carl Kesselman USC/Information Sciences Institute Ian Foster, Steve Tuecke Argonne National.
High Performance Computing Course Notes Grid Computing.
Condor-G: A Computation Management Agent for Multi-Institutional Grids James Frey, Todd Tannenbaum, Miron Livny, Ian Foster, Steven Tuecke Reporter: Fu-Jiun.
Resource Management of Grid Computing
1 Software & Grid Middleware for Tier 2 Centers Rob Gardner Indiana University DOE/NSF Review of U.S. ATLAS and CMS Computing Projects Brookhaven National.
Introduction and Overview “the grid” – a proposed distributed computing infrastructure for advanced science and engineering. Purpose: grid concept is motivated.
Transparent Process Migration for Distributed Applications in a Beowulf Cluster Mark Claypool and David Finkel Computer Science Department Worcester Polytechnic.
CompuNet Grid Computing Milena Natanov Keren Kotlovsky Project Supervisor: Zvika Berkovich Lab Chief Engineer: Dr. Ilana David Spring, /
Milos Kobliha Alejandro Cimadevilla Luis de Alba Parallel Computing Seminar GROUP 12.
Globus Ian Foster and Carl Kesselman Argonne National Laboratory and University of Southern California
1 GRID D. Royo, O. Ardaiz, L. Díaz de Cerio, R. Meseguer, A. Gallardo, K. Sanjeevan Computer Architecture Department Universitat Politècnica de Catalunya.
NPACI: National Partnership for Advanced Computational Infrastructure August 17-21, 1998 NPACI Parallel Computing Institute 1 Cluster Archtectures and.
GRID COMPUTING & GRID SCHEDULERS - Neeraj Shah. Definition A ‘Grid’ is a collection of different machines where in all of them contribute any combination.
Grid Computing Net 535.
Introduction to Grid Computing Ann Chervenak Carl Kesselman And the members of the Globus Team.
Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.
Grid Toolkits Globus, Condor, BOINC, Xgrid Young Suk Moon.
Yavor Todorov. Introduction How it works OS level checkpointing Application level checkpointing CPR for parallel programing CPR functionality References.
Self Adaptivity in Grid Computing Reporter : Po - Jen Lo Sathish S. Vadhiyar and Jack J. Dongarra.
DISTRIBUTED COMPUTING
Grid Computing - AAU 14/ Grid Computing Josva Kleist Danish Center for Grid Computing
Resource Provisioning based on Lease Preemption in InterGrid Mohsen Amini Salehi, Bahman Javadi, Rajkumar Buyya Cloud Computing and Distributed Systems.
GT Components. Globus Toolkit A “toolkit” of services and packages for creating the basic grid computing infrastructure Higher level tools added to this.
Grid Technologies  Slide text. What is Grid?  The World Wide Web provides seamless access to information that is stored in many millions of different.
The Anatomy of the Grid: An Integrated View of Grid Architecture Ian Foster, Steve Tuecke Argonne National Laboratory The University of Chicago Carl Kesselman.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
Evaluation of Agent Teamwork High Performance Distributed Computing Middleware. Solomon Lane Agent Teamwork Research Assistant October 2006 – March 2007.
The Globus Project: A Status Report Ian Foster Carl Kesselman
Ajou University, South Korea GCC 2003 Presentation Dynamic Data Grid Replication Strategy based on Internet Hierarchy Sang Min Park , Jai-Hoon Kim, and.
The Anatomy of the Grid Mahdi Hamzeh Fall 2005 Class Presentation for the Parallel Processing Course. All figures and data are copyrights of their respective.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
1 4/23/2007 Introduction to Grid computing Sunil Avutu Graduate Student Dept.of Computer Science.
Service - Oriented Middleware for Distributed Data Mining on the Grid ,劉妘鑏 Antonio C., Domenico T., and Paolo T. Journal of Parallel and Distributed.
Middleware for Grid Computing and the relationship to Middleware at large ECE 1770 : Middleware Systems By: Sepehr (Sep) Seyedi Date: Thurs. January 23,
Perspectives on Grid Technology Ian Foster Argonne National Laboratory The University of Chicago.
Uni Innsbruck Informatik - 1 Open Issues and New Challenges for End-to-End Transport E2E RG Meeting - July 28/29, MIT, Cambridge MA Michael Welzl
Ames Research CenterDivision 1 Information Power Grid (IPG) Overview Anthony Lisotta Computer Sciences Corporation NASA Ames May 2,
The Grid the united computing power Jian He Amit Karnik.
GRID ARCHITECTURE Chintan O.Patel. CS 551 Fall 2002 Workshop 1 Software Architectures 2 What is Grid ? "...a flexible, secure, coordinated resource- sharing.
More on Adaptivity in Grids Sathish S. Vadhiyar Source/Credits: Figures from the referenced papers.
Authors: Ronnie Julio Cole David
GRID Overview Internet2 Member Meeting Spring 2003 Sandra Redman Information Technology and Systems Center and Information Technology Research Center National.
Globus – Part II Sathish Vadhiyar. Globus Information Service.
GridLab Resource Management System (GRMS) Jarek Nabrzyski GridLab Project Coordinator Poznań Supercomputing and.
Introduction to Grids By: Fetahi Z. Wuhib [CSD2004-Team19]
1 ©2004 Board of Trustees of the University of Illinois Computer Science Overview Laxmikant (Sanjay) Kale ©
1 Observations on Architecture, Protocols, Services, APIs, SDKs, and the Role of the Grid Forum Ian Foster Carl Kesselman Steven Tuecke.
7. Grid Computing Systems and Resource Management
International Symposium on Grid Computing (ISGC-07), Taipei - March 26-29, 2007 Of 16 1 A Novel Grid Resource Broker Cum Meta Scheduler - Asvija B System.
Globus and PlanetLab Resource Management Solutions Compared M. Ripeanu, M. Bowman, J. Chase, I. Foster, M. Milenkovic Presented by Dionysis Logothetis.
Scheduling MPI Workflow Applications on Computing Grids Juemin Zhang, Waleed Meleis, and David Kaeli Electrical and Computer Engineering Department, Northeastern.
Securing the Grid & other Middleware Challenges Ian Foster Mathematics and Computer Science Division Argonne National Laboratory and Department of Computer.
GRID ANATOMY Advanced Computing Concepts – Dr. Emmanuel Pilli.
An approach to Web services Management in OGSA environment By Shobhana Kirtane.
The Globus Toolkit The Globus project was started by Ian Foster and Carl Kesselman from Argonne National Labs and USC respectively. The Globus toolkit.
- DAG Scheduling with Reliability - - GridSolve - - Fault Tolerance In Open MPI - Asim YarKhan, Zhiao Shi, Jack Dongarra VGrADS Workshop April 2007.
The EPIKH Project (Exchange Programme to advance e-Infrastructure Know-How) gLite Grid Introduction Salma Saber Electronic.
INTRODUCTION TO HIGH PERFORMANCE COMPUTING AND TERMINOLOGY.
Clouds , Grids and Clusters
Globus —— Toolkits for Grid Computing
Grid Computing.
University of Technology
Distributed System Concepts and Architectures
Grid Computing B.Ramamurthy 9/22/2018 B.Ramamurthy.
CSS490 Grid Computing Textbook No Corresponding Chapter
Resource and Service Management on the Grid
The Anatomy and The Physiology of the Grid
The Anatomy and The Physiology of the Grid
Presentation transcript:

Computer Science Department 1 Load Balancing and Grid Computing David Finkel Computer Science Department Worcester Polytechnic Institute

Computer Science Department 2 References “The Anatomy of the Grid”, Ian Foster, Carl Kesselman, Steven Tuccke, International Journal of Supercomputer Applications, 2001 “A Performance Oriented Migration Framework for the Grid”, Satish S. Vadhiyar and Jack J. Dongarra, Proceedings of CCGrid 2003, Third IEEE/ACM International Symposium on Cluster Computing and the Grid Innumerable papers by PEDS members Finkel, Wills and Finkel, and Claypool and Finkel, with additional co- authors.

Computer Science Department 3 What is the Grid? (Foster et al paper) Distributed computing infrastructure for advanced science and engineering Runs over the Internet, potentially world- wide Several approaches have emerged: Paper discusses Globus Toolkit

Computer Science Department 4 The Grid Concept Coordinated resource sharing and problem solving in dynamic, multi-institutional virtual organizations. Highly controlled, with resource providers and consumers defining what is shared and the conditions of sharing. Issues to address: Protocols, privacy, security, costs, …

Computer Science Department 5 Related approaches Application Service Providers Storage Service Providers CORBA DCE Volunteer Computing home, Distriblets, SLINC)

Computer Science Department 6

7 Fabric Layer Provides access and control to resources Resources: Computational, storage, network Enquiry functions: to determine characteristics and state of a resource Management functions: Start, stop computations, reserve bandwidth

Computer Science Department 8 Collective Layer Protocols and services not associated with a particular resource –Directory services for discovery of resources –Co-allocation, scheduling, brokering –Monitoring the Virtual Organization for failure, intrusion detection, etc.

Computer Science Department 9 Load Sharing - Overview Transferring work from a heavily loaded node to a lightly loaded node Purpose: To improve application performance Transferring processes not suitable for fine- grain parallelism Also known as: Load Balancing, Process Migration.

Computer Science Department 10 Load Sharing Issues Criteria for heavily-loaded, lightly loaded Measuring load (policy, implementation) Exchanging information about load, state Which jobs to transfer When to transfer (new processes only, already-running processes)

Computer Science Department 11 Load Sharing in the Grid “A Performance Oriented Migration Framework for the Grid”, Vadhiyar and Donngarra Part of the GrADS project – Grid Application Development System – based at Univ. of Tennessee and other institutions Designed for long-running computations

Computer Science Department 12 Load Sharing in the Grid - 2 Basic idea – the load sharing system can run a performance model of a computation to estimate running time and resource requirements. Application programmer is responsible for providing performance model for the application, and hooks to stop application, checkpoint state, and re-start application. Based on MPI Programming Library, Globus Toolkit

Computer Science Department 13 Load Sharing in the Grid - 3 Before application begins, Application Manager runs performance model to predict execution times, number of processors. Determines whether an appropriate set of processors is available, schedules jobs Monitors process of application as it runs

Computer Science Department 14

Computer Science Department 15 Load Sharing in the Grid - 4 Load sharing can occur if –Application progress is delayed –Additional resources become available App Manager sends message to application so it will –Checkpoint –Stop computation Re-start on new collection of nodes

Computer Science Department 16

Computer Science Department 17 Research Directions Load sharing on the Grid: –There’s a large body of pre-Grid research of load balancing in distributed systems –Can the results of this research be used to design load balancing systems for the Grid

Computer Science Department 18 Load Balancing and Grid Computing David Finkel Computer Science Department Worcester Polytechnic Institute