Use of Performance Prediction Techniques for Grid Management Junwei Cao University of Warwick April 2002.

Slides:



Advertisements
Similar presentations
Resource Management §A resource can be a logical, such as a shared file, or physical, such as a CPU (a node of the distributed system). One of the functions.
Advertisements

Development of Parallel Simulator for Wireless WCDMA Network Hong Zhang Communication lab of HUT.
Performance-responsive Middleware for Grid Computing Dr Stephen Jarvis High Performance Systems Group University of Warwick, UK High Performance Systems.
Evaluation of Advertising Effectiveness Using Agent-Based Modeling and Simulation Junwei Cao Department of Computer Science University of Warwick.
Cracow Grid Workshop, November 5-6, 2001 Towards the CrossGrid Architecture Marian Bubak, Marek Garbacz, Maciej Malawski, and Katarzyna Zając.
Software Tools for Dynamic Resource Management Irina V. Shoshmina, Dmitry Yu. Malashonok, Sergay Yu. Romanov Institute of High-Performance Computing and.
University of Minho School of Engineering Centre Algoritmi Uma Escola a Reinventar o Futuro – Semana da Escola de Engenharia - 24 a 27 de Outubro de 2011.
A Grid Parallel Application Framework Jeremy Villalobos PhD student Department of Computer Science University of North Carolina Charlotte.
GridFlow: Workflow Management for Grid Computing Kavita Shinde.
A Grid Resource Broker Supporting Advance Reservations and Benchmark- Based Resource Selection Erik Elmroth and Johan Tordsson Reporter : S.Y.Chen.
The CrossGrid project Juha Alatalo Timo Koivusalo.
Parallel Programming Models and Paradigms
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
GHS: A Performance Prediction and Task Scheduling System for Grid Computing Xian-He Sun Department of Computer Science Illinois Institute of Technology.
Cross Cluster Migration Remote access support Adianto Wibisono supervised by : Dr. Dick van Albada Kamil Iskra, M. Sc.
Chapter 2 Introduction to Systems Architecture. Chapter goals Discuss the development of automated computing Describe the general capabilities of a computer.
A Load Balancing Framework for Adaptive and Asynchronous Applications Kevin Barker, Andrey Chernikov, Nikos Chrisochoides,Keshav Pingali ; IEEE TRANSACTIONS.
Introduction to Systems Analysis and Design
Self-Organizing Agents for Grid Load Balancing Junwei Cao Fifth IEEE/ACM International Workshop on Grid Computing (GRID'04)
PMIT-6102 Advanced Database Systems
Parallel Computing The Bad News –Hardware is not getting faster fast enough –Too many architectures –Existing architectures are too specific –Programs.
 What is an operating system? What is an operating system?  Where does the OS fit in? Where does the OS fit in?  Services provided by an OS Services.
Self-Organizing Agents for Grid Load Balancing Junwei Cao, Ph.D. Research Scientist Center for Space Research Massachusetts Institute of Technology Cambridge,
ARGONNE  CHICAGO Ian Foster Discussion Points l Maintaining the right balance between research and development l Maintaining focus vs. accepting broader.
Computational Design of the CCSM Next Generation Coupler Tom Bettge Tony Craig Brian Kauffman National Center for Atmospheric Research Boulder, Colorado.
WP9 Resource Management Current status and plans for future Juliusz Pukacki Krzysztof Kurowski Poznan Supercomputing.
1 (1)Cluster computing (2) Grid computing) Part 4 Current trend of parallel processing.
Through the development of advanced middleware, Grid computing has evolved to a mature technology in which scientists and researchers can leverage to gain.
Grid Workload Management & Condor Massimo Sgaravatto INFN Padova.
Performance Model & Tools Summary Hung-Hsun Su UPC Group, HCS lab 2/5/2004.
Jean-Sébastien Gay LIP ENS Lyon, Université Claude Bernard Lyon 1 INRIA Rhône-Alpes GRAAL Research Team Join work with DIET TEAM D istributed I nteractive.
CCGrid 2003, Tokyo, Japan GridFlow: Workflow Management for Grid Computing Junwei Cao ( 曹军威 ) C&C Research Labs, NEC Europe Ltd., Germany Stephen A. Jarvis.
A Survey of Distributed Task Schedulers Kei Takahashi (M1)
Scientific Workflow Scheduling in Computational Grids Report: Wei-Cheng Lee 8th Grid Computing Conference IEEE 2007 – Planning, Reservation,
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
The Grid System Design Liu Xiangrui Beijing Institute of Technology.
Virtual Data Grid Architecture Ewa Deelman, Ian Foster, Carl Kesselman, Miron Livny.
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Perspectives on Grid Technology Ian Foster Argonne National Laboratory The University of Chicago.
Department of Electronic Engineering Challenges & Proposals INFSO Information Day e-Infrastructure Grid Initiatives 26/27 May.
Performance evaluation on grid Zsolt Németh MTA SZTAKI Computer and Automation Research Institute.
Chapter 2 Introduction to Systems Architecture. Chapter goals Discuss the development of automated computing Describe the general capabilities of a computer.
Service-oriented Resource Broker for QoS-Guaranteed in Grid Computing System Yichao Yang, Jin Wu, Lei Lang, Yanbo Zhou and Zhili Sun Centre for communication.
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.
GSAF: A Grid-based Services Transfer Framework Chunyan Miao, Wang Wei, Zhiqi Shen, Tan Tin Wee.
Motivation: Sorting is among the fundamental problems of computer science. Sorting of different datasets is present in most applications, ranging from.
Domain Decomposition in High-Level Parallelizaton of PDE codes Xing Cai University of Oslo.
Connections to Other Packages The Cactus Team Albert Einstein Institute
Parallelization Strategies Laxmikant Kale. Overview OpenMP Strategies Need for adaptive strategies –Object migration based dynamic load balancing –Minimal.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Performance-responsive Scheduling for Grid Computing Dr Stephen Jarvis High Performance Systems Group University of Warwick, UK High Performance Systems.
Performance Modelling of Parallel and Distributed Computing Using PACE High Performance Systems Laboratory University of Warwick Junwei Cao Darren J. Kerbyson.
Performance Evaluation of Parallel Algorithms on a Computational Grid Environment Simona Blandino 1, Salvatore Cavalieri 2 1 Consorzio COMETA, 2 Faculty.
Euro-Par, HASTE: An Adaptive Middleware for Supporting Time-Critical Event Handling in Distributed Environments ICAC 2008 Conference June 2 nd,
Grid Activities in CMS Asad Samar (Caltech) PPDG meeting, Argonne July 13-14, 2000.
Agent-Based Grid Load-Balancing Daniel P. Spooner University of Warwick, UK Junwei Cao NEC Europe Ltd., Germany.
1 Hierarchical Parallelization of an H.264/AVC Video Encoder A. Rodriguez, A. Gonzalez, and M.P. Malumbres IEEE PARELEC 2006.
Computer Science and Engineering Parallel and Distributed Processing CSE 8380 April 28, 2005 Session 29.
2004 Queue Scheduling and Advance Reservations with COSY Junwei Cao Falk Zimmermann C&C Research Laboratories NEC Europe Ltd.
IPDPS 2003, Nice, France Agent-Based Grid Load Balancing Using Performance-Driven Task Scheduling Junwei Cao (C&C Research Labs, NEC Europe Ltd., Germany)
Towards a High Performance Extensible Grid Architecture Klaus Krauter Muthucumaru Maheswaran {krauter,
Flexibility and Interoperability in a Parallel MD code Robert Brunner, Laxmikant Kale, Jim Phillips University of Illinois at Urbana-Champaign.
Agent-Based Grid Load-Balancing
Introduction to Load Balancing:
Junwei Cao Darren J. Kerbyson Graham R. Nudd
Auburn University COMP7500 Advanced Operating Systems I/O-Aware Load Balancing Techniques (2) Dr. Xiao Qin Auburn University.
Department of Computer Science University of Warwick
Agent-based Resource Management for Grid Computing
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

Use of Performance Prediction Techniques for Grid Management Junwei Cao University of Warwick April 2002

Outline Introduction Performance Prediction Local Grid Scheduling Global Grid Management A Case Study Conclusions & Future Work

Introduction Glossary Overview Related Works

Glossary Grids – computational grids Resources – multiprocessors or clusters Applications (tasks) – MPI & PVM parallel programs Users – developers and end users Agents, Requests & Services Performance – execution time

Overview Grid Users Grid Resources Global Grid Management Local Grid Scheduling Application Tools Performance Evaluation Engine Resource Tools

Related Works Performance evaluation POEMS, AppLeS, … Local grid schedulers Condor, LSF, Ninf, Nimrod, … Global grid management Globus, Legion, DPSS, …

Performance Prediction Methodology Implementation

PACE Methodology Application Layer Hardware Layer Subtask Layer Parallel Template Layer Model Parameters Predicted Execution Time acts as the entry point to the performance study describes the sequential parts within an application describes the parallel characteristics of subtasks characterises the comm. and comp. abilities of a particular system

PACE Toolkit Application Tools Resource Tools Evaluation Engine Source Code Analysis Object Editor Object Library PSL Compiler CPU Network (MPI, PVM) Cache (L1, L2) HMCL Compiler

Summary Advantages Reasonable accuracy Rapid evaluation time Easy cross-platform comparisons Limitations Application source codes required Static resource configurations

Local Grid Scheduling Algorithms Implementation (Titan)

FIFO Algorithm Processor 1 Processor 2 Processor 3 Processor 4 Processor 5 Processor 6 Processor 7 Processor 8 2 n -1

Genetic Algorithm Heuristic Evolutionary Near-optimal: Makespan Idletime Deadlines

Titan Implementation Communication Module PACE Evaluation Engine Task Management GA Scheduling Resource MonitoringTask Execution RequestsResultsService

Global Grid Management Methodology Implementation (ARMS) Metrics

Agent-based Methodology Agent structure Communication layer Decision-making layer Local management layer Agent hierarchy Service advertisement Service discovery Agent Capability Tables A AA AA User

Optimisation Strategies A AA AA M Advertisement Data-push & data-pull Periodic & event-driven Discovery Local services Services in ACTs Lower or upper agents Optimisation Modelling Simulation User

ARMS Implementation A AA AA M TTTTT Service information PACE models Makespan Request information Application binary PACE model Deadline Matchmaking Estimation (FIFO) Deadline User

Metrics Average advance time of application execution completions (compared to required deadlines) Average processor utilisation rate – utilisedtime/totaltime Load balancing level – mean square deviation of processor utilisation rates Average discovery speed – #req./#disc.conn. Average discovery efficiency – #req./(#disc.conn.+ #adver.conn.)

A Case Study Design Demonstrations Results

Experiment Design S 1 (SGIOrigin2000, 16) S 2 (SGIOrigin2000, 16) S 4 (SunUltra10, 16) S 3 (SunUltra10, 16) S 5 (SunUltra5, 16) S 6 (SunUltra5, 16) S 12 (SunSPARCstat ion2, 16) S 11 (SunSPARCstat ion2, 16) S 8 (SunUltra1, 16) S 7 (SunUltra5, 16) S 10 (SunUltra1, 16) S 9 (SunUltra1, 16) sweep3d fft improc closure jacobi memsort cpi

Experiment 1 FIFO

Experiment 1 FIFO

Experiment 2 GA

Experiment 3 GA

Application Execution Both GA and agents contribute towards the improvement in application executions.

Resource Utilisation S11 & S12 benefit mainly from the GA. S1 & S2 benefit mainly from agents.

Load Balancing The GA contributes more to local grid load balancing. Agents contribute more to global grid load balancing.

Discovery Speed & Efficiency No advertisement: Low speed Low efficiency Reasonable advertisement: High speed High efficiency Too much advertisement: Very high speed Very low efficiency Discovery speed (*100) Discovery efficiency (*100)

Conclusions & Future Work

Conclusions Performance prediction capabilities are essential to grid management. Genetic algorithm is applied for local grid scheduling. Global grid management is achieved using an agent-based methodology. Agent-based framework is scalable, flexible, extensible and easy for further enhancement.

Future Work Impact of prediction accuracy on grid management and scheduling Transaction-based application performance modelling Integration with Globus and NWS More than discovery, enabling negotiation and coordination

JAG: Java Agents for Grids Agent Coordination Agent Negotiation Agent Discovery Agent Communication Knowledge Representation Performance Prediction Application Scheduling Information Service Grid Monitoring PACE AppLeS Globus NWS ……

Questions are welcome …