1 Supporting Dynamic Migration in Tightly Coupled Grid Applications Liang Chen Qian Zhu Gagan Agrawal Computer Science & Engineering The Ohio State University.

Slides:



Advertisements
Similar presentations
Analysis of : Operator Scheduling in a Data Stream Manager CS561 – Advanced Database Systems By Eric Bloom.
Advertisements

High Availability Group 08: Võ Đức Vĩnh Nguyễn Quang Vũ
A Grid Parallel Application Framework Jeremy Villalobos PhD student Department of Computer Science University of North Carolina Charlotte.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
Energy-efficient Self-adapting Online Linear Forecasting for Wireless Sensor Network Applications Jai-Jin Lim and Kang G. Shin Real-Time Computing Laboratory,
Web-based Portal for Discovery, Retrieval and Visualization of Earth Science Datasets in Grid Environment Zhenping (Jane) Liu.
Computer Science and Engineering A Middleware for Developing and Deploying Scalable Remote Mining Services P. 1DataGrid Lab A Middleware for Developing.
Sensor Networks Storage Sanket Totala Sudarshan Jagannathan.
Designing Efficient Systems Services and Primitives for Next-Generation Data-Centers K. Vaidyanathan, S. Narravula, P. Balaji and D. K. Panda Network Based.
Efficient Evaluation of XQuery over Streaming Data Xiaogang Li Gagan Agrawal The Ohio State University.
Virtualization. Virtualization  In computing, virtualization is a broad term that refers to the abstraction of computer resources  It is "a technique.
Active Monitoring in GRID environments using Mobile Agent technology Orazio Tomarchio Andrea Calvagna Dipartimento di Ingegneria Informatica e delle Telecomunicazioni.
An Integration Framework for Sensor Networks and Data Stream Management Systems.
A Metadata Based Approach For Supporting Subsetting Queries Over Parallel HDF5 Datasets Vignesh Santhanagopalan Graduate Student Department Of CSE.
Ohio State University Department of Computer Science and Engineering 1 Cyberinfrastructure for Coastal Forecasting and Change Analysis Gagan Agrawal Hakan.
An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Tekin Bicer Gagan Agrawal 1.
A Framework for Elastic Execution of Existing MPI Programs Aarthi Raveendran Graduate Student Department Of CSE 1.
G-JavaMPI: A Grid Middleware for Distributed Java Computing with MPI Binding and Process Migration Supports Lin Chen, Cho-Li Wang, Francis C. M. Lau and.
Porting Irregular Reductions on Heterogeneous CPU-GPU Configurations Xin Huo, Vignesh T. Ravi, Gagan Agrawal Department of Computer Science and Engineering.
LIGO-G9900XX-00-M ITR 2003 DMT Sub-Project John G. Zweizig LIGO/Caltech.
Euro-Par, A Resource Allocation Approach for Supporting Time-Critical Applications in Grid Environments Qian Zhu and Gagan Agrawal Department of.
Smita Vijayakumar Qian Zhu Gagan Agrawal 1.  Background  Data Streams  Virtualization  Dynamic Resource Allocation  Accuracy Adaptation  Research.
Performance Prediction for Random Write Reductions: A Case Study in Modelling Shared Memory Programs Ruoming Jin Gagan Agrawal Department of Computer and.
OMIS Approach to Grid Application Monitoring Bartosz Baliś Marian Bubak Włodzimierz Funika Roland Wismueller.
CISC Machine Learning for Solving Systems Problems Presented by: Alparslan SARI Dept of Computer & Information Sciences University of Delaware
Tool Integration with Data and Computation Grid GWE - “Grid Wizard Enterprise”
Computer Science and Engineering Predicting Performance for Grid-Based P. 1 IPDPS’07 A Performance Prediction Framework.
1 A Grid-Based Middleware for Processing Distributed Data Streams Liang Chen Advisor: Gagan Agrawal Computer Science & Engineering.
Computer Science and Engineering Parallelizing Defect Detection and Categorization Using FREERIDE Leonid Glimcher P. 1 ipdps’05 Scaling and Parallelizing.
What is SAM-Grid? Job Handling Data Handling Monitoring and Information.
High-level Interfaces and Abstractions for Data-Driven Applications in a Grid Environment Gagan Agrawal Department of Computer Science and Engineering.
CCGrid, 2012 Supporting User Defined Subsetting and Aggregation over Parallel NetCDF Datasets Yu Su and Gagan Agrawal Department of Computer Science and.
Compiler and Runtime Support for Enabling Generalized Reduction Computations on Heterogeneous Parallel Configurations Vignesh Ravi, Wenjing Ma, David Chiu.
GSAF: A Grid-based Services Transfer Framework Chunyan Miao, Wang Wei, Zhiqi Shen, Tan Tin Wee.
Department of Computer Science MapReduce for the Cell B. E. Architecture Marc de Kruijf University of Wisconsin−Madison Advised by Professor Sankaralingam.
Implementing Data Cube Construction Using a Cluster Middleware: Algorithms, Implementation Experience, and Performance Ge Yang Ruoming Jin Gagan Agrawal.
Supporting Load Balancing for Distributed Data-Intensive Applications Leonid Glimcher, Vignesh Ravi, and Gagan Agrawal Department of ComputerScience and.
Computer Science and Engineering FREERIDE-G: A Grid-Based Middleware for Scalable Processing of Remote Data Leonid Glimcher Gagan Agrawal.
Development of e-Science Application Portal on GAP WeiLong Ueng Academia Sinica Grid Computing
A N I N - MEMORY F RAMEWORK FOR E XTENDED M AP R EDUCE 2011 Third IEEE International Conference on Coud Computing Technology and Science.
High-level Interfaces for Scalable Data Mining Ruoming Jin Gagan Agrawal Department of Computer and Information Sciences Ohio State University.
Tool Integration with Data and Computation Grid “Grid Wizard 2”
Ohio State University Department of Computer Science and Engineering Servicing Range Queries on Multidimensional Datasets with Partial Replicas Li Weng,
Research Overview Gagan Agrawal Associate Professor.
Euro-Par, HASTE: An Adaptive Middleware for Supporting Time-Critical Event Handling in Distributed Environments ICAC 2008 Conference June 2 nd,
1 A Grid-Based Middleware’s Support for Processing Distributed Data Streams Liang Chen Advisor: Gagan Agrawal Computer Science & Engineering.
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
Efficient Opportunistic Sensing using Mobile Collaborative Platform MOSDEN.
1 Supporting a Volume Rendering Application on a Grid-Middleware For Streaming Data Liang Chen Gagan Agrawal Computer Science & Engineering Ohio State.
Efficient Evaluation of XQuery over Streaming Data
Introduction to Load Balancing:
GWE Core Grid Wizard Enterprise (
QianZhu, Liang Chen and Gagan Agrawal
Auburn University COMP7500 Advanced Operating Systems I/O-Aware Load Balancing Techniques (2) Dr. Xiao Qin Auburn University.
Performance Evaluation of Adaptive MPI
Liang Chen Advisor: Gagan Agrawal Computer Science & Engineering
A Framework for Automatic Resource and Accuracy Management in A Cloud Environment Smita Vijayakumar.
Supporting Fault-Tolerance in Streaming Grid Applications
Year 2 Updates.
Li Weng, Umit Catalyurek, Tahsin Kurc, Gagan Agrawal, Joel Saltz
An Adaptive Middleware for Supporting Time-Critical Event Response
Smita Vijayakumar Qian Zhu Gagan Agrawal
Pramod Bhatotia, Ruichuan Chen, Myungjin Lee
GATES: A Grid-Based Middleware for Processing Distributed Data Streams
Resource Allocation in a Middleware for Streaming Data
Resource Allocation for Distributed Streaming Applications
Quality-aware Middleware
Support for Adaptivity in ARMCI Using Migratable Objects
L. Glimcher, R. Jin, G. Agrawal Presented by: Leo Glimcher
Presentation transcript:

1 Supporting Dynamic Migration in Tightly Coupled Grid Applications Liang Chen Qian Zhu Gagan Agrawal Computer Science & Engineering The Ohio State University

2 Introduction- Motivation –Grid resources vary frequently –Tightly coupled applications in Grid v.s. bag of tasks Pipelined applications and streaming applications Features: –Dependencies –Run-longing –Large volumes of data transfer between tasks (stages) –Dynamically allocating new resources and migrating applications to the new resources improve performance

3 Introduction- Challenges –Checkpointing is a classic method to support dynamic migration A snapshot of system’s running state Transmit to a remote site Restore execution context and restart processes –Pros of checkingpointing Maybe transparent to applications –Cons of checkingpointing Platform dependent Inefficient

4 Introduction- Our approach –Typical processing structure of a data streaming application:... while(true) { read_data_from_streams(); process_data(); accumulate_intermediate_results(); reset_auxiliary_structures(); }... –Our approach is based on Light-weight Summary Structure (LSS) Data structure storing summary information is Light-weight summary structure Others are Auxiliary structures

5 Introduction - Contribution Proposed the notion of LSS that enables efficient process migration Implemented application migration using LSS in the GATES middleware Designed a dynamic resource allocation algorithm for pipeline processing on streaming data Demonstrated an architecture for resource monitoring and allocation Extensively evaluated the LSS implementation using 3 data stream applications

6 Middleware System Architecture Features of data steam –Data arrive continuously –Enormous volume and must be processed online –Need to be processed in real-time –Data sources could be distributed The needs for processing distributed data streams –A middleware running in Grid –Allocate Grid resources –Provide self-adaptation function

7 Middleware System Architecture –GATES (Grid-Based AdapTive Execution on Streams) middleware Use Globus Toolkit 3.0, built on OGSA Allows users to specify their algorithms implemented in Java Take care of plugging user-defined algorithms into the system and running them in Grid. Applications need be broken down into a number of pipelined stages

8 ABC Stage A Stage BStage C :GATES services :Stages of an application :Queues between Grid services :Buffers for applications Middleware System Architecture Application Stage A Stage B Stage C

9 Public class Second-Stage implements StreamProcessing { … void work(buffer in, buffer out) { … while(true) { DATA = GATES.getFromInputBuffer(in); Inter-Results = Processing(Data); GATES.putToOutputBuffer (out, Inter-Results); } System Architecture and Design (GATES API Functions)

10 Roadmap Introduction ‒ Motivation for tight-coupled applications in Grid ‒ Challenge and our approach Middleware System Overview ‒ Introduce the system architecture and design Implementing Dynamic Migration Using LSS ‒ Light-weight summary structure (LSS) and its example ‒ Advantages of utilizing LSS ‒ LSS Implementation Detail ‒ Architecture of dynamic resource allocation scheme Evaluation ‒ Three distributed data stream applications ‒ Memory usage of LSS ‒ Efficient migration by using LSS ‒ Processing accuracy and LSS migration Related work Conclusion

11 Light-weight Summary Structure (LSS) & its Example LSS is a data structure that stores summary information of processing Auxiliary structures An application calculates the average value of all integer numbers in a stream –Two stage: the first is data source the second calculates the sum and counts the number of integers, ave=sum/count – LSS would be the sum and the count – Auxiliary structures would be loop index and other temporary variables

12 Advantages of using LSS Efficient, only LSS is migrated –Only “sum” and “count” migrate Not impact the accuracy of processing Support migration across heterogeneous platforms –“sum” and “count” are logic structures Reduce application developers’ efforts on making application capable of migration

13 An Example of LSS LSS can be used to support dynamic migration –GAETS provides an API function to allocate memory to be LSS –An application stores summary information to LSS –transmit only LSS at the end of the loop to a new node –Restore the LSS at the new node

14 Public class Second-Stage implements StreamProcessing { … void work(buffer in, buffer out) { … while(true) { DATA = GATES.getFromInputBuffer(in); Inter-Results = Processing(Data); GATES.putToOutputBuffer (out, Inter-Results); } Application using LSS LSS = Get a LSS from GATES Accumulate Inter-Results to LSS Reset all Auxiliary structures Inform GATES migration could be executed

15 Implementation Detail

16 Architecture of dynamic resource allocation scheme Using Information Service to collect resource information Apply dynamic resource allocation algorithm Advise and assist GATES services to migrate

17 Roadmap Introduction ‒ Motivation for tight-coupled applications in Grid ‒ Challenge and our approach Middleware System Overview ‒ Introduce the system architecture and design Implementing Dynamic Migration Using LSS ‒ Light-weight summary structure (LSS) and its example ‒ How applications utilize LSS ‒ LSS Implementation Detail ‒ Architecture of dynamic resource allocation scheme Evaluation ‒ Three distributed data stream applications ‒ Memory usage of LSS ‒ Efficient migration by using LSS ‒ Processing accuracy and LSS migration Related work Conclusion

18 Experimental Evaluation Evaluation –Three applications Counting sample –LSS stores intermediate top M frequently occurring numbers Clustream, clustering data points in streams –LSS stores micro-clusters computed at the second stage Dist-Freq-Counting, finding frequent itemsets in distributed streams. –LSS stores unprocessed itemsets

19 Experimental Evaluation Memory usage of LSS

20 Experimental Evaluation Memory usage of LSS

21 Experimental Evaluation Migration using LSS is efficient

22 Experimental Evaluation Migration using LSS is efficient

23 Experimental Evaluation Migration using LSS is efficient

24 Experimental Evaluation Migration using LSS is efficient

25 Experimental Evaluation Benefits of migration in a dynamic environment

26 Experimental Evaluation Benefits of migration in a dynamic environment

27 Experimental Evaluation LSS migration does not impact processing accuracy –The counting sample application was used –Compared the average accuracy of the processing results from the non- migration and the migration versions, they are 97.28% and 97.51% accurate

28 Related work  Condor  XCATS  Charm++  User Level Processes (ULP)  Migratable PVM  Piranha

29 Related Work Middleware for data stream processing –Data cutter, Stampede –Differences: in a cluster, no self-adaptation, no specifically for real-time processing Continuous query systems –STREAM, dQUOB, TelegraphCQ, NiagraCQ –Differences: centralized, no adaptation supports Distributed continuous query systems –Aurora*, Medusa, Borealis –Differences: continuous queries, not in Grid environment In- Network aggregation in sensor network Stream-based overlay networks

30 Conclusion  LSS enables efficient migration for distributed data stream applications  The main observations from our experiments –Enables efficient process migration; the size of process state reduced by times –Introduces a very small overhead –Significantly improve the performance of long- running applications. –Our migration scheme does not impact the accuracy of the processing.

31 Questions?

32 Implementing Dynamic Migration Using LSS

33 Architecture of Dynamic Resource Allocation Scheme