Edinburgh Napier University

Slides:

Advertisements

Similar presentations

Aggressive Cloning of Jobs for Effective Straggler Mitigation Ganesh Ananthanarayanan, Ali Ghodsi, Scott Shenker, Ion Stoica.

Advertisements

Effective Straggler Mitigation: Attack of the Clones Ganesh Ananthanarayanan, Ali Ghodsi, Srikanth Kandula, Scott Shenker, Ion Stoica.

LIBRA: Lightweight Data Skew Mitigation in MapReduce

Based on the text by Jimmy Lin and Chris Dryer; and on the yahoo tutorial on mapreduce at index.html

Ilias Tachmazidis 1,2, Grigoris Antoniou 1,2,3, Giorgos Flouris 2, Spyros Kotoulas 4 1 University of Crete 2 Foundation for Research and Technology, Hellas.

Pei Fan*, Ji Wang, Zibin Zheng, Michael R. Lyu Toward Optimal Deployment of Communication-Intensive Cloud Applications 1.

Meeting Service Level Objectives of Pig Programs Zhuoyao Zhang, Ludmila Cherkasova, Abhishek Verma, Boon Thau Loo University of Pennsylvania Hewlett-Packard.

Software Quality Ranking: Bringing Order to Software Modules in Testing Fei Xing Michael R. Lyu Ping Guo.

Center-of-Gravity Reduce Task Scheduling to Lower MapReduce Network Traffic Mohammad Hammoud, M. Suhail Rehman, and Majd F. Sakr 1.

UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University.

Fault-tolerant Adaptive Divisible Load Scheduling Xuan Lin, Sumanth J. V. Acknowledge: a few slides of DLT are from Thomas Robertazzi ’ s presentation.

1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.

Grid Load Balancing Scheduling Algorithm Based on Statistics Thinking The 9th International Conference for Young Computer Scientists Bin Lu, Hongbin Zhang.

1 Optimizing Utility in Cloud Computing through Autonomic Workload Execution Reporter : Lin Kelly Date : 2010/11/24.

UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University.

UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University.

CPS216: Advanced Database Systems (Data-intensive Computing Systems) How MapReduce Works (in Hadoop) Shivnath Babu.

Improving MapReduce Performance Using Smart Speculative Execution Strategy Qi Chen, Cheng Liu, and Zhen Xiao Oct 2013 To appear in IEEE Transactions on.

Table of ContentsTable of Contents  Overview  Scheduling in Hadoop  Heterogeneity in Hadoop  The LATE Scheduler(Longest Approximate Time to End) 

Research on cloud computing application in the peer-to-peer based video-on-demand systems Speaker : 吳靖緯 MA0G rd International Workshop.

An approach for solving the Helmholtz Equation on heterogeneous platforms An approach for solving the Helmholtz Equation on heterogeneous platforms G.

A Dynamic MapReduce Scheduler for Heterogeneous Workloads Chao Tian, Haojie Zhou, Yongqiang He,Li Zha 簡報人：碩資工一甲董耀文.

Presented By HaeJoon Lee Yanyan Shen, Beng Chin Ooi, Bogdan Marius Tudor National University of Singapore Wei Lu Renmin University Cang Chen Zhejiang University.

MARISSA: MApReduce Implementation for Streaming Science Applications 作者 : Fadika, Z. ; Hartog, J. ; Govindaraju, M. ; Ramakrishnan, L. ; Gunter, D. ; Canon,

1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.

Dominant Resource Fairness: Fair Allocation of Multiple Resource Types Ali Ghodsi, Matei Zaharia, Benjamin Hindman, Andy Konwinski, Scott Shenker, Ion.

GreenSched: An Energy-Aware Hadoop Workflow Scheduler

Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.

MC 2 : Map Concurrency Characterization for MapReduce on the Cloud Mohammad Hammoud and Majd Sakr 1.

Job scheduling algorithm based on Berger model in cloud environment Advances in Engineering Software (2011) Baomin Xu,Chunyan Zhao,Enzhao Hua,Bin Hu 2013/1/251.

5 May CmpE 516 Fault Tolerant Scheduling in Multiprocessor Systems Betül Demiröz.

Using Map-reduce to Support MPMD Peng

MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.

Dynamic Slot Allocation Technique for MapReduce Clusters School of Computer Engineering Nanyang Technological University 25 th Sept 2013 Shanjiang Tang,

DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters Nanyang Technological University Shanjiang Tang, Bu-Sung Lee, Bingsheng.

Big traffic data processing framework for intelligent monitoring and recording systems 學生 : 賴弘偉教授 : 許毅然作者 : Yingjie Xia a, JinlongChen a,b,n, XindaiLu.

Efficient Load Balancing Algorithm for Cloud Computing Network Che-Lun Hung 1, Hsiao-hsi Wang 2 and Yu-Chen Hu 2 1 Dept. of Computer Science & Communication.

Author : Cedric Augonnet, Samuel Thibault, and Raymond Namyst INRIA Bordeaux, LaBRI, University of Bordeaux Workshop on Highly Parallel Processing on a.

Genetic algorithms for task scheduling problem J. Parallel Distrib. Comput. (2010) Fatma A. Omara, Mona M. Arafa 2016/3/111 Shang-Chi Wu.

INTRODUCTION TO HADOOP. OUTLINE  What is Hadoop  The core of Hadoop  Structure of Hadoop Distributed File System  Structure of MapReduce Framework.

Name : Mamatha J M Seminar guide: Mr. Kemparaju. GRID COMPUTING.

1 ”MCUDA: An efficient implementation of CUDA kernels for multi-core CPUs” John A. Stratton, Sam S. Stone and Wen-mei W. Hwu Presentation for class TDT24,

MapReduce MapReduce is one of the most popular distributed programming models Model has two phases: Map Phase: Distributed processing based on key, value.

Talal H. Noor, Quan Z. Sheng, Lina Yao,

Introduction to Load Balancing:

Tao Zhu1,2, Chengchun Shu1, Haiyan Yu1

CS 425 / ECE 428 Distributed Systems Fall 2016 Nov 10, 2016

CS 425 / ECE 428 Distributed Systems Fall 2017 Nov 16, 2017

Efficient Load Balancing Algorithm for Cloud

PA an Coordinated Memory Caching for Parallel Jobs

Chapter 6: CPU Scheduling

MapReduce Computing Paradigm Basics Fall 2013 Elke A. Rundensteiner

Distributed Systems CS

Dynamic Routing Using Inter Capsule Routing Protocol Between Capsules

湖南大学-信息科学与工程学院-计算机与科学系

MapReduce: Data Distribution for Reduce

On Spatial Joins in MapReduce

Module 5: CPU Scheduling

Cse 344 May 4th – Map/Reduce.

MapReduce Scheduling in Cloud Computing

ECE539 final project Instructor: Yu Hen Hu Fall 2005

SMEM Algorithm for Mixture Models

Chapter 2: The Linux System Part 3

Smita Vijayakumar Qian Zhu Gagan Agrawal

Pei Fan*, Ji Wang, Zibin Zheng, Michael R. Lyu

Outline Scheduling algorithms Multi-processor scheduling

Distributed Systems CS

Cloud Computing MapReduce in Heterogeneous Environments

Shortest-Job-First (SJR) Scheduling

MapReduce: Simplified Data Processing on Large Clusters

Presentation transcript:

Edinburgh Napier University An Optimized Speculative Execution Strategy Based on Local Data Prediction in a Heterogeneous Hadoop Environment Edinburgh Napier University Xiaodong Liu and Qi Liu

Contents Background Introduction Related Work Model and Algorithm Results and Evaluation Conclusion

Background Hadoop, which acts as the top project of Apache and one of the most popular cloud computing frameworks, has been widely adopted for its distributed features on data storage, computing and searching. Job scheduling is the core component of Hadoop and aims to divide a job into multiple tasks, and then provoke a JobTracker service to assign the tasks to corresponding TaskTracker nodes.

Background Distributing tasks as fast as possible cannot guarantee that subsequent execution in each TaskTracker still maintains its superiority [3], and may lead to the so-called slow tasks-Straggler. Speculative Execution (SE) is the current effective mechanism to recognize and correct inefficient allocation made by a JobTracker service so as to improve the fault tolerance feature of the Hadoop.

Related Work Due to the poor performance of Hadoop-naïve speculative execution strategy in heterogeneous environments, many optimized SE algorithm was proposed. LATE-using the remaining time as the speculative execution priority. MCP-optimizing the SE strategy by maximizing the benefits of launching backup tasks. ERUL-calculating the remaining time by the real-time system load and improves the accuracy of the prediction.

Model and Algorithm

Model and Algorithm (1) The Recognition of Straggler Candidates The LWR method was implemented to calculate the remaining time of tasks. Where X is an input matrix, Y is the output vector. W is a diagonal weight function matrix.

Model and Algorithm A Gaussian kernel function is therefore used to calculate the weight function ω(d) ,where γ is the wave-length parameter and is set to 0.08 in this paper.

One slot for trem-tbackup Model and Algorithm (2) The Benefit Calculation of Replicating Stragglers SE Enabled SE Disabled Cluster Consumption Two slots for tbackup One slot for trem Cluster Benefits One slot for trem-tbackup trem is the remaining time predicted by the LWR model, tavg is the average execution time of completed tasks. μ is introduced to avoid the influence of the data skew of the input data.

Model and Algorithm (3) The Selection of Backup Nodes To enhance the performance of SE, we proposes a new method to measure and assess potential backup nodes by dividing the nodes into two good-at groups, i.e. “Map-Fast” nodes and “Reduce-Fast” nodes. PR represents the processing rate of node candidates.

Results & Evaluation The detailed information of experimental environment NodeID Memory(GB) Core Processors Node 1 10 8 Node 2 4 Node 3 1 Node 4 Node 5 Node 6 Node 7 18 Node 8 12

Results & Evaluation Job execution time and Cluster Throughtput of different SE strategies on Wordcount jobs in a normal load scenario

Results & Evaluation Job execution time and Cluster Throughtput of different SE strategies on Wordcount jobs in a busy load with data skew scenario

Conclusion LWR-SE was proposed inspired by the non-linear relationship between job execution time and progress. The experimental results have shown that the LWR-SE outperforms the MCP, LATE and Hadoop-None in three different heterogeneous scenarios designed with either normal or busy workloads

Thank you!