UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University.

Slides:



Advertisements
Similar presentations
MapReduce Online Tyson Condie UC Berkeley Slides by Kaixiang MO
Advertisements

Aggressive Cloning of Jobs for Effective Straggler Mitigation Ganesh Ananthanarayanan, Ali Ghodsi, Scott Shenker, Ion Stoica.
Effective Straggler Mitigation: Attack of the Clones Ganesh Ananthanarayanan, Ali Ghodsi, Srikanth Kandula, Scott Shenker, Ion Stoica.
MapReduce.
GRASS: Trimming Stragglers in Approximation Analytics Ganesh Ananthanarayanan, Michael Hung, Xiaoqi Ren, Ion Stoica, Adam Wierman, Minlan Yu.
LIBRA: Lightweight Data Skew Mitigation in MapReduce
MapReduce Online Created by: Rajesh Gadipuuri Modified by: Ying Lu.
MapReduce Online Veli Hasanov Fatih University.
SkewTune: Mitigating Skew in MapReduce Applications
HadoopDB Inneke Ponet.  Introduction  Technologies for data analysis  HadoopDB  Desired properties  Layers of HadoopDB  HadoopDB Components.
UC Berkeley Job Scheduling for MapReduce Matei Zaharia, Dhruba Borthakur *, Joydeep Sen Sarma *, Scott Shenker, Ion Stoica 1 RAD Lab, * Facebook Inc.
THE DATACENTER NEEDS AN OPERATING SYSTEM MATEI ZAHARIA, BENJAMIN HINDMAN, ANDY KONWINSKI, ALI GHODSI, ANTHONY JOSEPH, RANDY KATZ, SCOTT SHENKER, ION STOICA.
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Discretized Streams An Efficient and Fault-Tolerant Model for Stream Processing on Large Clusters Matei Zaharia, Tathagata Das, Haoyuan Li, Scott Shenker,
Charles Reiss *, Alexey Tumanov †, Gregory R. Ganger †, Randy H. Katz *, Michael A. Kozuch ‡ * UC Berkeley† CMU‡ Intel Labs.
Backup Path Allocation Based on A Link Failure Probability Model in Overlay Networks Weidong Cui, Ion Stoica, and Randy H. Katz EECS, UC Berkeley {wdc,
UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University.
Mesos A Platform for Fine-Grained Resource Sharing in Data Centers Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony D. Joseph, Randy.
Matei Zaharia, Dhruba Borthakur *, Joydeep Sen Sarma *, Khaled Elmeleegy +, Scott Shenker, Ion Stoica UC Berkeley, * Facebook Inc, + Yahoo! Research Delay.
Improving MapReduce Performance through Data Placement in Heterogeneous Hadoop Clusters Jiong Xie Ph.D. Student April 2010.
Mesos A Platform for Fine-Grained Resource Sharing in the Data Center Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony Joseph, Randy.
Mesos A Platform for Fine-Grained Resource Sharing in the Data Center Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony Joseph, Randy.
UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University.
Map-Reduce and Parallel Computing for Large-Scale Media Processing Youjie Zhou.
CPS216: Advanced Database Systems (Data-intensive Computing Systems) How MapReduce Works (in Hadoop) Shivnath Babu.
Improving MapReduce Performance Using Smart Speculative Execution Strategy Qi Chen, Cheng Liu, and Zhen Xiao Oct 2013 To appear in IEEE Transactions on.
L22: SC Report, Map Reduce November 23, Map Reduce What is MapReduce? Example computing environment How it works Fault Tolerance Debugging Performance.
Table of ContentsTable of Contents  Overview  Scheduling in Hadoop  Heterogeneity in Hadoop  The LATE Scheduler(Longest Approximate Time to End) 
Survey on Programming and Tasking in Cloud Computing Environments PhD Qualifying Exam Zhiqiang Ma Supervisor: Lin Gu Feb. 18, 2011.
Outline | Motivation| Design | Results| Status| Future
Distributed Low-Latency Scheduling
A Platform for Fine-Grained Resource Sharing in the Data Center
Advanced Topics: MapReduce ECE 454 Computer Systems Programming Topics: Reductions Implemented in Distributed Frameworks Distributed Key-Value Stores Hadoop.
By: Jeffrey Dean & Sanjay Ghemawat Presented by: Warunika Ranaweera Supervised by: Dr. Nalin Ranasinghe.
Google MapReduce Simplified Data Processing on Large Clusters Jeff Dean, Sanjay Ghemawat Google, Inc. Presented by Conroy Whitney 4 th year CS – Web Development.
MapReduce: Simplified Data Processing on Large Clusters 컴퓨터학과 김정수.
A Dynamic MapReduce Scheduler for Heterogeneous Workloads Chao Tian, Haojie Zhou, Yongqiang He,Li Zha 簡報人:碩資工一甲 董耀文.
Introduction to MapReduce ECE7610. The Age of Big-Data  Big-data age  Facebook collects 500 terabytes a day(2011)  Google collects 20000PB a day (2011)
MapReduce: Hadoop Implementation. Outline MapReduce overview Applications of MapReduce Hadoop overview.
Mesos A Platform for Fine-Grained Resource Sharing in the Data Center Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony Joseph, Randy.
MapReduce How to painlessly process terabytes of data.
Dominant Resource Fairness: Fair Allocation of Multiple Resource Types Ali Ghodsi, Matei Zaharia, Benjamin Hindman, Andy Konwinski, Scott Shenker, Ion.
Department of Computer Science, UIUC Presented by: Muntasir Raihan Rahman and Anupam Das CS 525 Spring 2011 Advanced Distributed Systems Cloud Scheduling.
A Hierarchical MapReduce Framework Yuan Luo and Beth Plale School of Informatics and Computing, Indiana University Data To Insight Center, Indiana University.
Matei Zaharia, Dhruba Borthakur *, Joydeep Sen Sarma *, Khaled Elmeleegy +, Scott Shenker, Ion Stoica UC Berkeley, * Facebook Inc, + Yahoo! Research Fair.
Sparrow Distributed Low-Latency Spark Scheduling Kay Ousterhout, Patrick Wendell, Matei Zaharia, Ion Stoica.
DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters Nanyang Technological University Shanjiang Tang, Bu-Sung Lee, Bingsheng.
A Platform for Fine-Grained Resource Sharing in the Data Center
Dominant Resource Fairness: Fair Allocation of Multiple Resource Types Ali Ghodsi, Matei Zaharia, Benjamin Hindman, Andy Konwinski, Scott Shenker, Ion.
MapReduce: Simplied Data Processing on Large Clusters Written By: Jeffrey Dean and Sanjay Ghemawat Presented By: Manoher Shatha & Naveen Kumar Ratkal.
Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center NSDI 11’ Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony D.
Resilient Distributed Datasets A Fault-Tolerant Abstraction for In-Memory Cluster Computing Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave,
Map reduce Cs 595 Lecture 11.
Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center
Edinburgh Napier University
Introduction to MapReduce and Hadoop
PA an Coordinated Memory Caching for Parallel Jobs
Auburn University COMP7500 Advanced Operating Systems I/O-Aware Load Balancing Techniques (2) Dr. Xiao Qin Auburn University.
MapReduce Simplied Data Processing on Large Clusters
EECS 582 Final Review Mosharaf Chowdhury EECS 582 – F16.
湖南大学-信息科学与工程学院-计算机与科学系
February 26th – Map/Reduce
Cse 344 May 4th – Map/Reduce.
Discretized Streams: A Fault-Tolerant Model for Scalable Stream Processing Zaharia, et al (2012)
Introduction to MapReduce
Cloud Computing Large-scale Resource Management
Cloud Computing MapReduce in Heterogeneous Environments
5/7/2019 Map Reduce Map reduce.
COS 518: Distributed Systems Lecture 11 Mike Freedman
MapReduce: Simplified Data Processing on Large Clusters
Presentation transcript:

UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University of California at Berkeley

Motivation MapReduce programming model growing in popularity –Open-source implementation, Hadoop, used at Yahoo, Facebook, CMU, Berkeley,… Virtualized computing services like Amazon EC2 provide on-demand compute power, but less control over performance

Results Main challenge for Hadoop on EC2 was node performance heterogeneity Designed heterogeneity-aware scheduler that improves response time up to 2x

Outline MapReduce background The challenge of heterogeneity LATE: a heterogeneity-aware scheduler

What is MapReduce? Programming model to split computations into independent parallel tasks –Map tasks filter data set –Reduce tasks aggregate values by key Goal: hide the complexity of distributed programming and fault tolerance

Fault Tolerance in MapReduce Nodes fail  re-run tasks Nodes very slow (stragglers)  launch backup copies of tasks How to do this in heterogeneous env.?

Heterogeneity in Virtualized Environments VM technology isolates CPU and memory, but disk and network are shared –Full bandwidth when no contention –Equal shares when there is contention 2.5x I/O performance difference on EC2

Disk Performance Heterogeneity Experiment

Backup Task Scheduling in Hadoop Scheduler starts all primary tasks, then looks for tasks to back up Tasks report “progress score” from 0 to 1 Backup launched if progress < avgProgress - 20%

Problems in Heterogeneous Environment Too many backups (trash shared resources) Wrong tasks may be backed up Backups may be placed on slow nodes Tasks never backed up if progress > 80% Result: 80% of reduces backed up in some experiments, network overloaded

Progress Rate Approaches Compute average progress rate, back up tasks that are “far enough” below this Problems: –How long to wait for statistics? –Can still select the wrong tasks

Example Time (min) Node 1 Node 2 Node 3 3x slower 1.9x slower 1 task/min

Example Node 1 Node 2 Node 3 What if the job had 5 tasks? time left: 1 min time left: 1.8 min 2 min Time (min) Should back up node 3’s task

Our Scheduler: LATE Insight: back up the task with the largest estimated finish time –“Longest Approximate Time to End” Sanity thresholds: –Cap backup tasks to ~10% –Launch backups on fast nodes –Only back up tasks that are sufficiently slow

LATE Details Estimating finish time: Thresholds: –25 th percentiles for slow node/task, 10% cap –Sensitivity analysis shows robustness progress score execution time progress rate = 1 – progress score progress rate estimated time left =

Evaluation EC2 experiments (3 job types, 200 nodes) Experiments in small controlled testbed Contention through VM placement Results: –2x better response time if there are stragglers –30% better response time when no stragglers

Conclusion Heterogeneity is a challenge for parallel applications, and is getting more important Lessons for scheduling backup tasks: –Detecting slow nodes isn’t enough; do it early –Pick tasks which hurt response time the most –Be mindful of shared resources 2x improvement using simple algorithm

Questions? ? ? ?