Papers on Storage Systems 1) Purlieus: Locality-aware Resource Allocation for MapReduce in a Cloud, SC 2011. 2) Making Cloud Intermediate Data Fault-Tolerant,

Slides:



Advertisements
Similar presentations
Remus: High Availability via Asynchronous Virtual Machine Replication
Advertisements

Starfish: A Self-tuning System for Big Data Analytics.
University of Minnesota Optimizing MapReduce Provisioning in the Cloud Michael Cardosa, Aameek Singh†, Himabindu Pucha†, Abhishek Chandra
SDN + Storage.
LIBRA: Lightweight Data Skew Mitigation in MapReduce
SLA-Oriented Resource Provisioning for Cloud Computing
EHarmony in Cloud Subtitle Brian Ko. eHarmony Online subscription-based matchmaking service Available in United States, Canada, Australia and United Kingdom.
Load Rebalancing for Distributed File Systems in Clouds Hung-Chang Hsiao, Member, IEEE Computer Society, Hsueh-Yi Chung, Haiying Shen, Member, IEEE, and.
Power Management in Cloud Computing using Green Algorithm -Kushal Mehta COP 6087 University of Central Florida.
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Availability in Globally Distributed Storage Systems
Study of Hurricane and Tornado Operating Systems By Shubhanan Bakre.
Center-of-Gravity Reduce Task Scheduling to Lower MapReduce Network Traffic Mohammad Hammoud, M. Suhail Rehman, and Majd F. Sakr 1.
Performance Evaluation of Peer-to-Peer Video Streaming Systems Wilson, W.F. Poon The Chinese University of Hong Kong.
Distributed Computations MapReduce
Google Distributed System and Hadoop Lakshmi Thyagarajan.
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
On Availability of Intermediate Data in Cloud Computations Steven Y. Ko, Imranul Hoque, Brian Cho, and Indranil Gupta Distributed Protocols Research Group.
Advanced Topics: MapReduce ECE 454 Computer Systems Programming Topics: Reductions Implemented in Distributed Frameworks Distributed Key-Value Stores Hadoop.
A User Experience-based Cloud Service Redeployment Mechanism KANG Yu.
Cloud Data Center/Storage Power Efficiency Solutions Junyao Zhang 1.
CS492: Special Topics on Distributed Algorithms and Systems Fall 2008 Lab 3: Final Term Project.
Predicting performance of applications and infrastructures Tania Lorido 27th May 2011.
Map Reduce for data-intensive computing (Some of the content is adapted from the original authors’ talk at OSDI 04)
Location-aware MapReduce in Virtual Cloud 2011 IEEE computer society International Conference on Parallel Processing Yifeng Geng1,2, Shimin Chen3, YongWei.
Network Aware Resource Allocation in Distributed Clouds.
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
MapReduce: Simplified Data Processing on Large Clusters Jeffrey Dean and Sanjay Ghemawat.
Storage Management in Virtualized Cloud Environments Sankaran Sivathanu, Ling Liu, Mei Yiduo and Xing Pu Student Workshop on Frontiers of Cloud Computing,
EXPOSE GOOGLE APP ENGINE AS TASKTRACKER NODES AND DATA NODES.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
Introduction to Hadoop and HDFS
f ACT s  Data intensive applications with Petabytes of data  Web pages billion web pages x 20KB = 400+ terabytes  One computer can read
Hadoop Hardware Infrastructure considerations ©2013 OpalSoft Big Data.
Autonomic SLA-driven Provisioning for Cloud Applications Nicolas Bonvin, Thanasis Papaioannou, Karl Aberer Presented by Ismail Alan.
Optimizing Cloud MapReduce for Processing Stream Data using Pipelining 作者 :Rutvik Karve , Devendra Dahiphale , Amit Chhajer 報告 : 饒展榕.
Challenges towards Elastic Power Management in Internet Data Center.
MapReduce M/R slides adapted from those of Jeff Dean’s.
1 Distributed Energy-Efficient Scheduling for Data-Intensive Applications with Deadline Constraints on Data Grids Cong Liu and Xiao Qin Auburn University.
Eneryg Efficiency for MapReduce Workloads: An Indepth Study Boliang Feng Renmin University of China Dec 19.
Performance Evaluation of Image Conversion Module Based on MapReduce for Transcoding and Transmoding in SMCCSE Speaker : 吳靖緯 MA0G IEEE.
Bi-Hadoop: Extending Hadoop To Improve Support For Binary-Input Applications Xiao Yu and Bo Hong School of Electrical and Computer Engineering Georgia.
BALANCED DATA LAYOUT IN HADOOP CPS 216 Kyungmin (Jason) Lee Ke (Jessie) Xu Weiping Zhang.
CARDIO: Cost-Aware Replication for Data-Intensive workflOws Presented by Chen He.
1 ACTIVE FAULT TOLERANT SYSTEM for OPEN DISTRIBUTED COMPUTING (Autonomic and Trusted Computing 2006) Giray Kömürcü.
Optimizing Live Migration of Virtual Machines across Wide Area Networks using Integrated Replication and Scheduling Sumit Kumar Bose, Unisys Scott Brock,
Computing Scientometrics in Large-Scale Academic Search Engines with MapReduce Leonidas Akritidis Panayiotis Bozanis Department of Computer & Communication.
1 Enabling Efficient and Reliable Transitions from Replication to Erasure Coding for Clustered File Systems Runhui Li, Yuchong Hu, Patrick P. C. Lee The.
Network-Aware Scheduling for Data-Parallel Jobs: Plan When You Can
Efficient Live Checkpointing Mechanisms for computation and memory-intensive VMs in a data center Kasidit Chanchio Vasabilab Dept of Computer Science,
June 30 - July 2, 2009AIMS 2009 Towards Energy Efficient Change Management in A Cloud Computing Environment: A Pro-Active Approach H. AbdelSalamK. Maly.
HADOOP DISTRIBUTED FILE SYSTEM HDFS Reliability Based on “The Hadoop Distributed File System” K. Shvachko et al., MSST 2010 Michael Tsitrin 26/05/13.
DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters Nanyang Technological University Shanjiang Tang, Bu-Sung Lee, Bingsheng.
Optimizing Live Migration of Virtual Machines across Wide Area Networks using Integrated Replication and Scheduling Sumit Kumar Bose, Unisys Scott Brock,
IBM Research ® © 2007 IBM Corporation Introduction to Map-Reduce and Join Processing.
MapReduce & Hadoop IT332 Distributed Systems. Outline  MapReduce  Hadoop  Cloudera Hadoop  Tutorial 2.
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
Data Consolidation: A Task Scheduling and Data Migration Technique for Grid Networks Author: P. Kokkinos, K. Christodoulopoulos, A. Kretsis, and E. Varvarigos.
On the Placement of Web Server Replicas Yu Cai. Paper On the Placement of Web Server Replicas Lili Qiu, Venkata N. Padmanabhan, Geoffrey M. Voelker Infocom.
- 세부 1 - 이종 클라우드 플랫폼 데이터 관리 브로커 연구 및 개발 Network and Computing Lab.
1 Student Date Time Wei Li Nov 30, 2015 Monday 9:00-9:25am Shubbhi Taneja Nov 30, 2015 Monday9:25-9:50am Rodrigo Sanandan Dec 2, 2015 Wednesday9:00-9:25am.
BAHIR DAR UNIVERSITY Institute of technology Faculty of Computing Department of information technology Msc program Distributed Database Article Review.
By Chris immanuel, Heym Kumar, Sai janani, Susmitha
Curator: Self-Managing Storage for Enterprise Clusters
Distributed Network Traffic Feature Extraction for a Real-time IDS
MapReduce Computing Paradigm Basics Fall 2013 Elke A. Rundensteiner
Ministry of Higher Education
湖南大学-信息科学与工程学院-计算机与科学系
MapReduce: Simplified Data Processing on Large Clusters
Towards Predictable Datacenter Networks
Presentation transcript:

Papers on Storage Systems 1) Purlieus: Locality-aware Resource Allocation for MapReduce in a Cloud, SC ) Making Cloud Intermediate Data Fault-Tolerant, SOCC Present by: Qiangju Xiao

Purlieus: Locality-aware Resource Allocation for MapReduce in a Cloud SC ’11, 2011 Authors: Balaji Palanisamy Aameek Singh Ling Liu Bhushan Jain

Introduction (1) What does the paper present? – This paper designed Purlieus, a MapReduce resource allocation system aimed to enhance the performance of MapReduce jobs in the cloud. How does Purlieus work? – Provision virtual MapReduce clusters in a locality- aware manner; – Enable MapReduce VMs access to input data (Map Phase) and intermediate data (Reduce phase) from local or close-by physical machines

Introduction (2) What are the improvements for Purlieus? – Reduces cumulative data center network traffic; – 50% reduction in job execution times for a variety of workloads because network transfer times are big components of total execution time

Impact of Reduce Locality

System Model (1) – Current Cloud Infrastructure Data Load

System Model (2) – Purlieus Infrastructure 1)Data is broken into chunks 2)Blocks stored on distributed file system of the physical machines 3)VM access data on physical machines

System Model (3) – Dataflow from physical to virtual machines

Two Key Questions Data Placement – Which physical machines should be used for each dataset? VM Placement – Where should the VMs be provisioned to process these data blocks?

Purlieus’ Solution – Principles (1) Job Specific Locality-awareness – Placing data in the MapReduce cloud service should incorporate job characteristics like the amount of data accessed in the map and reduce phases. – Three distinct classes of jobs – (1) Map-input heavy; (2) Map-and-Reduce heavy; (3) Reduce- input-heavy.

Purlieus’ Solution – Principles (2) Load Awareness – Placing data in a MapReduce cloud should also account for computational load (CPU, memory ) on the physical machines. – Ensure that the expected load on the servers does not exceed a configurable threshold.

Purlieus’ Solution – Principles (3) Job-specific Data Replication – Replicas of the data set are placed based on the type and frequency of jobs. For example, if an input dataset is used by three sets of MapReduce jobs, two of which are reduce- input heavy and one map-input heavy, Purlieus places two replicas of data blocks in a reduce- input heavy fashion and the third one using map- input heavy strategy.

Purlieus – Placement Techniques (1) Map-input heavy jobs – Data placement Do not require reducers to be executed close to each other; Purlieus chooses machines that have the least expected load. – VM placement Attempt to place VMs on the physical machines that contain the input data chunks for the map phase; if those machines have lower expected computational load, the VM may be placed close to the node that stores the actual data chunk. Among the physical machines at a same network distance, the one having the least load is chosen.

Purlieus – Placement Techniques (2) Map and Reduce-input heavy jobs – Data Placement Should support reduce-locality – VMs should be machines close to each other; Data blocks get placed in a set of closely connected physical machines. – VM placement Ensure that VMs get placed on either the physical machines storing the input data or the close-by ones. Map tasks use local reads and reduce tasks also read within the same rack, maximizing the reduce locality

Purlieus – Placement Techniques (3) Reduce-input heavy jobs – Data Placement Map-locality is not so important; Chooses the physical machine with maximum free storage – VM placement Network traffic for transferring intermediate data among MapReduce VMs is intense in reduce-input heavy jobs and hence the set of VMs for the job should be placed close to each other.

Experiments Data Placement Techniques – Purlieus proposed locality and load-aware data placement (LLADP) – Random data placement (RDP) VM placement techniques: – Locality-unaware VMPlacement(LUAVP) – Map-locality aware VM placement (MLVP) – Reduce-locality aware VM placement (RLVP) – Map and Reduce-locality aware VM placement (MRLVP) – Hybrid locality-aware VM placement (HLVP): Our proposed HLVP technique adaptively picks the placement strategy based on type of the input job. It uses MLVP for map-input heavy, RLVP for reduce-input heavy jobs and MRLVP for map and reduce- input heavy jobs.

Results – Map and Reduce-input heavy workload

Results – Map-input heavy workload

Results – Reduce-input heavy workload

Results – Macro analysis using MapReduce simulator, PurSim (1)

Results – Macro analysis using MapReduce simulator, PurSim (2)

Conclusions Purlieus’ proposed placement techniques optimize for data locality during both map and reduce phases of the job by considering VM placement, MapReduce job characteristics and load on the physical cloud infrastructure at the time of data placement. Purlieus’ evaluation shows significant performance gains with some scenarios showing close to 50% reduction in the cross-rack network traffic.

Making Cloud Intermediate Data Fault-Tolerant SOCC 2010 Authors: Steven Y. Ko Imranul Hoque Brian Cho Indranil Gupta

MapReduce Phases – Map – Shuffle – Reduce Data –Input –Intermediate –Output

Intermediate Data Short lived Used immediately Discarded on completion Write once/ Read bounded Large Many blocks

Intermediate Data – Failures Cascaded re-execution

Intermediate Data Loss requires recomputation

Intermediate Data – Behavior breakdown 0f-10min 1f-30sec

Intermediate Data – Repliation Traditional replication expensive

Can replication be accomplished without significantly affecting execution speed?

Extend HDFS Asynchronous replication Replicate within rack Minimize replicated data

Asynchronous Replication HDFS Replication usually pessimistic – Blocks until replicas made Do not block (Async) – Consistency loss not problem - only one writer

Asynchronous Replication

Replicate within Rack HDFS replicates to a different rack for greater availability Lifespan of intermediate data short “Safe” to replicate to machine in same rack

Replicate within Rack

Minimize Data Replicated HDFS replication – Shuffle phase replicates most data as side effect – Only data used locally is not copied ISS – Replicate only local data

Minimize Data Replicated

IIS under failure

Conclusion Intermediate data properties allow a tailored replication strategy to outperform a traditional one Replication improves MapReduce performance in the case of failure

References 1) Balaji Palanisamy, Aameek Singh, Ling Liu, Bhushan Jain; Purlieus: Locality-aware Resource Allocation for MapReduce in a Cloud, SC ) Steven Y. Ko, Imranul Hoque, Brian Cho, Indranil Gupta; Making Cloud Intermediate Data Fault-Tolerant, SOCC 2010