Yahoo! Research Johns Hopkins University Chris Olston Anish Das Sarma Xiaodan Wang Randal Burns Shared Scan Batch Scheduling in Cloud Computing.

Slides:



Advertisements
Similar presentations
Meet Hadoop Doug Cutting & Eric Baldeschwieler Yahoo!
Advertisements

A MapReduce Workflow System for Architecting Scientific Data Intensive Applications By Phuong Nguyen and Milton Halem phuong3 or 1.
MapReduce Online Tyson Condie UC Berkeley Slides by Kaixiang MO
MAP REDUCE PROGRAMMING Dr G Sudha Sadasivam. Map - reduce sort/merge based distributed processing Best for batch- oriented processing Sort/merge is primitive.
University of Minnesota Optimizing MapReduce Provisioning in the Cloud Michael Cardosa, Aameek Singh†, Himabindu Pucha†, Abhishek Chandra
The Big Picture Scientific disciplines have developed a computational branch Models without closed form solutions solved numerically This has lead to.
HDFS & MapReduce Let us change our traditional attitude to the construction of programs: Instead of imagining that our main task is to instruct a computer.
Piccolo: Building fast distributed programs with partitioned tables Russell Power Jinyang Li New York University.
SkewReduce YongChul Kwon Magdalena Balazinska, Bill Howe, Jerome Rolia* University of Washington, *HP Labs Skew-Resistant Parallel Processing of Feature-Extracting.
Johns Hopkins University Xiaodan Wang Eric Perlman Randal Burns Tamas Budavari Charles Meneveau Alexander Szalay Purdue University Tanu Malik JAWS: J ob-
LIBRA: Lightweight Data Skew Mitigation in MapReduce
Mapreduce and Hadoop Introduce Mapreduce and Hadoop
Based on the text by Jimmy Lin and Chris Dryer; and on the yahoo tutorial on mapreduce at index.html
MapReduce Online Created by: Rajesh Gadipuuri Modified by: Ying Lu.
EHarmony in Cloud Subtitle Brian Ko. eHarmony Online subscription-based matchmaking service Available in United States, Canada, Australia and United Kingdom.
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
Meeting Service Level Objectives of Pig Programs Zhuoyao Zhang, Ludmila Cherkasova, Abhishek Verma, Boon Thau Loo University of Pennsylvania Hewlett-Packard.
Distributed Computations
Distributed Computations MapReduce
Take An Internal Look at Hadoop Hairong Kuang Grid Team, Yahoo! Inc
HADOOP ADMIN: Session -2
Design Patterns for Efficient Graph Algorithms in MapReduce Jimmy Lin and Michael Schatz University of Maryland MLG, January, 2014 Jaehwan Lee.
Advanced Topics: MapReduce ECE 454 Computer Systems Programming Topics: Reductions Implemented in Distributed Frameworks Distributed Key-Value Stores Hadoop.
SIDDHARTH MEHTA PURSUING MASTERS IN COMPUTER SCIENCE (FALL 2008) INTERESTS: SYSTEMS, WEB.
Hadoop Ida Mele. Parallel programming Parallel programming is used to improve performance and efficiency In a parallel program, the processing is broken.
By: Jeffrey Dean & Sanjay Ghemawat Presented by: Warunika Ranaweera Supervised by: Dr. Nalin Ranasinghe.
Committed to Deliver….  We are Leaders in Hadoop Ecosystem.  We support, maintain, monitor and provide services over Hadoop whether you run apache Hadoop,
Detecting Near-Duplicates for Web Crawling Manku, Jain, Sarma
Map Reduce for data-intensive computing (Some of the content is adapted from the original authors’ talk at OSDI 04)
NOVA: CONTINUOUS PIG/HADOOP WORKFLOWS. storage & processing scalable file system e.g. HDFS distributed sorting & hashing e.g. Map-Reduce dataflow programming.
CS525: Special Topics in DBs Large-Scale Data Management Hadoop/MapReduce Computing Paradigm Spring 2013 WPI, Mohamed Eltabakh 1.
Whirlwind tour of Hadoop Inspired by Google's GFS Clusters from systems Batch Processing High Throughput Partition-able problems Fault Tolerance.
Presented by CH.Anusha.  Apache Hadoop framework  HDFS and MapReduce  Hadoop distributed file system  JobTracker and TaskTracker  Apache Hadoop NextGen.
Panagiotis Antonopoulos Microsoft Corp Ioannis Konstantinou National Technical University of Athens Dimitrios Tsoumakos.
MapReduce: Hadoop Implementation. Outline MapReduce overview Applications of MapReduce Hadoop overview.
Distributed Indexing of Web Scale Datasets for the Cloud {ikons, eangelou, Computing Systems Laboratory School of Electrical.
Hadoop/MapReduce Computing Paradigm 1 Shirish Agale.
Introduction to Hadoop and HDFS
f ACT s  Data intensive applications with Petabytes of data  Web pages billion web pages x 20KB = 400+ terabytes  One computer can read
Whirlwind Tour of Hadoop Edward Capriolo Rev 2. Whirlwind tour of Hadoop Inspired by Google's GFS Clusters from systems Batch Processing High.
L/O/G/O 云端的小飞象系列报告之二 Cloud 组. L/O/G/O Hadoop in SIGMOD
MapReduce High-Level Languages Spring 2014 WPI, Mohamed Eltabakh 1.
MapReduce M/R slides adapted from those of Jeff Dean’s.
Map-Reduce-Merge: Simplified Relational Data Processing on Large Clusters Hung-chih Yang(Yahoo!), Ali Dasdan(Yahoo!), Ruey-Lung Hsiao(UCLA), D. Stott Parker(UCLA)
Grid Computing at Yahoo! Sameer Paranjpye Mahadev Konar Yahoo!
Xiaodan Wang Department of Computer Science Johns Hopkins University Processing Data Intensive Queries in Scientific Database Federations.
MC 2 : Map Concurrency Characterization for MapReduce on the Cloud Mohammad Hammoud and Majd Sakr 1.
Xiaodan Wang, Randal Burns Department of Computer Science Johns Hopkins University Tanu Malik Cyber Center Purdue University LifeRaft: Data-Driven, Batch.
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.
MapReduce and Data Management Based on slides from Jimmy Lin’s lecture slides ( (licensed.
CS525: Big Data Analytics MapReduce Computing Paradigm & Apache Hadoop Open Source Fall 2013 Elke A. Rundensteiner 1.
DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters Nanyang Technological University Shanjiang Tang, Bu-Sung Lee, Bingsheng.
ApproxHadoop Bringing Approximations to MapReduce Frameworks
Hadoop/MapReduce Computing Paradigm 1 CS525: Special Topics in DBs Large-Scale Data Management Presented By Kelly Technologies
MapReduce: Simplified Data Processing on Large Clusters By Dinesh Dharme.
Next Generation of Apache Hadoop MapReduce Owen
PARALLEL AND DISTRIBUTED PROGRAMMING MODELS U. Jhashuva 1 Asst. Prof Dept. of CSE om.
Google Cloud computing techniques (Lecture 03) 18th Jan 20161Dr.S.Sridhar, Director, RVCT, RVCE, Bangalore
Department of Computer Science Johns Hopkins University Xiaodan Wang Advisor: Randal Burns Processing Data-Intensive Queries in Petabyte-Scale Scientific.
Organizations Are Embracing New Opportunities
Hadoop.
An Open Source Project Commonly Used for Processing Big Data Sets
Introduction to MapReduce and Hadoop
MapReduce Computing Paradigm Basics Fall 2013 Elke A. Rundensteiner
Database Applications (15-415) Hadoop Lecture 26, April 19, 2016
MapReduce Simplied Data Processing on Large Clusters
Omega: flexible, scalable schedulers for large compute clusters
Cse 344 May 2nd – Map/reduce.
Cse 344 May 4th – Map/Reduce.
Charles Tappert Seidenberg School of CSIS, Pace University
Presentation transcript:

Yahoo! Research Johns Hopkins University Chris Olston Anish Das Sarma Xiaodan Wang Randal Burns Shared Scan Batch Scheduling in Cloud Computing

Project Goals Eliminate redundant data processing for concurrent workflows that access the same dataset in the Cloud Batch MapReduce workflows to enable scan sharing – Single pass scan of shared data segments – Alleviate contention and improve scalability – Utilize fewer map/reduce slots under load Data-intensive workloads (tens of minutes to hours) – Joins across multiple datasets – User specified rewards for early completion Trade-offs between efficient resource utilization and deadlines

Shared Scan Batch Scheduling in Cloud Computing Data-Driven Batch Scheduling Throughput scales with contention (Astro. & Turbulence) Decompose into sub-queries based on data access Co-schedule sub-queries to amortize I/O Evaluate data atoms based on utility metric – Reordering based on contention vs. arrival order (CIDR’09) – Adaptive starvation resistance – Job-aware (queries with data dependency) (SC’10) Turbulence DB R1R2R3 R2R3R4 R1R2 Q1 Q2 Q3 Decomposition Data Access by Query Q1Q2Q3 Q1Q3 Q1Q2 R2 R1 R3 Q2 R3 Co-schedule by Sub-query Batch Sched. Query Results Query Results

Shared Scan Batch Scheduling in Cloud Computing Application in Cloud Computing Fixed Cloud (fixed resources) – Single pass scan of shared data – Alleviate contention (utilize less map/reduce slots, shared loading and shuffling of data) – Earn rewards for early completion (soft deadlines) – Local improvement w/ simulated annealing, greedy ordering Elastic Cloud – Machine charge = (# of machines) x (# hours) – Speed-up factors w/ more machines (i.e. more parallelism) – Add machines to meet soft deadlines – Aggressive batching to minimize machine charge (efficiency)

Shared Scan Batch Scheduling in Cloud Computing Sample Pig A = load ‘input1' as (a, b, c); B = filter A by a > 5; store B into 'output1'; C = group B by b; store C into 'output2'; Nova Workflow Platform What is Nova? – Content mgmt and workflow scheduling for the Cloud – Leverages existing resources Cloud Data: HDFS/Zebra storage Cloud Computing: Oozie, Pig/MR/Hadoop Users define complex workflows in Oozie that consume the data Storage: HDFS Processing: Hadoop M- R Simple workflow: Oozie Dataflow: Pig Advanced workflow: Nova App 1 App 2 App 3 Oozie Workflow engine for coordinating Map- Reduce/Pig jobs in Hadoop (i.e. Workflow DAG in which nodes are MR tasks and edges are dataflows)

Shared Scan Batch Scheduling in Cloud Computing Sample Nova Workflow cand. entity extractor candidate entity occurrences (url, entity string) crawled pages (url, content) entities (entity id, entity string) validated entity occurrences (url, entity id) entity occurrence counts (entity id, count) crawle r editors join group- wise count output Nova Tasks Nova Task Nova Data

Shared Scan Batch Scheduling in Cloud Computing Shared Scan via Workflow Merging Nova Workflow 1 Nova Workflow 2 c2s0c2s0 c1s0c1s0c3s0c3s0 c4s0c4s0 c2s0c2s0 c5s0c5s0 Workflow Merger Nova Workflow 1.2 (scans c2s0 once) c2s0c2s0 c1s0c1s0c3s0c3s0 c4s0c4s0 c5s0c5s0 Input Data Pig/MR Tasks Output Data Sample Use Cases in Nova – Concurrent research, production, maintenance workflows over same data – Content enrichment workflows (i.e. dedup, clustering) over news content – Webmap workflows consuming same URL table

Shared Scan Batch Scheduling in Cloud Computing Output Data Map 1 Reduce 1 Shuffle Split(Tuple)Split(Tuple) NestedPlanNestedPlan NestedPlanNestedPlan NestedPlanNestedPlan Combine(Tuple)Combine(Tuple) Demux(Tuple)Demux(Tuple) NestedPlanNestedPlan NestedPlanNestedPlan NestedPlanNestedPlan Map 2 Reduce 2 … … Output Data Map n Reduce m Input Data Performance Impact (1) Shared Loading (network, redundant proc.) (2) Consolidated computation (shared startup/tear down) (3) Reducer parallelism (Max/Sum # of reducers) (1) (2) (3)

Shared Scan Batch Scheduling in Cloud Computing Completion Time by Scheduling Strategy Performance in Nova for different enrichment workflows (ie. de-dup) on news content (SIGMOD’11)

Shared Scan Batch Scheduling in Cloud Computing Utilization of Grid Resources (Slot Time)

Shared Scan Batch Scheduling in Cloud Computing PigMix: Load Cost Savings

Shared Scan Batch Scheduling in Cloud Computing PigMix: Estimating Makespan

Shared Scan Batch Scheduling in Cloud Computing Ongoing Work Starvation resistance – Account for heterogeneity in workflow sizes – Provide soft deadline guarantees – Handling cascading failures – Prefer jobs with high load cost (less dilation, high slot time savings, map-only jobs) Predicting workflow runtime and frequency – Robustness to inaccuracies in cost estimates – Conserve or expend Cloud resources based on deadline requirements and system load Jobs that join/scan multiple input sources

Shared Scan Batch Scheduling in Cloud Computing Questions?

Shared Scan Batch Scheduling in Cloud Computing Nova Workflow Platform Nova features – Abstraction for complex workflows that consume data Incrementally arriving data (logs, crawls, feeds,...) Incremental processing of arriving data – Stateless: shingle every newly-crawled page – Stateful: maintain inlink counts as web grows Scheduling processing steps – Periodic: run inlink counter once per week – Triggered: run inlink counter after link extractor – Provides provenance, metadata management, incremental processing (i.e. joins), data replication, transactional guarantees

Shared Scan Batch Scheduling in Cloud Computing PigMix: Reducer Parallelism

Shared Scan Batch Scheduling in Cloud Computing Optimizing for Shared Scan Define a job J (i.e. MapReduce or Pig) – Scans files f(J) = (F 1, …, F i ), scan time per file: s(F i ) – Fixed processing cost c(J) d(J) defines a soft deadline of each job – Step: d defined by n pairs of (t i, p i ) where 0 p i+1 (a job that completes by t i is award p i points) – Linearly decay: enforce eventual completion w/ negative pts Cost of shared scan for Jobs J 1 and J 2 c(J 1 ) + c(J 2 ) + ∑ Fє(f(J1) U f(J2)) s(F) Maximize points and minimize resources – Local improvement w/ simulated annealing, greedy ordering – Aggressive batching when load is high

Shared Scan Batch Scheduling in Cloud Computing Performance Evaluation Experimental Setup – Nova with Shared Scan Module – 200 node Hadoop cluster 128MB HDFS block size 1GB RAM per node 640 mapper and 320 reducer slots – Shingling workflow (offline content enrichment) De-duplication of news Filter and extract features from content Cluster content by feature and pick one per cluster Execution of multiple de-dup workflows using different clustering alg. – Scheduling strategies compared Sequential-NoMerge (slower, conserve Grid resources) Concurrent-NoMerge (fast, elastic Grid resources) Merged (fast, conserve Grid resources)