Dominant Resource Fairness: Fair Allocation of Multiple Resource Types Ali Ghodsi, Matei Zaharia, Benjamin Hindman, Andy Konwinski, Scott Shenker, Ion.

Slides:



Advertisements
Similar presentations
The Datacenter Needs an Operating System Matei Zaharia, Benjamin Hindman, Andy Konwinski, Ali Ghodsi, Anthony Joseph, Randy Katz, Scott Shenker, Ion Stoica.
Advertisements

Locality-Aware Dynamic VM Reconfiguration on MapReduce Clouds Jongse Park, Daewoo Lee, Bokyeong Kim, Jaehyuk Huh, Seungryoul Maeng.
Beyond Dominant Resource Fairness David Parkes (Harvard) Ariel Procaccia (CMU) Nisarg Shah (CMU)
UC Berkeley Job Scheduling for MapReduce Matei Zaharia, Dhruba Borthakur *, Joydeep Sen Sarma *, Scott Shenker, Ion Stoica 1 RAD Lab, * Facebook Inc.
No Agent Left Behind: Dynamic Fair Division of Multiple Resources Ian Kash 1 Ariel Procaccia 2 Nisarg Shah 2 (Speaker) 1 MSR Cambridge 2 Carnegie Mellon.
THE DATACENTER NEEDS AN OPERATING SYSTEM MATEI ZAHARIA, BENJAMIN HINDMAN, ANDY KONWINSKI, ALI GHODSI, ANTHONY JOSEPH, RANDY KATZ, SCOTT SHENKER, ION STOICA.
Soft Real-Time Semi-Partitioned Scheduling with Restricted Migrations on Uniform Heterogeneous Multiprocessors Kecheng Yang James H. Anderson Dept. of.
Reciprocal Resource Fairness: Towards Cooperative Multiple-Resource Fair Sharing in IaaS Clouds School of Computer Engineering Nanyang Technological University,
Quincy: Fair Scheduling for Distributed Computing Clusters Microsoft Research Silicon Valley SOSP’09 Presented at the Big Data Reading Group by Babu Pillai.
UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University.
Mesos A Platform for Fine-Grained Resource Sharing in Data Centers Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony D. Joseph, Randy.
Matei Zaharia, Dhruba Borthakur *, Joydeep Sen Sarma *, Khaled Elmeleegy +, Scott Shenker, Ion Stoica UC Berkeley, * Facebook Inc, + Yahoo! Research Delay.
Mesos A Platform for Fine-Grained Resource Sharing in the Data Center Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony Joseph, Randy.
Mesos A Platform for Fine-Grained Resource Sharing in the Data Center Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony Joseph, Randy.
UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University.
Approximation Algorithms Motivation and Definitions TSP Vertex Cover Scheduling.
The Weighted Proportional Allocation Mechanism Milan Vojnović Microsoft Research Joint work with Thành Nguyen Harvard University, Nov 3, 2009.
UC Berkeley Improving MapReduce Performance in Heterogeneous Environments Matei Zaharia, Andy Konwinski, Anthony Joseph, Randy Katz, Ion Stoica University.
Outline | Motivation| Design | Results| Status| Future
DRFQ: Multi-Resource Fair Queueing for Packet Processing Ali Ghodsi 1,3, Vyas Sekar 2, Matei Zaharia 1, Ion Stoica 1 1 UC Berkeley, 2 Intel ISTC/Stony.
Network Aware Resource Allocation in Distributed Clouds.
G-commerce Computational economies for resource allocation in computational Grid settings Fundamentals Resources are not free Resources are not free For.
DRFQ: Multi-Resource Fair Queueing for Packet Processing Ali Ghodsi 1,3, Vyas Sekar 2, Matei Zaharia 1, Ion Stoica 1 1 UC Berkeley, 2 Intel ISTC/Stony.
SLA-based Resource Allocation for Software as a Service Provider (SaaS) in Cloud Computing Environments Author Linlin Wu, Saurabh Kumar Garg and Rajkumar.
Rensselaer Polytechnic Institute Rajagopal Iyengar Combinatorial Approaches to QoS Scheduling in Multichannel Wireless Systems Rajagopal Iyengar Rensselaer.
E VALUATION OF F AIRNESS IN ICN X. De Foy, JC. Zuniga, B. Balazinski InterDigital
Mesos A Platform for Fine-Grained Resource Sharing in the Data Center Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony Joseph, Randy.
Scheduling policies for real- time embedded systems.
August 16, 2010 MPREF’10 Dynamic House Allocation Sujit Gujar 1, James Zou 2 and David C. Parkes 2 5 th Multidisciplinary Workshop on Advances in Preference.
Exploiting Context Analysis for Combining Multiple Entity Resolution Systems -Ramu Bandaru Zhaoqi Chen Dmitri V.kalashnikov Sharad Mehrotra.
Scheduling Periodic Real-Time Tasks with Heterogeneous Reward Requirements I-Hong Hou and P.R. Kumar 1 Presenter: Qixin Wang.
Job scheduling algorithm based on Berger model in cloud environment Advances in Engineering Software (2011) Baomin Xu,Chunyan Zhao,Enzhao Hua,Bin Hu 2013/1/251.
EECS 262a Advanced Topics in Computer Systems Lecture 13 Resource allocation: Lithe/DRF October 16 th, 2012 John Kubiatowicz and Anthony D. Joseph Electrical.
1 11 Channel Assignment for Maximum Throughput in Multi-Channel Access Point Networks Xiang Luo, Raj Iyengar and Koushik Kar Rensselaer Polytechnic Institute.
Matei Zaharia, Dhruba Borthakur *, Joydeep Sen Sarma *, Khaled Elmeleegy +, Scott Shenker, Ion Stoica UC Berkeley, * Facebook Inc, + Yahoo! Research Fair.
1 Iterative Integer Programming Formulation for Robust Resource Allocation in Dynamic Real-Time Systems Sethavidh Gertphol and Viktor K. Prasanna University.
Towards Economic Fairness for Big Data Processing in Pay-as-you-go Cloud Computing Nanyang Technological University Shanjiang Tang, Bu-Sung Lee, Bingsheng.
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.
June 4, 2003EE384Y1 Demand Based Rate Allocation Arpita Ghosh and James Mammen {arpitag, EE 384Y Project 4 th June, 2003.
Shanjiang Tang, Bu-Sung Lee, Bingsheng He, Haikun Liu School of Computer Engineering Nanyang Technological University Long-Term Resource Fairness Towards.
Dynamic Slot Allocation Technique for MapReduce Clusters School of Computer Engineering Nanyang Technological University 25 th Sept 2013 Shanjiang Tang,
Operational Research & ManagementOperations Scheduling Economic Lot Scheduling 1.Summary Machine Scheduling 2.ELSP (one item, multiple items) 3.Arbitrary.
DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters Nanyang Technological University Shanjiang Tang, Bu-Sung Lee, Bingsheng.
Multi-Resource Packing for Cluster Schedulers Robert Grandl Aditya Akella Srikanth Kandula Ganesh Ananthanarayanan Sriram Rao.
A Platform for Fine-Grained Resource Sharing in the Data Center
John Kubiatowicz and Anthony D. Joseph
Presented by Qifan Pu With many slides from Ali’s NSDI talk Ali Ghodsi, Matei Zaharia, Benjamin Hindman, Andy Konwinski, Scott Shenker, Ion Stoica.
Dominant Resource Fairness: Fair Allocation of Multiple Resource Types Ali Ghodsi, Matei Zaharia, Benjamin Hindman, Andy Konwinski, Scott Shenker, Ion.
Part III BigData Analysis Tools (YARN) Yuan Xue
PACMan: Coordinated Memory Caching for Parallel Jobs Ganesh Ananthanarayanan, Ali Ghodsi, Andrew Wang, Dhruba Borthakur, Srikanth Kandula, Scott Shenker,
Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center NSDI 11’ Benjamin Hindman, Andy Konwinski, Matei Zaharia, Ali Ghodsi, Anthony D.
Resilient Distributed Datasets A Fault-Tolerant Abstraction for In-Memory Cluster Computing Matei Zaharia, Mosharaf Chowdhury, Tathagata Das, Ankur Dave,
COS 518: Advanced Computer Systems Lecture 13 Michael Freedman
Hierarchical Scheduling for Diverse Datacenter Workloads
COS 418: Distributed Systems Lecture 23 Michael Freedman
Tao Zhu1,2, Chengchun Shu1, Haiyan Yu1
Mesos: A Platform for Fine-Grained Resource Sharing in the Data Center
MESOS.
Lottery Scheduling and Dominant Resource Fairness (Lecture 24, cs262a)
Edinburgh Napier University
CS 425 / ECE 428 Distributed Systems Fall 2016 Nov 10, 2016
EECS 262a Advanced Topics in Computer Systems Lecture 13 Resource allocation: Lithe/DRF March 7th, 2016 John Kubiatowicz Electrical Engineering and Computer.
CS 425 / ECE 428 Distributed Systems Fall 2017 Nov 16, 2017
PA an Coordinated Memory Caching for Parallel Jobs
EECS 582 Final Review Mosharaf Chowdhury EECS 582 – F16.
COS 518: Advanced Computer Systems Lecture 14 Michael Freedman
Shanjiang Tang1, Bingsheng He2, Shuhao Zhang2,4, Zhaojie Niu3
Cloud Computing Large-scale Resource Management
Cloud Computing MapReduce in Heterogeneous Environments
Uniprocessor scheduling
Presentation transcript:

Dominant Resource Fairness: Fair Allocation of Multiple Resource Types Ali Ghodsi, Matei Zaharia, Benjamin Hindman, Andy Konwinski, Scott Shenker, Ion Stoica University of California, Berkeley nsdi’11

Introduction Resource allocation is a key building block of any shared computer system. One of the most popular allocation policies proposed so far has been max-min fairness,which maximizes the minimum allocation received by a user in the system. The focus has so far been primarily on a single resource type, or allocate resources at the level of fixed-size partitions of the nodes, called slots.

Introduction In this paper, we address the problem of fair allocation of multiple types of resources to users with heterogeneous demands This paper proposes Dominant Resource Fairness (DRF), a generalization of max-min fairness for multiple resources. For example: if user A runs CPU-heavy tasks and user B runs memory-heavy tasks, DRF attempts to equalize user A’s share of CPUs with user B’s share of memory.

Introduction The strength of DRF lies in the properties it satisfies.

Motivation

Motivation

Motivation Figure1 shows that though the majority of tasks are CPU-heavy, there exist tasks that are memory heavy as well, especially for reduce operations. Figure 2 shows that most of the tasks either underutilize or overutilize some of their slot resources.

Allocation Properties 1. Sharing incentive: Each user should be better off sharing the cluster, than exclusively using her own partition of the cluster. Consider a cluster with identical nodes and n users. Then a user should not be able to allocate more tasks in a cluster partition consisting of 1/n of all resources.

Allocation Properties 2. Strategy-proofness: Users should not be able to benefit by lying about their resource demands. This provides incentive compatibility, as a user cannot improve her allocation by lying.

Allocation Properties 3. Envy-freeness: A user should not prefer the allocation of another user. 4. Pareto efficiency: It should not be possible to increase the allocation of a user without decreasing the allocation of at least another user.

Allocation Properties we believe that the strategy-proofness and sharing incentive properties,are of special importance in datacenter environments. For example, one of Yahoo!’s Hadoop MapReduce datacenters has different numbers of slots for map and reduce tasks. (strategy-proofness) Another big search company provided dedicated machines for jobs only if the users could guarantee high utilization.(strategy-proofness)

Some other nice-to-have properties Single resource fairness: For a single resource, the solution should reduce to max-min fairness. Bottleneck fairness: If there is one resource that is percent-wise demanded most of by every user, then the solution should reduce to max-min fairness for that resource.

Some other nice-to-have properties Population monotonicity: When a user leaves the system and relinquishes her resources, none of the allocations of the remaining users should decrease. Resource monotonicity: If more resources are added to the system, none of the allocations of the existing users should decrease.

Dominant Resource Fairness (DRF) For every user, DRF computes the share of each resource allocated to that user. The maximum among all shares of a user is called that user’s dominant share. The resource corresponding to the dominant share is called the dominant resource.

An Example a system with of 9 CPUs, 18 GB RAM Two users, where user A runs tasks with demand vector, and user B runs tasks with demand vector each. Dominant share: A:2/9 (memory) B:1/3 (CPU) With this allocation, each user ends up with the same dominant share, i.e., user A gets 2/3 of RAM, while user B gets 2/3 of the CPUs.

An Example

This allocation can be computed mathematically as follows. Let x and y be the number of tasks allocated by DRF to users A and B Solving this problem yields2 x = 3 and y = 2. User A gets and B gets

Alternative Fair Allocation Policies Asset Fairness: The idea behind Asset Fairness is that equal shares of different resources are worth the same, i.e., that 1% of all CPUs worth is the same as 1% of memory and 1% of I/O bandwidth.

Alternative Fair Allocation Policies Consider the example Since there are twice as many GB of RAM as CPUs (i.e., 9 CPUs and 18 GB RAM), one CPU is worth twice as much as one GB of RAM.

Alternative Fair Allocation Policies Solving the above problem yields x = 2:52 and y = 2:16. Thus, user A gets while user B gets Simple but violate the sharing incentive property

Alternative Fair Allocation Policies Competitive Equilibrium from Equal Incomes With CEEI, each user receives initially 1/n of every resource, and subsequently, each user trades her resources with other users in a perfectly competitive market Maximizes where ui(ai) is the utility that user i gets from her allocation ai. To simplify the comparison, we assume that the utility that a user gets from her allocation is simply her dominant share, si.

Alternative Fair Allocation Policies Solving the above problem yields x = 45/11 and y = 18/11. Thus, user A gets while user B gets But it is not strategy-proof

Comparison with DRF

Fairness Properties

Properties Violated by Asset Fairness Theorem1. Asset Fairness violates the sharing incentive property. Proof: two users in a system with total resources D1 = D2 = user1: 6 tasks user2: 12tasks user1 receives resources user2 receives resources

Properties Violated by Asset Fairness

Theorem 2. Asset Fairness violates the bottleneck fairness property. Proof: total resource : D1 = and D2 = Asset fairness will give each user 3 tasks However, this only gives the first user 3/7 of resource 1

Properties Violated by CEEI Theorem 4. CEEI is not strategy-proof. Proof: Total resources D1 = D2 = CEEI allocates 3.2 and 48.8 tasks to each user if: D1 = D2 = CEEI allocates 4.2 and 33.3 tasks to each user

Theorem 6. No allocation policy that satisfies the sharing incentive and Pareto efficiency properties can also satisfy resource monotonicity. Proof: Total resources: D1 = D2 = (sharing incentive, Pareto dfficiency) if total resources change to The only feasible allocation that satisfies the sharing incentive is to give both users half of resource 1

Discrete Resource Allocation Max-task: Theorem 7. In the discrete scenario, it is possible to allocate resources such that the difference between the allocations of any two users is bounded by one max- task compared to the continuous allocation scenario.

Experimental Results We evaluate DRF with respect to two alternative schemes: slot-based fair scheduling (a common policy in current systems, such as the Hadoop Fair Scheduler ) and (max-min) fair sharing applied only to a single resource (CPU).

Experimental Results We ran a 48-node cluster on EC2 instances with 8 CPU cores and 6 GB RAM each two classes of users(total 8 users): small jobs with task demands large jobs with task demands For the slot-based allocation scheme, we varied the number of slots per machine from 3 to 6

Experimental Results

Simulation

Simulation

Conclusion We have introduced Dominant Resource Fairness (DRF), a fair sharing model that generalizes max-min fairness to multiple resource types. allows cluster schedulers to take into account the heterogeneous demands of datacenter applications Fairer, high utilization, properties We have evaluated DRF by implementing it and shown that it can lead to better overall performance than the slot-based fair schedulers that are commonly in use today.