Dynamic Resource Allocation for Shared Data Centers Using Online Measurements By- Abhishek Chandra, Weibo Gong and Prashant Shenoy.

Slides:



Advertisements
Similar presentations
Pricing for Utility-driven Resource Management and Allocation in Clusters Chee Shin Yeo and Rajkumar Buyya Grid Computing and Distributed Systems (GRIDS)
Advertisements

Feedback Control Real-Time Scheduling: Framework, Modeling, and Algorithms Chenyang Lu, John A. Stankovic, Gang Tao, Sang H. Son Presented by Josh Carl.
QoS-based Management of Multiple Shared Resources in Dynamic Real-Time Systems Klaus Ecker, Frank Drews School of EECS, Ohio University, Athens, OH {ecker,
Hadi Goudarzi and Massoud Pedram
SLA-Oriented Resource Provisioning for Cloud Computing
Fast Algorithms For Hierarchical Range Histogram Constructions
XENMON: QOS MONITORING AND PERFORMANCE PROFILING TOOL Diwaker Gupta, Rob Gardner, Ludmila Cherkasova 1.
U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science Dynamic Provisioning for Multi-tier Internet Applications Bhuvan Urgaonkar, Prashant.
Efficient Autoscaling in the Cloud using Predictive Models for Workload Forecasting Roy, N., A. Dubey, and A. Gokhale 4th IEEE International Conference.
Computer Science Deadline Fair Scheduling: Bridging the Theory and Practice of Proportionate-Fair Scheduling in Multiprocessor Servers Abhishek Chandra.
Adaptive Sampling for Sensor Networks Ankur Jain ٭ and Edward Y. Chang University of California, Santa Barbara DMSN 2004.
Chapter 6: CPU Scheduling. 5.2 Silberschatz, Galvin and Gagne ©2005 Operating System Concepts – 7 th Edition, Feb 2, 2005 Chapter 6: CPU Scheduling Basic.
U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science Quantifying the Benefits of Resource Multiplexing in On-Demand Data Centers Abhishek.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms.
U NIVERSITY OF M ASSACHUSETTS, A MHERST – Department of Computer Science Dynamic Resource Allocation for Shared Data Centers Using Online Measurements.
Fair Scheduling in Web Servers CS 213 Lecture 17 L.N. Bhuyan.
An Adaptive Multi-Objective Scheduling Selection Framework For Continuous Query Processing Timothy M. Sutherland Bradford Pielech Yali Zhu Luping Ding.
Bandwidth Allocation in a Self-Managing Multimedia File Server Vijay Sundaram and Prashant Shenoy Department of Computer Science University of Massachusetts.
Computer Science Cataclysm: Policing Extreme Overloads in Internet Applications Bhuvan Urgaonkar and Prashant Shenoy University of Massachusetts.
Self-Adaptive QoS Guarantees and Optimization in Clouds Jim (Zhanwen) Li (Carleton University) Murray Woodside (Carleton University) John Chinneck (Carleton.
Business Process Performance Prediction on a Tracked Simulation Model Andrei Solomon, Marin Litoiu– York University.
Particle Filtering in Network Tomography
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.
OPTIMAL SERVER PROVISIONING AND FREQUENCY ADJUSTMENT IN SERVER CLUSTERS Presented by: Xinying Zheng 09/13/ XINYING ZHENG, YU CAI MICHIGAN TECHNOLOGICAL.
An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.
Silberschatz, Galvin and Gagne  Operating System Concepts Chapter 6: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms.
Budget-based Control for Interactive Services with Partial Execution 1 Yuxiong He, Zihao Ye, Qiang Fu, Sameh Elnikety Microsoft Research.
Chapter 3 System Performance and Models. 2 Systems and Models The concept of modeling in the study of the dynamic behavior of simple system is be able.
Entities and Objects The major components in a model are entities, entity types are implemented as Java classes The active entities have a life of their.
Performance Prediction for Random Write Reductions: A Case Study in Modelling Shared Memory Programs Ruoming Jin Gagan Agrawal Department of Computer and.
Silberschatz and Galvin  Operating System Concepts Module 5: CPU Scheduling Basic Concepts Scheduling Criteria Scheduling Algorithms Multiple-Processor.
An Optimal Service Ordering for a World Wide Web Server A Presentation for the Fifth INFORMS Telecommunications Conference March 6, 2000 Amy Csizmar Dalal.
1 11/29/2015 Chapter 6: CPU Scheduling l Basic Concepts l Scheduling Criteria l Scheduling Algorithms l Multiple-Processor Scheduling l Real-Time Scheduling.
MROrder: Flexible Job Ordering Optimization for Online MapReduce Workloads School of Computer Engineering Nanyang Technological University 30 th Aug 2013.
1/22 Optimization of Google Cloud Task Processing with Checkpoint-Restart Mechanism Speaker: Sheng Di Coauthors: Yves Robert, Frédéric Vivien, Derrick.
“A cost-based admission control algorithm for digital library multimedia systems storing heterogeneous objects” – I.R. Chen & N. Verma – The Computer Journal.
OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.
Content caching and scheduling in wireless networks with elastic and inelastic traffic Group-VI 09CS CS CS30020 Performance Modelling in Computer.
1 Hidra: History Based Dynamic Resource Allocation For Server Clusters Jayanth Gummaraju 1 and Yoshio Turner 2 1 Stanford University, CA, USA 2 Hewlett-Packard.
03/03/051 Performance Engineering of Software and Distributed Systems Research Activities at IIT Bombay Varsha Apte March 3 rd, 2005.
Author Utility-Based Scheduling for Bulk Data Transfers between Distributed Computing Facilities Xin Wang, Wei Tang, Raj Kettimuthu,
Dynamic Placement of Virtual Machines for Managing SLA Violations NORMAN BOBROFF, ANDRZEJ KOCHUT, KIRK BEATY SOME SLIDE CONTENT ADAPTED FROM ALEXANDER.
Ensieea Rizwani An energy-efficient management mechanism for large-scale server clusters By: Zhenghua Xue, Dong, Ma, Fan, Mei 1.
Task Mapping and Partition Allocation for Mixed-Criticality Real-Time Systems Domițian Tămaș-Selicean and Paul Pop Technical University of Denmark.
Zeta: Scheduling Interactive Services with Partial Execution Yuxiong He, Sameh Elnikety, James Larus, Chenyu Yan Microsoft Research and Microsoft Bing.
1 Performance Impact of Resource Provisioning on Workflows Gurmeet Singh, Carl Kesselman and Ewa Deelman Information Science Institute University of Southern.
Basic Concepts Maximum CPU utilization obtained with multiprogramming
1 Chapter 5: CPU Scheduling. 2 Basic Concepts Scheduling Criteria Scheduling Algorithms.
Spark on Entropy : A Reliable & Efficient Scheduler for Low-latency Parallel Jobs in Heterogeneous Cloud Huankai Chen PhD Student at University of Kent.
Towards Robust Revenue Management: Capacity Control Using Limited Demand Information Michael Ball, Huina Gao, Yingjie Lan & Itir Karaesmen Robert H Smith.
Optimization of Time-Partitions for Mixed-Criticality Real-Time Distributed Embedded Systems Domițian Tămaș-Selicean and Paul Pop Technical University.
Web Servers load balancing with adjusted health-check time slot.
Resource Provision for Batch and Interactive Workloads in Data Centers Ting-Wei Chang, Pangfeng Liu Department of Computer Science and Information Engineering,
OPERATING SYSTEMS CS 3502 Fall 2017
Analyzing Security and Energy Tradeoffs in Autonomic Capacity Management Wei Wu.
Chapter 6: CPU Scheduling
CPU Scheduling G.Anuradha
Module 5: CPU Scheduling
3: CPU Scheduling Basic Concepts Scheduling Criteria
Chapter5: CPU Scheduling
Chapter 6: CPU Scheduling
Feifei Li, Ching Chang, George Kollios, Azer Bestavros
Lecture 2 Part 3 CPU Scheduling
Uniprocessor scheduling
Operating System , Fall 2000 EA101 W 9:00-10:00 F 9:00-11:00
Realizing Closed-loop, Online Tuning and Control for Configurable-Cache Embedded Systems: Progress and Challenges Islam S. Badreldin*, Ann Gordon-Ross*,
Chapter 6: CPU Scheduling
Module 5: CPU Scheduling
Chapter 6: CPU Scheduling
Module 5: CPU Scheduling
Presentation transcript:

Dynamic Resource Allocation for Shared Data Centers Using Online Measurements By- Abhishek Chandra, Weibo Gong and Prashant Shenoy

Overview Outline  Motivation  System Model  Dynamic Allocation Techniques  Experimental Results  Conclusions

Motivation  Data Centers  Server farms  Rent computing and storage resources to applications  Revenue for meeting QoS guarantees  Goals  Satisfy application QoS guarantees  Maximize resource utilization of platform  Robustness against “Slashdot” effects  Cluster of servers – Dedicated or Shared  Static Allocation is problematic

Dynamic Resource Allocation  Periodically re-allocate resources among applications  Estimate resource requirements for near future  Challenges  Reallocation at short time-scales  No prior workload profiling/knowledge  Low overhead  Approach: Online Measurement-based Allocation

Research Contribution  Generalized processor sharing (GPS)  Time domain queuing model & Non-linear optimization technique  Prediction algorithm  Synthetic Workloads & Real Web Traces

Problem Formulation  Resource Model  Queue are assumed to be served in FIFO order and the resource capacity C is shared among the queues using GPS  Queue is assigned a weight  Allocated a resource share in proportion to its weight.  GPS Scheduler

 Problem Definition  If denotes the target response time of application and is its observed mean response time, then the application should be allocated a share, such that.  The discontent of an application grows as its response time deviates from the target di. This discontent function can be represented as follows  System goal then is to assign a share to each application such that the total system-wide discontent, i.e., the quantity is minimized.

Dynamic Resource Allocation

Monitoring  Measure system and application metrics  Queue lengths  Request response times  Monitoring windows Adaptation Window History Measurement Interval Time

Allocating  Invoked periodically to dynamically partition the resource capacity among the various applications running on the shared server.  Resource Model Types  Time-domain Queuing Model  Online optimization-based Model

Time Domain Queuing Model  Transient queuing behavior over adaptation window  The request service rate is  Relation between mean response time T ¯ and application share. Average response time in near future:  Relation is parameterized by the measured workload  Arrival rate λ and mean service time s ¯

Optimization-based Resource Allocation  Discontent function  Non-linear Optimization Problem:  Solved using Lagrange multiplier method

Prediction  Short-term prediction of workload characteristics  Request arrival process  Service demand distribution  Use history of measured system metrics

Prediction Techniques  Estimating the Arrival Rate  Accurate estimate of allows the time domain queuing model to estimate the average queue length for the next adaptation window.  We represent Ai at any time by the sequence of values from the measurement history.  To predict, model using the AR(1), a sample value of Ai is estimated as  Estimating the Service Demand  Computes the probability distribution of the per-request service demands  Mean of the distribution is used to represent the service demand of application requests  Measuring the Queue Length  Monitoring module records the no. of outstanding requests at the beginning of each adaptation window.

Experiments  Soccer World Cup’98 Traces  Results based on a 24-hour portion of the trace  755,000 requests  Mean req rate: 8.7 req/sec  Mean req size: 8.47 KB

Experiments Evaluation  Synthetic Web Workload Comparison of static and dynamic resource allocations for a synthetic web workload

 Trace-driven Web Workloads Comparison of static and dynamic resource allocations in the presence of heavy-tailed request sizes and varying arrival rates.

Adaptation to Transient Overloads The workload and the resulting allocations in the presence of varying arrival rates and varying request sizes

Conclusions  Dynamic Resource Allocation needed for data centers  Measurement-based allocation:  Monitoring and Prediction gather online state  Use this state for application modeling and allocation  Results showed that these techniques can judiciously allocate system resources, especially under transient overload conditions

Thank You