Efficient Autoscaling in the Cloud using Predictive Models for Workload Forecasting Roy, N., A. Dubey, and A. Gokhale 4th IEEE International Conference.

Slides:



Advertisements
Similar presentations
Key Metrics for Effective Storage Performance and Capacity Reporting.
Advertisements

Autonomic Scaling of Cloud Computing Resources
Virtual Memory (II) CSCI 444/544 Operating Systems Fall 2008.
University of Minnesota Optimizing MapReduce Provisioning in the Cloud Michael Cardosa, Aameek Singh†, Himabindu Pucha†, Abhishek Chandra
Hadi Goudarzi and Massoud Pedram
Resource Management §A resource can be a logical, such as a shared file, or physical, such as a CPU (a node of the distributed system). One of the functions.
SLA-Oriented Resource Provisioning for Cloud Computing
Walter Binder University of Lugano, Switzerland Niranjan Suri IHMC, Florida, USA Green Computing: Energy Consumption Optimized Service Hosting.
Virtual Memory Introduction to Operating Systems: Module 9.
A SLA Framework for QoS Provisioning and Dynamic Capacity Allocation Rahul Garg (IBM India Research Lab), R. S. Randhawa (Stanford University), Huzur Saran.
Green Cloud Computing Hadi Salimi Distributed Systems Lab, School of Computer Engineering, Iran University of Science and Technology,
Proactive Prediction Models for Web Application Resource Provisioning in the Cloud _______________________________ Samuel A. Ajila & Bankole A. Akindele.
All Hands Meeting, 2006 Title: Grid Workflow Scheduling in WOSE (Workflow Optimisation Services for e- Science Applications) Authors: Yash Patel, Andrew.
XENMON: QOS MONITORING AND PERFORMANCE PROFILING TOOL Diwaker Gupta, Rob Gardner, Ludmila Cherkasova 1.
CloudScale: Elastic Resource Scaling for Multi-Tenant Cloud Systems Zhiming Shen, Sethuraman Subbiah, Xiaohui Gu, John Wilkes.
Automatic Resource Scaling for Web Applications in the Cloud Ching-Chi Lin Institute of Information Science, Academia Sinica Department of Computer Science.
Kuang-Hao Liu et al Presented by Xin Che 11/18/09.
SLA-aware Virtual Resource Management for Cloud Infrastructures
1 On Constructing Efficient Shared Decision Trees for Multiple Packet Filters Author: Bo Zhang T. S. Eugene Ng Publisher: IEEE INFOCOM 2010 Presenter:
OS Fall ’ 02 Performance Evaluation Operating Systems Fall 2002.
Performance Evaluation
Fair Scheduling in Web Servers CS 213 Lecture 17 L.N. Bhuyan.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
Proteus: Power Proportional Memory Cache Cluster in Data Centers Shen Li, Shiguang Wang, Fan Yang, Shaohan Hu, Fatemeh Saremi, Tarek Abdelzaher.
By- Jaideep Moses, Ravi Iyer , Ramesh Illikkal and
New Challenges in Cloud Datacenter Monitoring and Management
Host Load Prediction in a Google Compute Cloud with a Bayesian Model Sheng Di 1, Derrick Kondo 1, Walfredo Cirne 2 1 INRIA 2 Google.
Self-Adaptive QoS Guarantees and Optimization in Clouds Jim (Zhanwen) Li (Carleton University) Murray Woodside (Carleton University) John Chinneck (Carleton.
AUTONOMOUS RESOURCE PROVISIONING FOR MULTI-SERVICE WEB APPLICATIONS Jiang Dejun,Guillaume Pierre,Chi-Hung Chi WWW '10 Proceedings of the 19th international.
MobSched: An Optimizable Scheduler for Mobile Cloud Computing S. SindiaS. GaoB. Black A.LimV. D. AgrawalP. Agrawal Auburn University, Auburn, AL 45 th.
Adaptive Control of Virtualized Resources in Utility Computing Environments HP Labs: Xiaoyun Zhu, Mustafa Uysal, Zhikui Wang, Sharad Singhal University.
Department of Computer Science Engineering SRM University
A Shaft Sensorless Control for PMSM Using Direct Neural Network Adaptive Observer Authors: Guo Qingding Luo Ruifu Wang Limei IEEE IECON 22 nd International.
Click to add text TWA Cloud Integration with Tivoli Service Automation Manager TWS Education.
Parallel Programming Models Jihad El-Sana These slides are based on the book: Introduction to Parallel Computing, Blaise Barney, Lawrence Livermore National.
November , 2009SERVICE COMPUTATION 2009 Analysis of Energy Efficiency in Clouds H. AbdelSalamK. Maly R. MukkamalaM. Zubair Department.
Cloud Computing Energy efficient cloud computing Keke Chen.
Heterogeneity and Dynamicity of Clouds at Scale: Google Trace Analysis [1] 4/24/2014 Presented by: Rakesh Kumar [1 ]
Lecture 2 Process Concepts, Performance Measures and Evaluation Techniques.
SLA-based Resource Allocation for Software as a Service Provider (SaaS) in Cloud Computing Environments Author Linlin Wu, Saurabh Kumar Garg and Rajkumar.
An Autonomic Framework in Cloud Environment Jiedan Zhu Advisor: Prof. Gagan Agrawal.
RECON: A TOOL TO RECOMMEND DYNAMIC SERVER CONSOLIDATION IN MULTI-CLUSTER DATACENTERS Anindya Neogi IEEE Network Operations and Management Symposium, 2008.
Autonomic SLA-driven Provisioning for Cloud Applications Nicolas Bonvin, Thanasis Papaioannou, Karl Aberer Presented by Ismail Alan.
Dynamic Resource Monitoring and Allocation in a virtualized environment.
Various topics Petter Mostad Overview Epidemiology Study types / data types Econometrics Time series data More about sampling –Estimation.
Energy Aware Consolidation for Cloud Computing Srikanaiah, Kansal, Zhao Usenix HotPower 2008.
Problem Formulation Elastic cloud infrastructures provision resources according to the current actual demand on the infrastructure while enforcing service.
Job scheduling algorithm based on Berger model in cloud environment Advances in Engineering Software (2011) Baomin Xu,Chunyan Zhao,Enzhao Hua,Bin Hu 2013/1/251.
Embedded System Lab 김해천 Thread and Memory Placement on NUMA Systems: Asymmetry Matters.
Managing Server Energy and Operational Costs Chen, Das, Qin, Sivasubramaniam, Wang, Gautam (Penn State) Sigmetrics 2005.
Distributed Information Systems. Motivation ● To understand the problems that Web services try to solve it is helpful to understand how distributed information.
VGreen: A System for Energy Efficient Manager in Virtualized Environments G. Dhiman, G Marchetti, T Rosing ISLPED 2009.
Handling Session Classes for Predicting ASP.NET Performance Metrics Ágnes Bogárdi-Mészöly, Tihamér Levendovszky, Hassan Charaf Budapest University of Technology.
Authors: Mianyu Wang, Nagarajan Kandasamy, Allon Guez, and Moshe Kam Proceedings of the 3 rd International Conference on Autonomic Computing, ICAC 2006,
June 30 - July 2, 2009AIMS 2009 Towards Energy Efficient Change Management in A Cloud Computing Environment: A Pro-Active Approach H. AbdelSalamK. Maly.
Modeling Virtualized Environments in Simalytic ® Models by Computing Missing Service Demand Parameters CMG2009 Paper 9103, December 11, 2009 Dr. Tim R.
1 Hidra: History Based Dynamic Resource Allocation For Server Clusters Jayanth Gummaraju 1 and Yoshio Turner 2 1 Stanford University, CA, USA 2 Hewlett-Packard.
Copyright © 2010, Performance and Power Management for Cloud Infrastructures Hien Nguyen Van; Tran, F.D.; Menaud, J.-M. Cloud Computing (CLOUD),
Module 4 Forecasting Multiple Variables from their own Histories EC 827.
Dynamic Placement of Virtual Machines for Managing SLA Violations NORMAN BOBROFF, ANDRZEJ KOCHUT, KIRK BEATY SOME SLIDE CONTENT ADAPTED FROM ALEXANDER.
Developing resource consolidation frameworks for moldable virtual machines in clouds Author: Liang He, Deqing Zou, Zhang Zhang, etc Presenter: Weida Zhong.
Online Parameter Optimization for Elastic Data Stream Processing Thomas Heinze, Lars Roediger, Yuanzhen Ji, Zbigniew Jerzak (SAP SE) Andreas Meister (University.
Web Servers load balancing with adjusted health-check time slot.
Introduction to Load Balancing:
Smita Vijayakumar Qian Zhu Gagan Agrawal
CPU SCHEDULING.
From Rivulets to Rivers: Elastic Stream Processing in Heron
COMP755 Advanced Operating Systems
Reinforcement Learning (2)
Presentation transcript:

Efficient Autoscaling in the Cloud using Predictive Models for Workload Forecasting Roy, N., A. Dubey, and A. Gokhale 4th IEEE International Conference on Cloud Computing (Cloud 2011)

Agenda Introduction Related Work Challenge Solution Evaluation Conclusion Comment

Introduction (1/4) Typically customers maintain SLAs with service providers for the QoS properties. Failure to comply with satisfying these QoS metrics leads to a major loss of revenue in the form of decreased user base Catering to the SLA while still keeping costs low is challenging for such enterprise systems

Introduction (2/4) – naïve method

Introduction (3/4) A problem with such a resource allocation scheme Is the chance of thrashing where due to frequent variation of workload, machines can be added and released on every sample A desirable solution would require an ability to predict the incoming workload on the system and allocate resources a priori

Introduction (4/4) autoscaling the resources in a cloud environment is not an easy and straightforward task. (i) overheads related to state transition when number of resources are changed (ii) ability to accurately predict future workload (iii) compute the right number of resources required for the expected increase or decrease in workload.

Related Work (1/2) (1) Heuristics-based virtual machine allocation and migration Urgaonkar et. al. [2], 2008 VM, dynamic provisioning, Queueing model Only a single VM can be run in a host Wood et. al. [3], 2007 VM, dynamic migration define a unique metric based on the consumption data of the three resources to make the migration decision CPU, network and memory Cunha et. al. [4], 2007 Queueing model Pricing Model that gives rewards for throughput to be within SLA limits and penalty for throughput going above

Related Work (2/2) (2) Autonomic management of virtual computing environment using control-theoretic approaches: Wang et. al. [6] A load balancing controller VMs are all load-balanced and the response time of the applications in all the VMs are the same Moreno et. al. [7] An architecture for elastic management of cluster-based services Waheed et. al. [8] Reactive algorithm to allocate resources to a cluster farm Yang et. al [9] Profiled based approach

Challenge (1/4) Challenge 1: Workload Forecasting Correctness of prediction Releasing resources is easy, but.. Acquiring resources Make a call on the cloud API which starts the acquisition process The machine will be needed to boot up with the specified image The application need to be started

Challenge (2/4) Challenge 2: Identify Resource Requirement for Incoming Load The required number of resources is a function of the number of customers the nature of the application the type of calls that each customer makes on the application

Challenge (3/4) Challenge 3: Resource Allocation while Optimizing Multiple Cost Factors To optimize resource usage and/or minimize idle resources define a time interval and change resources as many times as possible as workload changes. In the limit, this interval could be made infinitesimally small and resources are changed continuously in accordance with the change in load

Challenge (4/4) Obviously, such as scheme is not possible the overhead in allocating a resource scaling up or down resources also involves cost and needs to be optimized

Solution (1/9) Control theory offers a promising methodology to address the challenges

Solution (2/9) 1. For every future time step, it computes the cost of selecting each possible resource allocation 2. To compute the cost of a particular allocation, it uses Algorithm 1 to compute the estimated response time for that particular machine configuration 3. Once the response time is calculated, it is used to calculate the cost of the allocation which is a combination of how far the estimated response time is from the SLA bounds (SLA violation) cost of leasing additional machines and also a cost of re-configuration

Solution (3/9) A. Workload Prediction Authors used a second order autoregressive moving average method (ARMA) filter for the workload The value for the variables β and γ are given by the values 0.8 and 0.15 ARMA models are widely used for prediction of economic and industrial time series

ARMA Autoregressive Model (AR) a model depends on the level of the lagged observations For example, if we observe a high realisation of GDP we would expect that the GDP in the next few periods are high as well

ARMA Moving Average Model (MA) model that the observations of a random variable at time t are not only affected by the shock at time t, but also the shocks of prior periods Ex. if we observe a negative shock to the economy, say, 9/11, then we would expect that the negative effect affects the economy also for the near future.

ARMA Autoregressive Moving Average Model combine both models we get a ARMA(p,q) model ARMA models are widely used for prediction of economic and industrial time series

Solution (4/9)

Solution (5/9) B. Performance Model The next challenge we resolve is identifying resource requirements for the predicted workload The workload used in this work is the number of users currently in the system. It also depends upon what each user does. In prior work [20] we have used Customer Behavior Modeling Graphs (CBMG) (?) to model the overall behavior of customers

Solution (6/9) A CBMG is built from a log of previous customer behavior and computes the probability of a typical user to visit each page Using this information, we can calculate the number of visits to a single page from the total number of customers in the system. The number of visits to each page helps in calculating the average load on each page.

Solution (7/9)

Solution (8/9) C. Optimizing Resource Provisioning The intuition is to identify the right number of time intervals Our solution works on look-ahead optimization iteratively solves an optimization problem, Costopt, starting from t0

Solution (9/9) The actual algorithm is not described here The next challenge is the choice of the look-ahead period. A small look-ahead period will neglect trends A very large period will increase computational complexity The actual algorithm is not described here because the implementation requires recursive data structures is difficult to describe in the limited space available.

Evaluation (1/7) Cost Function

Evaluation (2/7) Just in time Resource Allocation the weights on each component of the cost function is the same

Evaluation (3/7) -- Resource Usage under Different Cost Priorities 1) SLA violation against Resource Cost The ratio of SLA penalty to machine cost is varied from 4 : 1 to 1 : 13

Evaluation (4/7) -- Resource Usage under Different Cost Priorities

Evaluation (5/7) -- Resource Usage under Different Cost Priorities

Evaluation (6/7) -- Resource Usage under Different Cost Priorities 2) Including the Cost of Reconfiguration

Evaluation (7/7) -- Resource Usage under Different Cost Priorities

Conclusion this paper describes a look-ahead resource allocation algorithm based on model predictive control predicts future workload adjusts resources allocated to users ahead-of-time

Comments The detail of the model in the paper is too simple I cannot understand why the authors did these evaluations The paper use control theory and it seems to have a good prediction of workload Something in 3 challenges the overhead of allocating resources the prediction interval the costs of SLA violation, reconfiguring machine….