Problem Formulation Elastic cloud infrastructures provision resources according to the current actual demand on the infrastructure while enforcing service.

Slides:



Advertisements
Similar presentations
Hadi Goudarzi and Massoud Pedram
Advertisements

Agile Infrastructure built on OpenStack Building The Next Generation Data Center with OpenStack John Griffith, Senior Software Engineer,
Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.
A SLA Framework for QoS Provisioning and Dynamic Capacity Allocation Rahul Garg (IBM India Research Lab), R. S. Randhawa (Stanford University), Huzur Saran.
SLA Basics Describes a set of non functional requirements of the service. Example : RTO time – Return to Operation Time if case of failure SLO – Service.
Green Cloud Computing Hadi Salimi Distributed Systems Lab, School of Computer Engineering, Iran University of Science and Technology,
Proactive Prediction Models for Web Application Resource Provisioning in the Cloud _______________________________ Samuel A. Ajila & Bankole A. Akindele.
CloudScale: Elastic Resource Scaling for Multi-Tenant Cloud Systems Zhiming Shen, Sethuraman Subbiah, Xiaohui Gu, John Wilkes.
Efficient Autoscaling in the Cloud using Predictive Models for Workload Forecasting Roy, N., A. Dubey, and A. Gokhale 4th IEEE International Conference.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Marakas: Decision Support Systems, 2nd Edition © 2003, Prentice-Hall Chapter Chapter 4: Modeling Decision Processes Decision Support Systems in the.
1 Abstract This study presents an analysis of two modified fuzzy ARTMAP neural networks. The modifications are first introduced mathematically. Then, the.
What is Cloud Computing? o Cloud computing:- is a style of computing in which dynamically scalable and often virtualized resources are provided as a service.
11 Quantifying Benefits of Traffic Information Provision under Stochastic Demand and Capacity Conditions: A Multi-day Traffic Equilibrium Approach Mingxin.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Virtualization in Data Centers Prashant Shenoy
Distributed-Dynamic Capacity Contracting: A congestion pricing framework for Diff-Serv Murat Yuksel and Shivkumar Kalyanaraman Rensselaer Polytechnic Institute,
Modeling Quality-Quantity based Communication Orr Srour under the supervision of Ishai Menache.
INTRODUCTION TO CLOUD COMPUTING CS 595 LECTURE 4.
Rohit Ray ESE 251. What are Artificial Neural Networks? ANN are inspired by models of the biological nervous systems such as the brain Novel structure.
SOFTWARE AS A SERVICE PLATFORM AS A SERVICE INFRASTRUCTURE AS A SERVICE.
Presented by : Ran Koretzki. Basic Introduction What are VM’s ? What is migration ? What is Live migration ?
Jiazhang Liu;Yiren Ding Team 8 [10/22/13]. Traditional Database Servers Database Admin DBMS 1.
* Who we are? * Animation Industry, Challenges… * What is Render Cloud Farm? * Render Cloud Farm for Whom? * Scope of Blender? * Types of Rendering farms.
ASK Labs Company We help machines make decisions.
Self-Adaptive QoS Guarantees and Optimization in Clouds Jim (Zhanwen) Li (Carleton University) Murray Woodside (Carleton University) John Chinneck (Carleton.
Capacity Scaling for Elastic Compute Clouds Ahmed Aleyeldin Hassan
A User Experience-based Cloud Service Redeployment Mechanism KANG Yu.
By Saparila Worokinasih
University of Zagreb MMVE 2012 workshop1 Towards Reinterpretation of Interaction Complexity for Load Prediction in Cloud-based MMORPGs Mirko Sužnjević,
Neurons, Neural Networks, and Learning 1. Human brain contains a massively interconnected net of (10 billion) neurons (cortical cells) Biological.
MSE 2400 EaLiCaRA Spring 2015 Dr. Tom Way
Dynamic Resource Allocation Using Virtual Machines for Cloud Computing Environment.
A Cloud is a type of parallel and distributed system consisting of a collection of inter- connected and virtualized computers that are dynamically provisioned.
Predicting performance of applications and infrastructures Tania Lorido 27th May 2011.
Network Aware Resource Allocation in Distributed Clouds.
A Comparison of Nature Inspired Intelligent Optimization Methods in Aerial Spray Deposition Management Lei Wu Master’s Thesis Artificial Intelligence Center.
1 High-Level Carrier Requirements for Cross Layer Optimization Dave McDysan Verizon.
SLA-based Resource Allocation for Software as a Service Provider (SaaS) in Cloud Computing Environments Author Linlin Wu, Saurabh Kumar Garg and Rajkumar.
Service Transition & Planning Service Validation & Testing
© 2009 IBM Corporation 1 Improving Consolidation of Virtual Machines with Risk-aware Bandwidth Oversubscription in Compute Clouds Amir Epstein Joint work.
Autonomic SLA-driven Provisioning for Cloud Applications Nicolas Bonvin, Thanasis Papaioannou, Karl Aberer Presented by Ismail Alan.
Machine Learning Dr. Shazzad Hosain Department of EECS North South Universtiy
Chapter 3 System Performance and Models. 2 Systems and Models The concept of modeling in the study of the dynamic behavior of simple system is be able.
Challenges towards Elastic Power Management in Internet Data Center.
Smita Vijayakumar Qian Zhu Gagan Agrawal 1.  Background  Data Streams  Virtualization  Dynamic Resource Allocation  Accuracy Adaptation  Research.
Modelling Language Evolution Lecture 1: Introduction to Learning Simon Kirby University of Edinburgh Language Evolution & Computation Research Unit.
Job scheduling algorithm based on Berger model in cloud environment Advances in Engineering Software (2011) Baomin Xu,Chunyan Zhao,Enzhao Hua,Bin Hu 2013/1/251.
Managing Server Energy and Operational Costs Chen, Das, Qin, Sivasubramaniam, Wang, Gautam (Penn State) Sigmetrics 2005.
Akram Bitar and Larry Manevitz Department of Computer Science
Look, Ma, No Hardware -Stephanie Schossow. Cisco & VMware  September 16, Industry leaders in virtualization Cisco and VMware® announced that they.
OPERATING SYSTEMS CS 3530 Summer 2014 Systems and Models Chapter 03.
Chapter 4 A First Analysis of Feedback Feedback Control A Feedback Control seeks to bring the measured quantity to its desired value or set-point (also.
Copyright © 2010, Performance and Power Management for Cloud Infrastructures Hien Nguyen Van; Tran, F.D.; Menaud, J.-M. Cloud Computing (CLOUD),
3/12/2013Computer Engg, IIT(BHU)1 CLOUD COMPUTING-1.
ECE 576 – Power System Dynamics and Stability Prof. Tom Overbye Dept. of Electrical and Computer Engineering University of Illinois at Urbana-Champaign.
Dynamic Placement of Virtual Machines for Managing SLA Violations NORMAN BOBROFF, ANDRZEJ KOCHUT, KIRK BEATY SOME SLIDE CONTENT ADAPTED FROM ALEXANDER.
1 Elastically Replicated Information Services: Sustaining the Availability of Distributed Storage Across Dynamic Topological Changes Sponsored by Program.
ANASOFT VIATUS. Challenges Supply chain optimization is necessary for achieving competitive price of final products Synchronization and utilization of.
© 2012 Eucalyptus Systems, Inc. Cloud Computing Introduction Eucalyptus Education Services 2.
The Biologically Inspired Distributed File System: An Emergent Thinker Instantiation Presented by Dr. Ying Lu.
Department of Computer, Control, and Management Engineering Providing Transaction Class-Based QoS in in-Memory Data Grids Via Machine Learning Pierangelo.
Determining How Costs Behave
A Simple Artificial Neuron
Cloud Computing By P.Mahesh
Adaptive Cloud Computing Based Services for Mobile Users
A Framework for Automatic Resource and Accuracy Management in A Cloud Environment Smita Vijayakumar.
Smita Vijayakumar Qian Zhu Gagan Agrawal
Cloud Computing Architecture
CHAPTER I. of EVOLUTIONARY ROBOTICS Stefano Nolfi and Dario Floreano
Akram Bitar and Larry Manevitz Department of Computer Science
Presentation transcript:

Problem Formulation Elastic cloud infrastructures provision resources according to the current actual demand on the infrastructure while enforcing service level agreements (SLAs). Elasticity is the ability of the cloud to rapidly vary the allocated resource capacity to a service according to the current load in order to meet the quality of service (QoS) requirements specified in the SLA agreements. 1

Problem Formulation Problems: – When to release resources allocated to a service? Oscillations & increased costs if done prematurely as (de)allocation is costly. – Can we do better prediction for future demand to predict flash crowds and diurnal/periodical loads? – Which resources to release? Release cost is non-uniform (server consolidation, etc.) The elasticity engine should decrease the costs of operation for the cloud provider while not violating SLAs 2

Our solution -We start by trying to solve the problems one at a time starting with the problem of when to release the resources, i.e. we try to predict the when is it safe to release resources

Problem Formulation Prediction of load/signal/future is not a new problem It arises in a wide range of applications and scientific disciplines – Time series analysis, econometrics, stock markets, biology, control theory… There are solutions to the prediction problem – Neural networks, Kriging models, Fuzzy logic, adaptive control, regression,, etc.

Desired properties Fast – Limited lookahead control is quite accurate, but needs close to 0.5h for control of 15 physical machines and 60 VMs… Reliable (even when the load dynamics and operating conditions change) – PID-controllers reliable for certain load patterns, but breaks/become unstable once the load surges Should not also affect the reliability of the hardware – Reactive controllers are the best choice if you only look at the amount of allocated resources No resources are allocated until needed Resources are de-allocated once not needed But they cause large oscillations. Adaptive to changing workloads Scalable: A cloud will host hundreds of services on tens of thousands of VMs and physical machines. Simple: simplicity is a key to wide scale adoption

Our solution Build an adaptive P-Controller – P-Controllers are one of the simplest types of closed-loop controllers. – The error signal (the difference between the wanted output and the current output) is multiplied by a constant. – This constant changes according to the load dynamics (adaptive gain). – The rate of changing the constant is also adaptive according to the load dyanmics

Results (Fifa’98 traces) Reactive controller Our Controller

More results Regression based Controller Our Controller

Aggregate results PolicyNumber of servers not provisioned on time Number of servers over- provisioned Failure %Over- Provisioning % Reactive %2.2% Our Controller %3.6% Simulation using wikipedia traces from the wikibench project. As comparison, Rackspace SLA pays 10% of the bill back for availability=99.89% as a penalty.

Do we meet our constraints? The algorithm is very fast as the gain calculation depends on simple mathematical operations Adaptive according to load dynamics and thus reliable – Premature release of resources reduced significantly and thus hardware reliability is maintained Scalable: the calculations involved does not depend on the load of the volume or the number of VMs Simple: There is nothing simpler than a P- Controller ! – Except if you use thresholds Adaptive: Even the rate of changing the controller parameter changes