Power and Performance Modeling in a Virtualized Server System M. Pedram and I. Hwang Department of Electrical Engineering Univ. of Southern California.

Slides:



Advertisements
Similar presentations
Virtualization Dr. Michael L. Collard
Advertisements

Virtual Machine Technology Dr. Gregor von Laszewski Dr. Lizhe Wang.
KAIST Computer Architecture Lab. The Effect of Multi-core on HPC Applications in Virtualized Systems Jaeung Han¹, Jeongseob Ahn¹, Changdae Kim¹, Youngjin.
Hadi Goudarzi and Massoud Pedram
Virtualization and Cloud Computing. Definition Virtualization is the ability to run multiple operating systems on a single physical system and share the.
VSphere vs. Hyper-V Metron Performance Showdown. Objectives Architecture Available metrics Challenges in virtual environments Test environment and methods.
XENMON: QOS MONITORING AND PERFORMANCE PROFILING TOOL Diwaker Gupta, Rob Gardner, Ludmila Cherkasova 1.
Virtual Machine Security Design of Secure Operating Systems Summer 2012 Presented By: Musaad Alzahrani.
© 2008 Hewlett-Packard Development Company, L.P. The information contained herein is subject to change without notice In search of a virtual yardstick:
By Aaron Thomas. Quick Network Protocol Intro. Layers 1- 3 of the 7 layer OSI Open System Interconnection Reference Model  Layer 1 Physical Transmission.
Towards High-Availability for IP Telephony using Virtual Machines Devdutt Patnaik, Ashish Bijlani and Vishal K Singh.
CS-3013 & CS-502, Summer 2006 Virtual Machine Systems1 CS-502 Operating Systems Slides excerpted from Silbershatz, Ch. 2.
Cloud Computing and Virtualization Sorav Bansal CloudCamp 2010 IIT Delhi.
Server Virtualization Gina Myers. Definition Creating virtual machines (VMs) “VMs are software entities that emulate a real machine’s functionality” ◦
A. Frank - P. Weisberg Operating Systems Structure of Operating Systems.
Virtual Machines. Virtualization Virtualization deals with “extending or replacing an existing interface so as to mimic the behavior of another system”
Virtualization for Cloud Computing
Virtualization Performance H. Reza Taheri Senior Staff Eng. VMware.
Resource Management in Virtualization-based Data Centers Bhuvan Urgaonkar Computer Systems Laboratory Pennsylvania State University Bhuvan Urgaonkar Computer.
VMware vSphere 4 Introduction. Agenda VMware vSphere Virtualization Technology vMotion Storage vMotion Snapshot High Availability DRS Resource Pools Monitoring.
PowerVM and VMware. What this presentation is Basic Terms that can be used to discuss multiple forms of virtualization Concepts common to virtualization.
Virtualization Technology Prof D M Dhamdhere CSE Department IIT Bombay Moving towards Virtualization… Department of Computer Science and Engineering, IIT.
Tanenbaum 8.3 See references
1 Scheduling I/O in Virtual Machine Monitors© 2008 Diego Ongaro Scheduling I/O in Virtual Machine Monitors Diego Ongaro, Alan L. Cox, and Scott Rixner.
Virtualization Virtualization is the creation of substitutes for real resources – abstraction of real resources Users/Applications are typically unaware.
Green IT and Data Centers Darshan R. Kapadia Gregor von Laszewski 1.
Cellular Disco: resource management using virtual clusters on shared memory multiprocessors Published in ACM 1999 by K.Govil, D. Teodosiu,Y. Huang, M.
Microkernels, virtualization, exokernels Tutorial 1 – CSC469.
SAIGONTECH COPPERATIVE EDUCATION NETWORKING Spring 2010 Seminar #1 VIRTUALIZATION EVERYWHERE.
SAIGONTECH COPPERATIVE EDUCATION NETWORKING Spring 2009 Seminar #1 VIRTUALIZATION EVERYWHERE.
Cloud Computing Energy efficient cloud computing Keke Chen.
Secure & flexible monitoring of virtual machine University of Mazandran Science & Tecnology By : Esmaill Khanlarpour January.
การติดตั้งและทดสอบการทำคลัสเต อร์เสมือนบน Xen, ROCKS, และไท ยกริด Roll Implementation of Virtualization Clusters based on Xen, ROCKS, and ThaiGrid Roll.
Virtualization Paul Krzyzanowski Distributed Systems Except as otherwise noted, the content of this presentation is licensed.
Improving Network I/O Virtualization for Cloud Computing.
Virtual Machine and its Role in Distributed Systems.
Our work on virtualization Chen Haogang, Wang Xiaolin {hchen, Institute of Network and Information Systems School of Electrical Engineering.
Xen (Virtual Machine Monitor) Operating systems laboratory Esmail asyabi- April 2015.
High Performance Computing on Virtualized Environments Ganesh Thiagarajan Fall 2014 Instructor: Yuzhe(Richard) Tang Syracuse University.
Power-Aware Scheduling of Virtual Machines in DVFS-enabled Clusters
1 Kansas City CMG, 2005 Ethan Bolker and Yiping Ding October, 2005 Virtual performance won't do : Capacity planning for virtual systems.
OPERATING SYSTEMS CS 3530 Summer 2014 Systems with Multi-programming Chapter 4.
A. Frank - P. Weisberg Operating Systems Structure of Operating Systems.
Full and Para Virtualization
Enabling Technologies for Distributed Computing Dr. Sanjay P. Ahuja, Ph.D. Fidelity National Financial Distinguished Professor of CIS School of Computing,
Ian Gable HEPiX Spring 2009, Umeå 1 VM CPU Benchmarking the HEPiX Way Manfred Alef, Ian Gable FZK Karlsruhe University of Victoria May 28, 2009.
Virtualization One computer can do the job of multiple computers, by sharing the resources of a single computer across multiple environments. Turning hardware.
Operating-System Structures
Technical Reading Report Virtual Power: Coordinated Power Management in Virtualized Enterprise Environment Paper by: Ripal Nathuji & Karsten Schwan from.
Research on Embedded Hypervisor Scheduler Techniques 2014/10/02 1.
Cloud Computing – UNIT - II. VIRTUALIZATION Virtualization Hiding the reality The mantra of smart computing is to intelligently hide the reality Binary->
E Virtual Machines Lecture 1 What is Virtualization? Scott Devine VMware, Inc.
Unit 2 VIRTUALISATION. Unit 2 - Syllabus Basics of Virtualization Types of Virtualization Implementation Levels of Virtualization Virtualization Structures.
Multi-Core CPUs Matt Kuehn. Roadmap ► Intel vs AMD ► Early multi-core processors ► Threads vs Physical Cores ► Multithreading and Multi-core processing.
1 Automated Power Management Through Virtualization Anne Holler, VMware Anil Kapur, VMware.
Open Source Virtualization Andrey Meganov RHCA, RHCX Consultant / VDEL
Virtualization for Cloud Computing
Virtualization.
OPERATING SYSTEMS CS 3502 Fall 2017
Virtualization Dr. Michael L. Collard
Virtualization Virtualization is the creation of substitutes for real resources – abstraction of real resources Users/Applications are typically unaware.
Virtual Servers.
1. 2 VIRTUAL MACHINES By: Satya Prasanna Mallick Reg.No
Comparison of the Three CPU Schedulers in Xen
Virtualization Virtualization is the creation of substitutes for real resources – abstraction of real resources Users/Applications are typically unaware.
Hui Chen, Shinan Wang and Weisong Shi Wayne State University
Virtualization Techniques
Virtual machines benefits
Virtualization Dr. S. R. Ahmed.
Xing Pu21 Ling Liu1 Yiduo Mei31 Sankaran Sivathanu1 Younggyun Koh1
Presentation transcript:

Power and Performance Modeling in a Virtualized Server System M. Pedram and I. Hwang Department of Electrical Engineering Univ. of Southern California

Outline Study and its purpose Motivation Concepts and Definitions Methodology Experimental system setup Simulation results Power modeling Performance modeling Power-performance tradeoff analysis

Motivation  Modern data centers consume large amounts of energy and put a lot of stress on the aging Power Grid  Average utilization level of enterprise servers in a typical datacenter is quite low  Service level agreements (SLAs) between clients and datacenter operators are critically important esp. for hosting centers  Accurate models of power and performance are essential for dynamic resource provisioning and allocation as well as power/thermal management

Virtualization  Virtualization is disassociating the tight bond between software and hardware by introducing a hypervisor between the OS and hardware  One can then use the same hardware to serve up the needs of the different software servers: Oracle, MS SQL Server, Exchange, Dynamics CRM, etc.  It is also possible to run different operating systems so one could run MS SQL Server 2008 on Windows 2008 Server and run Oracle on Linux all running on the same hardware  By doing this, the resources will be better utilized since we can easily add/migrate virtual machines  Examples include Microsoft Hyper-V, VMware ESX Server 3.5, Linux Xen hypervisor

Virtualization Cont’d

Concepts and Definitions  Processor: An integrated circuit containing possibly multiple cores, caches, memory and other I/O interfaces  Hypervisor (Virtual Machine Monitor): Essentially a hardware platform virtualization software with which one can run different OS on the same hardware at the same time  VMM is responsible for CPU scheduling and memory partitioning of the various virtual machines running on the hardware device; Also it is the software that allocates basic machine resources including CPU time and memory  Fully-virtualized: A virtual machine not aware of its virtualization (needs no OS support, is fully transparent but has high overhead)  Para-virtualized: A virtual machine running on a hypervisor that is aware of it being virtualized

Definitions Cont’d  Domain 0, the Privileged Domain (Dom0): Privileged guest running on the hypervisor with direct hardware access and guest management responsibilities  Multiple Domain U, Unprivileged Domain Guests (DomU): Unprivileged guests running on the hypervisor; they have no direct access to hardware  Virtual machine (VM): Same as a domain  (Physical) CPU : A physical core in a processor  Virtual CPU: (one or more per VM) – maybe a process, request, or job that must run on a CPU  Each CPU manages a local run queue of runnable VCPUs; This queue is sorted by VCPU priority  CPU utilization: Total of utilization of all CPUs in the system  E.g., 400% for 4 physical CPUs that are fully utilized

Methodology

Workload Generator  Generate tasks for load the system  TCP/IP based structure  Mimic web-based services  Schedule various types of tasks  CPU/IO/MEM bound, inter-arrival time, etc

Experimental System Setup  Each processor is equipped with "Demand Based Switching" power management  We cut the (12V) power line for the processor chip and measure DC power dissipation  Processors  Voltage regulator  Model: Intel Dual Xeon E5400 series; Code Name: Harpertown  Two processor system; Each processor is quad-core  The chip supports two frequency levels: 2.0GHz, 2.33GHz  Two cores in a socket must run the exact same frequency

Test Cases: Taxonomy Case # of vCPUs ID of pCPUs Processor 1Processor ,2,4,6LLLL 240,2,4LLLL 74 HHHH 1310LLLL 1840,1HHHHHHHH  Variables: # of vCPUs, set of active pCPUs, and their freq. levels

Complete set of Test Cases Case # of vCPUs ID of pCPUs Processor 1Processor ,2,4,6LLLL 240,2,4LLLL 340,2LLLL 440,4LLLL 540LLLL 640,2,4,6HHHH 740,2,4HHHH 840,2HHHH 940,4HHHH 1040HHHH 1140,2LLHH 1240,4LLHH 1310LLLL 1410HHHH 1540,2LLLLLLLL 1640,1HHHHLLLL 1740,2HHHHLLLL 1840,1HHHHHHHH

Power Measurement Results  Linear model  Cluster 1: cases 1-5, 11  Low frequency  Cluster 2: cases 6-10, 12  High & mixed frequency Case ID of pCPU Processor 1 Case ID of pCPU Processor ,2,4,6LLLL6 HHHH 20,2,4LLLL7 HHHH 30,2LLLL8 HHHH 40,4LLLL9 HHHH 50LLLL100HHHH 110,2LLHH120,4LLHH

Power Analysis Observation 1: When all active CPUs are running at the low frequency level, the processor power dissipation is nearly independent of which subset of CPUs is used by the running domain (cf. cases 1 through 5, and 11) Even if some of inactive CPUs are running at a high frequency, they do not make any difference (case 3 vs. case 11) Core consolidation is ineffective for power saving CaseabR squared Case # of vCPUs ID of pCPUs Processor ,2,4,6LLLL 240,2,4LLLL 340,2LLLL 440,4LLLL 540LLLL 1140,2LLHH

Power Analysis Cont’d Observation 2: When all active CPUs are running at high frequency, processor power dissipation is only weakly dependent on the subset of CPUs used by the running domain (cf. cases 6 through 10) caseabR squared Case # of vCPUs ID of pCPUs Processor ,2,4,6HHHH 740,2,4HHHH 840,2HHHH 940,4HHHH 1040HHHH Power slope increases as # of pCPUs decreases There is a small change in power offset, but this is mainly caused by regression error Core consolidation is ineffective for power saving

Power Analysis Cont’d Observation 3: Power dissipation for the case where active CPUs are running at different frequency levels is similar to that of the case that has the same # of active CPUs all running at the high frequency level (cf. cases 9 and 12) Power slopes are nearly the same Only small difference in the power offsets caseabR squared Case # of vCPUs ID of pCPUs Processor ,4HHHH 1240,4LLHH

Power Analysis Cont’d Observation 4: The idle power dissipation of the processor chip is somewhat higher for the higher frequency level (cf. cases 1-5 vs. 6-10) caseabR squared caseabR squared Case # of vCPUs ID of pCPUs Processor 1Processor ,2,4,6LLLL … 540LLLL 64 HHHH … 1040HHHH

Power Analysis Cont’d Observation 5: Processor consolidation is helpful in reducing power dissipation (cf. case 17 vs. case 18) Case # of vCPUs ID of pCPUs Processor 1Processor ,2HHHHLLLL 1840,1HHHHHHHH case 17 = cases 8 & 9 Processor 2 is not being used, so it only increases the offset power case 18 = case 10 Settings of the two processors are identical The maximum possible utilization of case 18 is 200%, which is twice that of case 10

Power Model caseabR squared caseabR squared  Linear model  Cluster 1: cases 1-5, 11  Low frequency  Independent of parameters  Cluster 2: cases 6-10, 12  High & mixed frequency  Dependent on parameters Low f  High f (for same λ)

Power Model Cont’d caseabR squared  Power dissipation as a function of the frequency level  Exclude offset because no difference in power offsets of cases 1,6  P(case6) / P(case1) = / =  Freq(case6) / Freq(case1) = 2.33 / 2.0 = Low f  High f (for same λ) Case ID of pCPU Processor 1 Case ID of pCPU Processor ,2,4,6LLLL6 HHHH

Unified Power Model – High Freq. caseabR squared cluster1 1-5, cluster  Additional required parameter  # of active CPUs

Extension to Multi-Processors Case # of vCPUs ID of pCPUs Processor 1Processor ,2LLLLLLLL 1640,1HHHHLLLL 1740,2HHHHLLLL 1840,1HHHHHHHH Caseestimatedmeasured a*a* b*b* ab  Estimated from power model  Measured from experimental results  Linear relationship between ‘pwr’ and ‘# of processors’ under the same utilization level  a * = average of the slopes  b * = sum of the offsets

Performance Model casecdeR squared , ,  Queuing theory result  Average wait time:  Our performance model: Case # of vCPUs ID of pCPUs 140,2,4,6 240,2,4 340,2 440, ,2,4,6 740,2,4 840,2 940,4 1040

Performance Analysis  Observation 1: At the same normalized utilization level, response time of cases with smaller number of active CPUs is much higher than response times of cases with larger number of active CPUs (cases 5 & 10 vs. cases 1 & 6) Case # of vCPUs # of pCPUs freqCase # of vCPUs # of pCPUs freq 144Low644High 243Low743High 3,442Low8,942High 541Low1041High norm util = total util / (# of active CPUs)

Performance Analysis Cont’d. Case # of vCPUs ID of pCPUs Processor 1Processor LLLL 1310LLLL 1040HHHH 1410HHHH  Observation 2: When # of vCPUs is larger than # of pCPUs, response time increases (cf. case 5 vs. case 13, or case 10 vs. case 14)  # of vCPUs can be a parameter of the model

Pareto Surface Cost function Energy * resp. time / task Select best parameter One must choose the best parameters for different levels of load intensity or CPU utilization levels Either case 1 or 6 is the best one (cases w/o consolidation) CPU consolidation in a physical machine is not a good idea from a power savings perspective (but processor consolidation is effective)

Conclusion Accurate power/performance models for CPUs and servers in a virtualized computer system were derived through extensive simulations and hardware measurements These models can be used for doing a better power and performance tradeoff analysis or dynamic management Future work will include getting power/performance models for a more general virtualized system (multi-tiered service or multi- guest domains) using more advanced architectures (multi-core processor chips with hardware assists for virtualization)