Evaluation of Delta Compression Techniques for Efficient Live Migration of Large Virtual Machines Petter Svärd, Benoit Hudzia, Johan Tordsson and Erik.

Slides:

Advertisements

Similar presentations

Remus: High Availability via Asynchronous Virtual Machine Replication

Advertisements

Live migration of Virtual Machines Nour Stefan, SCPD.

Difference Engine: Harnessing Memory Redundancy in Virtual Machines by Diwaker Gupta et al. presented by Jonathan Berkhahn.

A Fast Rejuvenation Technique for Server Consolidation with Virtual Machines Kenichi Kourai Shigeru Chiba Tokyo Institute of Technology.

Parallelizing Live Migration of Virtual Machines

VSphere vs. Hyper-V Metron Performance Showdown. Objectives Architecture Available metrics Challenges in virtual environments Test environment and methods.

Exploiting Data Deduplication to Accelerate Live Virtual Machine Migration Xiang Zhang 1,2, Zhigang Huo 1, Jie Ma 1, Dan Meng 1 1. National Research Center.

Live Migration of Virtual Machines Christopher Clark, Keir Fraser, Steven Hand, Jacob Gorm Hansen, Eric Jul, Christian Limpach, Ian Pratt, Andrew Warfield.

1 Cheriton School of Computer Science 2 Department of Computer Science RemusDB: Transparent High Availability for Database Systems Umar Farooq Minhas 1,

XENMON: QOS MONITORING AND PERFORMANCE PROFILING TOOL Diwaker Gupta, Rob Gardner, Ludmila Cherkasova 1.

Efficient Autoscaling in the Cloud using Predictive Models for Workload Forecasting Roy, N., A. Dubey, and A. Gokhale 4th IEEE International Conference.

Post-Copy Live Migration of Virtual Machines Michael R. Hines, Umesh Deshpande, Kartik Gopalan Computer Science, Binghamton University(SUNY) SIGOPS 09’

NFS setup in vLab Yunji Zhong. Without NFS Each VM is binding to each host machine.

Towards High-Availability for IP Telephony using Virtual Machines Devdutt Patnaik, Ashish Bijlani and Vishal K Singh.

Remus: High Availability via Asynchronous Virtual Machine Replication.

CacheMind: Fast Performance Recovery Using a Virtual Machine Monitor Kenichi Kourai Kyushu Institute of Technology, Japan.

By- Jaideep Moses, Ravi Iyer , Ramesh Illikkal and

Implementing Failover Clustering with Hyper-V

Cache Memory By Sean Hunter.

Slingshot: Deploying Stateful Services in Wireless Hotspots Ya-Yunn Su Jason Flinn University of Michigan.

Presented by : Ran Koretzki. Basic Introduction What are VM’s ? What is migration ? What is Live migration ?

Robert Bradford, Evangelos Kotsovinos, Anja Feldmann, Harald Schiöberg Presented by Kit Cischke.

Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.

Computer Measurement Group, India Reliable and Scalable Data Streaming in Multi-Hop Architecture Sudhir Sangra, BMC Software Lalit.

Virtualization and Cloud Computing Research at Vasabilab Kasidit Chanchio Vasabilab Dept of Computer Science, Faculty of Science and Technology, Thammasat.

Distributed Systems Early Examples. Projects NOW – a Network Of Workstations University of California, Berkely Terminated about 1997 after demonstrating.

Profiling Grid Data Transfer Protocols and Servers George Kola, Tevfik Kosar and Miron Livny University of Wisconsin-Madison USA.

Types of Computers Mainframe/Server Two Dual-Core Intel ® Xeon ® Processors 5140 Multi user access Large amount of RAM ( 48GB) and Backing Storage Desktop.

VM Algorithm Improvement Student’s Name: Kamlesh Patel Date: Oct 13, 2008 Advisor’s Name: Dr. Chung-E-Wang Prof. Dick Smith Department of Computer Science.

Virtual Machine Scheduling for Parallel Soft Real-Time Applications

Improving Network I/O Virtualization for Cloud Computing.

Kinshuk Govil, Dan Teodosiu*, Yongqiang Huang, and Mendel Rosenblum

Measuring System Performance The speed of a computer is often referred to as THROUGHPUT. This is very difficult to measure. It can be done with Measures.

1 Wenguang WangRichard B. Bunt Department of Computer Science University of Saskatchewan November 14, 2000 Simulating DB2 Buffer Pool Management.

Dynamic Resource Monitoring and Allocation in a virtualized environment.

Amy Apon, Pawel Wolinski, Dennis Reed Greg Amerson, Prathima Gorjala University of Arkansas Commercial Applications of High Performance Computing Massive.

Live Migration of Virtual Machines Christopher Clark, Keir Fraser, Steven Hand, Jacob Gorm Hansen†,Eric Jul†, Christian Limpach, Ian Pratt, Andrew Warfield.

1 University of Maryland Linger-Longer: Fine-Grain Cycle Stealing in Networks of Workstations Kyung Dong Ryu © Copyright 2000, Kyung Dong Ryu, All Rights.

1 Process migration n why migrate processes n main concepts n PM design objectives n design issues n freezing and restarting a process n address space.

Server VirtualizationServer Virtualization Hyper-V 2012.

VTurbo: Accelerating Virtual Machine I/O Processing Using Designated Turbo-Sliced Core Embedded Lab. Kim Sewoog Cong Xu, Sahan Gamage, Hui Lu, Ramana Kompella,

Embedded System Lab. 정범종 A_DRM: Architecture-aware Distributed Resource Management of Virtualized Clusters H. Wang et al. VEE, 2015.

Efficient Live Checkpointing Mechanisms for computation and memory-intensive VMs in a data center Kasidit Chanchio Vasabilab Dept of Computer Science,

A BRIEF INTRODUCTION TO CACHE LOCALITY YIN WEI DONG 14 SS.

DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters Nanyang Technological University Shanjiang Tang, Bu-Sung Lee, Bingsheng.

Full and Para Virtualization

Memory Hierarchy: Terminology Hit: data appears in some block in the upper level (example: Block X)  Hit Rate : the fraction of memory access found in.

Ian Gable HEPiX Spring 2009, Umeå 1 VM CPU Benchmarking the HEPiX Way Manfred Alef, Ian Gable FZK Karlsruhe University of Victoria May 28, 2009.

Using Uncacheable Memory to Improve Unity Linux Performance

PROOF Benchmark on Different Hardware Configurations 1 11/29/2007 Neng Xu, University of Wisconsin-Madison Mengmeng Chen, Annabelle Leung, Bruce Mellado,

Application Design Document Developers: o Uri Goldenberg o Henry Abravanel o Academic.

FroNtier Stress Tests at Tier-0 Status report Luis Ramos LCG3D Workshop – September 13, 2006.

Computer Performance. Hard Drive - HDD Stores your files, programs, and information. If it gets full, you can’t save any more. Measured in bytes (KB,

Friendly Virtual Machines Zhang,Bestavros etc., Boston Univ. ACM/USENIX VEE 2005 CSE 598c April 17, 2006 Bhuvan Urgaonkar CSE 598c April 17, 2006 Bhuvan.

Capacity Planning in a Virtual Environment Chris Chesley, Sr. Systems Engineer

Taeho Kgil, Trevor Mudge Advanced Computer Architecture Laboratory The University of Michigan Ann Arbor, USA CASES’06.

Urgent Virtual Machine Eviction with Enlightened Post-Copy Yoshihisa Abe†, Roxana Geambasu‡, Kaustubh Joshi, and Mahadev Satyanarayanan† †Carnegie Mellon.

Split Migration of Large Memory Virtual Machines

Chapter 13: I/O Systems Modified by Dr. Neerja Mhaskar for CS 3SH3.

Optimizing the Migration of Virtual Computers

BD-CACHE Big Data Caching for Datacenters

Windows Azure Migrating SQL Server Workloads

Slingshot: Deploying Stateful Services in Wireless Hotspots

Unistore: Project Updates

I'm Kenichi Kourai from Kyushu Institute of Technology.

Process Migration Troy Cogburn and Gilbert Podell-Blume

A workload-aware energy model for VM migration

Microsoft Virtual Academy

Kenichi Kourai Kyushu Institute of Technology

Efficient Migration of Large-memory VMs Using Private Virtual Memory

Presentation transcript:

Evaluation of Delta Compression Techniques for Efficient Live Migration of Large Virtual Machines Petter Svärd, Benoit Hudzia, Johan Tordsson and Erik Elmroth Umeå University, Dept of Computing Science VEE 2011, Newport Beach, CA, USA

Live migration “Transfer a VM from one host to another without disrupting services.” The VM:s state (memory pages) is transferred in the background with the VM still running The file system is typically located on a NFS and is not moved

Live migration - Typical algorithm The time between the VM is suspended and resumed is defined as the downtime Our goal is to reduce the downtime

Live migration - Problems with the typical algorithm When migrating memory intensive VM or over slow NW links: 1.Memory pages can be dirtied faster than they are transferred over the network 2.The VM has to be suspended for an extended period of time -> long downtime 3.Network connections time out and drop / triggers fail Leads to disruption of services

Live migration - Problems with the typical algorithm (cont) Problem dirtying rate > migration throughput Possible Solutions Decrease dirtying rate or increase migration throughput Decreasing dirtying rate might hurt server performance and disrupts services Increase migration throughput!

Delta compression - Increasing migration throughput Overall idea: transfer changes to pages instead of the full page contents thus increasing migration throughput Store sent pages in a cache When transferring, if the page is cached, compute an XOR delta page Compress the delta page

Delta compression - continued Wasting time on cache misses Efficient caching scheme and compression algorithm is vital! Vanilla (no compr.) Delta compression

Delta compression - caching Desired properties: Lean Constant seek time regardless of size L2 caching scheme

Delta compression - compression Desired properties: Lean (low cpu usage) Effective (high compression ratio) General purpose The XOR delta page is suitable for RLE compression –(Symbol)(Repetitions) → AAAAABBBCCCCC = 5A3B5C XOR BinaryRunLengthEncoding -> XBRLE

XBRLE compression - Source side algorithm

XBRLE compression - Destination side algorithm

XBRLE compression - conceptual illustration

Implementation Modified version of qemu-kvm userspace code to support the XBRLE migration algorithm. Lean, ~500 LoC Evaluation done on version

Demo Migrating streaming video over 10Mbit/s Before migration:

Demo Migrating streaming video (cont) After migration:

Demo Migrating streaming video (cont)

Evaluation - Test cases Memory write benchmark (lm_bench) –1 GB RAM, 1 vcpu VM –Near ideal case Transcoded HD Video –1 GB RAM, 1 vcpu VM –Real-world, non-ideal case SAP ERP application –8 GB RAM, 4 vcpus VM –Large business application –Relies on transactions and is thus sensitive to extended downtime

Evaluation - Experimental setup Benchmark and HD Video 2x 2,66GHz core2quad 16GB RAM NFS share on source machine 100Mbit/s Network SAP ERP 2x 3,0GHz Xeon dual-core 32GB RAM 16TB Raid 5, 6Gbits/s trunked NFS server 1000Mbit/s Network

Evaluation - Benchmark Downtime reduced by a factor of 100 Throughput increased by 63 %

Evaluation - Streaming video UDP downtime reduced from 8 s to 1 Migration is transparent using XBRLE

Evaluation - SAP ERP The ERP application was non-responsive on resume using the vanilla algorithm but survived using XBRLE “Rule of thumb” is that more than 0.5 s of downtime might hurt the system. Measured downtime was 0.2 for XBRLE and 2 for vanilla. Vanilla XBRLE

Conclusion Delta compression works well migrating VMs running workloads with a highly compressible working set VMs running heavy workloads with large working sets and/or over slow networks (i.e., WANs).

Future work Page priority algorithm Avoid re-sends of pages that are dirtied frequently Promising early results