Exploiting Data Deduplication to Accelerate Live Virtual Machine Migration Xiang Zhang 1,2, Zhigang Huo 1, Jie Ma 1, Dan Meng 1 1. National Research Center.

Slides:

Advertisements

Similar presentations

Remus: High Availability via Asynchronous Virtual Machine Replication

Advertisements

Live migration of Virtual Machines Nour Stefan, SCPD.

KAIST Computer Architecture Lab. The Effect of Multi-core on HPC Applications in Virtualized Systems Jaeung Han¹, Jeongseob Ahn¹, Changdae Kim¹, Youngjin.

Efficient Event-based Resource Discovery Wei Yan*, Songlin Hu*, Vinod Muthusamy +, Hans-Arno Jacobsen +, Li Zha* * Chinese Academy of Sciences, Beijing.

Difference Engine: Harnessing Memory Redundancy in Virtual Machines by Diwaker Gupta et al. presented by Jonathan Berkhahn.

Live Migration of Virtual Machines Christopher Clark, Keir Fraser, Steven Hand, Jacob Gorm Hansen, Eric Jul, Christian Limpach, Ian Pratt, Andrew Warfield.

Fast and Safe Performance Recovery on OS Reboot Kenichi Kourai Kyushu Institute of Technology.

Parallelizing Live Migration of Virtual Machines

Energy-efficient Virtual Machine Provision Algorithms for Cloud System Ching-Chi Lin Institute of Information Science, Academia Sinica Department of Computer.

KMemvisor: Flexible System Wide Memory Mirroring in Virtual Environments Bin Wang Zhengwei Qi Haibing Guan Haoliang Dong Wei Sun Shanghai Key Laboratory.

Live Migration of Virtual Machines Christopher Clark, Keir Fraser, Steven Hand, Jacob Gorm Hansen, Eric Jul, Christian Limpach, Ian Pratt, Andrew Warfield.

Heterogeneous Live Migration of Virtual Machines Pengcheng Liu, Ziye Yang, Xiang Song, Yixun Zhou, Haibo Chen, and Binyu Zang Parallel Processing Institute,

1 Cheriton School of Computer Science 2 Department of Computer Science RemusDB: Transparent High Availability for Database Systems Umar Farooq Minhas 1,

Memory Buddies: Exploiting Page Sharing for Smart Colocation in Virtualized Data Centers Timothy Wood, Gabriel Tarasuk-Levin, Prashant Shenoy, Peter Desnoyers*,

Post-Copy Live Migration of Virtual Machines Michael R. Hines, Umesh Deshpande, Kartik Gopalan Computer Science, Binghamton University(SUNY) SIGOPS 09’

Towards High-Availability for IP Telephony using Virtual Machines Devdutt Patnaik, Ashish Bijlani and Vishal K Singh.

U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Server Consolidation in Virtualized Data Centers Prashant Shenoy University of Massachusetts.

U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Virtualization in Data Centers Prashant Shenoy

1 stdchk : A Checkpoint Storage System for Desktop Grid Computing Matei Ripeanu – UBC Sudharshan S. Vazhkudai – ORNL Abdullah Gharaibeh – UBC The University.

Adaptive Content Delivery for Scalable Web Servers Authors: Rahul Pradhan and Mark Claypool Presented by: David Finkel Computer Science Department Worcester.

CacheMind: Fast Performance Recovery Using a Virtual Machine Monitor Kenichi Kourai Kyushu Institute of Technology, Japan.

DatacenterMicrosoft Azure Consistency Connectivity Code.

1 Token-ordered LRU an Effective Policy to Alleviate Thrashing Presented by Xuechen Zhang, Pei Yan ECE7995 Presentation.

By- Jaideep Moses, Ravi Iyer , Ramesh Illikkal and

Virtualization for Cloud Computing

Implementing Failover Clustering with Hyper-V

Virtual Network Servers. What is a Server? 1. A software application that provides a specific one or more services to other computers  Example: Apache.

Alleviating Constraints with Resource Pools & Live Migration with Enhanced VMotion* Breakout Session# 2823 Raghu Yeluri Sr. Architect Intel Corporation.

Presented by : Ran Koretzki. Basic Introduction What are VM’s ? What is migration ? What is Live migration ?

Design and Implementation of a Single System Image Operating System for High Performance Computing on Clusters Christine MORIN PARIS project-team, IRISA/INRIA.

U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Black-box and Gray-box Strategies for Virtual Machine Migration Timothy Wood, Prashant.

Virtualizing Modern High-Speed Interconnection Networks with Performance and Scalability Institute of Computing Technology, Chinese Academy of Sciences,

Virtualization and Cloud Computing Research at Vasabilab Kasidit Chanchio Vasabilab Dept of Computer Science, Faculty of Science and Technology, Thammasat.

Supporting Strong Cache Coherency for Active Caches in Multi-Tier Data-Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan, S. Krishnamoorthy,

Evaluation of Delta Compression Techniques for Efficient Live Migration of Large Virtual Machines Petter Svärd, Benoit Hudzia, Johan Tordsson and Erik.

Virtual Machine Scheduling for Parallel Soft Real-Time Applications

CERN - IT Department CH-1211 Genève 23 Switzerland t Tier0 database extensions and multi-core/64 bit studies Maria Girone, CERN IT-PSS LCG.

Kinshuk Govil, Dan Teodosiu*, Yongqiang Huang, and Mendel Rosenblum

Zero-copy Migration for Lightweight Software Rejuvenation of Virtualized Systems Kenichi Kourai Hiroki Ooba Kyushu Institute of Technology.

Virtualization in the Data Center Virtual Servers – How it works – Pros – Cons IPAC’s implementation – Hardware resource usage and trends – Virtualization.

A study of introduction of the virtualization technology into operator consoles T.Ohata, M.Ishii / SPring-8 ICALEPCS 2005, October 10-14, 2005 Geneva,

Our work on virtualization Chen Haogang, Wang Xiaolin {hchen, Institute of Network and Information Systems School of Electrical Engineering.

Live Migration of Virtual Machines

Revisiting Hardware-Assisted Page Walks for Virtualized Systems

Live Migration of Virtual Machines Christopher Clark, Keir Fraser, Steven Hand, Jacob Gorm Hansen†,Eric Jul†, Christian Limpach, Ian Pratt, Andrew Warfield.

A Measurement Based Memory Performance Evaluation of High Throughput Servers Garba Isa Yau Department of Computer Engineering King Fahd University of Petroleum.

June 6, 2007TeraGrid '071 Clustering the Reliable File Transfer Service Jim Basney and Patrick Duda NCSA, University of Illinois This material is based.

Evaluating memory compression and deduplication Yuhui Deng, Liangshan Song, Xinyu Huang Department of Computer Science Jinan University.

Server Virtualization

Micro-sliced Virtual Processors to Hide the Effect of Discontinuous CPU Availability for Consolidated Systems Jeongseob Ahn, Chang Hyun Park, and Jaehyuk.

1 Virtual Machine Memory Access Tracing With Hypervisor Exclusive Cache USENIX ‘07 Pin Lu & Kai Shen Department of Computer Science University of Rochester.

VTurbo: Accelerating Virtual Machine I/O Processing Using Designated Turbo-Sliced Core Embedded Lab. Kim Sewoog Cong Xu, Sahan Gamage, Hui Lu, Ramana Kompella,

Latency Reduction Techniques for Remote Memory Access in ANEMONE Mark Lewandowski Department of Computer Science Florida State University.

Efficient Live Checkpointing Mechanisms for computation and memory-intensive VMs in a data center Kasidit Chanchio Vasabilab Dept of Computer Science,

Project Presentation By: Dean Morrison 12/6/2006 Dynamically Adaptive Prepaging for Effective Virtual Memory Management.

DynamicMR: A Dynamic Slot Allocation Optimization Framework for MapReduce Clusters Nanyang Technological University Shanjiang Tang, Bu-Sung Lee, Bingsheng.

A Grid-enabled Multi-server Network Game Architecture Tianqi Wang, Cho-Li Wang, Francis C.M.Lau Department of Computer Science and Information Systems.

Core Migration On SCC [keyword : Lookup Table, MPB] Chan Seok Kang 2013/06/19.

Full and Para Virtualization

Layali Rashid, Wessam M. Hassanein, and Moustafa A. Hammad*

Taeho Kgil, Trevor Mudge Advanced Computer Architecture Laboratory The University of Michigan Ann Arbor, USA CASES’06.

Qin Zhao1, Joon Edward Sim2, WengFai Wong1,2 1SingaporeMIT Alliance 2Department of Computer Science National University of Singapore

Presented by Yoon-Soo Lee

Kenichi Kourai Kouta Sannomiya Kyushu Institute of Technology, Japan

Optimizing the Migration of Virtual Computers

Kenichi Kourai Hiroki Ooba Kyushu Institute of Technology, Japan

Declarative Transfer Learning from Deep CNNs at Scale

A workload-aware energy model for VM migration

Microsoft Virtual Academy

Low-cost and Fast Failure Recovery Using In-VM Containers in Clouds

Presentation transcript:

Exploiting Data Deduplication to Accelerate Live Virtual Machine Migration Xiang Zhang 1,2, Zhigang Huo 1, Jie Ma 1, Dan Meng 1 1. National Research Center for Intelligent Computing Systems, Institute of Computing Technology, Chinese Academy of Sciences 2. Graduate University of Chinese Academy of Sciences

Outline Introduction Design Implementation Evaluation Conclusion & Future Work

Live Migration Definition Migrating OS and Apps as a whole to another physical machine without rebooting the VM Advantages Load Balance Services Consolidation Fault Tolerance... Usually a shared storage is deployed Migrating VCPU context and memory image

Pre-Copy Pre-Copy is the default choice in Xen First phase, initial memory pages are copied Second phase, several rounds of incremental synchronization are employed Last phase, VM is suspended, remaining memory image and VCPU context are copied Pre-Copy is reliable

Motivation of Research Performance metrics of migration Total Data Transferred Total Migration Time Downtime Necessity for improving performance of Migration Apps suffer less time of performance degradation Would not miss many migration opportunities Shorter downtime for latency-sensitive Apps

Outline Introduction Design Implementation Evaluation Conclusion & Future Work

Analyzing Migration Data Regularities During the first phase Zero pages are in the majority for lightweight workloads At least 25% of non-zero pages are identical or above 80% similar Ratios of identical and similar pages to reference pages are 8:1 at least During last two phases Little zero pages At least 50% of pages are above 80% similar to their old versions Conclusion Too many redundant data transferred during migration. Migration with Data Deduplication (MDD) Denominator Pecentage of pages whose similarity are above 80% (%) CompilationVODStaticWebBankingEcommerceSupport Non-Zero Pages During the First Phase All Transferred Pages During Last Two Phases

How to Find Identical and Similar Pages (1) HashSimilarityDetector(k, s, c) [21] Hashes (k * s) blocks on the page, and groups them into k groups of s hashes each For each hash fingerprint, c candidates are stored as reference pages HashSimilarityDetector(2, 1, 1), SuperFastHash of 64-byte blocks

How to Find Identical and Similar Pages (2) Similarity is transitive P trans ≈ P old, P hash ≈ P trans, so P hash ≈ P old Need not to cache all the transferred pages Only the privileged domain in source needs to maintain hash table Reference pages are transferred and can be found by their frame numbers in destination

How to Find Identical and Similar Pages (3) Only indexing by hash fingerprints may cause data inconsistency FPHash PxPx b1 P x-old b1 P x-old b1 P x-new b2 P x-new b2 PyPy b1 SourceDestination

How to Find Identical and Similar Pages (4) Double-Hash to eliminate data inconsistency PxPx b1 P x-old b1 FNHash FPHash PyPy b1 b2 P x-old b1 P x-new b2 P x-new b2 SourceDestination

Data Deduplication during Migration In source P parity = P trans ⊕ P ref Encoding P parity with RLE, then migrating In destination Decoding to get P parity P trans = P parity ⊕ P ref Advantages P parity contains less information than P trans Reflects the exact different data at bit level Contains many blocks of continuous zeros, even RLE can compress effectively RLE is one of the fastest encoding algorithm

Outline Introduction Design Implementation Evaluation Conclusion & Future Work

Implementation Do data deduplication parallelly by multi-thread Hash tables are maintained by LRU Extended memcmp() to reduce the overhead of judging zero pages

Outline Introduction Design Implementation Evaluation Conclusion & Future Work

Experimental Setup Experiment platform Cluster composed by six identical servers One storage server, iSCSI protocol, isolated gigabit Ethernet Two servers, which act as the source and destination of migration Three servers work as clients for workloads Server configuration Two Intel Xeon E5520 quad-core CPUs, 2.2GHz 8GB DDR RAM Gigabit LAN Xen and modified Linux Migrated VM is configured with one VCPU and 1GB RAM Migration shares the same network with workloads. Workloads Compilation, VOD, static web server, dynamic web server

Total Data Transferred Transferred data is reduced by 56.60% on average Number of transferred pages is reduced by 48.73% on average (Banking) Compression ratio is 49.27% on average (Banking)

Total Migration Time and Downtime MDD decreases total migration time and downtime by 34.93% and 26.16% on average Less data transferred Number of migration rounds are not reduced

CPU Resource Required Extra CPU resource which MDD requires is 47.21% of a CPU CompilationVODBankingEcommerceSupport Xen MDD DD Average CPU Utilization Ratio of Migration (%)

Influence to Apps Run Apache in migrated VM, and migrate it in normal and adaptive mode respectively The more limited network bandwidth is, the more essential data deduplication is Total Data TransferredTotal Migration TimeDowntime Normal Migration Adaptive Migration Benefits of MDD in Different Migration Mode (%)

Outline Introduction Design Implementation Evaluation Conclusions & Future Work

Conclusion & Future Work Conclusion Study the characteristics of run-time memory image data during migration Present the design and implementation of MDD MDD reduces total data transferred, total migration time and downtime by 56.60%, 34.93% and 26.16% respectively, reduces the influence of migration to Apps. Future work Extend MDD into live whole-system migration in wide-area environment

Thank You! Any Questions?

Related Work Reducing transferred data Post-Copy [7][12] Self-Ballooning [7] Trace and replay [13] Adaptive compression [8] Improving network bandwidth InfiniBand RDMA [14]

Backup EcommerceSupport