Remus: VM Replication Jeff Chase Duke University.

Slides:



Advertisements
Similar presentations
Remus: High Availability via Asynchronous Virtual Machine Replication
Advertisements

Unmodified Device Driver Reuse and Improved System Dependability via Virtual Machines J. LeVasseur V. Uhlig J. Stoess S. G¨otz University of Karlsruhe,
Virtualisation From the Bottom Up From storage to application.
Consistency and Replication Chapter 7 Part II Replica Management & Consistency Protocols.
XEN AND THE ART OF VIRTUALIZATION Paul Barham, Boris Dragovic, Keir Fraser, Steven Hand, Tim Harris, Alex Ho, Rolf Neugebauer, lan Pratt, Andrew Warfield.
Bart Miller. Outline Definition and goals Paravirtualization System Architecture The Virtual Machine Interface Memory Management CPU Device I/O Network,
Replication and Consistency (2). Reference r Replication in the Harp File System, Barbara Liskov, Sanjay Ghemawat, Robert Gruber, Paul Johnson, Liuba.
Memory-efficient Virtual Machine High Availability Karen Kai-Yuan Hou Prof. Kang G. Shin University of Michigan Mustafa Uysal (VMware) Arif Merchant (HP.
1 Cheriton School of Computer Science 2 Department of Computer Science RemusDB: Transparent High Availability for Database Systems Umar Farooq Minhas 1,
Acknowledgments University of Waterloo University of British Columbia
Database Systems on Virtual Machines: How Much Do We Lose? Kristin Travis March 2, 2011.
Towards High-Availability for IP Telephony using Virtual Machines Devdutt Patnaik, Ashish Bijlani and Vishal K Singh.
U NIVERSITY OF M ASSACHUSETTS, A MHERST Department of Computer Science Virtualization in Data Centers Prashant Shenoy
Remus: High Availability via Asynchronous Virtual Machine Replication.
Module – 11 Local Replication
16: Distributed Systems1 DISTRIBUTED SYSTEM STRUCTURES NETWORK OPERATING SYSTEMS The users are aware of the physical structure of the network. Each site.
November 1, 2004Introduction to Computer Security ©2004 Matt Bishop Slide #29-1 Chapter 33: Virtual Machines Virtual Machine Structure Virtual Machine.
Virtualization for Cloud Computing
VIRTUALISATION OF HADOOP CLUSTERS Dr G Sudha Sadasivam Assistant Professor Department of CSE PSGCT.
MULTICOMPUTER 1. MULTICOMPUTER, YANG DIPELAJARI Multiprocessors vs multicomputers Interconnection topologies Switching schemes Communication with messages.
Scalability, Fidelity and Containment in the Potemkin Virtual Honeyfarm Authors: Michael Vrable, Justin Ma, Jay chen, David Moore, Erik Vandekieft, Alex.
Presented by : Ran Koretzki. Basic Introduction What are VM’s ? What is migration ? What is Live migration ?
Highly Available ACID Memory Vijayshankar Raman. Introduction §Why ACID memory? l non-database apps: want updates to critical data to be atomic and persistent.
Tanenbaum 8.3 See references
Zen and the Art of Virtualization Paul Barham, et al. University of Cambridge, Microsoft Research Cambridge Published by ACM SOSP’03 Presented by Tina.
Microkernels, virtualization, exokernels Tutorial 1 – CSC469.
Disco : Running commodity operating system on scalable multiprocessor Edouard et al. Presented by Jonathan Walpole (based on a slide set from Vidhya Sivasankaran)
CS533 Concepts of Operating Systems Jonathan Walpole.
Virtualization Concepts Presented by: Mariano Diaz.
Benefits: Increased server utilization Reduced IT TCO Improved IT agility.
Xen I/O Overview. Xen is a popular open-source x86 virtual machine monitor – full-virtualization – para-virtualization para-virtualization as a more efficient.
Virtualization Paul Krzyzanowski Distributed Systems Except as otherwise noted, the content of this presentation is licensed.
Xen I/O Overview.
Improving Network I/O Virtualization for Cloud Computing.
Xen and The Art of Virtualization Paul Barham, Boris Dragovic, Keir Fraser, Steven Hand, Tim Harris, Alex Ho, Rolf Neugebauer, Ian Pratt & Andrew Warfield.
INTRODUCTION TO VIRTUALIZATION KRISTEN WILLIAMS MOSES IKE.
Our work on virtualization Chen Haogang, Wang Xiaolin {hchen, Institute of Network and Information Systems School of Electrical Engineering.
1 Xen and Co.: Communication-aware CPU Scheduling for Consolidated Xen-based Hosting Platforms Sriram Govindan, Arjun R Nath, Amitayu Das, Bhuvan Urgaonkar,
CS533 Concepts of Operating Systems Jonathan Walpole.
Nathanael Thompson and John Kelm
Chapter 8-2 : Multicomputers Multiprocessors vs multicomputers Multiprocessors vs multicomputers Interconnection topologies Interconnection topologies.
1 Process migration n why migrate processes n main concepts n PM design objectives n design issues n freezing and restarting a process n address space.
Virtualization 3 Subtitle: “What can we do to a VM?” Learning Objectives: – To understand the VM-handling mechanisms of a hypervisor – To understand how.
 Virtual machine systems: simulators for multiple copies of a machine on itself.  Virtual machine (VM): the simulated machine.  Virtual machine monitor.
Dynamic and Secure Application Consolidation with Nested Virtualization and Library OS in Cloud Kouta Sannomiya and Kenichi Kourai (Kyushu Institute of.
Disco : Running commodity operating system on scalable multiprocessor Edouard et al. Presented by Vidhya Sivasankaran.
Eduardo Gutarra Velez. Outline Distributed Filesystems Motivation Google Filesystem Architecture The Metadata Consistency Model File Mutation.
Introduction to virtualization
COLO: COarse-grain LOck-stepping Virtual Machine for Non-stop Service Li Zhijian Fujitsu Limited.
Chap 7: Consistency and Replication
Efficient Live Checkpointing Mechanisms for computation and memory-intensive VMs in a data center Kasidit Chanchio Vasabilab Dept of Computer Science,
COMP25212: Virtualization 3 Subtitle: “What can we do to a VM?” Learning Objectives: –To understand the VM-handling mechanisms of a hypervisor –To understand.
Operating-System Structures
Protection of Processes Security and privacy of data is challenging currently. Protecting information – Not limited to hardware. – Depends on innovation.
Cloud Computing Lecture 5-6 Muhammad Ahmad Jan.
What's New in Windows Server Hyper-V
Cloud Computing – UNIT - II. VIRTUALIZATION Virtualization Hiding the reality The mantra of smart computing is to intelligently hide the reality Binary->
VIRTUAL MACHINE – VMWARE. VIRTUAL MACHINE (VM) What is a VM? – A virtual machine (VM) is a software implementation of a computing environment in which.
Virtualization - an introduction Gordon Ross Computing Service.
Xen and the Art of Virtualization
NEWS LAB 薛智文 嵌入式系統暨無線網路實驗室
A Technical Overview of Microsoft® SQL Server™ 2005 High Availability Beta 2 Matthew Stephen IT Pro Evangelist (SQL Server)
XenFS Sharing data in a virtualised environment
Group 8 Virtualization of the Cloud
Introduction to Operating Systems
EECS 498 Introduction to Distributed Systems Fall 2017
OS Virtualization.
Virtualization Meetup Discussion
Virtualization Layer Virtual Hardware Virtual Networking
Prof. Leonardo Mostarda University of Camerino
Presentation transcript:

Remus: VM Replication Jeff Chase Duke University

Recall: virtual machines (VMs) Each guest VM runs a complete OS instance over an isolated “sliver” of host physical memory. Hypervisors support migration and suspend/resume. – Both operations require an atomic snapshot (checkpoint) of VM memory state and register contexts. – Capture modified pages and write them to snapshot. hypervisor (VMM) host guest guest kernel

Capturing modified pages How to do it? Recall the Address Translation Uses slides earlier.

Remus checkpoints Snapshot the VM, but don’t suspend it. – Snapshot periodically as it executes. – Snapshot concurrently: keep running while snap is in progress. Migrate the VM, but don’t start the remote copy. – Just load the snapshot on the remote host. – Transmit “live” incremental checkpoints over the network. – Update the remote snapshot/copy/instance in place. – Remote host is a warm standby or backup replica. All checkpoints are atomic: they capture a point in time.

Remus Checkpoints Remus divides time into epochs (~25ms) Performs a checkpoint at the end of each epoch 1. Suspend primary VM 2. Copy all state changes to a buffer in Domain 0 3. Resume primary VM 4. Send asynchronous message to backup containing state changes 5. Backup VM applies state changes 5 Periodic Checkpoints (Changes to VM State) Primary Server Domain 0 Backup Server Domain 0 Xen VMM Primary VM Xen VMM Backup VM [Ashraf Aboulnaga RemusDB]

Changes to VM State Transparent HA for DBMS RemusDB: efficient and transparent active/standby high availability for DBMS implemented in the virtualization layer Propagates all changes in VM state from primary to backup High availability with no code changes to the DBMS Completely transparent failover from primary to backup Failover to a warmed up backup server Backup Server DB DBMS Primary Server VM DB DBMS VM Primary Server 6 [Ashraf Aboulnaga RemusDB]

Remus

Remus Checkpoints After a failure, the backup resumes execution from the latest checkpoint Any work done by the primary during epoch C will be lost (unsafe) Remus provides a consistent view of execution to clients Any network packets sent during an epoch are buffered until the next checkpoint Guarantees that a client will see results only if they are based on safe execution Same principle is also applied to disk writes 8 [Ashraf Aboulnaga RemusDB]

Outbound packet buffering

Disk (FS) updates

Remus implementation

Tardigrade (NSDI-15)

Remus checkpoint latency

Remus overhead

Tardigrade