Chen Qian, Xin Li University of Kentucky

Slides:



Advertisements
Similar presentations
Data Center Networking with Multipath TCP
Advertisements

Towards Predictable Datacenter Networks
Scalable Rule Management for Data Centers Masoud Moshref, Minlan Yu, Abhishek Sharma, Ramesh Govindan 4/3/2013.
VCRIB: Virtual Cloud Rule Information Base Masoud Moshref, Minlan Yu, Abhishek Sharma, Ramesh Govindan HotCloud 2012.
PARIS: ProActive Routing In Scalable Data Centers Dushyant Arora, Theophilus Benson, Jennifer Rexford Princeton University.
Stratos: A Network-Aware Orchestration Layer for Middleboxes in the Cloud Aditya Akella, Aaron Gember, Anand Krishnamurthy, Saul St. John University of.
PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric. Presented by: Vinuthna Nalluri Shiva Srivastava.
Data Center Fabrics. Forwarding Today Layer 3 approach: – Assign IP addresses to hosts hierarchically based on their directly connected switch. – Use.
PRESENTED BY: TING WANG PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric Radhika Niranjan Mysore, Andreas Pamboris, Nathan.
Applying NOX to the Datacenter Arsalan Tavakoli, Martin Casado, Teemu Koponen, and Scott Shenker 10/22/2009Hot Topics in Networks Workshop 2009.
Improving Datacenter Performance and Robustness with Multipath TCP Costin Raiciu, Sebastien Barre, Christopher Pluntke, Adam Greenhalgh, Damon Wischik,
Datacenter Network Topologies
A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares, Alexander Loukissas, Amin Vahdat Presented by Gregory Peaker and Tyler Maclean.
Jennifer Rexford Princeton University MW 11:00am-12:20pm Data-Center Traffic Management COS 597E: Software Defined Networking.
Authors: Vic Liu, Chen Li China Mobile Speaker: Vic Liu China Mobile NaaS (Network as a service) Requirement draft-liu-nvo3-naas-requirement-00.
Sanjay Ghemawat, Howard Gobioff, and Shun-Tak Leung Google∗
WHAT IS PRIVATE CLOUD? Michał Jędrzejczak Główny Architekt Rozwiązań Infrastruktury IT
ElasticTree: Saving Energy in Data Center Networks 許倫愷 2013/5/28.
Utility Computing Casey Rathbone 1http://cyberaide.org.edu.
1 The Google File System Reporter: You-Wei Zhang.
Module 13: Network Load Balancing Fundamentals. Server Availability and Scalability Overview Windows Network Load Balancing Configuring Windows Network.
Virtual Machine Hosting for Networked Clusters: Building the Foundations for “Autonomic” Orchestration Based on paper by Laura Grit, David Irwin, Aydan.
Cloud Computing 1. Outline  Introduction  Evolution  Cloud architecture  Map reduce operation  Platform 2.
Network Aware Resource Allocation in Distributed Clouds.
DARD: Distributed Adaptive Routing for Datacenter Networks Xin Wu, Xiaowei Yang.
Windows Azure Conference 2014 Deploy your Java workloads on Windows Azure.
1 Finding Constant From Change: Revisiting Network Performance Aware Optimizations on IaaS Clouds Yifan Gong, Bingsheng He, Dan Li Nanyang Technological.
Joint Power Optimization Through VM Placement and Flow Scheduling in Data Centers DAWEI LI, JIE WU (TEMPLE UNIVERISTY) ZHIYONG LIU, AND FA ZHANG (CHINESE.
VL2: A Scalable and Flexible Data Center Network Albert Greenberg, James R. Hamilton, Navendu Jain, Srikanth Kandula, Changhoon Kim, Parantap Lahiri, David.
Visual Studio Windows Azure Portal Rest APIs / PS Cmdlets US-North Central Region FC TOR PDU Servers TOR PDU Servers TOR PDU Servers TOR PDU.
1 Enabling Efficient and Reliable Transitions from Replication to Erasure Coding for Clustered File Systems Runhui Li, Yuchong Hu, Patrick P. C. Lee The.
Surviving Failures in Bandwidth Constrained Datacenters Authors: Peter Bodik Ishai Menache Mosharaf Chowdhury Pradeepkumar Mani David A.Maltz Ion Stoica.
Authors: Xiaoqiao Meng, Vasileio Pappas and Li Zhang
Resource Allocation in Network Virtualization Jie Wu Computer and Information Sciences Temple University.
Towards Predictable Data Centers Why Johnny can’t use the cloud and what we can do about it? Hitesh Ballani, Paolo Costa, Thomas Karagiannis, Greg O’Shea.
MMPTCP: A Multipath Transport Protocol for Data Centres 1 Morteza Kheirkhah University of Edinburgh, UK Ian Wakeman and George Parisis University of Sussex,
VL2: A Scalable and Flexible Data Center Network
Data Center Architectures
Yiting Xia, T. S. Eugene Ng Rice University
Xin Li, Chen Qian University of Kentucky
R-Storm: Resource Aware Scheduling in Storm
Optimizing Distributed Actor Systems for Dynamic Interactive Services
CIS 700-5: The Design and Implementation of Cloud Networks
Resilient Datacenter Load Balancing in the Wild
Lecture 2: Cloud Computing
Data Center Network Topologies II
A Survey of Network Function Placement
Heitor Moraes, Marcos Vieira, Italo Cunha, Dorgival Guedes
Hydra: Leveraging Functional Slicing for Efficient Distributed SDN Controllers Yiyang Chang, Ashkan Rezaei, Balajee Vamanan, Jahangir Hasan, Sanjay Rao.
ECE 544: Traffic engineering (supplement)
Improving Datacenter Performance and Robustness with Multipath TCP
Chapter 15: Networking Services Design Optimization
Improving Datacenter Performance and Robustness with Multipath TCP
ElasticTree Michael Fruchtman.
Anna Giannakou Christine Morin, Jean-Louis Pazat, Louis Rilling
NTHU CS5421 Cloud Computing
Aled Edwards, Anna Fischer, Antonio Lain HP Labs
Replication Middleware for Cloud Based Storage Service
VDN: Virtual Machine Image Distribution Network for Cloud Data Centers
NTHU CS5421 Cloud Computing
CloudMirror: Application-Driven Bandwidth Guarantees in Datacenters
Process Migration Troy Cogburn and Gilbert Podell-Blume
Internet and Web Simple client-server model
Data Center Architectures
Network Systems and Throughput Preservation
QoS routing Finding a path that can satisfy the QoS requirement of a connection. Achieving high resource utilization.
In-network computation
Elmo Muhammad Shahbaz Lalith Suresh, Jennifer Rexford, Nick Feamster,
Towards Predictable Datacenter Networks
Data Center Traffic Engineering
Presentation transcript:

Chen Qian, Xin Li University of Kentucky Traffic and Failure Aware VM Placement for Multi-tenant Cloud Computing Chen Qian, Xin Li University of Kentucky

Multi-tenant Cloud Datacenters with multiple tenants Provider: Amazon EC2, Windows Azure, etc. Tenants: Using renting virtual machines (VMs).

VM placement overview Cloud Interface Easy to express tenants’ requests Abstraction model #VMs, network performance, availability Fast to place VMs on physical networks Optimize network performances Request Virtual to Physical Cloud Interface Tenant

Datacenter Networks … … … …. Top-of-rack Switch Rack 1 Rack 2 Rack m Server 1 Server n Server 1 Server n Server 1 Server n Rack 1 Rack 2 Rack m

In-network traffic … … … …. Rack 1 Rack 2 Rack m More Bandwidth&latency Cross-rack Traffic In-rack Traffic … … … …. a c d b Rack 1 Rack 2 Rack m

Reducing cross-rack traffic In-rack traffic is more preferred than cross-rack traffic Switch can forward in-rack packets at line-rate between different ports Oversubscription is common in current DCNs Cross-rack traffic is a level of oversubscription. Packet- drop will occur for high cross-rack traffics

Existing work TMVPP [INFOCOM’10], Oktopus [SIGCOMM’11] Require for full traffic matrix information NOT consider fault tolerance Hose Model [SIGCOMM’99] and Virtual Cluster [SIGCOMM’11] NOT reflect communication patterns CloudMirror [SIGCOMM’14] Fault tolerance is not guaranteed

Function-based Abstraction Model (FAM) Utilize some application-level knowledge as the hint for traffic-aware and function-aware placement Tenant networks consists of functions Each VM serves one function A function consists of one or more VMs E.g. load balancer, getway, etc.

Function-based Abstraction Model (FAM) Inter-function traffic Vary significantly (e.g. B>>b) Distribute evenly between VM pairs DP1 DP2 DP3 B/9 MySQL1 MySQL2 MySQL2

FAM V.S. Hose Hose Model Hose Model Physical Deployment

FAM capitalizes on tenant communication patterns FAM V.S. Hose Smaller (compared to 2B) FAM FAM capitalizes on tenant communication patterns Suitable for typical applications Improved network performance FAM Physical Deployment

Network failure … Different levels of failures We focus on failures within a DCN Tenants want reliable services Server/rack failure may cause function disability If all VMs of “load balancer” function are in a same rack Rack failure causes the disability of “load balancer” … 12 lb1 lb2 lb3

FAM representation Functions (Vertex) Bandwidth (Link) #VMs Fault tolerance: max fraction of VMs in a same rack Bandwidth (Link) Load Balancer b Dev. Portal (3, 0.9) (3, 0.8) B B KMS b MySQL (3, 0.8) (3, 0.8)

VM Placement Goal Reduce the traffic-distance product of a multi-tenant DCN by smart VM placement, while preserving the reliability requirements

VM placement heuristic This optimization problem is NP-hard Quadratic Assignment Problem (QAP) Three steps: Partition Place Virtual Migration

VM placement : partition Split the set of VMs to multiple components that are placed to different racks Minimize cross-block traffic, while keeping fault tolerance requirement

VM placement: place … Core Place blocks onto DCN Rack2 Rack1 Block1

Fault tolerance requirement violated VM placement: place Core Fault tolerance requirement violated Place blocks onto DCN Split blocks if needed Virtual migration Block1 Block2 Rack1 Rack2

Evaluation Trace: 44 tenant networks, 512 VMs Physical topology: fattree 8 racks, 32 machines Each machine can host 16 VMs Comparison: random, swap, k-cut

Evaluation Outperform in all cases Traffic-network product Worse Better Requirement less strict

Evaluation Less than 20% More accurate

Conclusion Function-based Abstraction Model VM placement Easy and expressive VM placement Low overhead Good for low-granularity traffic

Q&A Thank you