Data Center Networking with Multipath TCP

Slides:



Advertisements
Similar presentations
Cognitive Radio Communications and Networks: Principles and Practice By A. M. Wyglinski, M. Nekovee, Y. T. Hou (Elsevier, December 2009) 1 Chapter 12 Cross-Layer.
Advertisements

Congestion Control and Fairness Models Nick Feamster CS 4251 Computer Networking II Spring 2008.
Improving Datacenter Performance and Robustness with Multipath TCP
A Switch-Based Approach to Starvation in Data Centers Alex Shpiner and Isaac Keslassy Department of Electrical Engineering, Technion. Gabi Bracha, Eyal.
Opportunistic Mobility with Multipath TCP
The strength of routing Schemes. Main issues Eliminating the buzz: Are there real differences between forwarding schemes: OSPF vs. MPLS? Can we quantify.
Restless bandits and congestion control Mark Handley, Costin Raiciu, Damon Wischik UCL.
Multipath TCP Costin Raiciu University Politehnica of Bucharest Joint work with: Mark Handley, Damon Wischik, University College London Olivier Bonaventure,
TDTS21 Advanced Networking
Improving Datacenter Performance and Robustness with Multipath TCP Costin Raiciu, Sebastien Barre, Christopher Pluntke, Adam Greenhalgh, Damon Wischik,
Utilizing Datacenter Networks: Dealing with Flow Collisions Costin Raiciu Department of Computer Science University Politehnica of Bucharest.
Cross-Layer Scheduling in Cloud Systems Hilfi Alkaff, Indranil Gupta, Luke Leslie Department of Computer Science University of Illinois at Urbana-Champaign.
Datacenter Network Topologies
Virtual Layer 2: A Scalable and Flexible Data-Center Network Work with Albert Greenberg, James R. Hamilton, Navendu Jain, Srikanth Kandula, Parantap Lahiri,
ProActive Routing In Scalable Data Centers with PARIS Joint work with Dushyant Arora + and Jennifer Rexford* + Arista Networks *Princeton University Theophilus.
Data Center Basics (ENCS 691K – Chapter 5)
A Scalable, Commodity Data Center Network Architecture.
1 Algorithms for Bandwidth Efficient Multicast Routing in Multi-channel Multi-radio Wireless Mesh Networks Hoang Lan Nguyen and Uyen Trang Nguyen Presenter:
Path selection Packet scheduling and multipath Sebastian Siikavirta and Antti aalto.
ElasticTree: Saving Energy in Data Center Networks 許倫愷 2013/5/28.
Advanced Topics in Distributed Systems Fall 2011 Instructor: Costin Raiciu.
Multipath TCP design, and application to data centers Damon Wischik, Mark Handley, Costin Raiciu, Christopher Pluntke.
VL2 – A Scalable & Flexible Data Center Network Authors: Greenberg et al Presenter: Syed M Irteza – LUMS CS678: 2 April 2013.
Network Aware Resource Allocation in Distributed Clouds.
David G. Andersen CMU Guohui Wang, T. S. Eugene Ng Rice Michael Kaminsky, Dina Papagiannaki, Michael A. Kozuch, Michael Ryan Intel Labs Pittsburgh 1 c-Through:
DARD: Distributed Adaptive Routing for Datacenter Networks Xin Wu, Xiaowei Yang.
IMPROVING DATACENTER PERFORMANCE AND ROBUSTNESS WITH MULTIPATH TCP Costin Raiciu†, Sebastien Barre‡, Christopher Pluntke†, Adam Greenhalgh†, Damon Wischik†,
© Copyright 2010 Hewlett-Packard Development Company, L.P. 1 Jayaram Mudigonda, HP Labs Praveen Yalagandula, HP Labs Mohammad Al-Fares, UCSD Jeff Mogul,
A.SATHEESH Department of Software Engineering Periyar Maniammai University Tamil Nadu.
Congestion control for Multipath TCP (MPTCP) Damon Wischik Costin Raiciu Adam Greenhalgh Mark Handley THE ROYAL SOCIETY.
VL2: A Scalable and Flexible Data Center Network Albert Greenberg, James R. Hamilton, Navendu Jain, Srikanth Kandula, Changhoon Kim, Parantap Lahiri, David.
Department of Computer Science A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares Alexander Loukissas Amin Vahdat SIGCOMM’08 Reporter:
Datacenter Network Simulation using ns3
Some questions about multipath Damon Wischik, UCL Trilogy UCL.
Subways: A Case for Redundant, Inexpensive Data Center Edge Links Vincent Liu, Danyang Zhuo, Simon Peter, Arvind Krishnamurthy, Thomas Anderson University.
Jennifer Rexford Fall 2014 (TTh 3:00-4:20 in CS 105) COS 561: Advanced Computer Networks TCP.
Data Center Load Balancing T Seminar Kristian Hartikainen Aalto University, Helsinki, Finland
1 Slides by Yong Liu 1, Deep Medhi 2, and Michał Pióro 3 1 Polytechnic University, New York, USA 2 University of Missouri-Kansas City, USA 3 Warsaw University.
Theophilus Benson*, Ashok Anand*, Aditya Akella*, Ming Zhang + *University of Wisconsin, Madison + Microsoft Research.
Approaches to Improve Data Center Performance through Networking - Gurubaran.
Data Centers and Cloud Computing 1. 2 Data Centers 3.
1 Three ways to (ab)use Multipath Congestion Control Costin Raiciu University Politehnica of Bucharest.
MMPTCP: A Multipath Transport Protocol for Data Centres 1 Morteza Kheirkhah University of Edinburgh, UK Ian Wakeman and George Parisis University of Sussex,
R2C2: A Network Stack for Rack-scale Computers Paolo Costa, Hitesh Ballani, Kaveh Razavi, Ian Kash Microsoft Research Cambridge EECS 582 – W161.
VL2: A Scalable and Flexible Data Center Network
Data Center Architectures
Chen Qian, Xin Li University of Kentucky
Yiting Xia, T. S. Eugene Ng Rice University
How I Learned to Stop Worrying About the Core and Love the Edge
Data Center Network Architectures
Presented by Tae-Seok Kim
Chuanxiong Guo, et al, Microsoft Research Asia, SIGCOMM 2008
ECE 544: Traffic engineering (supplement)
Improving Datacenter Performance and Robustness with Multipath TCP
Improving Datacenter Performance and Robustness with Multipath TCP
NTHU CS5421 Cloud Computing
Multipath TCP Yifan Peng Oct 11, 2012
Congestion-Aware Load Balancing at the Virtual Edge
NTHU CS5421 Cloud Computing
AMP: A Better Multipath TCP for Data Center Networks
VL2: A Scalable and Flexible Data Center Network
Jellyfish: Networking Data Centers Randomly
Data Center Architectures
Centralized Arbitration for Data Centers
Congestion-Aware Load Balancing at the Virtual Edge
Acknowledgement: slides include content from Hedera and MP-TCP authors
Resource Pooling A system exhibits complete resource pooling if it behaves as if there was a single pooled resource. I propose ‘extent of resource pooling’
2019/5/13 A Weighted ECMP Load Balancing Scheme for Data Centers Using P4 Switches Presenter:Hung-Yen Wang Authors:Peng Wang, George Trimponias, Hong Xu,
Towards Predictable Datacenter Networks
Data Center Traffic Engineering
Presentation transcript:

Data Center Networking with Multipath TCP UCL Data Center Networking with Multipath TCP Costin Raiciu University College London & Universitatea Politehnica Bucuresti Christopher Pluntke, UCL Adam Greenhalgh, UCL Sebastien Barre, Universite Catholique Louvain Damon Wischik. UCL Mark Handley, UCL

Data Center Networking Today Resource Allocation TCP Path Selection Random load balancing Routing OSPF, VLANs, TRILL Show multiple paths between servers Say that network is rearrangeably non blocking Clos Topology FatTree, VL2, BCube, multi-rooted tree 2

Data Center Networking Tomorrow Resource Allocation Multipath TCP TCP Path Selection Random load balancing Routing OSPF, VLANs, TRILL Show multiple paths between servers Say that network is rearrangeably non blocking Clos Topology FatTree, VL2, BCube, multi-rooted tree 3

Data Centers are Important Cloud computing Economies of scale: networks of tens of thousands of hosts Cool apps Web search, GFS, BigTable, DryadLINQ, MapReduce Dense traffic patterns

Flexibility is Important in Data Centers Apps distributed across thousands of machines. Flexibility: want any machine to be able to play any role. But: Traditional data center topologies are tree based. Don’t cope well with non-local traffic patterns. Many recent proposals for better topologies.

Traditional Data Center Topology Core Switch 10Gbps Aggregation Switches 10Gbps Top of Rack Switches 1Gbps Racks of servers …

Fat Tree Topology [Fares et al., 2008; Clos, 1953] K=4 Aggregation Switches 1Gbps K Pods with K Switches each 1Gbps Show multiple paths between servers Say that network is rearrangeably non blocking Clos Racks of servers 7

VL2 Topology [Greenberg et al, 2009, Clos topology] 10Gbps … 10Gbps 20 hosts

BCube Topology [Guo et al, 2009]

How Do We Use this Capacity? Need to distribute flows across paths. Basic solution: Random Load Balancing. Use Equal-Cost Multipath (ECMP) routing. Hash to a path at random. Use many differently rooted VLANs. End-host hashes to a VLAN; determines path.

Collisions Racks of servers 1Gbps 1Gbps Show multiple paths between servers Say that network is rearrangeably non blocking Clos Racks of servers 11

Can MPTCP self-optimize data-center traffic? With Multipath TCP we can explore many paths: Instead of using one random path, use many random paths Don’t worry about collisions. Just don’t send (much) traffic on colliding paths

Simulation Setup ~8000 hosts Long-lived flows Permutation traffic matrix Each hosts sends and receives from a single other randomly chosen host Smallest amount of traffic that can fill the network

Multipath TCP in the Fat Tree Topology Throughput Allocation

Performance depends on topology VL2 BCube

Overloaded Fat Tree: better fairness with Multipath TCP

Centralized Scheduling With RLB, it’s really hard to utilize FatTree. Hedera [Fares et al.,2010] uses a centralized scheduler and flow switching. Start by using RLB Measure all flow throughput periodically. Any flow using more than 10% of its interface rate is explicitly scheduled onto an unloaded link. How does centralized scheduling compare with MPTCP?

MPTCP vs Centralized Dynamic Scheduling Centralized Scheduling MPTCP Animate tough part of the graph Infinite Scheduling Interval 18

Can’t we just use many TCP connections? Loss rate of MP-TCP (“linked”) vs multiple uncoupled TCP flows Retransmit timeouts with MPTCP (“linked”) vs uncoupled TCP flows

MPTCP Linked Increases in DCs Better fairness and less aggressive than uncoupled TCP Improves throughput in dense traffic in BCube (25%)

? The bigger picture Resource Allocation Path Selection Routing Multipath TCP Path Selection Routing OSPF, VLANs, etc. Topology ? FatTree, VL2, Bcube, multi-rooted tree

Multipath TCP can utilize topologies TCP can’t Requirement: a subset of hosts should be able to communicate at 10Gb/s 1Gb/s Show multiple paths between servers Say that network is rearrangeably non blocking Clos 10Gb/s 22

Multipath TCP can utilize topologies TCP can’t [2] Problem ToR switch failures wipe out tens of hosts Repair time is on the order of days Solution: use two ToRs/rack, multi-home servers Single path TCP Single flows still get same max throughput Which interface do I use? With Multipath TCP Flows double their maximum throughput Path selection automatic Show multiple paths between servers Say that network is rearrangeably non blocking Clos 23

Summary Data center networking offers many paths between end-hosts. Yet: Random Load Balancing does a poor job of utilizing them Centralized scheduling is laggy and has inherently limited knowledge Multipath TCP naturally optimizes data center networks: Improves throughput Improves fairness More robust than centralized scheduling Question: what topologies does multipath TCP enable?

Backup Slides

Centralized Scheduling: Setting the Threshold Throughput Hope 1Gbps 17% worse than multipath TCP App Limited 100Mbps

Centralized Scheduling: Setting the Threshold Throughput 1Gbps 21% worse than multipath TCP App Limited 100Mbps Hope

Centralized Scheduling: Setting the Threshold Throughput 1Gbps 500Mbps 45% 51% 100Mbps 17% 21%