Improving Datacenter Performance and Robustness with Multipath TCP

Slides:



Advertisements
Similar presentations
Data Center Networking with Multipath TCP
Advertisements

Improving Datacenter Performance and Robustness with Multipath TCP
Opportunistic Mobility with Multipath TCP
SDN + Storage.
Restless bandits and congestion control Mark Handley, Costin Raiciu, Damon Wischik UCL.
Multipath TCP Costin Raiciu University Politehnica of Bucharest Joint work with: Mark Handley, Damon Wischik, University College London Olivier Bonaventure,
PRESENTED BY: TING WANG PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric Radhika Niranjan Mysore, Andreas Pamboris, Nathan.
Applying NOX to the Datacenter Arsalan Tavakoli, Martin Casado, Teemu Koponen, and Scott Shenker 10/22/2009Hot Topics in Networks Workshop 2009.
TDTS21 Advanced Networking
Improving Datacenter Performance and Robustness with Multipath TCP Costin Raiciu, Sebastien Barre, Christopher Pluntke, Adam Greenhalgh, Damon Wischik,
PFabric: Minimal Near-Optimal Datacenter Transport Mohammad Alizadeh Shuang Yang, Milad Sharif, Sachin Katti, Nick McKeown, Balaji Prabhakar, Scott Shenker.
Utilizing Datacenter Networks: Dealing with Flow Collisions Costin Raiciu Department of Computer Science University Politehnica of Bucharest.
Balajee Vamanan et al. Deadline-Aware Datacenter TCP (D 2 TCP) Balajee Vamanan, Jahangir Hasan, and T. N. Vijaykumar.
Options or payload? Costin Raiciu UCL IETF 78, Maastricht.
Bertha & M Sadeeq.  Easy to manage the problems  Scalability  Real time and real environment  Free data collection  Cost efficient  DCTCP only covers.
Datacenter Network Topologies
Jennifer Rexford Fall 2014 (TTh 3:00-4:20 in CS 105) COS 561: Advanced Computer Networks TCP.
A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares, Alexander Loukissas, Amin Vahdat Presented by Gregory Peaker and Tyler Maclean.
A Scalable, Commodity Data Center Network Architecture.
Datacenter Networks Mike Freedman COS 461: Computer Networks
Second year review Resource Pooling Damon Wischik, UCL.
ElasticTree: Saving Energy in Data Center Networks 許倫愷 2013/5/28.
Curbing Delays in Datacenters: Need Time to Save Time? Mohammad Alizadeh Sachin Katti, Balaji Prabhakar Insieme Networks Stanford University 1.
Advanced Topics in Distributed Systems Fall 2011 Instructor: Costin Raiciu.
Multipath TCP design, and application to data centers Damon Wischik, Mark Handley, Costin Raiciu, Christopher Pluntke.
VL2 – A Scalable & Flexible Data Center Network Authors: Greenberg et al Presenter: Syed M Irteza – LUMS CS678: 2 April 2013.
Routing & Architecture
DARD: Distributed Adaptive Routing for Datacenter Networks Xin Wu, Xiaowei Yang.
IMPROVING DATACENTER PERFORMANCE AND ROBUSTNESS WITH MULTIPATH TCP Costin Raiciu†, Sebastien Barre‡, Christopher Pluntke†, Adam Greenhalgh†, Damon Wischik†,
© Copyright 2010 Hewlett-Packard Development Company, L.P. 1 Jayaram Mudigonda, HP Labs Praveen Yalagandula, HP Labs Mohammad Al-Fares, UCSD Jeff Mogul,
A.SATHEESH Department of Software Engineering Periyar Maniammai University Tamil Nadu.
Congestion control for Multipath TCP (MPTCP) Damon Wischik Costin Raiciu Adam Greenhalgh Mark Handley THE ROYAL SOCIETY.
VL2: A Scalable and Flexible Data Center Network Albert Greenberg, James R. Hamilton, Navendu Jain, Srikanth Kandula, Changhoon Kim, Parantap Lahiri, David.
Data Center Load Balancing T Seminar Kristian Hartikainen Aalto University, Helsinki, Finland
Network Traffic Characteristics of Data Centers in the Wild
Theophilus Benson*, Ashok Anand*, Aditya Akella*, Ming Zhang + *University of Wisconsin, Madison + Microsoft Research.
Approaches to Improve Data Center Performance through Networking - Gurubaran.
1 Three ways to (ab)use Multipath Congestion Control Costin Raiciu University Politehnica of Bucharest.
MMPTCP: A Multipath Transport Protocol for Data Centres 1 Morteza Kheirkhah University of Edinburgh, UK Ian Wakeman and George Parisis University of Sussex,
Revisiting Transport Congestion Control Jian He UT Austin 1.
VL2: A Scalable and Flexible Data Center Network
Data Center Architectures
Yiting Xia, T. S. Eugene Ng Rice University
Resilient Datacenter Load Balancing in the Wild
Lecture 2: Cloud Computing
Data Center Network Topologies II
Confluent vs. Splittable Flows
How I Learned to Stop Worrying About the Core and Love the Edge
Data Center Network Architectures
Hydra: Leveraging Functional Slicing for Efficient Distributed SDN Controllers Yiyang Chang, Ashkan Rezaei, Balajee Vamanan, Jahangir Hasan, Sanjay Rao.
Networking Recap Storage Intro
ECE 544: Traffic engineering (supplement)
Improving Datacenter Performance and Robustness with Multipath TCP
NTHU CS5421 Cloud Computing
Multipath TCP Yifan Peng Oct 11, 2012
Congestion-Aware Load Balancing at the Virtual Edge
Dingming Wu+, Yiting Xia+*, Xiaoye Steven Sun+,
Congestion Control in Software Define Data Center Network
AMP: A Better Multipath TCP for Data Center Networks
VL2: A Scalable and Flexible Data Center Network
Jellyfish: Networking Data Centers Randomly
Data Center Architectures
Centralized Arbitration for Data Centers
Backbone Traffic Engineering
Congestion-Aware Load Balancing at the Virtual Edge
Lecture 17, Computer Networks (198:552)
Acknowledgement: slides include content from Hedera and MP-TCP authors
2019/5/13 A Weighted ECMP Load Balancing Scheme for Data Centers Using P4 Switches Presenter:Hung-Yen Wang Authors:Peng Wang, George Trimponias, Hong Xu,
2019/10/9 A Weighted ECMP Load Balancing Scheme for Data Centers Using P4 Switches Presenter:Hung-Yen Wang Authors:Jin-Li Ye, Yu-Huang Chu, Chien Chen.
Towards Predictable Datacenter Networks
Presentation transcript:

Improving Datacenter Performance and Robustness with Multipath TCP Costin Raiciu, Sebastien Barre, Christopher Pluntke, Adam Greenhalgh, Damon Wischik, Mark Handley SIGCOMM 2011

“Putting Things in Perspective” High performing network crucial for today’s datacenters Many takes… How to build better performing networks VL2, PortLand, c-Through How to manage these architectures Maximize link capacity utilization, Improve performance Hedera, Orchestra, DCTCP , MPTCP

Modern datacenters provide many parallel paths… Traditional topologies are tree-based Poor performance Not fault tolerant Shift towards multipath topologies: FatTree, BCube, VL2, Cisco, EC2

Fat Tree Topology (Fares et al., 2008; Clos, 1953) K=4 Aggregation Switches 1Gbps K Pods with K Switches each 1Gbps Racks of servers 4

How to efficiently utilize the capacity? Fat Tree Topology (Fares et al., 2008; Clos, 1953) K=4 Aggregation Switches How to efficiently utilize the capacity? K Pods with K Switches each What is the optimal load-balancing strategy? Racks of servers 5

State of the Art (as discussed in Hedera) Statically stripe flows across available paths using ECMP Collision What has been done in this space? Hedera paper discusses the state of the art in datacenters. Mention that the bottom link is not utilized.

How about mapping each flow to a different path?

How about mapping each flow to a different path?

How about mapping each flow to a different path? Not fair

How about mapping each flow to a different path? Not fair Mapping each flow to a path is the wrong approach!

Instead, pool capacity from links This is the key insight in this paper.

Use Multipath Transport Instead of using one path for each flow, use many random paths Don’t worry about collisions Just don’t send (much) traffic on colliding paths

Multipath TCP Primer (IETF MPTCP WG) A drop in replacement for TCP Spreads application data over multiple sub flows The challenging part is to do congestion control. The same group did congestion control for MPTCP which won the best paper award at NSDI 2011 - For each ACK on sub-flow r, increase the window wr by min(α/wtotal, 1/wr) - For each loss on sub-flow r, decrease the window wr by wr/2

MPTCP better utilizes the Fat Tree network Fairness matters for apps!

Understanding Gains How many sub-flows are needed? How does the topology affect results? How does the traffic matrix affect results?

At most 8 sub-flows are needed Total Throughput TCP

MPTCP improves fairness in VL2

MPTCP improves throughput and fairness in BCube

Performance improvements depend on traffic matrix Underloaded Sweet Spot Overloaded Increase Load

MPTCP enables better topologies In single homed topologies: - Host links are often bottlenecks - ToR switch failures wipe out tens of hosts for days Multi-homing is necessary 20

MPTCP enables better topologies 21

MPTCP enables better topologies Upper Pod Switch ToR Switch Servers Fat Tree Topology

MPTCP enables better topologies Upper Pod Switch ToR Switch Servers Dual Homed Fat Tree Topology

DHFT provides significant improvements when core is not overloaded Core Underloaded Core Overloaded

EC2 Experiment Same Rack

Conclusion Multipath topologies need multipath transport Multipath transport enables better topologies

Thoughts (1) Old idea applied to datacenters First suggested in 1995, then 2000s Not very nice for middleboxes Works on a wide variety of topologies (as long as there are multiple paths) Number of advantages Fairness Balanced congestion Robustness (hotspots) Backward compatible with normal TCP Can build optimized topologies

How much benefit in practice? Thoughts (2) However… Needs changes at all end-hosts Benefits heavily depend on traffic matrix, congestion control What’s the right number of sub flows? No evaluation “in the wild” No benefits for in-rack or many-to-one traffic Prioritization of flows might be hard Hedera claims 93% bisection bw. MPTCP says it can provide 3x improvements, but these two papers did not consider real traffic characteristics How much benefit in practice?

Understanding Datacenter Traffic A few papers that analyzed datacenter traffic: “The Nature of Data Center Traffic: Measurements and Analysis” – IMC 2009 “Network Traffic Characteristics of Data Centers in the Wild” – IMC 2010 3 US universities - distributed file servers, email server 2 private enterprises - custom line-of-business apps 5 commercial cloud data centers - MR, search, advertising, datamining etc. VL2, DCTCP also does this, but on specialized data centers

Understanding Datacenter Traffic “Most flows in the data centers are small in size (<10KB)...”. In other words, elephants are a very small fraction. All the results from this paper are from prolonged flows.

Understanding Datacenter Traffic Majority of the traffic in cloud datacenters stay within the rack. Does this mean there is no need for load balancing schemes?

Understanding Datacenter Traffic Only a fraction of the existing bisection capacity is likely to be utilized at any given time => no need for more bandwidth 25% of core links are hot spots at any time => Load balancing mechanisms for spreading traffic across the existing links in the network’s core helpful Which means all of the core won’t be overloaded at any time => scope for mptcp. Why not hedera (centralized)?

Understanding Datacenter Traffic Centralized controllers: Significant amount of flows (~20%) arrive within 20us. Parallelization important. Most flows last less than 100ms Reactive controllers add ~10ms overhead This overhead might not be acceptable. MPTCP would be useful, but totally depends on traffic characteristics

Backups

Hedera vs MPTCP Hedera MPTCP Load balancing Overhead Deployment Centralized Distributed Overhead Flow measurement, running scheduler Creating sub-flows Deployment Open flow support in switches, central scheduler Replace TCP stack Traffic differentiation Easier compared to a distributed solution Hard(?) Optimality Centralized, so better optimized solution Doesn’t have global view so might not be the most optimal

DCTCP vs MPTCP Perhaps MP-DCTCP? DCTPC MPTCP Deployment Coexistance ECN support in switches, TCP stack changes Replace TCP stack Coexistance Might throttle regular TCP flows due to the difference in congestion control Yes Multi-homed topologies Cannot fully utilize since it still is single flow Can fully utilize multi-homed topologies Perhaps MP-DCTCP?