Improving Datacenter Performance and Robustness with Multipath TCP Costin Raiciu, Sebastien Barre, Christopher Pluntke, Adam Greenhalgh, Damon Wischik,

Slides:



Advertisements
Similar presentations
Data Center Networking with Multipath TCP
Advertisements

Improving Datacenter Performance and Robustness with Multipath TCP
Opportunistic Mobility with Multipath TCP
SDN + Storage.
Restless bandits and congestion control Mark Handley, Costin Raiciu, Damon Wischik UCL.
Multipath TCP Costin Raiciu University Politehnica of Bucharest Joint work with: Mark Handley, Damon Wischik, University College London Olivier Bonaventure,
Data Center Fabrics. Forwarding Today Layer 3 approach: – Assign IP addresses to hosts hierarchically based on their directly connected switch. – Use.
PRESENTED BY: TING WANG PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric Radhika Niranjan Mysore, Andreas Pamboris, Nathan.
Applying NOX to the Datacenter Arsalan Tavakoli, Martin Casado, Teemu Koponen, and Scott Shenker 10/22/2009Hot Topics in Networks Workshop 2009.
TDTS21 Advanced Networking
Fixing TCP in Datacenters Costin Raiciu Advanced Topics in Distributed Systems 2011.
PFabric: Minimal Near-Optimal Datacenter Transport Mohammad Alizadeh Shuang Yang, Milad Sharif, Sachin Katti, Nick McKeown, Balaji Prabhakar, Scott Shenker.
Utilizing Datacenter Networks: Dealing with Flow Collisions Costin Raiciu Department of Computer Science University Politehnica of Bucharest.
Balajee Vamanan et al. Deadline-Aware Datacenter TCP (D 2 TCP) Balajee Vamanan, Jahangir Hasan, and T. N. Vijaykumar.
Options or payload? Costin Raiciu UCL IETF 78, Maastricht.
Bertha & M Sadeeq.  Easy to manage the problems  Scalability  Real time and real environment  Free data collection  Cost efficient  DCTCP only covers.
Datacenter Network Topologies
Virtual Layer 2: A Scalable and Flexible Data-Center Network Work with Albert Greenberg, James R. Hamilton, Navendu Jain, Srikanth Kandula, Parantap Lahiri,
Jennifer Rexford Fall 2014 (TTh 3:00-4:20 in CS 105) COS 561: Advanced Computer Networks TCP.
A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares, Alexander Loukissas, Amin Vahdat Presented by Gregory Peaker and Tyler Maclean.
A Scalable, Commodity Data Center Network Architecture.
Datacenter Networks Mike Freedman COS 461: Computer Networks
Routing of Outgoing Packets with MP-TCP draft-handley-mptcp-routing-00 Mark Handley Costin Raiciu Marcelo Bagnulo.
ElasticTree: Saving Energy in Data Center Networks 許倫愷 2013/5/28.
Network Support for Cloud Services Lixin Gao, UMass Amherst.
TCP & Data Center Networking
Curbing Delays in Datacenters: Need Time to Save Time? Mohammad Alizadeh Sachin Katti, Balaji Prabhakar Insieme Networks Stanford University 1.
Advanced Topics in Distributed Systems Fall 2011 Instructor: Costin Raiciu.
Multipath TCP design, and application to data centers Damon Wischik, Mark Handley, Costin Raiciu, Christopher Pluntke.
VL2 – A Scalable & Flexible Data Center Network Authors: Greenberg et al Presenter: Syed M Irteza – LUMS CS678: 2 April 2013.
Routing & Architecture
DARD: Distributed Adaptive Routing for Datacenter Networks Xin Wu, Xiaowei Yang.
IMPROVING DATACENTER PERFORMANCE AND ROBUSTNESS WITH MULTIPATH TCP Costin Raiciu†, Sebastien Barre‡, Christopher Pluntke†, Adam Greenhalgh†, Damon Wischik†,
© Copyright 2010 Hewlett-Packard Development Company, L.P. 1 Jayaram Mudigonda, HP Labs Praveen Yalagandula, HP Labs Mohammad Al-Fares, UCSD Jeff Mogul,
A.SATHEESH Department of Software Engineering Periyar Maniammai University Tamil Nadu.
Congestion control for Multipath TCP (MPTCP) Damon Wischik Costin Raiciu Adam Greenhalgh Mark Handley THE ROYAL SOCIETY.
VL2: A Scalable and Flexible Data Center Network Albert Greenberg, James R. Hamilton, Navendu Jain, Srikanth Kandula, Changhoon Kim, Parantap Lahiri, David.
Demystifying and Controlling the Performance of Big Data Jobs Theophilus Benson Duke University.
Department of Computer Science A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares Alexander Loukissas Amin Vahdat SIGCOMM’08 Reporter:
CISC856 University of Delaware
Data Center Load Balancing T Seminar Kristian Hartikainen Aalto University, Helsinki, Finland
Network Traffic Characteristics of Data Centers in the Wild
Jiaxin Cao, Rui Xia, Pengkun Yang, Chuanxiong Guo,
Theophilus Benson*, Ashok Anand*, Aditya Akella*, Ming Zhang + *University of Wisconsin, Madison + Microsoft Research.
Approaches to Improve Data Center Performance through Networking - Gurubaran.
1 Three ways to (ab)use Multipath Congestion Control Costin Raiciu University Politehnica of Bucharest.
MMPTCP: A Multipath Transport Protocol for Data Centres 1 Morteza Kheirkhah University of Edinburgh, UK Ian Wakeman and George Parisis University of Sussex,
6.888: Lecture 4 Data Center Load Balancing Mohammad Alizadeh Spring
Revisiting Transport Congestion Control Jian He UT Austin 1.
VL2: A Scalable and Flexible Data Center Network
Data Center Architectures
Chen Qian, Xin Li University of Kentucky
Yiting Xia, T. S. Eugene Ng Rice University
Resilient Datacenter Load Balancing in the Wild
Data Center Network Topologies II
Confluent vs. Splittable Flows
Data Center Network Architectures
Hydra: Leveraging Functional Slicing for Efficient Distributed SDN Controllers Yiyang Chang, Ashkan Rezaei, Balajee Vamanan, Jahangir Hasan, Sanjay Rao.
Multipath TCP in SDN-enabled LEO Satellite Networks
ECE 544: Traffic engineering (supplement)
Improving Datacenter Performance and Robustness with Multipath TCP
Improving Datacenter Performance and Robustness with Multipath TCP
Multipath TCP Yifan Peng Oct 11, 2012
Congestion Control in Software Define Data Center Network
AMP: A Better Multipath TCP for Data Center Networks
Jellyfish: Networking Data Centers Randomly
Data Center Architectures
Centralized Arbitration for Data Centers
Acknowledgement: slides include content from Hedera and MP-TCP authors
2019/5/13 A Weighted ECMP Load Balancing Scheme for Data Centers Using P4 Switches Presenter:Hung-Yen Wang Authors:Peng Wang, George Trimponias, Hong Xu,
Presentation transcript:

Improving Datacenter Performance and Robustness with Multipath TCP Costin Raiciu, Sebastien Barre, Christopher Pluntke, Adam Greenhalgh, Damon Wischik, Mark Handley SIGCOMM 2011 Presented by Anand Iyer (Most slides borrowed from Costin’s presentation)

“Putting Things in Perspective” High performing network crucial for today’s datacenters Many takes… –How to build better performing networks VL2, PortLand, c-Through –How to manage these architectures Maximize link capacity utilization, Improve performance Hedera, Orchestra, DCTCP, MPTCP

Modern datacenters provide many parallel paths… Traditional topologies are tree-based –Poor performance –Not fault tolerant Shift towards multipath topologies: FatTree, BCube, VL2, Cisco, EC2 …

Fat Tree Topology (Fares et al., 2008; Clos, 1953) K=4 1Gbps Aggregation Switches K Pods with K Switches each Racks of servers

K=4 Aggregation Switches K Pods with K Switches each Racks of servers Fat Tree Topology (Fares et al., 2008; Clos, 1953) How to efficiently utilize the capacity?

Collision State of the Art (as discussed in Hedera) Statically stripe flows across available paths using ECMP

How about mapping each flow to a different path?

Not fair How about mapping each flow to a different path?

Not fair How about mapping each flow to a different path? Mapping each flow to a path is the wrong approach!

Instead, pool capacity from links

–Instead of using one path for each flow, use many random paths –Don’t worry about collisions –Just don’t send (much) traffic on colliding paths Use Multipath Transport

A drop in replacement for TCP Spreads application data over multiple sub flows Multipath TCP Primer (IETF MPTCP WG) - For each ACK on sub-flow r, increase the window w r by min(α/w total, 1/w r ) - For each loss on sub-flow r, decrease the window w r by w r /2

MPTCP better utilizes the Fat Tree network

1.How many sub-flows are needed? 1.How does the topology affect results? 1.How does the traffic matrix affect results? Understanding Gains

At most 8 sub-flows are needed Total Throughput TCP

MPTCP improves fairness in VL2 VL2

MPTCP improves throughput and fairness in BCube BCube

Performance improvements depend on traffic matrix OverloadedUnderloaded Sweet Spot Increase Load

In single homed topologies: - Host links are often bottlenecks - ToR switch failures wipe out tens of hosts for days Multi-homing is necessary MPTCP enables better topologies

Fat Tree Topology ToR Switch Servers Upper Pod Switch MPTCP enables better topologies

Dual Homed Fat Tree Topology ToR Switch Servers Upper Pod Switch MPTCP enables better topologies

Core Overloaded Core Underloaded DHFT provides significant improvements when core is not overloaded

EC2 Experiment Same Rack

Multipath topologies need multipath transport Multipath transport enables better topologies Conclusion

Thoughts (1) Old idea applied to datacenters –First suggested in 1995, then 2000s –Not very nice for middleboxes Works on a wide variety of topologies (as long as there are multiple paths) Number of advantages –Fairness –Balanced congestion –Robustness (hotspots) –Backward compatible with normal TCP –Can build optimized topologies

Thoughts (2) However… –Needs changes at all end-hosts –Benefits heavily depend on traffic matrix, congestion control –What’s the right number of sub flows? –No evaluation “in the wild” –No benefits for in-rack or many-to-one traffic –Prioritization of flows might be hard How much benefit in practice?

Understanding Datacenter Traffic A few papers that analyzed datacenter traffic: –“The Nature of Data Center Traffic: Measurements and Analysis” – IMC 2009 –“Network Traffic Characteristics of Data Centers in the Wild” – IMC US universities - distributed file servers, server 2 private enterprises - custom line-of-business apps 5 commercial cloud data centers - MR, search, advertising, datamining etc.

Understanding Datacenter Traffic “Most flows in the data centers are small in size (<10KB)...”. In other words, elephants are a very small fraction.

Understanding Datacenter Traffic Majority of the traffic in cloud datacenters stay within the rack.

Understanding Datacenter Traffic Only a fraction of the existing bisection capacity is likely to be utilized at any given time => no need for more bandwidth 25% of core links are hot spots at any time => Load balancing mechanisms for spreading traffic across the existing links in the network’s core helpful

Understanding Datacenter Traffic Centralized controllers: –Significant amount of flows (~20%) arrive within 20us. Parallelization important. –Most flows last less than 100ms Reactive controllers add ~10ms overhead This overhead might not be acceptable. MPTCP would be useful, but totally depends on traffic characteristics

Backups

Hedera vs MPTCP HederaMPTCP Load balancing CentralizedDistributed Overhead Flow measurement, running scheduler Creating sub-flows Deployment Open flow support in switches, central scheduler Replace TCP stack Traffic differentiation Easier compared to a distributed solution Hard(?) Optimality Centralized, so better optimized solution Doesn’t have global view so might not be the most optimal

DCTCP vs MPTCP DCTPCMPTCP Deployment ECN support in switches, TCP stack changes Replace TCP stack Coexistance Might throttle regular TCP flows due to the difference in congestion control Yes Multi-homed topologies Cannot fully utilize since it still is single flow Can fully utilize multi- homed topologies Perhaps MP-DCTCP?