Presentation on theme: "Data Center Networking with Multipath TCP"— Presentation transcript:
1Data Center Networking with Multipath TCP UCLData Center Networking with Multipath TCPCostin RaiciuUniversity College London & Universitatea Politehnica BucurestiChristopher Pluntke, UCLAdam Greenhalgh, UCLSebastien Barre, Universite Catholique LouvainDamon Wischik. UCLMark Handley, UCL
2Data Center Networking Today Resource AllocationTCPPath SelectionRandom load balancingRoutingOSPF, VLANs, TRILLShow multiple paths between serversSay that network is rearrangeably non blockingClosTopologyFatTree, VL2, BCube,multi-rooted tree2
3Data Center Networking Tomorrow Resource AllocationMultipathTCPTCPPath SelectionRandom load balancingRoutingOSPF, VLANs, TRILLShow multiple paths between serversSay that network is rearrangeably non blockingClosTopologyFatTree, VL2, BCube,multi-rooted tree3
4Data Centers are Important Cloud computingEconomies of scale: networks of tens of thousands of hostsCool appsWeb search, GFS, BigTable, DryadLINQ, MapReduceDense traffic patterns
5Flexibility is Important in Data Centers Apps distributed across thousands of machines.Flexibility: want any machine to be able to play any role.But:Traditional data center topologies are tree based.Don’t cope well with non-local traffic patterns.Many recent proposals for better topologies.
6Traditional Data Center Topology Core Switch10GbpsAggregation Switches10GbpsTop of RackSwitches1GbpsRacks ofservers…
7Fat Tree Topology [Fares et al., 2008; Clos, 1953] K=4Aggregation Switches1GbpsK Pods withK Switcheseach1GbpsShow multiple paths between serversSay that network is rearrangeably non blockingClosRacks ofservers7
8VL2 Topology [Greenberg et al, 2009, Clos topology] 10Gbps…10Gbps20 hosts
10How Do We Use this Capacity? Need to distribute flows across paths.Basic solution: Random Load Balancing.Use Equal-Cost Multipath (ECMP) routing.Hash to a path at random.Use many differently rooted VLANs.End-host hashes to a VLAN; determines path.
11Collisions Racks of servers 1Gbps 1Gbps Show multiple paths between serversSay that network is rearrangeably non blockingClosRacks ofservers11
12Can MPTCP self-optimize data-center traffic? With Multipath TCP we can explore many paths:Instead of using one random path, use many random pathsDon’t worry about collisions.Just don’t send (much) traffic on colliding paths
13Simulation Setup~8000 hostsLong-lived flowsPermutation traffic matrixEach hosts sends and receives from a single other randomly chosen hostSmallest amount of traffic that can fill the network
14Multipath TCP in the Fat Tree Topology Throughput Allocation
16Overloaded Fat Tree: better fairness with Multipath TCP
17Centralized Scheduling With RLB, it’s really hard to utilize FatTree.Hedera [Fares et al.,2010] uses a centralized scheduler and flow switching.Start by using RLBMeasure all flow throughput periodically.Any flow using more than 10% of its interface rate is explicitly scheduled onto an unloaded link.How does centralized scheduling compare with MPTCP?
18MPTCP vs Centralized Dynamic Scheduling Centralized Scheduling MPTCPAnimate tough part of the graphInfiniteScheduling Interval18
19Can’t we just use many TCP connections? Loss rate of MP-TCP (“linked”) vs multiple uncoupled TCP flowsRetransmit timeouts with MPTCP (“linked”) vs uncoupled TCP flows
20MPTCP Linked Increases in DCs Better fairness and less aggressive than uncoupled TCPImproves throughput in dense traffic in BCube (25%)
21? The bigger picture Resource Allocation Path Selection Routing MultipathTCPPath SelectionRoutingOSPF, VLANs, etc.Topology?FatTree, VL2, Bcube,multi-rooted tree
22Multipath TCP can utilize topologies TCP can’t Requirement: a subset of hosts should be able to communicate at 10Gb/s1Gb/sShow multiple paths between serversSay that network is rearrangeably non blockingClos10Gb/s22
23Multipath TCP can utilize topologies TCP can’t  ProblemToR switch failures wipe out tens of hostsRepair time is on the order of daysSolution: use two ToRs/rack, multi-home serversSingle path TCPSingle flows still get same max throughputWhich interface do I use?With Multipath TCPFlows double their maximum throughputPath selection automaticShow multiple paths between serversSay that network is rearrangeably non blockingClos23
24SummaryData center networking offers many paths between end-hosts. Yet:Random Load Balancing does a poor job of utilizing themCentralized scheduling is laggy and has inherently limited knowledgeMultipath TCP naturally optimizes data center networks:Improves throughputImproves fairnessMore robust than centralized schedulingQuestion: what topologies does multipath TCP enable?