Presentation is loading. Please wait.

Presentation is loading. Please wait.

On the Data Path Performance of Leaf-Spine Datacenter Fabrics Mohammad Alizadeh Joint with: Tom Edsall 1.

Similar presentations


Presentation on theme: "On the Data Path Performance of Leaf-Spine Datacenter Fabrics Mohammad Alizadeh Joint with: Tom Edsall 1."— Presentation transcript:

1 On the Data Path Performance of Leaf-Spine Datacenter Fabrics Mohammad Alizadeh Joint with: Tom Edsall 1

2 Datacenter Networks 2 1000s of server ports  Must complete transfers or “flows” quickly need very high throughput, very low latency web app db map- reduce HPC monitoring cache

3 Leaf-Spine DC Fabric 3 Approximates ideal output-queued switch H1 H2 H3 H4 H5 H6 H7 H8 H9 H1 H2 H3 H4 H5 H6 H7 H8 H9 Leaf switches Spine switches ≈ How close is Leaf-Spine to ideal OQ switch? What impacts its performance? – Link speeds, oversubscription, buffering

4 Methodology 4 Widely deployed mechanisms – TCP-Reno+Sack – DropTail (10MB shared buffer per switch) – ECMP (hash-based) load balancing OMNET++ simulations – 100×10Gbps servers (2-tiers) – Actual Linux 2.6.26 TCP stack Realistic workloads – Bursty query traffic – All-to-all background traffic: web search, data mining Metric: Flow completion time

5 Impact of Link Speed 5 Three non-oversubscribed topologies: 20×10Gbps Downlinks 20×10Gbps Uplinks 20×10Gbps Downlinks 5×40Gbps Uplinks 20×10Gbps Downlinks 2×100Gbps Uplinks

6 Better Impact of Link Speed 6 40/100Gbps fabric: ~ same FCT as OQ 10Gbps fabric: FCT up 40% worse than OQ

7 Intuition Higher speed links improve ECMP efficiency 7 20×10Gbps Uplinks 2×100Gbps Uplinks 11×10Gbps flows (55% load) 12 1 220 Prob of 100% throughput = 3.27% Prob of 100% throughput = 99.95%

8 Impact of Buffering 8 Where do queues build up? 2.5:1 oversubscribed fabric with large buffers Spine port 24% load 60% load Leaf front-panel port Leaf fabric port Leaf fabric ports queues ≈ Spine port queues – 1:1 port correspondence; same speed & load

9 Impact of Buffering Where are large buffers more effective for Incast? 9 Better Larger buffers better at Leaf than Spine

10 Summary 40/100Gbps fabric + ECMP ≈ OQ-switch; some performance loss with 10Gbps fabric Buffering should generally be consistent in Leaf & Spine tiers Larger buffers more useful in Leaf than Spine for Incast 10

11 11


Download ppt "On the Data Path Performance of Leaf-Spine Datacenter Fabrics Mohammad Alizadeh Joint with: Tom Edsall 1."

Similar presentations


Ads by Google