2019/5/13 A Weighted ECMP Load Balancing Scheme for Data Centers Using P4 Switches Presenter:Hung-Yen Wang Authors:Peng Wang, George Trimponias, Hong Xu,

Slides:



Advertisements
Similar presentations
PortLand: A Scalable Fault-Tolerant Layer 2 Data Center Network Fabric. Presented by: Vinuthna Nalluri Shiva Srivastava.
Advertisements

Improving TCP Performance over Mobile Ad Hoc Networks by Exploiting Cross- Layer Information Awareness Xin Yu Department Of Computer Science New York University,
A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares, Alexander Loukissas, Amin Vahdat Presented by Gregory Peaker and Tyler Maclean.
A Scalable, Commodity Data Center Network Architecture.
Server Load Balancing. Introduction Why is load balancing of servers needed? If there is only one web server responding to all the incoming HTTP requests.
OpenFlow-Based Server Load Balancing GoneWild Author : Richard Wang, Dana Butnariu, Jennifer Rexford Publisher : Hot-ICE'11 Proceedings of the 11th USENIX.
Introduction to Network Layer. Network Layer: Motivation Can we built a global network such as Internet by extending LAN segments using bridges? –No!
Packet Classification using Rule Caching Author: Nitesh B. Guinde, Roberto Rojas-Cessa, Sotirios G. Ziavras Publisher: IISA, 2013 Fourth International.
Packet Classification Using Multi-Iteration RFC Author: Chun-Hui Tsai, Hung-Mao Chu, Pi-Chung Wang Publisher: COMPSACW, 2013 IEEE 37th Annual (Computer.
CS 453 Computer Networks Lecture 18 Introduction to Layer 3 Network Layer.
Load-Balancing Routing in Multichannel Hybrid Wireless Networks With Single Network Interface So, J.; Vaidya, N. H.; Vehicular Technology, IEEE Transactions.
Department of Computer Science A Scalable, Commodity Data Center Network Architecture Mohammad Al-Fares Alexander Loukissas Amin Vahdat SIGCOMM’08 Reporter:
Selective Packet Inspection to Detect DoS Flooding Using Software Defined Networking Author : Tommy Chin Jr., Xenia Mountrouidou, Xiangyang Li and Kaiqi.
Early Detection of DDoS Attacks against SDN Controllers
Shadow MACs: Scalable Label- switching for Commodity Ethernet Author: Kanak Agarwal, John Carter, Eric Rozner and Colin Dixon Publisher: HotSDN 2014 Presenter:
Updating Designed for Fast IP Lookup Author : Natasa Maksic, Zoran Chicha and Aleksandra Smiljani´c Conference: IEEE High Performance Switching and Routing.
Binary-tree-based high speed packet classification system on FPGA Author: Jingjiao Li*, Yong Chen*, Cholman HO**, Zhenlin Lu* Publisher: 2013 ICOIN Presenter:
Lightweight Traffic-Aware Packet Classification for Continuous Operation Author: Shariful Hasan Shaikot, Min Sik Kim Presenter: Yen-Chun Tseng Date: 2014/11/26.
Packet Classification Using Dynamically Generated Decision Trees
LOP_RE: Range Encoding for Low Power Packet Classification Author: Xin He, Jorgen Peddersen and Sri Parameswaran Conference : IEEE 34th Conference on Local.
Hierarchical Hybrid Search Structure for High Performance Packet Classification Authors : O˜guzhan Erdem, Hoang Le, Viktor K. Prasanna Publisher : INFOCOM,
Scalable Multi-match Packet Classification Using TCAM and SRAM Author: Yu-Chieh Cheng, Pi-Chung Wang Publisher: IEEE Transactions on Computers (2015) Presenter:
INTRODUCTION NETWORKING CONCEPTS AND ADMINISTRATION CSIS 3723
COS 561: Advanced Computer Networks
HULA: Scalable Load Balancing Using Programmable Data Planes
Resilient Datacenter Load Balancing in the Wild
2018/4/23 Dynamic Load-balanced Path Optimization in SDN-based Data Center Networks Author: Yuan-Liang Lan , Kuochen Wang and Yi-Huai Hsu Presenter: Yi-Hsien.
Minimizing latency of critical traffic through SDN
How I Learned to Stop Worrying About the Core and Love the Edge
Data Center Network Architectures
How I Learned to Stop Worrying About the Core and Love the Edge
Introduction to Wireless Sensor Networks
3. Internetworking (part 1)
ECE 544: Traffic engineering (supplement)
Improving Datacenter Performance and Robustness with Multipath TCP
2018/6/26 An Energy-efficient TCAM-based Packet Classification with Decision-tree Mapping Author: Zhao Ruan, Xianfeng Li , Wenjun Li Publisher: 2013.
NOX: Towards an Operating System for Networks
VIRTUAL SERVERS Presented By: Ravi Joshi IV Year (IT)
Congestion-Aware Load Balancing at the Virtual Edge
Hamed Rezaei, Mojtaba Malekpourshahraki, Balajee Vamanan
2018/11/19 Source Routing with Protocol-oblivious Forwarding to Enable Efficient e-Health Data Transfer Author: Shengru Li, Daoyun Hu, Wenjian Fang and.
Using Link Cost as a Metric
Parallel Processing Priority Trie-based IP Lookup Approach
2018/12/10 Energy Efficient SDN Commodity Switch based Practical Flow Forwarding Method Author: Amer AlGhadhban and Basem Shihada Publisher: 2016 IEEE/IFIP.
2018/12/29 A Novel Approach for Prefix Minimization using Ternary trie (PMTT) for Packet Classification Author: Sanchita Saha Ray, Abhishek Chatterjee,
PRESENTATION COMPUTER NETWORKS
2019/1/1 High Performance Intrusion Detection Using HTTP-Based Payload Aggregation 2017 IEEE 42nd Conference on Local Computer Networks (LCN) Author: Felix.
Memory-Efficient Regular Expression Search Using State Merging
Virtual TCAM for Data Center Switches
EE 122: Lecture 7 Ion Stoica September 18, 2001.
A Small and Fast IP Forwarding Table Using Hashing
Scalable Multi-Match Packet Classification Using TCAM and SRAM
A New String Matching Algorithm Based on Logical Indexing
Congestion-Aware Load Balancing at the Virtual Edge
Packet Switching Outline Store-and-Forward Switches
2019/5/2 Using Path Label Routing in Wide Area Software-Defined Networks with OpenFlow ICNP = International Conference on Network Protocols Presenter:Hung-Yen.
SDN-Guard: DoS Attacks Mitigation in SDN Networks
Fast Network Congestion Detection And Avoidance Using P4
Large-scale Packet Classification on FPGA
OpenSec:Policy-Based Security Using Software-Defined Networking
Design principles for packet parsers
A Hybrid IP Lookup Architecture with Fast Updates
2019/7/26 OpenFlow-Enabled User Traffic Profiling in Campus Software Defined Networks Presenter: Wei-Li,Wang Date: 2016/1/4 Author: Taimur Bakhshi and.
2019/9/14 The Deep Learning Vision for Heterogeneous Network Traffic Control Proposal, Challenges, and Future Perspective Author: Nei Kato, Zubair Md.
A SRAM-based Architecture for Trie-based IP Lookup Using FPGA
2019/10/9 A Weighted ECMP Load Balancing Scheme for Data Centers Using P4 Switches Presenter:Hung-Yen Wang Authors:Jin-Li Ye, Yu-Huang Chu, Chien Chen.
Jennifer Rexford Princeton University
MEET-IP Memory and Energy Efficient TCAM-based IP Lookup
Towards TCAM-based Scalable Virtual Routers
2019/11/12 Efficient Measurement on Programmable Switches Using Probabilistic Recirculation Presenter:Hung-Yen Wang Authors:Ran Ben Basat, Xiaoqi Chen,
Presentation transcript:

2019/5/13 A Weighted ECMP Load Balancing Scheme for Data Centers Using P4 Switches Presenter:Hung-Yen Wang Authors:Peng Wang, George Trimponias, Hong Xu, Yanhui Geng Published in:2019 Transactions on Parallel and Distributed Systems Department of Computer Science and Information Engineering National Cheng Kung University, Taiwan R.O.C. CSIE CIAL Lab 1

2019/5/13 Introduction Recent work such as Presto proposes to break flows into small flowcells and load balance flowcells across available paths in a round-robin fashion. By transforming the heavy-tailed flows into many smaller flowcells, Presto can better balance the load and improve flow completion time (FCT) for medium and large flows. However, in practice most flows are small and only have a few flowcells. We find that in one production network 90 percent of the flows have less than 6 flowcells. This implies that a flow can only utilize a few random paths out of the hundreds available in typical large scale production networks . National Cheng Kung University CSIE Computer & Internet Architecture Lab CSIE CIAL Lab

2019/5/13 Introduction A more robust approach is congestion-aware load balancing advocated by CONGA and HULA. Switches monitor congestion levels for each path and direct a flow or flowlet to the least congested path. This is responsive to changing network conditions, and robust to failures and network asymmetry. To make the best load balancing decisions, prior work strives to collect congestion feedback for each path between the source and destination ToR switches. These omniscient schemes perform well in small scale enterprise networks with simple 2-tier leaf-spine topologies. The challenge is that they have serious scalability and overhead issues that impede the deployment potential in large-scale networks. National Cheng Kung University CSIE Computer & Internet Architecture Lab CSIE CIAL Lab

2019/5/13 Introduction We explore a different direction: what if we use congestion information of just a few random paths for load balancing? To answer this question we design and evaluate Luopan,1 a sampling based load balancing protocol for large-scale data center networks. Inspired by the power of two choices in randomized load balancing, in Luopan each ToR switch sends probe packets to sample a few random paths towards each active destination ToR switch. It stores the information in a local table, directs flowcells to the least congested path, and periodically re-samples the network to refresh the table. Luopan greatly reduces the overhead of maintaining global congestion information, and is more scalable for practical deployment. It also offers a much simpler switch implementation compared to existing work. National Cheng Kung University CSIE Computer & Internet Architecture Lab CSIE CIAL Lab

2019/5/13 Design Decisions The first key design decision is to adopt flowcells as the load balancing granularity. Simply balancing the number of flows does not balance the load on each path. Sub-flow level load balancing achieves better FCT by transforming the heavy-tailed flows into many data units of similar sizes. Presto uses round-robin to route flowcells. This congestion-agnostic approach balances the link load well in symmetric topologies and delivers good FCT performance for medium and large flows. National Cheng Kung University CSIE Computer & Internet Architecture Lab CSIE CIAL Lab

Cons of congestion-agnostic method 2019/5/13 Cons of congestion-agnostic method National Cheng Kung University CSIE Computer & Internet Architecture Lab CSIE CIAL Lab

Sampling versus Omniscient 2019/5/13 Sampling versus Omniscient Existing congestion-aware load balancing schemes such as CONGA and HULA maintain congestion information for all paths connecting a ToR switch pair. This is done by either frequently (every tens of microseconds) flooding probes to all paths as in HULA, or opportunistically piggybacking information on data packets as in CONGA. This omniscient approach works well in small-scale enterprise networks with 2-tier leaf-spine topologies. It however suffers from serious scalability challenges that make it difficult to deploy in practice. Thus, Luopan uses randomized sampling to practically exploit congestion feedback in a large-scale topology. Sampling has much less overhead and scales better than the omniscient approach. National Cheng Kung University CSIE Computer & Internet Architecture Lab CSIE CIAL Lab

Sampling versus Omniscient 2019/5/13 Sampling versus Omniscient Existing congestion-aware load balancing schemes such as CONGA and HULA maintain congestion information for all paths connecting a ToR switch pair. This is done by either frequently (every tens of microseconds) flooding probes to all paths as in HULA, or opportunistically piggybacking information on data packets as in CONGA. This omniscient approach works well in small-scale enterprise networks with 2-tier leaf-spine topologies. It however suffers from serious scalability challenges that make it difficult to deploy in practice. Thus, Luopan uses randomized sampling to practically exploit congestion feedback in a large-scale topology. Sampling has much less overhead and scales better than the omniscient approach. National Cheng Kung University CSIE Computer & Internet Architecture Lab CSIE CIAL Lab

2019/5/13 Design: Path Sampling Luopan builds upon flowcells that are at most 64KB. Flowcells are more sensitive to queuing delay rather than link utilization, since each of them finishes quickly. Besides, transient congestion caused by collisions among flowcells happens in a short time period, which is hard to be captured by link utilization. National Cheng Kung University CSIE Computer & Internet Architecture Lab CSIE CIAL Lab

Design: Path Selection 2019/5/13 Design: Path Selection Similar to Presto, a sending host in Luopan adds a sequentially increasing flowcell ID into each packet. Flowcell ID increases by 1 for every 64KB data sent out. The source MAC address field is used to hold the flowcell ID. Luopan recognizes the flowcell ID for each packet by matching the source MAC address field. Luopan makes path selection decisions on the first packet of each flowcell, and records the selected path ID in a flow table. Each flow table entry records flow ID (i.e., flow’s 5-tuple), flowcell ID, path ID, and time last seen. When a new packet arrives and matches both the flow ID and flowcell ID, it is a subsequent packet from a recorded flowcell and thus routed to the selected path recorded. If it only matches the flow ID, it belongs to the next flowcell and the new flowcell ID is recorded. If it does not match any entry in the flow table, it is the first packet of a flowcell from a new flow and a new entry is added to the flow table. National Cheng Kung University CSIE Computer & Internet Architecture Lab CSIE CIAL Lab

2019/5/13 Evaluation We use a 12-pod fat-tree as the baseline topology. The fabric consists of 432 hosts and 36 core switches. There are 36 equal-cost paths between any pair of ToR switches across pods. Link capacity is 10Gbps, and the fabric RTT is 40ms. We use two realistic workloads from production data centers running web search and cache jobs. National Cheng Kung University CSIE Computer & Internet Architecture Lab CSIE CIAL Lab

2019/5/13 Evaluation National Cheng Kung University CSIE Computer & Internet Architecture Lab CSIE CIAL Lab

2019/5/13 Evaluation National Cheng Kung University CSIE Computer & Internet Architecture Lab CSIE CIAL Lab

2019/5/13 Evaluation National Cheng Kung University CSIE Computer & Internet Architecture Lab CSIE CIAL Lab