Network Architecture for Joint Failure Recovery and Traffic Engineering Martin Suchara in collaboration with: D. Xu, R. Doverspike, D. Johnson and J. Rexford.

Slides:



Advertisements
Similar presentations
Failure Resilient Routing Simple Failure Recovery with Load Balancing Martin Suchara in collaboration with: D. Xu, R. Doverspike, D. Johnson and J. Rexford.
Advertisements

Greening Backbone Networks Shutting Off Cables in Bundled Links Will Fisher, Martin Suchara, and Jennifer Rexford Princeton University.
Responsive Yet Stable Traffic Engineering Srikanth Kandula Dina Katabi, Bruce Davie, and Anna Charny.
APNOMS03 1 A Resilient Path Management for BGP/MPLS VPN Jong T. Park School of Electrical Eng. And Computer Science Kyungpook National University
1 Praveen K. Muthuswamy Electrical Computer and Systems Engineering Rensselaer Polytechnic Institute In collaboration with Koushik Kar, Aparna Gupta (RPI)
Deployment of MPLS VPN in Large ISP Networks
1 Traffic Engineering (TE). 2 Network Congestion Causes of congestion –Lack of network resources –Uneven distribution of traffic caused by current dynamic.
1 EL736 Communications Networks II: Design and Algorithms Class3: Network Design Modeling Yong Liu 09/19/2007.
On Selfish Routing In Internet-like Environments Lili Qiu (Microsoft Research) Yang Richard Yang (Yale University) Yin Zhang (AT&T Labs – Research) Scott.
1 EL736 Communications Networks II: Design and Algorithms Class8: Networks with Shortest-Path Routing Yong Liu 10/31/2007.
Resource Pooling A system exhibits complete resource pooling if it behaves as if there was a single pooled resource. The Internet has many mechanisms for.
Projects Related to Coronet Jennifer Rexford Princeton University
Can Congestion Control and Traffic Engineering be at Odds? Jiayue He, Mung Chiang, Jennifer Rexford Princeton University November 30 th, 2006.
Ashish Gupta Under Guidance of Prof. B.N. Jain Department of Computer Science and Engineering Advanced Networking Laboratory.
Traffic Engineering With Traditional IP Routing Protocols
Traffic Engineering Jennifer Rexford Advanced Computer Networks Tuesdays/Thursdays 1:30pm-2:50pm.
1 Traffic Engineering for ISP Networks Jennifer Rexford IP Network Management and Performance AT&T Labs - Research; Florham Park, NJ
December 20, 2004MPLS: TE and Restoration1 MPLS: Traffic Engineering and Restoration Routing Zartash Afzal Uzmi Computer Science and Engineering Lahore.
Traffic Engineering in IP Networks Jennifer Rexford Computer Science Department Princeton University; Princeton, NJ
Rethinking Traffic Management: Using Optimization Decomposition to Derive New Architectures Jennifer Rexford Princeton University Jiayue He, Ma’ayan Bresler,
Rethinking Internet Traffic Management Using Optimization Theory Jennifer Rexford Princeton University Joint work with Jiayue He, Martin Suchara, Ma’ayan.
A General approach to MPLS Path Protection using Segments Ashish Gupta Ashish Gupta.
Path Protection in MPLS Networks Using Segment Based Approach.
Rethinking Internet Traffic Management: From Multiple Decompositions to a Practical Protocol Jiayue He Princeton University Joint work with Martin Suchara,
On Multi-Path Routing Aditya Akella 03/25/02. What is Multi-Path Routing?  Dynamically route traffic Multiple paths to a destination Path taken dependant.
A General approach to MPLS Path Protection using Segments Ashish Gupta Ashish Gupta.
Multipath Protocol for Delay-Sensitive Traffic Jennifer Rexford Princeton University Joint work with Umar Javed, Martin Suchara, and Jiayue He
Rethinking Internet Traffic Management Using Optimization Theory Jennifer Rexford Princeton University Joint work with Jiayue He, Martin Suchara, Ma’ayan.
Jennifer Rexford Princeton University MW 11:00am-12:20pm Wide-Area Traffic Management COS 597E: Software Defined Networking.
Tradeoffs in CDN Designs for Throughput Oriented Traffic Minlan Yu University of Southern California 1 Joint work with Wenjie Jiang, Haoyuan Li, and Ion.
SMUCSE 8344 Constraint-Based Routing in MPLS. SMUCSE 8344 Constraint Based Routing (CBR) What is CBR –Each link a collection of attributes (performance,
AGG-NANOG IP Network Traffic Engineering Albert Greenberg Internet and Networking Systems Research Lab AT&T Labs - Research; Florham Park, NJ See.
1 Latency Equalization: A Programmable Routing Service Primitive Minlan Yu Joint work with Marina Thottan, Li Li at Bell Labs.
Roadmap-Based End-to-End Traffic Engineering for Multi-hop Wireless Networks Mustafa O. Kilavuz Ahmet Soran Murat Yuksel University of Nevada Reno.
1 EL736 Communications Networks II: Design and Algorithms Class10: Restoration and Protection Design of Resilient Networks Yong Liu 11/28/2007.
Distributing Content Simplifies ISP Traffic Engineering Abhigyan Sharma* Arun Venkataramani* Ramesh Sitaraman*~ *University of Massachusetts Amherst ~Akamai.
Tomo-gravity Yin ZhangMatthew Roughan Nick DuffieldAlbert Greenberg “A Northern NJ Research Lab” ACM.
Cost-Performance Tradeoffs in MPLS and IP Routing Selma Yilmaz Ibrahim Matta Boston University.
DaVinci: Dynamically Adaptive Virtual Networks for a Customized Internet Jennifer Rexford Princeton University With Jiayue He, Rui Zhang-Shen, Ying Li,
1 EL736 Communications Networks II: Design and Algorithms Class2: Network Design Problems -- Notation and Illustrations Yong Liu 09/12/2007.
Internet Traffic Engineering by Optimizing OSPF Weights Bernard Fortz (Universit é Libre de Bruxelles) Mikkel Thorup (AT&T Labs-Research) Presented by.
Lecture 15. IGP and MPLS D. Moltchanov, TUT, Spring 2008 D. Moltchanov, TUT, Spring 2015.
Profiles and Multi-Topology Routing in Highly Heterogeneous Ad Hoc Networks Audun Fosselie Hansen Tarik Cicic Paal Engelstad Audun Fosselie Hansen – Poster,
Shannon Lab 1AT&T – Research Traffic Engineering with Estimated Traffic Matrices Matthew Roughan Mikkel Thorup
Central Control over Distributed Routing fibbing.net SIGCOMM Stefano Vissicchio 18th August 2015 UCLouvain Joint work with O. Tilmans (UCLouvain), L. Vanbever.
Scalable Multi-Class Traffic Management in Data Center Backbone Networks Amitabha Ghosh (UtopiaCompression) Sangtae Ha (Princeton) Edward Crabbe (Google)
Minimax Open Shortest Path First (OSPF) Routing Algorithms in Networks Supporting the SMDS Service Frank Yeong-Sung Lin ( 林永松 ) Information Management.
IP fast reroute s olutions and challenges Amund Kvalbein.
DaVinci: Dynamically Adaptive Virtual Networks for a Customized Internet Jiayue He, Rui Zhang-Shen, Ying Li, Cheng-Yen Lee, Jennifer Rexford, and Mung.
Intradomain Traffic Engineering By Behzad Akbari These slides are based in part upon slides of J. Rexford (Princeton university)
6 December On Selfish Routing in Internet-like Environments paper by Lili Qiu, Yang Richard Yang, Yin Zhang, Scott Shenker presentation by Ed Spitznagel.
Fast recovery in IP networks using Multiple Routing Configurations Amund Kvalbein Simula Research Laboratory.
On Selfish Routing In Internet-like Environments Lili Qiu (Microsoft Research) Yang Richard Yang (Yale University) Yin Zhang (AT&T Labs – Research) Scott.
Tunable QoS-Aware Network Survivability Jose Yallouz Joint work with Ariel Orda Department of Electrical Engineering, Technion.
1 An Arc-Path Model for OSPF Weight Setting Problem Dr.Jeffery Kennington Anusha Madhavan.
Inter-area MPLS TE Architecture and Protocol Extensions
1 Slides by Yong Liu 1, Deep Medhi 2, and Michał Pióro 3 1 Polytechnic University, New York, USA 2 University of Missouri-Kansas City, USA 3 Warsaw University.
1 Traffic Engineering By Kavitha Ganapa. 2 Introduction Traffic engineering is concerned with the issue of performance evaluation and optimization of.
Internet Traffic Engineering Motivation: –The Fish problem, congested links. –Two properties of IP routing Destination based Local optimization TE: optimizing.
Separating Routing From Routers Jennifer Rexford Princeton University
System & Network Reading Group On Selfish Routing In Internet-Like Evironments Lili Qiu (Microsoft Research) Yang Richard Yang (Yale University) Yin Zhang.
Multiprotocol Label Switching (MPLS) Routing algorithms provide support for performance goals – Distributed and dynamic React to congestion Load balance.
ECE 544: Traffic engineering (supplement)
Optimal Configuration of OSPF Aggregates
Frank Yeong-Sung Lin (林永松) Information Management Department
Kevin Lee & Adam Piechowicz 10/10/2009
Label Switched VPNs – Scalability and Performance Analysis
Backbone Traffic Engineering
Frank Yeong-Sung Lin (林永松) Information Management Department
Achieving Resilient Routing in the Internet
Presentation transcript:

Network Architecture for Joint Failure Recovery and Traffic Engineering Martin Suchara in collaboration with: D. Xu, R. Doverspike, D. Johnson and J. Rexford

2 Failure Recovery and Traffic Engineering in IP Networks  Uninterrupted data delivery when links or routers fail  Goal: re-balance the network load after failure  Failure recovery essential for Backbone network operators Large datacenters Local enterprise networks

3 Challenges of Failure Recovery  Existing solutions reroute traffic to avoid failures Can use, e.g., MPLS local or global protection  Drawbacks of the existing solutions Local path protection highly suboptimal Global path protection often requires dynamic recalculation of the tunnels primary tunnel backup tunnel primary tunnel backup tunnel local global

4 Overview I.Architecture: goals and proposed design II.Optimizations: of routing and load balancing III.Evaluation: using synthetic and realistic topologies IV.Conclusion

5 Architectural Goals 3.Detect and respond to failures 1.Simplify the network Allow use of minimalist cheap routers Simplify network management 2.Balance the load Before, during, and after each failure

6 The Architecture – Components  Management system Knows topology, approximate traffic demands, potential failures Sets up multiple paths and calculates load splitting ratios  Minimal functionality in routers Path-level failure notification Static configuration No coordination with other routers

The Architecture topology design list of shared risks traffic demands t s fixed paths splitting ratios

The Architecture t s link cut fixed paths splitting ratios

The Architecture t s link cut fixed paths splitting ratios path probing 9

The Architecture t s link cut path probing fixed paths splitting ratios

11 Overview I.Architecture: goals and proposed design II.Optimizations: of routing and load balancing III.Evaluation: using synthetic and realistic topologies IV.Conclusion

12 Goal I: Find Paths Resilient to Failures  A working path needed for each allowed failure state (shared risk link group) Example of failure states: S = {e 1 }, { e 2 }, { e 3 }, { e 4 }, { e 5 }, {e 1, e 2 }, {e 1, e 5 } e1e1 e3e3 e2e2 e4e4 e5e5 R1 R2

13 Goal II: Minimize Link Loads minimize ∑ s w s ∑ e Φ(u e s ) while routing all traffic link utilization u e s cost Φ(u e s ) aggregate congestion cost weighted for all failures: links indexed by e u e s =1 Cost function is a penalty for approaching capacity failure state weight failure states indexed by s

14 Possible Solutions capabilities of routers congestion Suboptimal solution Solution not scalable Good performance and practical?  Overly simple solutions do not do well  Diminishing returns when adding functionality

15 The Idealized Optimal Solution: Finding the Paths  Assume edge router can learn which links failed  Custom splitting ratios for each failure FailureSplitting Ratios -0.4, 0.4, 0.2 e4e4 0.7, 0, 0.3 e 1 & e 2 0, 0.6, 0.4 …… configuration: e4e4 e3e3 e1e1 e2e2 e5e5 e6e6 one entry per failure

16 The Idealized Optimal Solution: Finding the Paths  Solve a classical multicommodity flow for each failure case s: min load balancing objective s.t. flow conservation demand satisfaction edge flow non-negativity  Decompose edge flow into paths and splitting ratios

17 1. State-Dependent Splitting: Per Observable Failure  Edge router observes which paths failed  Custom splitting ratios for each observed combination of failed paths FailureSplitting Ratios -0.4, 0.4, 0.2 p20.6, 0, 0.4 …… configuration: p1p1 p2p2 p3p3  NP-hard unless paths are fixed at most 2 #paths entries

18 1. State-Dependent Splitting: Per Observable Failure  If paths fixed, can find optimal splitting ratios:  Heuristic: use the same paths as the idealized optimal solution min load balancing objective s.t. flow conservation demand satisfaction path flow non-negativity

19 2. State-Independent Splitting: Across All Failure Scenarios  Edge router observes which paths failed  Fixed splitting ratios for all observable failures p1, p2, p3: 0.4, 0.4, 0.2 configuration:  Non-convex optimization even with fixed paths p1p1 p2p2 p3p3

20 2. State-Independent Splitting: Across All Failure Scenarios  Heuristic to compute splitting ratios Averages of the idealized optimal solution weighted by all failure case weights  Heuristic to compute paths Paths from the idealized optimal solution r i = ∑ s w s r i s fraction of traffic on the i-th path

21 The Two Solutions 1.State-dependent splitting 2.State-independent splitting How well do they work in practice? How do they compare to the idealized optimal solution?

22 Overview I.Architecture: goals and proposed design II.Optimizations: of routing and load balancing III.Evaluation: using synthetic and realistic topologies IV.Conclusion

23 Simulations on a Range of Topologies  Single link failures  Shared risk failures for Tier-1 topology 954 failures, up to 20 links simultaneously

24 Congestion Cost – Tier-1 IP Backbone with SRLG Failures increasing load Additional router capabilities improve performance up to a point objective value network traffic State-dependent splitting indistinguishable from optimum State-independent splitting not optimal but simple How do we compare to OSPF? Use optimized OSPF link weights [Fortz, Thorup ’02].

25 Congestion Cost – Tier-1 IP Backbone with SRLG Failures increasing load OSPF uses equal splitting on shortest paths. This restriction makes the performance worse. objective value network traffic OSPF with optimized link weights can be suboptimal

26 Average Traffic Propagation Delay in Tier-1 Backbone  Service Level Agreements guarantee 37 ms mean traffic propagation delay  Need to ensure mean delay doesn’t increase much

27 Number of Paths (Tier-1) Number of paths almost independent of the load number of paths cdfnumber of paths For higher traffic load slightly more paths

28 Traffic is Highly Variable (Tier-1) number of paths relative traffic volume (%) Time (GMT) No single traffic peak (international traffic) Can a static configuration cope with the variability?

29 Performance with Static Router Configuration (Tier-1) State dependent splitting again nearly optimal number of paths objective valuetime (GMT)

30 Overview I.Architecture: goals and proposed design II.Optimizations: of routing and load balancing III.Evaluation: using synthetic and realistic topologies IV.Conclusion

31 Conclusion  Simple mechanism combining path protection and traffic engineering  Favorable properties of state-dependent splitting algorithm:  Path-level failure information is just as good as complete failure information

Thank You! 32

33 Size of Routing Tables (Tier-1) Size after compression: use the same label for routes that share path number of paths routing table sizenetwork traffic Largest routing table among all routers