Network Sensitivity to Hot-Potato Disruptions Renata Teixeira (UC San Diego) with Aman Shaikh (AT&T), Tim Griffin(Intel),

Slides:



Advertisements
Similar presentations
1 Interdomain Traffic Engineering with BGP By Behzad Akbari Spring 2011 These slides are based on the slides of Tim. G. Griffin (AT&T) and Shivkumar (RPI)
Advertisements

1 EL736 Communications Networks II: Design and Algorithms Class3: Network Design Modeling Yong Liu 09/19/2007.
© J. Liebeherr, All rights reserved 1 Border Gateway Protocol This lecture is largely based on a BGP tutorial by T. Griffin from AT&T Research.
Internet Routing Instability
1 BGP Anomaly Detection in an ISP Jian Wu (U. Michigan) Z. Morley Mao (U. Michigan) Jennifer Rexford (Princeton) Jia Wang (AT&T Labs)
1 Interdomain Routing Protocols. 2 Autonomous Systems An autonomous system (AS) is a region of the Internet that is administered by a single entity and.
Internet and Overlay Networks Ram Keralapura ECE Dept
TIE Breaking: Tunable Interdomain Egress Selection Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie with Tim Griffin.
On the Geographic Location of Internet Resources CSCI 780, Fall 2005.
1 Finding a Needle in a Haystack: Pinpointing Significant BGP Routing Changes in an IP Network Jian Wu (University of Michigan) Z. Morley Mao (University.
Traffic Engineering With Traditional IP Routing Protocols
Internet Routing (COS 598A) Today: Addressing and Routing Jennifer Rexford Tuesdays/Thursdays 11:00am-12:20pm.
1 Path Splicing Author: Murtaza Motiwala, Megan Elmore, Nick Feamster and Santosh Vempala Publisher: SIGCOMM’08 Presenter: Hsin-Mao Chen Date:2009/12/09.
Interdomain Routing and The Border Gateway Protocol (BGP) Courtesy of Timothy G. Griffin Intel Research, Cambridge UK
1 Traffic Engineering for ISP Networks Jennifer Rexford IP Network Management and Performance AT&T Labs - Research; Florham Park, NJ
Traffic Engineering in IP Networks Jennifer Rexford Computer Science Department Princeton University; Princeton, NJ
1 Policy-Based Path-Vector Routing Reading: Sections COS 461: Computer Networks Spring 2006 (MW 1:30-2:50 in Friend 109) Jennifer Rexford Teaching.
Traffic Engineering for ISP Networks Jennifer Rexford Internet and Networking Systems AT&T Labs - Research; Florham Park, NJ
Network Protocols Designed for Optimizability Jennifer Rexford Princeton University
A Measurement Framework for Pin-Pointing Routing Changes Renata Teixeira (UC San Diego) with Jennifer Rexford (AT&T)
Delayed Internet Routing Convergence Craig Labovitz, Abha Ahuja, Abhijit Bose, Farham Jahanian Presented By Harpal Singh Bassali.
Dynamics of Hot-Potato Routing in IP Networks Renata Teixeira (UC San Diego) with Aman Shaikh (AT&T), Tim Griffin(Intel),
Wresting Control from BGP: Scalable Fine-grained Route Control UCSD / AT&T Research Usenix —June 22, 2007 Dan Pei, Tom Scholl, Aman Shaikh, Alex C. Snoeren,
Internet Routing (COS 598A) Today: Interdomain Traffic Engineering Jennifer Rexford Tuesdays/Thursdays.
1 Design and implementation of a Routing Control Platform Matthew Caesar, Donald Caldwell, Nick Feamster, Jennifer Rexford, Aman Shaikh, Jacobus van der.
Internet Routing (COS 598A) Today: Hot-Potato Routing Jennifer Rexford Tuesdays/Thursdays 11:00am-12:20pm.
Impact of BGP Dynamics on Intra-Domain Traffic Patterns in the Sprint IP Backbone Sharad Agarwal, Chen-Nee Chuah, Supratik Bhattacharyya, Christophe Diot.
Routing Jennifer Rexford Advanced Computer Networks Tuesdays/Thursdays 1:30pm-2:50pm.
Network Monitoring for Internet Traffic Engineering Jennifer Rexford AT&T Labs – Research Florham Park, NJ 07932
Routing.
1 Interdomain Routing Policy Reading: Sections plus optional reading COS 461: Computer Networks Spring 2008 (MW 1:30-2:50 in COS 105) Jennifer Rexford.
A Routing Control Platform for Managing IP Networks Jennifer Rexford Princeton University
Backbone Networks Jennifer Rexford COS 461: Computer Networks Lectures: MW 10-10:50am in Architecture N101
1 Traffic Engineering for ISP Networks Jennifer Rexford IP Network Management and Performance AT&T Labs - Research; Florham Park, NJ
A Routing Control Platform for Managing IP Networks Jennifer Rexford Princeton University
Hot Potatoes Heat Up BGP Routing Jennifer Rexford AT&T Labs—Research Joint work with Renata Teixeira, Aman Shaikh, and.
Dynamics of Hot-Potato Routing in IP Networks Jennifer Rexford AT&T Labs—Research Joint work with Renata Teixeira, Aman.
1 Network Topology Measurement Yang Chen CS 8803.
Computer Networks Layering and Routing Dina Katabi
1 Meeyoung Cha (KAIST) Sue Moon (KAIST) Chong-Dae Park (KAIST) Aman Shaikh (AT&T Labs – Research) IEEE INFOCOM 2005 Poster Session Positioning Relay Nodes.
1 Meeyoung Cha, Sue Moon, Chong-Dae Park Aman Shaikh Placing Relay Nodes for Intra-Domain Path Diversity To appear in IEEE INFOCOM 2006.
Authors Renata Teixeira, Aman Shaikh and Jennifer Rexford(AT&T), Tim Griffin(Intel) Presenter : Farrukh Shahzad.
IP is a Network Layer Protocol Physical 1 Network DataLink 1 Transport Application Session Presentation Network Physical 1 DataLink 1 Physical 2 DataLink.
Traffic Engineering for ISP Networks Jennifer Rexford Internet and Networking Systems AT&T Labs - Research; Florham Park, NJ
A Case Study in Understanding OSPFv2 and BGP4 Interactions Using Efficient Experiment Design David Bauer†, Murat Yuksel‡, Christopher Carothers† and Shivkumar.
Dynamics of Hot-Potato Routing in IP Networks Jennifer Rexford AT&T Labs—Research Joint work with Renata Teixeira (UCSD),
Traffic Engineering for ISP Networks Jennifer Rexford Internet and Networking Systems AT&T Labs - Research; Florham Park, NJ
BGP topics to be discussed in the next few weeks: –Excessive route update –Routing instability –BGP policy issues –BGP route slow convergence problem –Interaction.
A Measurement Study on the Impact of Routing Events on End-to-End Internet Path Performance Feng Wang 1, Zhuoqing Morley Mao 2 Jia Wang 3, Lixin Gao 1,
On Understanding of Transient Interdomain Routing Failures Feng Wang, Lixin Gao, Jia Wang, and Jian Qiu Department of Electrical and Computer Engineering.
1 A Framework for Measuring and Predicting the Impact of Routing Changes Ying Zhang Z. Morley Mao Jia Wang.
Intradomain Traffic Engineering By Behzad Akbari These slides are based in part upon slides of J. Rexford (Princeton university)
BGP Routing Stability of Popular Destinations Jennifer Rexford, Jia Wang, Zhen Xiao, and Yin Zhang AT&T Labs—Research Florham Park, NJ All flaps are not.
A Measurement Study on the Impact of Routing Events on End-to-End Internet Path Performance Feng Wang 1, Zhuoqing Morley Mao 2 Jia Wang 3, Lixin Gao 1,
1 Chapter 4: Internetworking (IP Routing) Dr. Rocky K. C. Chang 16 March 2004.
1 Effective Diagnosis of Routing Disruptions from End Systems Ying Zhang Z. Morley Mao Ming Zhang.
Michael Schapira, Princeton University Fall 2010 (TTh 1:30-2:50 in COS 302) COS 561: Advanced Computer Networks
Traffic-aware Inter-Domain Routing for Improved Internet Routing Stability Zhenhai Duan Florida State University 1.
Internet Traffic Engineering Motivation: –The Fish problem, congested links. –Two properties of IP routing Destination based Local optimization TE: optimizing.
Placing Relay Nodes for Intra-Domain Path Diversity Meeyoung Cha Sue Moon Chong-Dae Park Aman Shaikh Proc. of IEEE INFOCOM 2006 Speaker 游鎮鴻.
BGP Routing Stability of Popular Destinations
Jian Wu (University of Michigan)
COS 561: Advanced Computer Networks
Interdomain Traffic Engineering with BGP
Introduction to Internet Routing
COS 561: Advanced Computer Networks
COS 561: Advanced Computer Networks
COS 561: Advanced Computer Networks
COS 461: Computer Networks
BGP Instability Jennifer Rexford
Presentation transcript:

Network Sensitivity to Hot-Potato Disruptions Renata Teixeira (UC San Diego) with Aman Shaikh (AT&T), Tim Griffin(Intel), and Geoff Voelker (UCSD) SIGCOMM’04 – Portland, OR

SIGCOMM’04 2 Internet Routing Architecture UCSD Sprint AT&T Verio AOL interdomain routing (BGP) intradomain routing (OSPF,IS-IS) User Web Server End-to-end performance depends on all ASes along the path Changes in one AS may impact traffic and routing in other ASes

SIGCOMM’04 3 Hot-Potato Routing San Francisco Dallas New York Hot-potato routing = route to closest egress point when there is more than one route to destination ISP network 9 10 dst multiple connections to the same peer -All traffic from customer to peers -All traffic to customer prefixes with multiple connections

SIGCOMM’04 4 Hot-Potato Disruption San Francisco Dallas New York ISP network dst failure - planned maintenance - traffic engineering 11 Routes to thousands of destinations switch exit point!!! 11

SIGCOMM’04 5 Consequences of Hot-Potato Disruptions  Transient forwarding instability  Up to three minutes convergence delay  Normal internal changes take a couple of seconds  Traffic shift  Responsible for largest traffic matrix variations  Interdomain routing changes  Around 2 – 5% of a router’s external BGP updates

SIGCOMM’04 6 What to do about it?  Engineer network to minimize disruptions  Network operator: operational practices to avoid changes  Network designer: designs that minimize sensitivity  Need a vocabulary and metrics to evaluate impact of internal changes  Compare possible network designs  Identify critical events  Take special care during maintenance or traffic engineering

SIGCOMM’04 7 Modeling Hot-Potato Routing  Model of egress selection in backbone networks  Internal topology and link weights  Set of egress routers for each destination prefix  Apply topology changes  Link or router failures  Link weight changes  Evaluate impact of topology changes  For a router what fraction of prefixes shifts  Most critical link failure  …

SIGCOMM’04 8 Modeling Egress Selection A B C D G E F dst Egress set for a destination prefix (dst) = set of border nodes that learn routes to dst ({A,B}) A B Region of egress node A = nodes that are closer to A than B Region of A Region of B

SIGCOMM’04 9 Modeling Topology Changes C D G E F Region of A Region of B A B Topology change = edge or node deletion, link weight change dst C D G E F Region of A Region of B A B C shifts from region of A to B dst

SIGCOMM’04 10 Generalizing to All Prefixes  Fraction of prefixes at a router that change egresses after a single topology change  Routing-shift function (H RM ) A B C D G E F A B X (10,000 prefixes) Z (4,000 prefixes) Y (1,000 prefixes) Routing-shift at C when CF is deleted = 10,000/15,000 (i.e. 2/3)

SIGCOMM’04 11 All Prefixes, Routers, and Topology Changes routers topology changes C failure of CF fraction of prefixes at C that changes egress after the failure of link CF: 2/3 routing-shift function

SIGCOMM’04 12 Node Routing Sensitivity Metrics (  RM ) routers topology changes C  Node routing sensitivity  Expected fraction of route shifts experienced by a node  Worst case  Maximum route shift experienced by a node

SIGCOMM’04 13 Routing Impact of a Graph Transformation (  RM )  Impact of graph transformations  Average fraction of route shifts across all nodes  Worst case  Maximum route shift caused by each graph transformation routers topology changes failure of CF

SIGCOMM’04 14 Case Study: A Large ISP Backbone Network  Obtaining input for the model  Topology – intradomain routing messages  Egress sets – collection of BGP tables  Set of graph transformations Single link failures Single router failures  Probability distribution for graph transformations Uniform

SIGCOMM’04 15 Order failures according to average impact Which failures are most disruptive? routers single router failures fraction of failures Routing Impact of Failures router failures link failures Most failures cause no hot-potato disruptions Operators can focus on most disruptive failures

SIGCOMM’04 16 Which routers are most sensitive? routers single router failures Order routers according to average sensitivity router failures link failures fraction of routers Node Routing Sensitivity Very few hot-potato changes on average, but there are many failures that cause no shift High variance among routers

SIGCOMM’04 17 What is the largest routing shift for each router? routers single router failures or single link failures Order routers according to worst case sensitivity Worst Case Node Routing Sensitivity fraction of routers Very disruptive failures for some routers

SIGCOMM’04 18 Conclusion  Contributions  Model of hot-potato disruptions  Basis for a sensitivity analysis tool  Robustness should be a first-order metric  As important as traditional performance metrics  Network should have small reactions to small changes  Two approaches  Engineer the system: our model  Redesign routing interaction: on-going work

SIGCOMM’04 19 Single Link vs. Single Router Failures A B C D E dst

SIGCOMM’04 20 Single Link vs. Single Router Failures A B C D E dst

SIGCOMM’04 21 Minimizing Disruptions  Reconfiguration of routing protocols  Link and node redundancy  Selection of peering locations