BGP Issues Geoff Huston.

Slides:



Advertisements
Similar presentations
NOPEER Route Attribute Propose a well-known transitive advisory scope attribute Applied by originating AS to route prefixes Interpretable as advice to.
Advertisements

Introduction to IP Routing Geoff Huston. Routing How do packets get from A to B in the Internet? A B Internet.
BGP01 An Examination of the Internets BGP Table Behaviour in 2001 Geoff Huston Telstra.
Allocation vs Announcement A comparison of RIR IPv4 Allocation Records with Global Routing Announcements Geoff Huston February 2004 (Supported by APNIC)
BGP Table Analysis (even more!) Geoff Huston. Interpretation There is no synchronized overview of the entire connectivity and policy state of all parts.
Tracking the Internets BGP Table Geoff Huston Telstra January 2001.
ARIN Public Policy Meeting
Measuring IPv6 Deployment Geoff Huston George Michaelson
1 An Update on Multihoming in IPv6 Report on IETF Activity IPv6 Technical SIG 1 Sept 2004 APNIC18, Nadi, Fiji Geoff Huston.
Routing Table Status Report Geoff Huston February 2005 APNIC.
Routing Items from IAB Utrecht Workshop Geoff Huston IAB.
Routing Table Status Report November 2005 Geoff Huston APNIC.
Update Damping in BGP Geoff Huston Chief Scientist, APNIC.
Number of ASs Observations ASs grew by 25% over the year Note span of visible ASs (11,200 – 12,500) Not every AS is visible to all other ASs But there.
4-Byte AS Numbers The view from the old BGP world Geoff Huston October 2006 APNIC.
BGP Issues Geoff Huston May What is BGP? The Internet is composed of a collection of networks Each network is autonomously managed The Internet.
IPv4 Unallocated Address Space Exhaustion Geoff Huston Chief Scientist APNIC APNIC 24, September 2007.
BGP Status Update Geoff Huston September What Happening (AS4637) Date.
Experimental Measurement of Delayed Convergence Abha Ahuja Internap/Merit Network, Inc. Craig Labovitz Microsoft Research/Merit Network, Inc. Farnam Jahanian,
Multihoming and Multi-path Routing
Multihoming and Multi-path Routing
Routing System Stability draft-dimitri-grow-rss-01.txt IETF71 - Philadelphia.
Scale Free Networks.
A S I A P A C I F I C N E T W O R K I N F O R M A T I O N C E N T R E APNIC Open Policy Meeting Address Policy (Procedures) SIG 1 March 2001.
Comparing IPv4 and IPv6 from the perspective of BGP dynamic activity Geoff Huston APNIC February 2012.
© J. Liebeherr, All rights reserved 1 Border Gateway Protocol This lecture is largely based on a BGP tutorial by T. Griffin from AT&T Research.
2006 – (Almost another) BGP Year in Review A BRIEF update to the 2005 report 18 October 2006 IAB Routing Workshop Geoff Huston APNIC.
1 Interdomain Routing Protocols. 2 Autonomous Systems An autonomous system (AS) is a region of the Internet that is administered by a single entity and.
BGP in 2009 Geoff Huston APNIC May Conventional BGP Wisdom IAB Workshop on Inter-Domain routing in October 2006 – RFC 4984: “routing scalability.
CSE5803 Advanced Internet Protocols and Applications (7) Introduction The IP addressing scheme discussed in Chapter 2 are classful and can be summarised.
Shivkumar Kalyanaraman Rensselaer Polytechnic Institute 1 Exterior Gateway Protocols: EGP, BGP-4, CIDR Shivkumar Kalyanaraman Rensselaer Polytechnic Institute.
Graphs and Topology Yao Zhao. Background of Graph A graph is a pair G =(V,E) –Undirected graph and directed graph –Weighted graph and unweighted graph.
COS 420 Day 16. Agenda Finish Individualized Project Please Have Grading sheets to me by Tomorrow Group Project Discussion Assignment 3 moved back to.
Routing and Routing Protocols
Allocations vs Announcements A comparison of RIR IPv4 Allocation Records with Global Routing Announcements Geoff Huston May 2004 (Activity supported by.
Information-Centric Networks07b-1 Week 7 / Paper 2 NIRA: A New Inter-Domain Routing Architecture –Xiaowei Yang, David Clark, Arthur W. Berger –IEEE/ACM.
1 Introducing Routing 1. Dynamic routing - information is learned from other routers, and routing protocols adjust routes automatically. 2. Static routing.
M.Menelaou CCNA2 ROUTING. M.Menelaou ROUTING Routing is the process that a router uses to forward packets toward the destination network. A router makes.
Interconnectivity Density Compare number of AS’s to average AS path length A uniform density model would predict an increasing AS Path length (“Radius”)
Architecting the Network Part 3 Geoff Huston Chief Scientist, Internet Telstra ISOC Workshop.
Lecture 4: BGP Presentations Lab information H/W update.
IPv4 Unallocated Address Space Exhaustion Geoff Huston Chief Scientist APNIC November 2007.
CAIA Seminar – 18 August 2007 – Taming BGP An incremental approach to improving the dynamic properties of BGP Geoff Huston.
BGP in 2011 Geoff Huston APNIC. Conventional (Historical) BGP Wisdom IAB Workshop on Inter-Domain routing in October 2006 – RFC 4984: “routing scalability.
David Wetherall Professor of Computer Science & Engineering Introduction to Computer Networks Hierarchical Routing (§5.2.6)
Inter-Domain Routing Trends Geoff Huston APNIC March 2007.
Routing and Routing Protocols
Routing Table Status Report Geoff Huston November 2004 APNIC.
Routing in the Inernet Outcomes: –What are routing protocols used for Intra-ASs Routing in the Internet? –The Working Principle of RIP and OSPF –What is.
1 Auto-Detecting Hijacked Prefixes? Routing SIG 7 Sep 2005 APNIC20, Hanoi, Vietnam Geoff Huston.
© 2005 Cisco Systems, Inc. All rights reserved. BGP v3.2—5-1 Customer-to-Provider Connectivity with BGP Connecting a Multihomed Customer to a Single Service.
Routing Table Status Report Geoff Huston August 2004 APNIC.
Routing Table Status Report August 2005 Geoff Huston.
Tracking the Internet’s BGP Table Geoff Huston Telstra December 2000.
Inter-domain Routing Outline Border Gateway Protocol.
BGP Validation Russ White Rule11.us.
Routing 2015 Scaling BGP Geoff Huston APNIC May 2016.
Border Gateway Protocol
Measuring BGP Geoff Huston.
COMP 3270 Computer Networks
Architecting the Network Part 3
Intra-Domain Routing Jacob Strauss September 14, 2006.
Routing Table Status Report
Geoff Huston September 2002
RIPE October 2005 Geoff Huston APNIC
Routing Table Status Report
2005 – A BGP Year in Review February 2006 Geoff Huston
Routing Table Status Report
Routing Table Status Report
Design Expectations vs. Deployment Reality in Protocol Development
Presentation transcript:

BGP Issues Geoff Huston

Why measure BGP? BGP describes the structure of the Internet, and an analysis of the BGP routing table can provide information to help answer the following questions: What is changing in the deployment environment? Are these changes sustainable? How do address allocation policies, BGP and the Internet inter-relate? Are current address allocation policies still relevant? What are sensible objectives for address allocation policies?

Techniques Passive Measurement Takes measurements from a default-free router at the edge of the local network Easily configured Single (Filtered) view of the larger Internet What you see is a collection of best paths from your immediate neighbours eBGP Measurement Point Local AS

Techniques Multiple Passive measurement points Measure a number of locations simultaneously Can be used to infer policy AS2 AS1 AS3 Measurement Points

Techniques Single passive measurement point with multiple route feeds Best example: Route-views.oregon-ix.net Operating since 1995 42 peers Uses eBGP multihop to pull in route views

Techniques Active Measurement Tests Convergence Announcement and withdrawal Reporting Points Monitoring Unit AS1 Route Injection Point Internet AS2

Interpretation BGP is not a link state protocol There is no synchronized overview of the entire connectivity and policy state Every BGP viewing point contains a filtered view of the network Just because you can’t see it does not mean that it does not exist BGP metrics are sample metrics

BGP Table Growth – 12 year history

BGP Table Growth – 2 year history

BGP Table Growth – 2 year & 6 month trends

BGP Table Growth – Projections

Prefix distribution in the BGP table

/24 is the fastest growing prefix length

/25 and smaller are the fastest growing prefixes in relative terms

Prefixes by AS Distribution of originating address sizes per AS Address advertisements are getting smaller Non-Hierarchical Advertisements Number of AS’s Prefix Length

Multi-homing on the rise? Track rate of CIDR “holes” – currently 41% of all route advertisements are routing ‘holes” This graph tracks the number of address prefix advertisements which are part of an advertised larger address prefix

Proportion of BGP advertisements which are more specific advertisements of existing aggregates

OOPS Program bug! The number is larger than that. More specific advertisement of existing aggregates account for 54% of the BGP selected route table from the perspective of AS1221 56,799 entries from a total of 103,561 Older (mid Jan) data from AS286 has the number at 53,644 from a total of 95,036 (56%)

Routed Address Space Large fluctuation is due to announcement / withdrawals of /8 prefixes 12 months of data does not provide clear longer growth characteristic

Routed Address Space (/8 Corrected) Annual compound growth rate is 7% p.a. Most address consumption today appears to be ocurring behind NATs /8 Corrected Data

AS Number Growth

AS Number Use - Extrapolation Continued exponential growth implies AS number exhaustion in 2005

Average size of a routing table entry /18.1 /18.5 The BGP routing tale is growing at a faster rate than the rate of growth of announced address space

Denser Internet Structure Dec-2000 Reachable Addresses Feb-2001 AS Hops

Denser Internet Structure 90% point Address Span Feb-2001 Dec-2000 AS Hops

Internet ‘Shape’ The network is becoming less ‘stringy’ and more densely interconnected i.e. Transit depth is getting smaller Distance Distance Span Span

Aggregation and Specifics Is the prevalence of fine-grained advertisements the result of deliberate configuration or inadvertent leakage of advertisements?

Publicity helps ? Efforts to illustrate the common problem of unconstrained table growth appear to have had an impact on growth of the table, as seen on the edge of AS1221 since Dec 2000

But - the view from KPNQwest Data from James Aldridge, KPNQwest - http://www.mcvax.org/~jhma/routing/

Different Views

Different Views Route views in prefix-length-filtered parts of the net do not show the same recent reduction in the size of the routing table. It is likely that the reduction in routes seen by AS1221 appears to be in the prefix-length filtered ranges Either more transit networks are prefix length filtering or origin AS’s are filtering at the edge, or both The underlying growth trend in BGP table size remains strong

Aggregation possibilities What if all advertisements were maximally aggregated* ? 27% reduction (103126 -> 74427) using AS Path aggregation 33% reduction (103126 -> 68504) using AS Origin aggregation This assumes that the specific advertisements are not matched by other specific advertisements which have been masked out closer to the origin AS – this is not a terribly good assumption, so these numbers are optimistic to some extent

Aggregation Potential from AS1221 AS Path AS Origin

The aggregation potential view from KPNQwest Data from James Aldridge, KPNQwest - http://www.mcvax.org/~jhma/routing/ AS Path AS Origin

A Longer Term View from AS286

Different Views Similar AS Origin, but different AS Path aggregation outcomes Prevalence of the use of specifics for local inter-domain traffic engineering

Aggregatability? A remote view of aggregation has two potential interpretations: Propose aggregation to the origin AS Propose a self-imposed proxy aggregation ruleset Any aggregation reduces the information content in the routing table. Any such reduction implies a potential change in inter-domain traffic patterns. Aggregation with preserved integrity of traffic flows is different from aggregation with potential changes in traffic flow patters

Aggregatability Origin AS aggregation is easier to perform at the origin, but harder to determine remotely IF traffic flows are to be preserved Proxy Aggregation is only possible IF you know what your neighbors know Yes this is a recursive statement If an AS proxy aggregates will it learn new specifics in response?

BGP as a Routing Protocol How quickly can the routing system converge to a consistent state following dynamic change? Is this time interval changing over time?

Increased convergence time intervals for BGP Measured time to withdraw route: Up to 2 minutes Measured time to advertise new route: Up to 30 minutes

How long until routes return? (From A Study of Internet Failures) What is happening here?

Withdraw Convergence AS1 AS2 AS3 AS4

Withdraw Convergence Probability distribution Providers exhibit different, but related convergence behaviors 80% of withdraws from all ISPs take more than a minute For ISP4, 20% withdraws took more than three minutes to converge

Failures, Fail-overs and Repairs

Failures, Fail-overs and Repairs Bad news does not travel fast… Repairs (Tup) exhibit similar convergence properties as long-short ASPath fail-over Failures (Tdown) and short-long fail-overs (e.g. primary to secondary path) also similar Slower than Tup (e.g. a repair) 60% take longer than two minutes Fail-over times degrade the greater the degree of multi-homing!

Conjectures…. BGP table size will continue to rise exponentially Multi-homing at the edge of the Internet is on the increase The interconnectivity mesh is getting denser The number of AS paths is increasing faster than the number of AS’s Average AS path length remains constant AS number deployment growth will exhaust 64K AS number space in August 2005 if current growth trends continue

More conjecturing…. Inter-AS Traffic Engineering is being undertaken through routing discrete prefixes along different paths -- globally (the routing mallet!) AS Origin aggregation < AS Path aggregation RIR allocation policy (/19, /20) is driving one area of per-prefix length growth in the aggregated prefix area of the table BUT - NAT is a very common deployment tool NAT, multihoming and Traffic Engineering is driving even larger growth in the /24 prefix area

And while we are having such a good time conjecturing… Over 12 months average prefix length in the table has shifted from /18.1 to /18.5 More noise (/25 and greater) in the table, but the absolute level of noise is low (so far) Most routing table flux is in the /24 to /32 prefix space – as this space gets relatively larger so will total routing table flux levels “Flux” here is used to describe the cumulative result of the withdrawals and announcements This space appears to be susceptible to social pressure – at present

This is fun – lets have even more conjectures… CIDR worked effectively for four years, but its effective leverage to support long term dampened route table growth and improved table stability has now finished Provider-based service aggregation hierarchies as a model of Internet deployment structure is more theoretic than real these days i.e. provider based route aggregation is leaking like a sieve!

Commentary draft-iab-bgparch-00.txt Exponential growth of BGP tables has resumed AS number space exhaustion Convergence issues Traffic Engineering in a denser mesh What are the inter-domain routing protocol evolutionary requirements?

Objectives and Requirements Supporting a larger and denser interconnection topology Scale by x100 over current levels in number of discrete policy entities Fast Convergence Security Integration of Policy and Traffic Engineering as an overlay on basic connectivity Control entropy / noise inputs

Available Options Social Pressure on aggregation Economic Pressure on route advertisements Tweak BGP4 behavior Revise BGP4 community attributes BGPng New IDR protocol(s) New IP routing architecture

Social Pressure Social pressure can reduce BGP noise Social pressure cannot reduce pressures caused by Denser interconnection meshing Increased use of multi-homing Traffic engineering of multiple connections Limited utility and does not address longer term routing scaling

Economic Pressure on Routing Charge for route advertisements Upstream charges a downstream per route advertisements Peers charge each other This topic is outside an agenda based on technology scope Raises a whole set of thorny secondary issues: Commercial National Regulatory International Such measures would attempt to make multi-homing less attractive economically. It does not address why multi-homing is attractive from a perspective of enhanced service resilience.

Tweaking BGP4 Potential tweak to BGP-4 Auto-Proxy-Aggregation Automatically proxy aggregate bitwise aligned route advertisements Cleans up noise – but reduces information Cannot merge multi-homed environments unless the proxy aggregation process makes sweeping assumptions, or unless there is an overlay aggregation protocol to control proxy aggregation (this is then no longer a tweak)

Extend BGP4 Communities We already need to extend community attributes to take on the 2 / 4 octet AS number transition. Can we add further community attribute semantics to allow proxy aggregation and proxy sublimation under specified conditions? Extend commonly defined transitive community attributes to allow further information to be attached to a routing advertisement Limit of ‘locality’ of propagation Aggregation conditions or constraints If we could do this, will this be enough? Can this improve Scaling properties convergence properties

BGPng Preserve: AS concept, prefix + AS advertisements, distance vector operation, AS policy “opaqueness” Alter: convergence algorithm (DUAL?), advertisement syntax (AS + prefix set + specifics + constraints), BGP processing algorithm Issues: Development time Potential to reach closure on specification Testing of critical properties Deployment considerations Transition mechanisms

IDR A different IDR protocol? Can we separate connectivity maintenance, application of policy constraints and sender- and/or receiver- managed traffic engineering? SPF topology maintenance Inter-Domain Policy Protocol to communicate policy preferences between policy “islands” Multi-domain path maintenance to support traffic engineering requirements Eliminate the need to advertise specifics to undertake traffic engineering Multi-homing may still be an issue – is multi-homing a policy issue within an aggregate or a new distinct routing “entity”? Can SPF scale? Will SPF routing hierarchies impose policy on the hierarchy elements?

New IP Routing Architecture Separate Identity, Location and Path at an architectural level? Identity How do you structure an entirely new unique identity label space? How do you construct the “identity lookup” mechanism? Location How can location be specified independent of network topology? Path: Is multi-homing an internal attribute within the network driven by inter-domain policies, or is multi-homing an end-host switching function

New IP Routing Architecture Other approaches? Realms and RSIP Inter-Domain CRLDP approaches where policy is the constraint