The Sprint IP Monitoring Project and Traffic Dynamics at a Backbone POP Supratik Bhattacharyya Sprint ATL

Slides:



Advertisements
Similar presentations
Traffic Dynamics at a Commercial Backbone POP Nina Taft Sprint ATL Co-authors: Supratik Bhattacharyya, Jorjeta Jetcheva, Christophe Diot.
Advertisements

1 EL736 Communications Networks II: Design and Algorithms Class3: Network Design Modeling Yong Liu 09/19/2007.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada ISP-Friendly Peer Matching without ISP Collaboration Mohamed Hefeeda (Joint.
TIE Breaking: Tunable Interdomain Egress Selection Renata Teixeira Laboratoire d’Informatique de Paris 6 Université Pierre et Marie Curie with Tim Griffin.
Trajectory Sampling for Direct Traffic Observation Matthias Grossglauser joint work with Nick Duffield AT&T Labs – Research.
Traffic Engineering With Traditional IP Routing Protocols
1 Traffic Engineering for ISP Networks Jennifer Rexford IP Network Management and Performance AT&T Labs - Research; Florham Park, NJ
Traffic Engineering in IP Networks Jennifer Rexford Computer Science Department Princeton University; Princeton, NJ
Traffic Engineering for ISP Networks Jennifer Rexford Internet and Networking Systems AT&T Labs - Research; Florham Park, NJ
1 Deriving Traffic Demands for Operational IP Networks: Methodology and Experience Anja Feldmann*, Albert Greenberg, Carsten Lund, Nick Reingold, Jennifer.
Traffic Measurement for IP Operations Jennifer Rexford Internet and Networking Systems AT&T Labs - Research; Florham Park, NJ
Dynamics of Hot-Potato Routing in IP Networks Renata Teixeira (UC San Diego) with Aman Shaikh (AT&T), Tim Griffin(Intel),
Traffic Measurement for IP Operations Jennifer Rexford Internet and Networking Systems AT&T Labs - Research; Florham Park, NJ
Wresting Control from BGP: Scalable Fine-grained Route Control UCSD / AT&T Research Usenix —June 22, 2007 Dan Pei, Tom Scholl, Aman Shaikh, Alex C. Snoeren,
Traffic Matrix Estimation: Existing Techniques and New Directions A. Medina (Sprint Labs, Boston University), N. Taft (Sprint Labs), K. Salamatian (University.
1 Design and implementation of a Routing Control Platform Matthew Caesar, Donald Caldwell, Nick Feamster, Jennifer Rexford, Aman Shaikh, Jacobus van der.
Impact of BGP Dynamics on Intra-Domain Traffic Patterns in the Sprint IP Backbone Sharad Agarwal, Chen-Nee Chuah, Supratik Bhattacharyya, Christophe Diot.
Measurement and Monitoring Nick Feamster Georgia Tech.
Rethinking Internet Traffic Management: From Multiple Decompositions to a Practical Protocol Jiayue He Princeton University Joint work with Martin Suchara,
Network Monitoring for Internet Traffic Engineering Jennifer Rexford AT&T Labs – Research Florham Park, NJ 07932
1 Deriving Traffic Demands for Operational IP Networks: Methodology and Experience Anja Feldmann*, Albert Greenberg, Carsten Lund, Nick Reingold, Jennifer.
Backbone Networks Jennifer Rexford COS 461: Computer Networks Lectures: MW 10-10:50am in Architecture N101
1 Traffic Engineering for ISP Networks Jennifer Rexford IP Network Management and Performance AT&T Labs - Research; Florham Park, NJ
Measuring ISP topologies with Rocketfuel Ratul Mahajan Neil Spring David Wetherall University of Washington ACM SIGCOMM 2002.
AGG-NANOG IP Network Traffic Engineering Albert Greenberg Internet and Networking Systems Research Lab AT&T Labs - Research; Florham Park, NJ See.
Bandwidth DoS Attacks and Defenses Robert Morris Frans Kaashoek, Hari Balakrishnan, Students MIT LCS.
Network-Wide Traffic Models for Managing IP Networks Jennifer Rexford Internet and Networking Systems AT&T Labs - Research; Florham Park, NJ
Internet Traffic Management Prafull Suryawanshi Roll No - 04IT6008.
Identifying and Using Energy Critical Paths Nedeljko Vasić with Dejan Novaković, Satyam Shekhar, Prateek Bhurat, Marco Canini, and Dejan Kostić EPFL, Switzerland.
Cost-Performance Tradeoffs in MPLS and IP Routing Selma Yilmaz Ibrahim Matta Boston University.
Coarse-Grained Traffic Analysis in ISP Networks A Router-Based Approach Christian Martin Verizon.
Network Sensitivity to Hot-Potato Disruptions Renata Teixeira (UC San Diego) with Aman Shaikh (AT&T), Tim Griffin(Intel),
Chapter 4. After completion of this chapter, you should be able to: Explain “what is the Internet? And how we connect to the Internet using an ISP. Explain.
1 © 2003, Cisco Systems, Inc. All rights reserved. CCNA 2 Module 6 Routing and Routing Protocols.
Internet Traffic Management. Basic Concept of Traffic Need of Traffic Management Measuring Traffic Traffic Control and Management Quality and Pricing.
SIGCOMM 2002 New Directions in Traffic Measurement and Accounting Focusing on the Elephants, Ignoring the Mice Cristian Estan and George Varghese University.
Traffic Engineering for ISP Networks Jennifer Rexford Internet and Networking Systems AT&T Labs - Research; Florham Park, NJ
Using Measurement Data to Construct a Network-Wide View Jennifer Rexford AT&T Labs—Research Florham Park, NJ
NetFlow: Digging Flows Out of the Traffic Evandro de Souza ESnet ESnet Site Coordinating Committee Meeting Columbus/OH – July/2004.
Tony McGregor RIPE NCC Visiting Researcher The University of Waikato DAR Active measurement in the large.
Traffic Engineering for ISP Networks Jennifer Rexford Internet and Networking Systems AT&T Labs - Research; Florham Park, NJ
Metadata Management of Terabyte Datasets from an IP Backbone Network: Experience and Challenges Sue B. Moon and Timothy Roscoe.
Network Measurement Tools ESnet Site Coordinators Meeting 26 April 2000 Tracie Monk, UCSD/SDSC/CAIDA -
1 Network Measurement Summary ESCC, Feb Joe Metzger ESnet Engineering Group Lawrence Berkeley National Laboratory.
Detection of Routing Loops and Analysis of Its Causes Sue Moon Dept. of Computer Science KAIST Joint work with Urs Hengartner, Ashwin Sridharan, Richard.
April 4th, 2002George Wai Wong1 Deriving IP Traffic Demands for an ISP Backbone Network Prepared for EECE565 – Data Communications.
Trajectory Sampling for Direct Traffic Oberservation N.G. Duffield and Matthias Grossglauser IEEE/ACM Transactions on Networking, Vol. 9, No. 3 June 2001.
Intradomain Traffic Engineering By Behzad Akbari These slides are based in part upon slides of J. Rexford (Princeton university)
Measurement in the Internet Measurement in the Internet Paul Barford University of Wisconsin - Madison Spring, 2001.
Taming Internet Traffic Some notes on modeling the wild nature of OD flows Augustin Soule Kavé Salamatian Antonio Nucci Nina Taft Univ. Paris VI Sprintlabs.
정하경 MMLAB Fundamentals of Internet Measurement: a Tutorial Nevil Brownlee, Chris Lossley, “Fundamentals of Internet Measurement: a Tutorial,” CMG journal.
1 An Arc-Path Model for OSPF Weight Setting Problem Dr.Jeffery Kennington Anusha Madhavan.
Advanced Technology Laboratories 8 December 2000 page 1 Characterization of Traffic at a Backbone POP Nina Taft Supratik Bhattacharyya Jorjeta Jetcheva.
The New Policy for Enterprise Networking Robert Bays Chief Scientist June 2002.
TeXCP: Protecting Providers’ Networks from Unexpected Failures & Traffic Spikes Dina Katabi MIT - CSAIL nms.csail.mit.edu/~dina.
1 Traffic Engineering By Kavitha Ganapa. 2 Introduction Traffic engineering is concerned with the issue of performance evaluation and optimization of.
Internet Traffic Engineering Motivation: –The Fish problem, congested links. –Two properties of IP routing Destination based Local optimization TE: optimizing.
1 Monitoring: from research to operations Christophe Diot and the IP Sprintlabs ipmon.sprintlabs.com.
CSci5221: Intra-Domain Traffic Engineering 1 Intra-Domain Traffic Engineering Traffic Engineering (TE) – MPLS and traffic engineering (will go over very.
1 Netflow Collection and Aggregation in the AT&T Common Backbone Carsten Lund.
Distributed Network Monitoring in the Wisconsin Advanced Internet Lab Paul Barford Computer Science Department University of Wisconsin – Madison Spring,
1 Deploying Measurement Systems in ESnet Joint Techs, Feb Joseph Metzger ESnet Engineering Group Lawrence Berkeley National Laboratory.
Working at a Small-to-Medium Business or ISP – Chapter 6
Routing and Routing Protocols: Routing Static
Detection of Routing Loops and Analysis of Its Causes
Routing and Routing Protocols: Routing Static
Netscope: Traffic Engineering for IP Networks
Backbone Networks Mike Freedman COS 461: Computer Networks
Working at a Small-to-Medium Business or ISP – Chapter 6
An Empirical Evaluation of Wide-Area Internet Bottlenecks
Presentation transcript:

The Sprint IP Monitoring Project and Traffic Dynamics at a Backbone POP Supratik Bhattacharyya Sprint ATL

The IP Group at Sprintlabs Charter :  Investigate IP technologies for robust, efficient, QOS-enabled networks  Anticipate and evaluate new services and applications Major Projects :  Monitoring Sprint’s IP Backbone  Service Platform

Talk Overview  The IPMon Project  Routing and Traffic Dynamics

IP Backbone : POP-to-POP view POP OC-48 POP : Point of Presence, typically a metropolitan area OC-12 OC-3

Motivation: Need for Monitoring Current network is over-provisioned, over- engineered, best-effort…  Diagnosis:  detect and report problems at IP level  Management  configuration problems, traffic engineering  resource provisioning, network dimensioning  Value-added service  feedback to customers (performance, traffic characteristics)  Detect attacks and anomalies

Existing Measurement Efforts  Passive measurements  SNMP-based tools  Netflow (Cisco proprietary)  OC3MON, OC12MON  Active Measurements  ping, traceroute, NIMI, MINC, Surveyor  Skitter, Keynote, Matrix  Integrated Approach  AT&T Netscope Network topology and routes Traffic at flow level granularity Delay and loss statistics

Our approach  Passive monitoring  Capture header (44 bytes) from every packet  full TCP/IP headers, no http information  Use GPS time stamping - allows accurate correlating of packets on different links  Day long traces  Simultaneously monitor multiple links and sites.  Collect routing information along with packet traces.  Traces archived for future use

Applications  Data from a commercial Tier-1 IP backbone  Applications of data:  traffic modeling  traffic engineering  provisioning  pricing, SLAs  hardware design in collaboration with vendors  denial-of-service

Measurement Facilities  IPMON System  Collects packet traces by passively tapping onto the fiber using optical splitters  supports OC-3 to OC-48 data rates  Data Repository  Large tape library to archive data  Analysis Platform  Initially 17 nodes computing cluster  SAN under deployment

IPMON Architecture Linux PC with multiple PCI buses

Monitoring links at a POP

Current Status of IPMONs  Currently operational in one major west coast POP on OC3 links  Under way in two major east coast POPs for OC3 and OC12 -- (we hope by July 2001)  OC48 in preparation for 1 east coast POP and 1 west coast POP -- summer 2001  Future: Sprint Dial-Up Network, more POPs, European network

Practical Constraints  Difficult to monitor operational network :  Complex procedure for deploying equipment   POPs evolve too fast  Too costly to be ubiquitous  Technology limitations (PCs, disks, etc.)  Only off-line analysis is possible  Are 44 bytes enough?

Ongoing Projects  Routing and Traffic Dynamics  Delay measurement across a router  TCP flow analysis  Denial of service  Bandwidth provisioning and pricing

Routing and Traffic Dynamics Project  Part 1: what are the traffic demands between pairs of POPs?  How stable is this demand?  Part 2: what are the paths taken by those demands?  Are link utilizations levels similar throughout the backbone?  Part 3: is there a better way to spread the traffic across paths?  At what level of traffic granularity should traffic be split up?

Motivation Understand traffic demands between POP pairs

POP-to-POP Traffic Matrix City A City B City C City A City B City C Measure traffic over different timescales Divide traffic per destination prefix, protocol, etc. For every ingress POP : Identify total traffic to each egress POP Further analyze this traffic

Applications  Intra-domain routing  Analyzing routing anomalies  Verify BGP Peering  Capacity planning and dimensioning  POP architecture

Generating POP-POP traffic matrices

The Mapping Problem What is the egress POP for a packet entering the a given ingress POP?

Mapping BGP destinations to POPs BGP table Find best Next-Hop Get Unique Next-Hops Map to POP (Dst,Next-Hop) Unique Next-Hops (Next-Hop, Last Sprint Hop) (Next-Hop, POP map) (BGP Dst,POP) Map Dst to POP Recursive BGP lookup to find last Sprint hop

Data Processing  Step 1: Use BGP tables to generate [prefix, egress POP] map  Step 2: Run IP lookup software on packet trace using above map  Output : single trace file for each egress-POP, e.g. all packets headed to POP k from monitored POP  Step 3: Use our traffic analysis tool for statistics evaluation.

Monitored links at a single POP Core Peer 2 Access web hosting ISP Peer 1

Data  5 traces collected on Aug 9, 2000 Access Link Type Trace Length (hours) Webhost 1 Webhost 2 Peer 1 Peer 2 ISP

Traffic Fanout: POP level granularity

Fanout: web host links

Time-of-Day for POP level granularity

Day-Night Variation : Webhost #1 % reduction at night between 20-50% depending upon access link

Summary  Wide disparity in “traffic demands” among egress POPs  POPs can be roughly categorized as : small, medium, large; and they maintain their rank during the day.  Traffic is heterogeneous in space yet stable in time.  Traffic varies by (access link, egress POP pair)  Hard to characterize time-of-day behaviour  20-50% reduction at night

Routing and Traffic Dynamics Project  Part 1: what are the traffic demands between pairs of POPs?  How stable is this demand?  Part 2: what are the paths taken by those demands?  Are link utilizations levels similar throughout the backbone?  Part 3: is there a better way to spread the traffic across paths?  At what level of traffic granularity should traffic be split up?

IS-IS Routing Practices

Is backbone traffic balanced?

What we’ve seen so far Wide disparity in traffic demands between (ingress, egress) POP pairs + Wide disparity in link utilization levels, plus many underutilized routes + Routing Policies concentrate traffic on few paths Question: Can we divert some traffic to the lightly loaded paths?

Routing and Traffic Dynamics Project  Part 1: what are the traffic demands between pairs of POPs?  How stable is this demand?  Part 2: what are the paths taken by those demands?  Are link utilizations levels similar throughout the backbone?  Part 3: is there a better way to spread the traffic across paths?  At what level of traffic granularity should traffic be split up?

Creating traffic aggregates  To address issues of splitting traffic over multiple paths, need to define “streams” within traffic  How should packets be aggregated into streams?  Coarse granularity: POP-to-POP  Very fine granularity: use 5-tuple  Initial criterion : destination address prefix

Elephants and Mice among /8 streams Stream : all packets in a group with same /8 destination address prefix Traffic grouped by egress POPs Ingress : Webhost Link

Stability of prefix-based aggregates

Observations about prefix-based streams Recursive : /8 elephant has a few /16 elephants and many mice, likewise at /24 level Phenomenon is less pronounced at /24 level Qn : Are elephants stable?  Definition:  R i (n) = the rank of flow i at time slot n   i,n,k = | R i (n) - R i (n+k) |  each time slot corresponds to 30 minutes

Frequency of Rank Changes Conclusion : For load balancing, route elephants along different paths

Conclusions  Monitoring and measurement is key to better network design  IPMon : a passive monitoring system for packet-level information  We have used our data to build components of traffic matrices for traffic engineering  Backbone traffic can be better load- balanced : destination-prefix is a possible (simple) criterion

Ongoing Work  Intra-domain Routing :  Choosing ISIS link weights  Load balancing in the backbone  Flow Characterization  Building Traffic Matrices  POP modeling