What Lies Beneath: Understanding Internet Congestion Leiwen Deng Aleksandar Kuzmanovic Northwestern University Bruce Davie, Cisco Systems

Slides:



Advertisements
Similar presentations
1 Experimental Study of Internet Stability and Wide-Area Backbone Failure Craig Labovitz, Abha Ahuja Merit Network, Inc Presented by Changchun Zou.
Advertisements

A Flexible Model for Resource Management in Virtual Private Networks Presenter: Huang, Rigao Kang, Yuefang.
Consensus Routing: The Internet as a Distributed System John P. John, Ethan Katz-Bassett, Arvind Krishnamurthy, and Thomas Anderson Presented.
PROMISE: Peer-to-Peer Media Streaming Using CollectCast Mohamed Hafeeda, Ahsan Habib et al. Presented By: Abhishek Gupta.
Detecting Traffic Differentiation in Backbone ISPs with NetPolice Ying Zhang Zhuoqing Morley Mao Ming Zhang.
James 1:5 If any of you lacks wisdom, he should ask God, who gives generously to all without finding fault, and it will be given to him.
Network Architecture for Joint Failure Recovery and Traffic Engineering Martin Suchara in collaboration with: D. Xu, R. Doverspike, D. Johnson and J. Rexford.
A Comparison of Layering and Stream Replication Video Multicast Schemes Taehyun Kim and Mostafa H. Ammar.
Server-based Inference of Internet Performance V. N. Padmanabhan, L. Qiu, and H. Wang.
An Algebraic Approach to Practical and Scalable Overlay Network Monitoring Yan Chen, David Bindel, Hanhee Song, Randy H. Katz Presented by Mahesh Balakrishnan.
1 A General Introduction to Tomography & Link Delay Inference with EM Algorithm Presented by Joe, Wenjie Jiang 21/02/2004.
Computer Networks Fall, 2007 Prof Peterson. CIS 235: Networks Fall, 2007 Western State College  What is “store and forward”?  What is a buffer / queue?
NetQuest: A Flexible Framework for Internet Measurement Lili Qiu Joint work with Mike Dahlin, Harrick Vin, and Yin Zhang UT Austin.
Multiple constraints QoS Routing Given: - a (real time) connection request with specified QoS requirements (e.g., Bdw, Delay, Jitter, packet loss, path.
1 Network Tomography Venkat Padmanabhan Lili Qiu MSR Tab Meeting 22 Oct 2001.
Spatial Reuse Ring Networks Chun-Hung Chen Department of Computer Science and Information Engineering National Taipei University of Technology
Monitoring Persistently Congested Internet Links Leiwen (Karl) Deng Aleksandar Kuzmanovic Northwestern University
Delayed Internet Routing Convergence Craig Labovitz, Abha Ahuja, Abhijit Bose, Farham Jahanian Presented By Harpal Singh Bassali.
Dynamics of Hot-Potato Routing in IP Networks Renata Teixeira (UC San Diego) with Aman Shaikh (AT&T), Tim Griffin(Intel),
1 TCP-LP: A Distributed Algorithm for Low Priority Data Transfer Aleksandar Kuzmanovic, Edward W. Knightly Department of Electrical and Computer Engineering.
1 End-to-End Detection of Shared Bottlenecks Sridhar Machiraju and Weidong Cui Sahara Winter Retreat 2003.
Cumulative Violation For any window size  t  Communication-Efficient Tracking for Distributed Cumulative Triggers Ling Huang* Minos Garofalakis.
A Routing Control Platform for Managing IP Networks Jennifer Rexford Princeton University
FTDCS 2003 Network Tomography based Unresponsive Flow Detection and Control Authors Ahsan Habib, Bharat Bhragava Presenter Mohamed.
Network Monitoring for Internet Traffic Engineering Jennifer Rexford AT&T Labs – Research Florham Park, NJ 07932
Network Tomography (A presentation for STAT 593E) Mingyan Li Radha Sampigethaya.
1 Interdomain Routing Policy Reading: Sections plus optional reading COS 461: Computer Networks Spring 2008 (MW 1:30-2:50 in COS 105) Jennifer Rexford.
Diagnosing Spatio-Temporal Internet Congestion Properties Leiwen Deng Aleksandar Kuzmanovic EECS Department Northwestern University
Ningning HuCarnegie Mellon University1 A Measurement Study of Internet Bottlenecks Ningning Hu (CMU) Joint work with Li Erran Li (Bell Lab) Zhuoqing Morley.
Root cause analysis of BGP routing dynamics Matt Caesar, Lakshmi Subramanian, Randy H. Katz.
Wide Web Load Balancing Algorithm Design Yingfang Zhang.
End-to-End Issues. Route Diversity  Load balancing o Per packet splitting o Per flow splitting  Spill over  Route change o Failure o policy  Route.
Jennifer Rexford Princeton University MW 11:00am-12:20pm Wide-Area Traffic Management COS 597E: Software Defined Networking.
A Machine Learning-based Approach for Estimating Available Bandwidth Ling-Jyh Chen 1, Cheng-Fu Chou 2 and Bo-Chun Wang 2 1 Academia Sinica 2 National Taiwan.
1 Meeyoung Cha (KAIST) Sue Moon (KAIST) Chong-Dae Park (KAIST) Aman Shaikh (AT&T Labs – Research) IEEE INFOCOM 2005 Poster Session Positioning Relay Nodes.
EQ-BGP: an efficient inter- domain QoS routing protocol Andrzej Bęben Institute of Telecommunications Warsaw University of Technology,
Introduction 1-1 Chapter 1: roadmap 1.1 What is the Internet? 1.2 Network edge  end systems, access networks, links 1.3 Network core  circuit switching,
Authors Renata Teixeira, Aman Shaikh and Jennifer Rexford(AT&T), Tim Griffin(Intel) Presenter : Farrukh Shahzad.
Alok Shriram and Jasleen Kaur Presented by Moonyoung Chung Empirical Evaluation of Techniques for Measuring Available Bandwidth.
1 Computer Communication & Networks Lecture 22 Network Layer: Delivery, Forwarding, Routing (contd.)
Introduction 1-1 Chapter 1 Part 2 Network Core These slides derived from Computer Networking: A Top Down Approach, 6 th edition. Jim Kurose, Keith Ross.
“Intra-Network Routing Scheme using Mobile Agents” by Ajay L. Thakur.
Scalable and Efficient Data Streaming Algorithms for Detecting Common Content in Internet Traffic Minho Sung Networking & Telecommunications Group College.
A Routing Underlay for Overlay Networks Akihiro Nakao Larry Peterson Andy Bavier SIGCOMM’03 Reviewer: Jing lu.
Tony McGregor RIPE NCC Visiting Researcher The University of Waikato DAR Active measurement in the large.
Scalable Multi-Class Traffic Management in Data Center Backbone Networks Amitabha Ghosh (UtopiaCompression) Sangtae Ha (Princeton) Edward Crabbe (Google)
CS 447 Networks and Data Communication Department of Computer Science Southern Illinois University Edwardsville Fall, 2015 Dr. Hiroshi Fujinoki
CS551: End-to-End Packet Dynamics Paxon’99 Christos Papadopoulos (
Paper # – 2009 A Comparison of Heterogeneous Video Multicast schemes: Layered encoding or Stream Replication Authors: Taehyun Kim and Mostafa H.
Towards Efficient Large-Scale VPN Monitoring and Diagnosis under Operational Constraints Yao Zhao, Zhaosheng Zhu, Yan Chen, Northwestern University Dan.
A Measurement Study on the Impact of Routing Events on End-to-End Internet Path Performance Feng Wang 1, Zhuoqing Morley Mao 2 Jia Wang 3, Lixin Gao 1,
A Light-Weight Distributed Scheme for Detecting IP Prefix Hijacks in Real-Time Lusheng Ji†, Joint work with Changxi Zheng‡, Dan Pei†, Jia Wang†, Paul Francis‡
A Practical Approach for Providing QoS: MPLS and DiffServ
Detection of Routing Loops and Analysis of Its Causes Sue Moon Dept. of Computer Science KAIST Joint work with Urs Hengartner, Ashwin Sridharan, Richard.
1 A Framework for Measuring and Predicting the Impact of Routing Changes Ying Zhang Z. Morley Mao Jia Wang.
R-BGP: Staying Connected in a Connected World Nate Kushman Srikanth Kandula, Dina Katabi, and Bruce Maggs.
N. Hu (CMU)L. Li (Bell labs) Z. M. Mao. (U. Michigan) P. Steenkiste (CMU) J. Wang (AT&T) Infocom 2005 Presented By Mohammad Malli PhD student seminar Planete.
Low-Rate TCP-Targeted DoS Attack Disrupts Internet Routing Ying Zhang Z. Morley Mao Jia Wang Presented in NDSS07 Prepared by : Hale Ismet.
정하경 MMLAB Fundamentals of Internet Measurement: a Tutorial Nevil Brownlee, Chris Lossley, “Fundamentals of Internet Measurement: a Tutorial,” CMG journal.
Interconnect Networks Basics. Generic parallel/distributed system architecture On-chip interconnects (manycore processor) Off-chip interconnects (clusters.
© 2005 Cisco Systems, Inc. All rights reserved. BGP v3.2—1-1 Course Introduction.
Internet Measurement and Analysis Vinay Ribeiro Shriram Sarvotham Rolf Riedi Richard Baraniuk Rice University.
1 Effective Diagnosis of Routing Disruptions from End Systems Ying Zhang Z. Morley Mao Ming Zhang.
Access Link Capacity Monitoring with TFRC Probe Ling-Jyh Chen, Tony Sun, Dan Xu, M. Y. Sanadidi, Mario Gerla Computer Science Department, University of.
PATH DIVERSITY WITH FORWARD ERROR CORRECTION SYSTEM FOR PACKET SWITCHED NETWORKS Thinh Nguyen and Avideh Zakhor IEEE INFOCOM 2003.
1 On the Impact of Route Monitor Selection Ying Zhang* Zheng Zhang # Z. Morley Mao* Y. Charlie Hu # Bruce M. Maggs ^ University of Michigan* Purdue University.
Monitoring Persistently Congested Internet Links
Monitoring Network Bias
Pong: Diagnosing Spatio-Temporal Internet Congestion Properties
COS 561: Advanced Computer Networks
Presentation transcript:

What Lies Beneath: Understanding Internet Congestion Leiwen Deng Aleksandar Kuzmanovic Northwestern University Bruce Davie, Cisco Systems

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 2 Common Wisdom and Our Key Results No congestion in the Internet core –Links are over-provisioned, hence no congestion No correlation among congestion events in the Internet –Diversity of traffic and links make large and long- lasting link congestion dependence unlikely Our key results –There is a subset of links (both inter-AS and intra- AS) that exhibit strong congestion intensity –Congestion events in the core can be highly correlated (up to 3 ASes)

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 3 Why Do We Care? Congestion in the core –Can depend on upon internal network policies or complex inter-AS relationships –Variable queuing delay can lead to jitter, affecting VoIP or streaming applications Correlation –Guidelines for re-routing systems –Most tomography models assume link congestion independence

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 4 Challenges Scalability –How to concurrently monitor a large number of Internet links? Need a light monitoring tool Need a triggered monitoring system Our approach –Pong: a light monitoring tool Per-path overhead 18 kbps –TPong: a triggered monitoring system Capable of monitoring up to 8,000 links concurrently

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 5 Congestion Events Congestion Intensity –How frequently does queue build-ups happen over 30 seconds time scales? We focus on persistent congestion events: –Intensity > 5%; duration > 2 minutes

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 6 Coordinated Probing SD Probe f s d b 4-p probing: a symmetric path scenario Combines e2e and router-targeted probing f probeb probes probed probe,,,

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 7 Pong: Coordinated Probing SD f s d b ΔfsΔfs ΔfdΔfd Half-path queuing delay Locating Congestion Points Tracing Congestion Status Probe ΔdΔd ΔbΔb ΔfΔf ΔsΔs

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 8 Pong: Methodology Highlights Coordinated probing –Send 4, 3, or 2 packets from two endpoints Quality of Measurability (QoM) –Able to deterministically detect its own inaccuracy Self-adaptivity –Switch among different probing schemes based on QoM and path properties

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 9 Vantage Point Selection Problem How to select vantage points to accurately measure congestion at a given link? Link measurability score –How well are we able to measure a specific link from a specific pair of endpoints; a function of: Quality of measurability (QoM) for a given node Queuing-delay threshold quality Observability score –Avoid paths that “see” multiple congested links concurrently

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 10 Triggered Monitoring System Paths usedPath selection algorithm Probing method Probing rateObjective All pathsNo selection, full mesh Low-rate probing Once every 5 minutesTrack topology and path reachability TMon paths – a subset of all paths Greedy TMon path selection Fast-rate probing 5 probes/secMonitor end-to-end congestion Pong paths – a subset of TMon paths upon triggering Priority-based Pong path allocation Coordinated probing 10 probes/sec for e2e probing, 2 probes/sec for router-targeted probing Locate and monitor link-level congestion Greedy algorithm to determine a subset of links Covered 65% (7,800) links with 4.9% (1,750) paths Limit the per-node measurement overhead Priority-based Pong path allocation Maximize quality of measurability

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 11 Coverage & Overhead Statistics We observe ~ 36,000 paths –N^2, N = 191 nodes –Expose ~ 12,100 links at a time Due to routing changes, we are able to observe ~ 29,000 links in total TMon paths: –Up to 2,000 paths running fast-rate probing concurrently –Cover up to 8,000 links concurrently 4.9% paths cover 65% of total links Pong paths –Up to 30 Pong paths; cover up to 350 links concurrently Overhead per node: –Average: 30 kbps, Peak: 68 kbps

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 12 Measurement Quality How good is our vantage-point selection algorithm? –Link Measurability Score: % of measurement samples have non-zero score 80% of measurements is better than fair 60% of measurements is better than good –The key point is that we know how good or bad we are doing

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 13 Key Findings Time-invariant hot spots Strong spatial correlation among congested links Root-cause analysis

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 14 Time-invariant Hot Spots Time-of-day effects for the number of congestion events Small number of links show strong time- invariant congestion intensity

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 15 Time-invariant Hot Spots Most of the links are not inter-continental links as we initially hypothesized Inter-AS links between large backbone networks as well as intra-AS links within these networks AS #Description 174Cogent Communications, a large Tier-2 ISP. 1299TeliaNet Global Network, a large Tier-2 ISP GEANT, a main European multi-gigabit computer network for research and education purposes, Tier Time Warner Telecom, a Tier-2 ISP in US. 3356Level 3 Communications, a Tier-1 ISPs. 237Merit, a Tier-2 network in US. 6461Abovenet Communications, a large Tier-2 ISP RedCLARA, a backbone connects the Latin-American National Research and Education Networks to Europe. 6453Teleglobe, a Tier-2 ISP. 2914NTT America, a Tier-1 ISPs. 3549Global Crossing, a Tier-1 ISPs Abilene, an Internet2 backbone network in US. 4538China Education and Research Network.

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 16 Pair-wise correlation –Percent of time 2 links are concurrently congested –Pair-wise correlation can be quite extensive E.g., 20% of pairs has correlation greater than 0.7 –Correlation: weekend > weekdays Overall congestion level smaller during weekends –Distance between correlated link pairs up to 3 ASes Congestion Correlation

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 17 Hypothesis: –When upstream traffic converges to a relatively thin aggregation point, then traffic surges in an upstream link are likely to create congestion at a thin downstream aggregation link Insights: –Aggregation points correspond to time-invariant hot spots –Interaction between an aggregation point and an upstream link causes link-level correlation Aggregation Effect Hypothesis Aggregation link

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 18 Root-cause Analysis: Example 10Gbps 622Mbps

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 19 Final Statistics RankNetworkPeers 1UUNET2,346 2AT&T WorldNet2,092 3Level 3 Comm.1,742 5Cogent Comm.1,642 7Global Crossing1,041 8Time Warner918 9Abovenet798 RankISP 1Level 3 Comm. 2UUNET 3AT&T WorldNet 6Cogent Comm. 9Global Crossing RankISP 1Level 3 Comm. 2TeliaNet Global Network 4Global Crossing 8Teleglobe RankISP 2NTT America 6UUNET 8AT&T WorldNet 9Level 3 Comm. 10Teleglobe Table 1: Matched locations in the top ten networks defined by the number of peers Table 2: Matched locations in the top ten ISPs that most aggressively promote customer access North America Europe Asia

Aleksandar Kuzmanovic What Lies Beneath: Understanding Internet Congestion 20 Conclusions Triggered monitoring system –Measuring congestion in a scalable way –Key feature: Select vantage points to measure congestion as a function of the measurement quality Key findings –A subset of links experience time-invariant high congestion intensity –There is strong correlation among congestion events at different links (up to 3 ASes) –Root cause: aggregation effect some links thinner than others