Lecture 13 – Network Mapping

Slides:



Advertisements
Similar presentations
T. S. Eugene Ng Mellon University1 Towards Global Network Positioning T. S. Eugene Ng and Hui Zhang Department of Computer.
Advertisements

Topology-Aware Overlay Construction and Server Selection Sylvia Ratnasamy Mark Handley Richard Karp Scott Shenker Infocom 2002.
Intel Research Internet Coordinate Systems - 03/03/2004 Internet Coordinate Systems Marcelo Pias Intel Research Cambridge
Scalable Content-Addressable Network Lintao Liu
Authors: Venkata N. Padmanabhan and Lakshminarayanan Subramanian Publisher: SIGCOMM 2001 Presenter: Chai-Yi Chu Date: 2013/03/06 1.
1 LINK STATE PROTOCOLS (contents) Disadvantages of the distance vector protocols Link state protocols Why is a link state protocol better?
King : Estimating latency between arbitrary Internet end hosts Krishna Gummadi, Stefan Saroiu Steven D. Gribble University of Washington Presented by:
A Network Measurement Architecture for Adaptive Networked Applications Mark Stemm* Randy H. Katz Computer Science Division University of California at.
Predicting Communication Latency in the Internet Dragan Milic Universität Bern.
Measurement in the Internet. Outline Internet topology Bandwidth estimation Tomography Workload characterization Routing dynamics.
T. S. Eugene Ng Mellon University1 Global Network Positioning: A New Approach to Network Distance Prediction Tze Sing Eugene.
Network Measurement Bandwidth Analysis. Why measure bandwidth? Network congestion has increased tremendously. Network congestion has increased tremendously.
CS 6401 Efficient Addressing Outline Addressing Subnetting Supernetting.
Jennifer Rexford Fall 2014 (TTh 3:00-4:20 in CS 105) COS 561: Advanced Computer Networks Locations.
How the Internet Works Acknowledgment and Disclaimer: This presentation is supported in part by the National Science Foundation under Grant Any.
Ao-Jan Su, David R. Choffnes, Fabián E. Bustamante and Aleksandar Kuzmanovic Department of EECS Northwestern University Relative Network Positioning via.
PIC: Practical Internet Coordinates for Distance Estimation Manuel Costa joint work with Miguel Castro, Ant Rowstron, Peter Key Microsoft Research Cambridge.
David Wetherall Professor of Computer Science & Engineering Introduction to Computer Networks Hierarchical Routing (§5.2.6)
1 Vivaldi: A Decentralized Network Coordinate System Frank Dabek, Russ Cox, Frans Kaashoek, Robert Morris Presented by: Chen Qian.
A Light-Weight Distributed Scheme for Detecting IP Prefix Hijacks in Real-Time Lusheng Ji†, Joint work with Changxi Zheng‡, Dan Pei†, Jia Wang†, Paul Francis‡
Internet Tomography and Geography What is this area all about? Related work in the area Main Paper WEBMAPPER’s features How WEBMAPPER works WEBMAPPER results.
L-24 Adaptive Applications 1. State of the Art – Manual Adaptation Objective: automating adaptation ? CaliforniaNew York 2.
1 A Framework for Measuring and Predicting the Impact of Routing Changes Ying Zhang Z. Morley Mao Jia Wang.
Determining the Geographic Location of Internet Hosts Venkata N. Padmanabhan Microsoft Research Lakshminarayanan Subramanian University of California at.
1 Evaluating NGI performance Matt Mathis
Network Coordinates : Internet Distance Estimation Jieming ZHU
Network Computing Laboratory 1 Vivaldi: A Decentralized Network Coordinate System Authors: Frank Dabek, Russ Cox, Frans Kaashoek, Robert Morris MIT Published.
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
Chapter 25 Internet Routing. Static Routing manually configured routes that do not change Used by hosts whose routing table contains one static route.
Internet Traffic Engineering Motivation: –The Fish problem, congested links. –Two properties of IP routing Destination based Local optimization TE: optimizing.
15-829A/18-849B/95-811A/19-729A Internet-Scale Sensor Systems: Design and Policy Review.
1 IP2Geo: Locating Internet Hosts Geographically Venkat Padmanabhan Microsoft Research Joint work with L. Subramanian (UC Berkeley)
Matchmaking for Online Games and Other Latency-Sensitive P2P Systems
Delay-Tolerant Networks (DTNs)
Multicast Outline Multicast Introduction and Motivation DVRMP.
Vivaldi: A Decentralized Network Coordinate System
Pastry Scalable, decentralized object locations and routing for large p2p systems.
Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting
CFA: A Practical Prediction System for Video Quality Optimization
COMP 3270 Computer Networks
A Comparison of Overlay Routing and Multihoming Route Control
LECTURE 34: WEB PROGRAMMING FOR SCALE
Introduction to Networks
Intra-Domain Routing Jacob Strauss September 14, 2006.
Routing.
CMPE 252A : Computer Networks
Phillipa Gill University of Toronto
LECTURE 32: WEB PROGRAMMING FOR SCALE
LECTURE 33: WEB PROGRAMMING FOR SCALE
High Throughput Route Selection in Multi-Rate Ad Hoc Wireless Networks
COS 561: Advanced Computer Networks
Routers Routing algorithms
Dynamic Routing and OSPF
COS 561: Advanced Computer Networks
COS 561: Advanced Computer Networks
Data and Computer Communications
IT351: Mobile & Wireless Computing
Content Distribution Networks
CMPE 252A : Computer Networks
COS 561: Advanced Computer Networks
BGP Interactions Jennifer Rexford
EE 122: Lecture 22 (Overlay Networks)
An Empirical Evaluation of Wide-Area Internet Bottlenecks
An Engineering Approach to Computer Networking
Computer Networks Protocols
Routing.
LECTURE 33: WEB PROGRAMMING FOR SCALE
Performance Evaluation of a Communication Round over the Internet
No-Jump-into-Latency in China's Internet
Evaluating the Running Time of a Communication Round over the Internet
Presentation transcript:

Lecture 13 – Network Mapping 15-829A/18-849B/95-811A/19-729A Internet-Scale Sensor Systems: Design and Policy Lecture 13 – Network Mapping

Why Predict Network Performance? In any large distributed system must assign responsibility to nodes Want to maximize performance, availability, etc. How to choose? Communication patterns are often localized Need to find nodes that are near each other Need geographic & network mapping 02-25-03 Lecture #13

Difficult to find nearby nodes quickly and efficiently Network Mapping Difficult to find nearby nodes quickly and efficiently Huge number of paths to measure TCP bandwidth and RTT probes are time-consuming No clean mapping of IP address  location (geographic or network topology) 02-25-03 Lecture #13

Outline IP2Geo IDMaps SPAND GNP 02-25-03 Lecture #13

IP2Geo Techniques GeoTrack GeoPing GeoCluster DNS-based Latency-based BGP-based 02-25-03 Lecture #13

GeoTrack Traceroute to host DNS names for routers near host Typically city, airport, country names embedded in names Need ISP-specific heuristics to reduce false positives Codes used and location in name Impact of proxies Always returns location of proxy Client may be anywhere 02-25-03 Lecture #13

GeoTrack Performance FooTV CDF looks identical to proxy location PDF Most FooTV customers use a proxy Median Error: 590km University hosts work well No proxy, always on, traceroute always completes Median error: 102km 02-25-03 Lecture #13

GeoPing Can latency be converted to distance? Distance to different locations grouped by latency While good at short latencies  error grows for longer latencies Can’t break speed of light, but can get slow route to nearby places 02-25-03 Lecture #13

GeoPing Performance Match host to other nearby nodes Delay map: vector of delays to known n landmarks Matching: most similar delay map Euclidean distance in n dimensional space Use matched host’s location 02-25-03 Lecture #13

GeoCluster Classify all hosts in an address cluster as same location BGP address prefixes as starting point Address allocation in CIDR blocks Basic granularity of announcements BGP announcements may be subdivided to reflect different attributes (e.g. policy) 100,000 address prefixes announced  from 12,000 ASes Address prefixes may still be large entities May span large geographic areas 02-25-03 Lecture #13

GeoCluster Sub-clustering When prefix is large  look at consensus on location If consensus is weak  split prefix in half Repeat if needed until too few samples Proxies result in lack of consensus and therefore no location report 02-25-03 Lecture #13

GeoCluster Performance Without sub-clustering University data set Median error: 28km No report for 12% of hosts bCentral data set Median error: 685km Much larger ISPs cause problems 23% with no answer 02-25-03 Lecture #13

GeoCluster Performance bCentral poor performance due to large size of prefixes Can subclustering help?  yes Note that even /24 clusters don’t work well  surprising given the address allocation of Internet 02-25-03 Lecture #13

Outline IP2Geo IDMaps SPAND GNP 02-25-03 Lecture #13

Network Distance Round-trip propagation and transmission delay Reflects Internet topology and routing A good first order performance optimization metric Helps achieve low communication delay A reasonable indicator of TCP throughput Can weed out most bad choices But the O(N2) network distances are also hard to determine efficiently in Internet-scale systems 02-25-03 Lecture #13

Active Measurements Network distance can be measured with ping-pong messages But active measurement does not scale 02-25-03 Lecture #13

Scaling Alternatives 02-25-03 Lecture #13

Sharing Measurements IDMaps [Francis et al ’99] Server A/B 50ms Probe A B 02-25-03 Lecture #13

Assumptions Probe nodes approximate direct path May require large number Careful placement may help Requires that distance between end-points is approximated by sum Triangle inequality must hold (i.e., (a,c) > (a,b) + (b,c) 02-25-03 Lecture #13

Triangle Inequality in the Internet 02-25-03 Lecture #13

Outline IP2Geo IDMaps SPAND GNP 02-25-03 Lecture #13

SPAND Design Choices Measurements are shared Measurements are passive Hosts share performance information by placing it in a per-domain repository Measurements are passive Application-to-application traffic is used to measure network performance Measurements are application-specific When possible, measure application response time, not bandwidth, latency, hop count, etc. 3 basic design choices that SPAND uses 02-25-03 Lecture #13

SPAND Architecture Internet Client Packet Capture Host Performance Data Performance Server Perf. Reports Perf Query/ Response Client 02-25-03 Lecture #13

SPAND Assumptions Geographic Stability: Performance observed by nearby clients is similar  works within a domain Amount of Sharing: Multiple clients within domain access same destinations within reasonable time period  strong locality exists Temporal Stability: Recent measurements are indicative of future performance  true for 10’s of minutes 02-25-03 Lecture #13

Prediction Accuracy Packet capture trace of IBM Watson traffic Cumulative Probability Ratio of Predicted to Actual Throughput Packet capture trace of IBM Watson traffic Compare predictions to actual throughputs 02-25-03 Lecture #13

Outline IP2Geo IDMaps SPAND GNP 02-25-03 Lecture #13

First Key Insight With millions of hosts, “What are the O(N2) network distances?” may be the wrong question Instead, could we ask: “Where are the hosts in the Internet?” What does it mean to ask “Where are the hosts in the Internet?” Do we need a complete topology map? Can we build an extremely simple geometric model of the Internet? 02-25-03 Lecture #13

New Fundamental Concept: “Internet Position” Using GNP, every host can have an “Internet position” O(N) positions, as opposed to O(N2) distances Accurate network distance estimates can be rapidly computed from “Internet positions” (x2,y2,z2) y “Internet position” is a local property that can be determined before applications need it Can be an interface for independent systems to interact (x1,y1,z1) x z (x4,y4,z4) (x3,y3,z3) 02-25-03 Lecture #13

Vision: Internet Positioning Service (2,0) (6,0) (1,3) (2,4) (5,4) (7,3) 65.4.3.87 33.99.31.1 128.2.254.36 12.5.222.1 123.4.22.54 126.93.2.34 Enable every host to independently determine its Internet position Internet position should be as fundamental as IP address “Where” as well as “Who” 02-25-03 Lecture #13

Global Network Positioning (GNP) Coordinates Model the Internet as a geometric space (e.g. 3-D Euclidean) Characterize the position of any end host with geometric coordinates Use geometric distances to predict network distances (x2,y2,z2) y (x1,y1,z1) x z (x4,y4,z4) (x3,y3,z3) 02-25-03 Lecture #13

Landmark Operations (Basic Design) y x (x2,y2) (x1,y1) (x3,y3) L1 L2 L3 L1 L2 L3 Measure inter-Landmark distances Use minimum of several round-trip time (RTT) samples Internet Compute coordinates by minimizing the discrepancy between measured distances and geometric distances Cast as a generic multi-dimensional minimization problem, solved by a central node 02-25-03 Lecture #13

Ordinary Host Operations (Basic Design) (x2,y2) (x1,y1) (x3,y3) L2 L1 L3 y (x4,y4) L1 L3 x L2 Internet Each host measures its distances to all the Landmarks Compute coordinates by minimizing the discrepancy between measured distances and geometric distances Cast as a generic multi-dimensional minimization problem, solved by each host 02-25-03 Lecture #13

Overall Accuracy 0.1 0.28 02-25-03 Lecture #13

Why the Difference? IDMaps overpredicts IDMaps GNP (1-dimensional model) IDMaps overpredicts 02-25-03 Lecture #13

Next Lecture Lecturer: Brad Karp Topic: distributed systems Readings Fault tolerance Replication Logging/monitoring Readings Ivy Byzantine Fault Tolerance 02-25-03 Lecture #13