Evaluation of the Proximity between Web Clients and their Local DNS Servers Z. Morley Mao UC Berkeley C. Cranor, M. Rabinovich,

Slides:



Advertisements
Similar presentations
Topology Modeling via Cluster Graphs Balachander Krishnamurthy and Jia Wang AT&T Labs Research.
Advertisements

Information-Centric Networks05c-1 Week 5 / Paper 3 Democratizing content publication with Coral –Michael J. Freedman, Eric Freudenthal, David Mazières.
Geographic Locality of IP Prefixes Mythili Vutukuru Joint work with Michael J. Freedman, Nick Feamster and Hari Balakrishnan Internet Measurement Conference.
EDNS0 Client-Subnet for DNS based CDNs
Challenges of OTT video delivery in the dual-stacked world
Using Mobility Support for Request-Routing in IPv6 CDNs Arup Acharya and Anees Shaikh TJ Watson Research Center Presented by Renu Tewari.
RB-Seeker: Auto-detection of Redirection Botnet Presenter: Yi-Ren Yeh Authors: Xin Hu, Matthew Knysz, Kang G. Shin NDSS 2009 The slides is modified from.
On the Effectiveness of Measurement Reuse for Performance-Based Detouring David Choffnes Fabian Bustamante Fabian Bustamante Northwestern University INFOCOM.
1 Routing and Scheduling in Web Server Clusters. 2 Reference The State of the Art in Locally Distributed Web-server Systems Valeria Cardellini, Emiliano.
Outline Web measurement motivation Challenges of web measurement Web measurement tools Current web measurements Web properties Web traffic data gathering.
A Taxonomy and Survey of Content Delivery Networks Meng-Huan Wu 2011/10/26 1.
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
King : Estimating latency between arbitrary Internet end hosts Krishna Gummadi, Stefan Saroiu Steven D. Gribble University of Washington Presented by:
CDNs & Replication Prof. Vern Paxson EE122 Fall 2007 TAs: Lisa Fowler, Daniel Killebrew, Jorge Ortiz.
Evaluation of the Proximity between Web Clients and their Local DNS Servers Z. Morley Mao UC Berkeley Chuck Cranor, Fred Douglis,
Flash Crowds And Denial of Service Attacks: Characterization and Implications for CDNs and Web Sites Aaron Beach Cs395 network security.
Anycast Jennifer Rexford Advanced Computer Networks Tuesdays/Thursdays 1:30pm-2:50pm.
1 Drafting Behind Akamai (Travelocity-Based Detouring) AoJan Su, David R. Choffnes, Aleksandar Kuzmanovic, and Fabian E. Bustamante Department of Electrical.
Web Caching Robert Grimm New York University. Before We Get Started  Illustrating Results  Type Theory 101.
Internet-Scale Research at Universities Panel Session SAHARA Retreat, Jan 2002 Prof. Randy H. Katz, Bhaskaran Raman, Z. Morley Mao, Yan Chen.
1 Web Content Delivery Reading: Section and COS 461: Computer Networks Spring 2007 (MW 1:30-2:50 in Friend 004) Ioannis Avramopoulos Instructor:
Web Caching and CDNs March 3, Content Distribution Motivation –Network path from server to client is slow/congested –Web server is overloaded Web.
Network-Aware Clustering of Web Clients Advanced IP Topics Seminar, Fall 2000 Supervisor: Anat Bremler Speaker: Zotenko Elena.
Evaluation of the Proximity between Web Clients and their Local DNS Servers Z. Morley Mao Chuck Cranor, Fred Douglis, Misha Rabinovich, Oliver Spatscheck,
Caching and Content Distribution Networks. Web Caching r As an example, we use the web to illustrate caching and other related issues browser Web Proxy.
Information-Centric Networks05a-1 Week 5 / Paper 1 On the use and performance of content distribution networks –Balachander Krishnamurthy, Craig Wills,
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
On the Use and Performance of Content Distribution Networks Balachander Krishnamurthy Craig Wills Yin Zhang Presenter: Wei Zhang CSE Department of Lehigh.
Active Network Applications Tom Anderson University of Washington.
Sipat Triukose, Zhihua Wen, Michael Rabinovich WWW 2011 Presented by Ye Tian for Course CS05112.
Redirection and Load Balancing
1. 1.Charting the CDNs(locating all their content and DNS servers). 2.Assessing their server availability. 3.Quantifying their world-wide delay performance.
{ Content Distribution Networks ECE544 Dhananjay Makwana Principal Software Engineer, Semandex Networks 5/2/14ECE544.
Oasis: Anycast for Any Service Michael J. Freedman Karthik Lakshminarayanan David Mazières in NSDI 2006 Presented by: Sailesh Kumar.
Krerk Piromsopa. Advance Net-Centric Computing Technology Krerk Piromsopa. Department of Computer Engineering. Chulalongkorn University.
Ao-Jan Su, David R. Choffnes, Fabián E. Bustamante and Aleksandar Kuzmanovic Department of EECS Northwestern University Relative Network Positioning via.
Lecture 8 Page 1 Advanced Network Security Review of Networking Basics: Internet Architecture, Routing, and Naming Advanced Network Security Peter Reiher.
Application Measurements: Web Measurement. Motivation Web is the single most popular Internet application. Measurement can be very useful.
SAINT ‘01 Proactive DNS Caching: Addressing a Performance Bottleneck Edith Cohen AT&T Labs-Research Haim Kaplan Tel-Aviv University.
Application-Layer Anycasting By Samarat Bhattacharjee et al. Presented by Matt Miller September 30, 2002.
Paper Presentation – CAP Page 2 Outline Review - DNS Proposed Solution Simulation Results / Evaluation Discussion.
CDN Brokering* Presented By Nick Arnold Authors Alexandros Biliris, et. Al.
1 On the Placement of Web Server Replicas Lili Qiu, Microsoft Research Venkata N. Padmanabhan, Microsoft Research Geoffrey M. Voelker, UCSD IEEE INFOCOM’2001,
Web Caching and Content Distribution: A View From the Interior Syam Gadde Jeff Chase Duke University Michael Rabinovich AT&T Labs - Research.
Advanced Networking Lab. Given two IP addresses, the estimation algorithm for the path and latency between them is as follows: Step 1: Map IP addresses.
Application of Content Computing in Honeyfarm Introduction Overview of CDN (content delivery network) Overview of honeypot and honeyfarm New redirection.
Hour 7 The Application Layer 1. What Is the Application Layer? The Application layer is the top layer in TCP/IP's protocol suite Some of the components.
Module 11: Implementing ISA Server 2004 Enterprise Edition.
Not So Fast Flux Networks for Concealing Scam Servers Theodore O. Cochran; James Cannady, Ph.D. Risks and Security of Internet and Systems (CRiSIS), 2010.
1 On the Placement of Web Server Replicas Lili Qiu, Microsoft Research Venkata N. Padmanabhan, Microsoft Research Geoffrey M. Voelker, UCSD IEEE INFOCOM’2001,
Content distribution networks (CDNs) r The content providers are the CDN customers. Content replication r CDN company installs hundreds of CDN servers.
On Network-Aware Clustering of Web Clients Balachander Krishnamurthy AT&T Labs-Research, Florham Park, NJ, USA Jia Wang
A Measurement Study on the Impact of Routing Events on End-to-End Internet Path Performance Feng Wang 1, Zhuoqing Morley Mao 2 Jia Wang 3, Lixin Gao 1,
1 A Framework for Measuring and Predicting the Impact of Routing Changes Ying Zhang Z. Morley Mao Jia Wang.
Geographic Locality of IP Prefixes Mythili Vutukuru Joint work with Michael Freedman, Nick Feamster and Hari Balakrishnan.
On the Impact of Clustering on Measurement Reduction May 14 th, D. Saucez, B. Donnet, O. Bonaventure Thanks to P. François.
Yaping Zhu with: Jennifer Rexford (Princeton University) Aman Shaikh and Subhabrata Sen (ATT Research) Route Oracle: Where Have.
Information-Centric Networks Section # 5.3: Content Distribution Instructor: George Xylomenos Department: Informatics.
Content Distribution Network, Proxy CDN: Distributed Environment
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
BGP Routing Stability of Popular Destinations Jennifer Rexford, Jia Wang, Zhen Xiao, and Yin Zhang AT&T Labs—Research Florham Park, NJ All flaps are not.
Content Delivery Networks: Status and Trends Speaker: Shao-Fen Chou Advisor: Dr. Ho-Ting Wu 5/8/
A Measurement Study on the Impact of Routing Events on End-to-End Internet Path Performance Feng Wang 1, Zhuoqing Morley Mao 2 Jia Wang 3, Lixin Gao 1,
Content Distribution Networks (CDNs)
Overview on Web Caching COSC 513 Class Presentation Instructor: Prof. M. Anvari Student name: Wei Wei ID:
John S. Otto Mario A. Sánchez John P. Rula Fabián E. Bustamante Northwestern, EECS.
Coral: A Peer-to-peer Content Distribution Network
Content Distribution Networks
Early Measurements of a Cluster-based Architecture for P2P Systems
Group 3: Olena Hunsicker and Divya Josyula
Presentation transcript:

Evaluation of the Proximity between Web Clients and their Local DNS Servers Z. Morley Mao UC Berkeley C. Cranor, M. Rabinovich, O. Spatscheck, and J. Wang AT&T Labs-Research F. Douglis IBM Research

Motivation Content Distribution Networks (CDNs) Attempt to deliver content from servers close to users Internet Clients Cache server Origin servers Cache server

DNS based server selection Originator problem Assumes that clients are close to their local DNS servers Verify the assumption that clients are close to their local DNS servers Client.myisp.net Local DNS Server ns.myisp.net Authoritative DNS server ns.service.com A.GTLD-SERVERS.NET ns.service.com Server IP address

Measurement setup Three components 1x1 pixel embedded transparent GIF image A specialized authoritative DNS server Allows hostnames to be wild-carded An HTTP redirector Always responds with “302 Moved Temporarily” Redirect to a URL with client IP address embedded 1x1 transparent GIF

Embedded image request sequence Client [ ] Redirector for xxx.rd.example.com Local DNS server Content server for the image Name server for *.cs.example.com 1. HTTP GET request for the image 2. HTTP redirect to IP cs.example.com 3. Request to resolve IP cs.example.com 4. Request to resolve IP cs.example.com 5. Reply: IP address of content server 6. Reply: content server IP address 7. HTTP GET request for the image 8. HTTP response

Measurement Data SiteParticipantImage hit count Duration 1att.com20,816,9272 months 2,3Personal pages (commercial domain)1,7433 months 4AT&T research212,8143 months 5-7University sites4,367,0763 months 8-19Personal pages (university domain)26,5633 months

Measurement statistics Data typeCount Unique client-LDNS associations4,253,157 HTTP requests25,425,123 Unique client IPs3,234,449 Unique LDNS IPs157,633 Client-LDNS associations where Client and LDNS have the same IP address56,086

Proximity metrics: AS clustering Network clustering Traceroute divergence Roundtrip time correlation

AS clustering Autonomous System (AS) A single administrative entity with unified routing policy Observes if client and LDNS belong to the same AS

Network clustering [Krishnamurthy,Wang sigcomm00] Based on BGP routing information using the longest prefix match Each prefix identifies a network cluster Observes if client and LDNS belong to the same network cluster

Traceroute divergence Probe machine client Local DNS server [Shaikh et al. infocom00] Use the last point of divergence Traceroute divergence: Max(3,4)= a b

Roundtrip time correlation Correlation between message roundtrip times from a probe site to the client and its LDNS server The probe site represents a potential cache server location A crude metric, highly dependent on the probe site

Aggregate statistics of AS/network clustering More than 13,000 ASes Close to 75% total ASes 440,000 unique prefixes Close to 25% of all possible network clusters  We have a representative data set Metrics# client clusters # LDNS clusters Total # clusters AS clustering9,2158,5909,570 Network clustering98,00153,321104,950

Proximity analysis: AS, network clustering MetricsClient IPsHTTP requests AS cluster64%69% Network cluster16%24% AS clustering: coarse-grained Network clustering: fine-grained Most clients not in the same routing entity as their LDNS Clients with LDNS in the same cluster slightly more active

Proximity analysis: Traceroute divergence Probe sites: NJ(UUNET), NJ(AT&T), Berkeley(Calren), Columbus(Calren) Sampled from top half of busy network clusters Median divergence: 4 Mean divergence: Ratio of common to disjoint path length 72%-80% pairs traced have common path at least as long as disjoint path

Improved local DNS configuration For client-LDNS associations not in the same cluster, do we know a LDNS in the client’s cluster? MetricsOriginalImprovedOriginalImproved AS cluster64%88%69%92% Network cluster16%66%24%70% Client IPsHTTP requests

Impact on commercial CDNs Data set Client-LDNS associations LDNS-CDN associations Available CDN servers Client w/ CDN server in cluster Verifiable clients: w/ responsive LDNS Misdirected clients: directed to a cache not in client’s cluster Clients with LDNS not in same cluster

Impact on commercial CDNs AS clustering CDNCDN XCDN YCDN Z Clients with CDN server in cluster 1,679,5151,215,372618,897 Verifiable clients1,324,022961,382516,969 Misdirected clients (% of verifiable clients) 809,683 (60%) 752,822 (77%) 434,905 (82%) Clients with LDNS not in client’s cluster (% of misdirected clients) 443,394 (55%) 354,928 (47%) 262,713 (60%)

Impact on commercial CDNs Network clustering CDNCDN XCDN YCDN Z Clients with cache server in cluster 264,743156,507103,448 Verifiable clients221,440132,56790,264 Misdirected clients (% of verifiable clients) 154,198 (68%) 125,449 (94%) 87,486 (96%) Clients with LDNS not in client’s cluster (% of misdirected clients) 145,276 (94%) 116,073 (93%) 84,737 (97%) Less than 10% of all clients

Conclusion Novel technique for finding client and local DNS associations Fast, non-intrusive, and accurate DNS based server selection works well for coarse-grained load-balancing 64% associations in the same AS 16% associations in the same network cluster Server selection can be inaccurate if server density is high

Related work Measurement methodology 1. IBM (Shaikh et al.) Time correlation of DNS and HTTP requests from DNS and Web server logs 2. Univ of Boston (Bestavros et al.) Assigning multiple IP addresses to a Web server Differences from our work: Our methodology: efficient, accurate, nonintrusive 3. Web bugs Proximity metrics Cisco’s Boomerang protocol: uses latency from cache servers to the LDNS