Sipat Triukose, Zhihua Wen, Michael Rabinovich WWW 2011 Presented by Ye Tian for Course CS05112.

Slides:



Advertisements
Similar presentations
Information-Centric Networks05c-1 Week 5 / Paper 3 Democratizing content publication with Coral –Michael J. Freedman, Eric Freudenthal, David Mazières.
Advertisements

EDNS0 Client-Subnet for DNS based CDNs
Building Cloud-ready Video Transcoding System for Content Delivery Networks(CDNs) Zhenyun Zhuang and Chun Guo Speaker: 饒展榕.
1 Server Selection & Content Distribution Networks (slides by Srini Seshan, CS CMU)
Jan 17, 2001CSCI {4,6}900: Ubiquitous Computing1 Announcements.
On the Effectiveness of Measurement Reuse for Performance-Based Detouring David Choffnes Fabian Bustamante Fabian Bustamante Northwestern University INFOCOM.
1 Content Delivery Networks iBAND2 May 24, 1999 Dave Farber CTO Sandpiper Networks, Inc.
1 Routing and Scheduling in Web Server Clusters. 2 Reference The State of the Art in Locally Distributed Web-server Systems Valeria Cardellini, Emiliano.
CompSci 356: Computer Network Architectures Lecture 21: Content Distribution Chapter 9.4 Xiaowei Yang
EEC-484/584 Computer Networks Lecture 6 Wenbing Zhao
Cis e-commerce -- lecture #6: Content Distribution Networks and P2P (based on notes from Dr Peter McBurney © )
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
EEC-484/584 Computer Networks Discussion Session for HTTP and DNS Wenbing Zhao
CDNs & Replication Prof. Vern Paxson EE122 Fall 2007 TAs: Lisa Fowler, Daniel Killebrew, Jorge Ortiz.
HTTP and Web Content Delivery COS 461: Computer Networks Spring 2011 Mike Freedman
Flash Crowds And Denial of Service Attacks: Characterization and Implications for CDNs and Web Sites Aaron Beach Cs395 network security.
Drafting Behind Akamai (Travelocity-Based Detouring) Aleksandar Kuzmanovic Northwestern University Joint work with: A. Su, D. Choffnes, and F. Bustamante.
1 Drafting Behind Akamai (Travelocity-Based Detouring) AoJan Su, David R. Choffnes, Aleksandar Kuzmanovic, and Fabian E. Bustamante Department of Electrical.
Evaluation of the Proximity between Web Clients and their Local DNS Servers Z. Morley Mao UC Berkeley C. Cranor, M. Rabinovich,
EEC-484/584 Computer Networks Lecture 7 Wenbing Zhao (Part of the slides are based on Drs. Kurose & Ross ’ s slides for their Computer.
Evaluation of the Proximity between Web Clients and their Local DNS Servers Z. Morley Mao Chuck Cranor, Fred Douglis, Misha Rabinovich, Oliver Spatscheck,
DOMAIN NAME SYSTEM. Introduction  There are several applications that follow client server paradigm.  The client/server programs can be divided into.
Caching and Content Distribution Networks. Web Caching r As an example, we use the web to illustrate caching and other related issues browser Web Proxy.
Content Distribution Networks (CDNs) Mike Freedman COS 461: Computer Networks Lectures: MW 10-10:50am in Architecture N101
CSCI-1680 Web Performance and Content Distribution Based partly on lecture notes by Scott Shenker and John Jannotti Rodrigo Fonseca.
Content Distribution Network (CDN) Performance Punit Shah CSE581 Internet Technologies OGI, OHSU 2002, Jan 16th.
Information-Centric Networks05a-1 Week 5 / Paper 1 On the use and performance of content distribution networks –Balachander Krishnamurthy, Craig Wills,
Content Distribution Networks CPE 401 / 601 Computer Network Systems Modified from Ravi Sundaram, Janardhan R. Iyengar, and others.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
Caching and Content Distribution Networks. Some Interesting Observations r Top 1 % of all documents account for 20% - 35% of proxy requests r Top 10%
1 Caching  Temporary storage of frequently accessed data (duplicating original data stored somewhere else)  Reduces access time/latency for clients 
Active Network Applications Tom Anderson University of Washington.
CSE 534 – Fundamentals of Computer Networks Lecture 11: Content Delivery Networks (Over 1 billion served … each day) Based on slides by D. NEU.
1. 1.Charting the CDNs(locating all their content and DNS servers). 2.Assessing their server availability. 3.Quantifying their world-wide delay performance.
{ Content Distribution Networks ECE544 Dhananjay Makwana Principal Software Engineer, Semandex Networks 5/2/14ECE544.
Ao-Jan Su, David R. Choffnes, Fabián E. Bustamante and Aleksandar Kuzmanovic Department of EECS Northwestern University Relative Network Positioning via.
Chapter 1: Introduction to Web Applications. This chapter gives an overview of the Internet, and where the World Wide Web fits in. It then outlines the.
Akamai vs. Flash Crowds and Distributed Denial of Service Akamai Technologies & Carnegie Mellon Bruce Maggs.
Company LOGO mDNS (ICM3400) Proposal for Hierarchical Multicast Session Directory Architecture Piyush Harsh & Richard Newman.
CDN Brokering* Presented By Nick Arnold Authors Alexandros Biliris, et. Al.
October 8, 2015 University of Tulsa - Center for Information Security Microsoft Windows 2000 DNS October 8, 2015.
CPSC 441: Multimedia Networking1 Outline r Scalable Streaming Techniques r Content Distribution Networks.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
Distributed Computing COEN 317 DC2: Naming, part 1.
1 On the Placement of Web Server Replicas Lili Qiu, Microsoft Research Venkata N. Padmanabhan, Microsoft Research Geoffrey M. Voelker, UCSD IEEE INFOCOM’2001,
An Efficient Approach for Content Delivery in Overlay Networks Mohammad Malli Chadi Barakat, Walid Dabbous Planete Project To appear in proceedings of.
Web Caching and Content Distribution: A View From the Interior Syam Gadde Jeff Chase Duke University Michael Rabinovich AT&T Labs - Research.
The Inter-network is a big network of networks.. The five-layer networking model for the internet.
CDN: Content Distribution Networks  References:  CS613 textbook, “Computer Networking – A Top-Down Approach”, 6 th edition. Chapter  The text.
Content distribution networks (CDNs) r The content providers are the CDN customers. Content replication r CDN company installs hundreds of CDN servers.
Drafting Behind Akamai (Travelocity-Based Detouring) Dr. Yingwu Zhu.
Information-Centric Networks Section # 3.2: DNS Issues Instructor: George Xylomenos Department: Informatics.
Information-Centric Networks Section # 5.3: Content Distribution Instructor: George Xylomenos Department: Informatics.
Globally Distributed Content Delivery Presenter: Baoning Wu 03/25/2003.
Content Distribution Network, Proxy CDN: Distributed Environment
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
Overlay Networks : An Akamai Perspective
Information-Centric Networks Section # 5.1: Content Distribution Instructor: George Xylomenos Department: Informatics.
Content Distribution Networks (CDNs)
John S. Otto Mario A. Sánchez John P. Rula Fabián E. Bustamante Northwestern, EECS.
Performance Evaluation of Redirection Schemes in Content Distribution Networks Jussi Kangasharju, Keith W. Ross Institut Eurecom Jim W. Roberts France.
MICROSOFT AJAX CDN (CONTENT DELIVERY NETWORK) Make Your ASP.NET site faster to retrieve.
Drafting Behind Akamai (Travelocity-Based Detouring) Ao-Jan Su, David R. Choffnes, Aleksandar Kuzmanovic and Fabián E. Bustamante Department of EECS Northwestern.
Content Distribution Networks
Content Distribution Networks
Caching Temporary storage of frequently accessed data (duplicating original data stored somewhere else) Reduces access time/latency for clients Reduces.
CloudFront: Living on the Edge
Content Distribution Networks
Engineering a Content Delivery Network
Engineering a Content Delivery Network
Presentation transcript:

Sipat Triukose, Zhihua Wen, Michael Rabinovich WWW 2011 Presented by Ye Tian for Course CS05112

Overview Background and Motivation Methodology Performance of Akamai CDN Performance of Consolidated Akamai CDN Review

Background A CDN operator deploys a large number of edge servers throughout the Internet. “outsources” certain hostnames to the CDN’s domain name service

HTTP End-user cnn.com (content provider) DNS root server 12 Nearby Akamai cluster GET index. html 4 HTTP Akamai cluster Akamai global DNS server Akamai regional DNS server

HTTP End-user cnn.com (content provider) DNS root server 12 Nearby Akamai cluster 5 DNS lookup cache.cnn.com Akamai cluster 3 4 ALIAS: g.akamai.net Akamai global DNS server Akamai regional DNS server

HTTP End-user cnn.com (content provider) DNS root server 12 Akamai global DNS server Akamai regional DNS server Nearby Akamai cluster 6 Akamai cluster ALIAS a73.g.akamai.net DNS lookup g.akamai.net

HTTP End-user cnn.com (content provider) DNS root server 12 Akamai global DNS server Akamai regional DNS server Nearby Akamai cluster 7 Akamai cluster DNS a73.g.akamai.net Address

HTTP End-user cnn.com (content provider) DNS root server 12 Akamai global DNS server Akamai regional DNS server Nearby Akamai cluster 8 Akamai cluster GET /foo.jpg Host: cache.cnn.com

HTTP End-user cnn.com (content provider) DNS root server 12 Akamai global DNS server Akamai regional DNS server Nearby Akamai cluster 9 Akamai cluster GET /foo.jpg Host: cache.cnn.com GET foo.jpg

HTTP End-user cnn.com (content provider) DNS root server 12 Akamai global DNS server Akamai regional DNS server Nearby Akamai cluster 10 Akamai cluster

Background Example: outsourcing “images.firm-x.com” Step 1-2: firm-x’s DNS server redirects DNS queries for outsourced hostnames to CDN by returning a CNAME; e.g., images.firm-x.com.CDNname.net Step 3: The user resolves the canonical name, with a query that will arrive at the DNS system for the CDN-name.net domain, operated by the CDN Step 4-5: selects an appropriate edge server (typically close to the client as long as server) and responds to the query with the selected server IP address, enacting the content download from the selected server

Example nslookup “ Example AExample B Sourceplanetlab9.millennium.be rkeley.edu planetlab2.dit.upm.es Source locationBerkeley, CA, USAMadrid, Spain Local DNS CNAMEa1105.b.akamai.net Edge Server IP Edge Server LocationSan Francisco, CA, USAMadrid, Spain

From Berkeley From Madrid

Motivation Two CDN design approaches Akamai-like design: Akamai platform in 2007 spanned more than 3,000 locations in over 750 cities in over 70 countries, with each location having on average less than ten servers Digital Island-like design: utilizes massive data centers, comprising thousands of servers, but in many fewer locations. Limelight currently lists 20 data centers on its web site Question: How many locations is enough from the client- observed performance perspective?

Overview Background and Motivation Methodology Performance of Akamai CDN Performance of Consolidated Akamai CDN Review

Edge Server Discovery Needs to harvest a large collection of edge servers since this requires hostname lookups resolutions from different geographical locations. Utilize DipZoom: Typically more then 400 active MPs, mostly on PlanetLab nodes but also on some academic and residential hosts.

Edge Server Discovery Compiled canonical names outsourced to Akamai of the 95 Akamai customers listed on the Akamai’s Web site. Performed DNS resolution of these names from a large number of network locations over a period of 13 weeks. By clustering these edge servers by city and autonomous system, we conservatively estimate we discovered at least 308 locations.

Youku’s CDN Huajun He, Yang Zhao, Jinfu Wu and Ye Tian, Cost-Aware Capacity Provisioning for Internet Video Streaming CDNs, The Computer Journal (Oxford), 2015.

Overriding CDN Edge Server Selection Why? To measure download performance from a particular edge server rather than the server of Akamai’s choosing. How? Connect to the desired edge server directly using its raw IP address rather than the DNS hostname from the URL.

Controlling Edge Server Caching Why? Compare the download performance from origin server and from the edge server cache. How? Use HTTP’s Cache-Control header. Not working Modern caches use the entire URL strings, including the search string as the cache key. On the other hand, origin servers ignore unexpected search strings in otherwise valid URLs. A request for “foo.jpg?randomstring” will return the valid foo.jpg image from the origin server.

Controlling Edge Server Caching 1, 4, 9 from origin server 2, 3,5, 6, 7, 8 from edge server Precisely control the caching behavior of Akamai’s edge servers by appending search strings to image URLs.

Assessing Client-Side Caching Bias How HTTP download performance from a given Akamai edge server depends on its ping round-trip latency. Perform a set of 4 curl downloads, 5 pings, and a traceroute from each MP to various edge servers. Spearman’s rank correlation coefficient of − indicating a strong dependency.

Overview Background and Motivation Methodology Performance of Akamai CDN Performance of Consolidated Akamai CDN Review

Does a CDN Enhance Performance? Compare download performance Cache vs. non-cache (Note: non-cache is still outsourced) Cache vs. origin CDN delivery outperforms both non-cache and origin delivery in 98% and 96% of cases respectively. Missing penalty can be significant Non-cache: first from CDN edge server to cache; then from cache to the client

How Good Is Akamai Server Selection? How good is the Akamai selection algorithm from the client performance perspective. For each measuring point, the percentage of time that the Akamai-selected server performed better than the alternative server CDNs rarely select the best edge server but successfully avoid the worst ones. In roughly 75% of the MPs, the Akamai-selected server outperformed half of the alternatives.

How Good Is Akamai Server Selection? The average difference, the maximum gain, and the maximum loss of throughput of the Akamai-selected server vs. the alternatives. Maximum gain and maximum loss are the throughput differences of the Akamai- selected server over, resp., the slowest and fastest alternative server. There is a substantial room for further improvements.

Overview Background and Motivation Methodology Performance of Akamai CDN Performance of Consolidated Akamai CDN Review

Data Center Consolidation Group edge servers that are close to each other into hypothetical consolidated data centers Refer to as big clusters. “Place” each big cluster into the location of a central server in the cluster called the representative of the cluster. For a given client, we replace the server selected by Akamai, S akam with the representative of the cluster to which S akam belongs.

Data Center Consolidation Estimate the pairwise network distances between all the servers using the dynamic triangles method Use the so-called hierarchical clustering with complete linkage method, following by the cut-the-tree procedure

DipZoom Experiment From each measurement point, downloaded the object of a given size from the Akamai-selected server and from the center of its cluster in the consolidated platform. Group 10,231 edge servers into 150, 100, 60, 40, and 20 data centers

DipZoom Experiment 150, 100, and 60 data centers: consolidated and current platforms show performance advantage over each other with equal probability. 40 and 20 data centers: the original platform starts outperforming the consolidated configuration

DipZoom Experiment Group the MPs by the maximum download throughput

DipZoom Experiment The existing Akamai configuration with large number of data centers favors well-connected users. Compare the performance of existing vs. consolidated configurations for all measurement points with bandwidth below 1.5Mbps A typical download bandwidth for DSL users.

A Live Study Built a Web page with an AJAX application that measures latency to the server that Akamai selects for this browser as well as to a list of other Akamai edge servers, and reports the results to us. Pick a CNAME (a1694.g.akamai.net) which is mapped by Akamai to 979 edge servers. Partitioned the 979 servers into 10, 50, and 100 clusters.

A Live Study Consider the differences between the measured latencies from each client to its closest consolidated data center and the Akamai-selected server. Negative indicates that the consolidated platform is better Positive reflects existing Akamai platform is better Even with significant consolidation, better server selection can make up for any performance losses from consolidation.

Summary Two CDN design approaches? Example. How CDN redirects its clients to one of its edge server? From user performance perspective, can we consolidate the Akamai CDN platform? To what extent?