Caching and Content Distribution Networks. Web Caching r As an example, we use the web to illustrate caching and other related issues browser Web Proxy.

Slides:



Advertisements
Similar presentations
CHAPTER 15 WEBPAGE OPTIMIZATION. LEARNING OBJECTIVES How to test your web-page performance How browser and server interactions impact performance What.
Advertisements

Consistency and Replication Chapter 7 Part II Replica Management & Consistency Protocols.
Amazon CloudFront An introductory discussion. What is Amazon CloudFront? 5/31/20122© e-Zest Solutions Ltd. Amazon CloudFront is a web service for content.
1 Server Selection & Content Distribution Networks (slides by Srini Seshan, CS CMU)
1 Configuring Internet- related services (April 22, 2015) © Abdou Illia, Spring 2015.
1 Content Delivery Networks iBAND2 May 24, 1999 Dave Farber CTO Sandpiper Networks, Inc.
EEC-484/584 Computer Networks Lecture 6 Wenbing Zhao
Internet Networking Spring 2006 Tutorial 12 Web Caching Protocols ICP, CARP.
2/23/2004 Load Balancing February 23, /23/2004 Assignments Work on Registrar Assignment.
Cis e-commerce -- lecture #6: Content Distribution Networks and P2P (based on notes from Dr Peter McBurney © )
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
The Internet Useful Definitions and Concepts About the Internet.
EEC-484/584 Computer Networks Discussion Session for HTTP and DNS Wenbing Zhao
What’s a Web Cache? Why do people use them? Web cache location Web cache purpose There are two main reasons that Web cache are used:  to reduce latency.
1 Spring Semester 2007, Dept. of Computer Science, Technion Internet Networking recitation #13 Web Caching Protocols ICP, CARP.
CDNs & Replication Prof. Vern Paxson EE122 Fall 2007 TAs: Lisa Fowler, Daniel Killebrew, Jorge Ortiz.
1 Drafting Behind Akamai (Travelocity-Based Detouring) AoJan Su, David R. Choffnes, Aleksandar Kuzmanovic, and Fabian E. Bustamante Department of Electrical.
1 Web Content Delivery Reading: Section and COS 461: Computer Networks Spring 2007 (MW 1:30-2:50 in Friend 004) Ioannis Avramopoulos Instructor:
Web Caching and CDNs March 3, Content Distribution Motivation –Network path from server to client is slow/congested –Web server is overloaded Web.
Application Layer  We will learn about protocols by examining popular application-level protocols  HTTP  FTP  SMTP / POP3 / IMAP  Focus on client-server.
Content Delivery Networks. History Early 1990s sees 100% growth in internet traffic per year 1994 o Netscape forms and releases their first browser.
Web Content Delivery Networks Yogesh Bhumralkar. CDN: Motivations zCongestion in the Internet. zWeb Servers sometimes become overloaded due to too many.
Technologies for Building Content Delivery Networks Pei Cao Cisco Systems, Inc.
AKAMAI Content Delivery Services AKAMAI Content Delivery Services CIS726 : PRESENTATION Avinash Ponugoti Avinash Ponugoti Nagarjuna Nagulapati Sathish.
Content Delivery Networks (CDN) Dr. Yingwu Zhu Reverse Proxy Reverse Proxy Reverse Proxy Intranet Web Cache Architecure Browser Local ISP cache L4 Switch.
Content Distribution Networks (CDNs) Mike Freedman COS 461: Computer Networks Lectures: MW 10-10:50am in Architecture N101
Content Distribution Network (CDN) Performance Punit Shah CSE581 Internet Technologies OGI, OHSU 2002, Jan 16th.
Content Distribution Networks CPE 401 / 601 Computer Network Systems Modified from Ravi Sundaram, Janardhan R. Iyengar, and others.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
Caching and Content Distribution Networks. Some Interesting Observations r Top 1 % of all documents account for 20% - 35% of proxy requests r Top 10%
On the Use and Performance of Content Distribution Networks Balachander Krishnamurthy Craig Wills Yin Zhang Presenter: Wei Zhang CSE Department of Lehigh.
1 Caching  Temporary storage of frequently accessed data (duplicating original data stored somewhere else)  Reduces access time/latency for clients 
Content Distribution March 8, : Application Layer1.
Active Network Applications Tom Anderson University of Washington.
Sipat Triukose, Zhihua Wen, Michael Rabinovich WWW 2011 Presented by Ye Tian for Course CS05112.
Redirection and Load Balancing
{ Content Distribution Networks ECE544 Dhananjay Makwana Principal Software Engineer, Semandex Networks 5/2/14ECE544.
SMUCSE 4344 application layer. SMUCSE 4344 application vs. application-layer protocols application-layer protocol is just one piece –how the end hosts.
CP476 Internet Computing Lecture 5 : HTTP, WWW and URL 1 Lecture 5. WWW, HTTP and URL Objective: to review the concepts of WWW to understand how HTTP works.
5 Chapter Five Web Servers. 5 Chapter Objectives Learn about the Microsoft Personal Web Server Software Learn how to improve Web site performance Learn.
Web Page Design I Basic Computer Terms “How the Internet & the World Wide Web (www) Works”
CPSC 441: Multimedia Networking1 Outline r Scalable Streaming Techniques r Content Distribution Networks.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
Application of Content Computing in Honeyfarm Introduction Overview of CDN (content delivery network) Overview of honeypot and honeyfarm New redirection.
Making the Best of the Best-Effort Service (2) Advanced Multimedia University of Palestine University of Palestine Eng. Wisam Zaqoot Eng. Wisam Zaqoot.
Ch 2. Application Layer Myungchul Kim
Content distribution networks (CDNs) r The content providers are the CDN customers. Content replication r CDN company installs hundreds of CDN servers.
Setup and Management for the CacheRaQ. Confidential, Page 2 Cache Installation Outline – Setup & Wizard – Cache Configurations –ICP.
Web Design and Development. World Wide Web  World Wide Web (WWW or W3), collection of globally distributed text and multimedia documents and files 
Server Setup and Configuration Babu Ram Dawadi 1.
Globally Distributed Content Delivery Presenter: Baoning Wu 03/25/2003.
© 2007 Cisco Systems, Inc. All rights reserved.Cisco Public ITE PC v4.0 Chapter 1 1 Application Layer Functionality and Protocols.
Content Distribution Network, Proxy CDN: Distributed Environment
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
Content Distribution Networks (CDNs)
Ch 2. Application Layer Myungchul Kim
MICROSOFT AJAX CDN (CONTENT DELIVERY NETWORK) Make Your ASP.NET site faster to retrieve.
(class #2) CLICK TO CONTINUE done by T Batchelor.
19 – Multimedia Networking
Content Distribution Networks
Content Distribution Networks
Web Caching? Web Caching:.
Utilization of Azure CDN for the large file distribution
Internet Applications
ECE 671 – Lecture 16 Content Distribution Networks
Content Distribution Networks
Configuring Internet-related services
Content Distribution Networks
The Internet and Electronic mail
AKAMAI Content Delivery Services
Presentation transcript:

Caching and Content Distribution Networks

Web Caching r As an example, we use the web to illustrate caching and other related issues browser Web Proxy cache request response request response Web server browser Web server request response

Web Browser Caching r Web browsers have their own caches. When a page is downloaded from a site the web page is put into the browser cache. r This is especially useful in those cases when the back button is pressed.  If a new copy is needed then a “ refresh ” can be done. r No page stays permanently in the cache. There is limited room. m A replacement algorithm is needed to determine which cached page should be purged.

Why Web Server Caching r Latency m Reduce latency m Request does not require going to the server m Request is served from the client side which means that network communication is avoided r Reduce traffic

Consistency r What if the page changes after saved in the cache? m This means that cached copy is out of date m The copy and the original are not consistent r There are different strategies for dealing with this

Web Browser Caching r Client pull m The server provides the content with instructions on when the client should ask for a refreshed copy of the content or if the content should be cached. r Server push m The server transmits page information to the screen. m The browser application displays the information and leaves the connection to the server open. m With an open connection, the server can continue to push updated pages for your screen to display on an ongoing basis. You can close the connection by closing the page. m The server is in control u Browser caches are different from proxy caches (discussed next).

Web Caching r Proxy caches (also called proxy server) m Intercepts HTTP requests from client Serves object if in its cache and the date is still valid If not go to object ’ s home server –On behalf of user, gets the object and possibly deposits in its cache before returning to user Usually deployed at edges of a network –Wide area bandwidth savings, improved response time and increased availability of static web-based objects r A browser may have to be configured to point to the proxy server. r Usually a proxy cache is purchased and installed by an organization

Web Caching r Not all web pages can be cached m If the Last-Modified tag then page can be cached r Refresh is often done when m There is a request; and m Expiry time has passed

Cooperative Caching r Caching infrastructure can have multiple web proxies m Proxies can be arranged in a hierarchy or other structures m Proxies can cooperate with one another Answer client requests Propagate server notifications m Uses a combination of HTTP and ICP (Internet Caching Protocol). ICP can be used by one cache to quickly ask another cache if it has an object. HTTP is used to actually retrieve the object.

Problems r Caching proxies do not serve all Internet users r Content providers (say, Web servers) cannot rely on existence and correct implementation of caching proxies. r Accounting issues with caching proxies: m Example: needs to know the number of hits to the advertisements displayed on the web page.

Content Distribution Networks (CDN) r Business Model: A content provider such as or Yahoo pays a CDN company (such as Akamai) to get its content to the requesting users with short delays. r A CDN provides a mechanism for m Replicating content on multiple servers in the Internet m Providing clients with a means to determine the servers that can deliver the content fastest.

Terminology r Content: Any publicly accessible combination of text, images, applets, frames, MP3, video, flash, virtual reality objects, etc. r Content Provider: Any individual, organization, or company that has content that it wishes to make available to users.  Origin Server: Content provider ’ s server, where the content is first uploaded.  Surrogate Server (sometimes called edge server): Content distributor ’ s server, where the replicated content is kept.

Players Content Provider H/W and S/W Vendor Content Distributor Hosting Provider Yahoo, MSNBC, CNN CBC Cisco, Oracle- Sun Akamai, Bell Sells servers Send content Install servers

CDN Distribution r Content providers are CDN customers Content replication r CDN company installs thousands of servers throughout Internet m In large datacenters m Or, close to users r CDN replicates customers’ content r When provider updates content, CDN updates servers origin server in North America CDN distribution node CDN server in S. America CDN server in Europe CDN server in Asia 14

CDN: Functional Components r Distribution Service r Redirection Service r Accounting and Billing system

CDN:Distribution Service r The content provider determines which of its objects it wants the CDN to distribute. r The content provider tags and then pushes this content to a CDN node, which in turn replicates and pushes the content to all its CDN servers.

CDN: Redirection  When a browser in a user ’ s host is instructed to retrieve a specific object (specified using a URL), how does the browser determine whether it should retrieve the object from the origin server or from one of the CDN servers? r As an example, suppose the hostname of the content provider is

How Akamai Works End-user cnn.com (content provider) DNS root server 12 Nearby Akamai cluster GET index. html 18 cnn.com/af/cnn.com/foo.jpg HTTP Akamai cluster Akamai global DNS server Akamai regional DNS server

CDN: Redirection r Users get an html document from this could be index.html r The file index.html uses a modified URL for content that has been replicated. r Example: If the jpeg files are what has been replicated then may be modified as follows: r The browser needs to resolve a73.g.akamai.net hostname for replicated content.

CDN: Redirection r What does this mean? m host part: a73.g.akamai.net m Akamai control part: /7/23 m Content URL: /af/foo.jpg

CDN: Redirection r DNS is configured so that all queries about g.akamai.net that arrive at a DNS server are sent to an authoritative DNS server for g.akamai.net. r This is referred to as a Akamai DNS server (authoritative DNS server)

How Akamai Works End-user cnn.com (content provider)DNS root server 12 Nearby Akamai cluster DNS lookup cache.cnn.com Akamai cluster 3 4 ALIAS: g.akamai.net Akamai global DNS server Akamai regional DNS server

CDN: Redirection r DNS is configured so that all queries about g.akamai.net that arrive at a DNS server are sent to an authoritative DNS server for g.akamai.net. This is referred to as a Akamai DNS server (authoritative DNS server) r When the Akamai DNS server receives the query, it extracts the IP address of the requesting browser..

P P How Akamai Works End-user cnn.com (content provider) DNS root server 12 Akamai global DNS server Akamai regional DNS server Nearby Akamai cluster Akamai cluster ALIAS a73.g.akamai.net DNS lookup g.akamai.net

CDN: Redirection r Based on the IP address and information that it has about the Internet (called a map), the IP address of an Akamai regional server is returned to the requesting browser based on policy m e.g., select the server that is the fewest hops away. r The regional server may choose a surrogate server for content retrieval

HTTP How Akamai Works End-user cnn.com (content provider) DNS root server 12 Akamai global DNS server Akamai regional DNS server Nearby Akamai cluster Akamai cluster DNS a73.g.akamai.net Address

HTTP How Akamai Works End-user cnn.com (content provider) DNS root server 12 Akamai global DNS server Akamai regional DNS server Nearby Akamai cluster Akamai cluster GET /foo.jpg Host: cache.cnn.com

HTTP How Akamai Works End-user cnn.com (content provider) DNS root server 12 Akamai global DNS server Akamai regional DNS server Nearby Akamai cluster Akamai cluster GET /foo.jpg Host: cache.cnn.com GET foo.jpg

CDN Redirection r The Akamai DNS server IP address is now in the cache of the local DNS server. m This implies that it is not always necessary to go to the root DNS server. r The TTL associated with the IP address of an Akamai server(surrogate) is relatively small. m This is done for performance reasons. r Akamai content distribution servers are caches

CDN Redirection r What if content is not there? m If the request content is not found then the surrogate will ask other surrogates within a specified region for information. m If requested information is still not found or is stale, then a request is made to the original web site.

CDN Selection r The tricky issue is selecting which local content server to use for a particular request m Want to spread load evenly m Want minimal impact if server is added or removed. r In Akamai, each surrogate server sends measurement results to the Network Operations Communications Center (NOCC). m Measurement results include number of active TCP connections, HTTP request arrival rate, bandwidth availability, etc m This information is used by the Akamai DNS server.

Accounting Mechanism r Accounting mechanisms collect and track information related to request routing, distribution and delivery. r Information is gathered in real time and put into log files for each CDN component. r This gets sent to the Network Operations Communications Center (NOCC).

Full Site Delivery vs. Partial Site Delivery r Full Site Delivery : All the contents are delivered by the CDN (including HTML, images, and other objects). r Partial Site delivery: Only images, streaming media and other bandwidth intensive objects delivered by the CDN.

Current Akamai Customers

Summary r We have examined replication and issues related to the design and implementation of a replicated system. r Many choices and tradeoffs to consider