September 2008 Josilene Aires Moreira.  Overview  CDN Topology  CDNs nowadays  Contructing a CDN ◦ Basic model ◦ Modules ◦ Characteristics  References.

Slides:



Advertisements
Similar presentations
Consistency and Replication Chapter 7 Part II Replica Management & Consistency Protocols.
Advertisements

1 Server Selection & Content Distribution Networks (slides by Srini Seshan, CS CMU)
Akamai networks,48000 servers and 70 countries in the world.
On the Effectiveness of Measurement Reuse for Performance-Based Detouring David Choffnes Fabian Bustamante Fabian Bustamante Northwestern University INFOCOM.
1 Content Delivery Networks iBAND2 May 24, 1999 Dave Farber CTO Sandpiper Networks, Inc.
CompSci 356: Computer Network Architectures Lecture 21: Content Distribution Chapter 9.4 Xiaowei Yang
19 Historical overview Main challenge: How to distribute content in high quality over the Internet cost-effectively? • Traditional “Best-effort” model:
A Taxonomy and Survey of Content Delivery Networks Meng-Huan Wu 2011/10/26 1.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Internet Networking Spring 2006 Tutorial 12 Web Caching Protocols ICP, CARP.
Cis e-commerce -- lecture #6: Content Distribution Networks and P2P (based on notes from Dr Peter McBurney © )
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
Web Servers How do our requests for resources on the Internet get handled? Can they be located anywhere? Global?
What’s a Web Cache? Why do people use them? Web cache location Web cache purpose There are two main reasons that Web cache are used:  to reduce latency.
1 Spring Semester 2007, Dept. of Computer Science, Technion Internet Networking recitation #13 Web Caching Protocols ICP, CARP.
Anycast Jennifer Rexford Advanced Computer Networks Tuesdays/Thursdays 1:30pm-2:50pm.
Web Caching Schemes For The Internet – cont. By Jia Wang.
Web Caching and CDNs March 3, Content Distribution Motivation –Network path from server to client is slow/congested –Web server is overloaded Web.
World Wide Web Caching: Trends and Technology Greg Barish and Katia Obraczka USC Information Science Institute IEEE Communications Magazine, May 2000 Presented.
Tradeoffs in CDN Designs for Throughput Oriented Traffic Minlan Yu University of Southern California 1 Joint work with Wenjie Jiang, Haoyuan Li, and Ion.
Technologies for Building Content Delivery Networks Pei Cao Cisco Systems, Inc.
AKAMAI Content Delivery Services AKAMAI Content Delivery Services CIS726 : PRESENTATION Avinash Ponugoti Avinash Ponugoti Nagarjuna Nagulapati Sathish.
Caching and Content Distribution Networks. Web Caching r As an example, we use the web to illustrate caching and other related issues browser Web Proxy.
Content Delivery Networks (CDN) Dr. Yingwu Zhu Reverse Proxy Reverse Proxy Reverse Proxy Intranet Web Cache Architecure Browser Local ISP cache L4 Switch.
Web Cache. Introduction what is web cache?  Introducing proxy servers at certain points in the network that serve in caching Web documents for faster.
Web Server Load Balancing/Scheduling Asima Silva Tim Sutherland.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
Caching and Content Distribution Networks. Some Interesting Observations r Top 1 % of all documents account for 20% - 35% of proxy requests r Top 10%
Content Distribution March 8, : Application Layer1.
Active Network Applications Tom Anderson University of Washington.
{ Content Distribution Networks ECE544 Dhananjay Makwana Principal Software Engineer, Semandex Networks 5/2/14ECE544.
Ao-Jan Su, David R. Choffnes, Fabián E. Bustamante and Aleksandar Kuzmanovic Department of EECS Northwestern University Relative Network Positioning via.
SCAN: a Scalable, Adaptive, Secure and Network-aware Content Distribution Network Yan Chen CS Department Northwestern University.
INFORMATION AND COMMUNICATION SYSTEMS MERIT 2008 Research Symposium Melbourne Engineering Graduates Look to the Future System Architecture An internetworking.
World Wide Web Caching: Trends and Technologys Gerg Barish & Katia Obraczka USC Information Sciences Institute, USA,2000.
CPSC 441: Multimedia Networking1 Outline r Scalable Streaming Techniques r Content Distribution Networks.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
The Eyeblaster ACM Advertising Campaign Management.
1 On the Placement of Web Server Replicas Lili Qiu, Microsoft Research Venkata N. Padmanabhan, Microsoft Research Geoffrey M. Voelker, UCSD IEEE INFOCOM’2001,
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
Application of Content Computing in Honeyfarm Introduction Overview of CDN (content delivery network) Overview of honeypot and honeyfarm New redirection.
Making the Best of the Best-Effort Service (2) Advanced Multimedia University of Palestine University of Palestine Eng. Wisam Zaqoot Eng. Wisam Zaqoot.
1 On the Placement of Web Server Replicas Lili Qiu, Microsoft Research Venkata N. Padmanabhan, Microsoft Research Geoffrey M. Voelker, UCSD IEEE INFOCOM’2001,
Internet Real-Time Laboratory Arezu Moghadam and Suman Srinivasan Columbia University in the city of New York 7DS System Design 7DS system is an architecture.
Content distribution networks (CDNs) r The content providers are the CDN customers. Content replication r CDN company installs hundreds of CDN servers.
CONTENT DELIVERY NETWORKS
Computer Science Lecture 14, page 1 CS677: Distributed OS Last Class: Concurrency Control Concurrency control –Two phase locks –Time stamps Intro to Replication.
Overlay Networks: An Akamai Perspective Ramesh K. Sitaraman, mangesh kasbekar, Woody Lichtenstein, and Manish Jain Akamai Technologies Inc Univerisy of.
 Cachet Technologies 1998 Cachet Technologies Technology Overview February 1998.
Distributed Computing Systems CSCI 4780/6780. Scalability ConceptExample Centralized servicesA single server for all users Centralized dataA single on-line.
Content Distribution Network, Proxy CDN: Distributed Environment
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
1 Reforming Software Delivery Using P2P Technology Purvi Shah Advisor: Jehan-François Pâris Department of Computer Science University of Houston Jeffrey.
Content Delivery Networks: Status and Trends Speaker: Shao-Fen Chou Advisor: Dr. Ho-Ting Wu 5/8/
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
On the Placement of Web Server Replicas Yu Cai. Paper On the Placement of Web Server Replicas Lili Qiu, Venkata N. Padmanabhan, Geoffrey M. Voelker Infocom.
Content Distribution Networks (CDNs)
/ Fast Web Content Delivery An Introduction to Related Techniques by Paper Survey B Li, Chien-chang R Sung, Chih-kuei.
Drafting Behind Akamai (Travelocity-Based Detouring) Ao-Jan Su, David R. Choffnes, Aleksandar Kuzmanovic and Fabián E. Bustamante Department of EECS Northwestern.
Content Distribution Networks
Coral: A Peer-to-peer Content Distribution Network
Content Distribution Networks
Mohammad Malli Chadi Barakat, Walid Dabbous Alcatel meeting
Utilization of Azure CDN for the large file distribution
Internet Networking recitation #12
Content Distribution Networks
DotSlash: An Automated Web Hotspot Rescue System
Content Distribution Networks
AWS Cloud Computing Masaki.
AKAMAI Content Delivery Services
Presentation transcript:

September 2008 Josilene Aires Moreira

 Overview  CDN Topology  CDNs nowadays  Contructing a CDN ◦ Basic model ◦ Modules ◦ Characteristics  References

 Origin Servers  Surrogate Servers (mirrow servers)

 First evolved in 1998, replicate content over many web servers to deal with flash crowds  Geographically distributed web servers to improve performance and scalability  Improves network performance through content replication ◦ Maximizing bandwidth ◦ Improving accessbility ◦ Mantaining correctness  Moves content to edge servers located close to the users

 Host thirdy-part content  Any digital content  Static ◦ Html pages ◦ Images ◦ Documents ◦ Software  Streaming media ◦ Audio ◦ Video  Content services ◦ E-commerce service Source of the content Large enterprises Web service providers Media companies New broadcasters

 Origin Server  Surrogate Servers ◦ A set of replica servers that cache the origin’s server content  Routers and network elements ◦ Deliver content requests to the optimal location and the optimal surrogate server  Accounting mechanism ◦ Provides logs and information to the origin server

 Advantages ◦ Reducing the customer need to invest in Website infrastructure ◦ Decreasing the operational effort and costs of managing such infrastructure ◦ Bypassing traffic jams  Data is closer to the user  There is no needs to transverse all congested pipes ◦ Improving content delivery quality, speed and reliability ◦ Reducing the load on origin servers

 Akamai Technologies ( is the market leader ◦ (80% of the overall CDN market) in providing content delivery services. It owns more than 12,000 servers over 1,000 networks in 62 countries.  Mirror Image Internet, Inc. ( ◦ supports surrogate servers located in 22 cities around the world (North America, Europe, and Asia), which provide a range of value-added services, from content distribution to media streaming and managed caching.  Inktomi, a Yahoo Company ( ◦ Provides managed services for global load balancing, failover, content delivery, and streaming media using more than 1,000 surrogate servers worldwide.  LimeLight Network ( ◦ provides a suite of services (including music download and subscription services, video game developers and distributors, movie/video download services, and so forth) and supports surrogate servers located in 72 locations around the world (Asia, the U.S., and Europe).

 Codeen ◦ Codeen is developed at Princeton University, USA. It provides caching of Web content and redirection of HTTP requests. ◦ It is an academic testbed content distribution network built on top of PlanetLab. This testbed consists of a network of high performance proxy servers. ◦ Each of these proxy servers acts both as request redirectors and surrogate servers. They provide fast and robust Web content delivery service through cooperation and collaboration.  Coral CDN ◦ Coral is a free, peer-to-peer content distribution network designed to mirror web content. ◦ Coral is designed to use the bandwidth of volunteers to avoid congestion and to reduce the load on websites and other web content providers in general.  Globule ◦ Globule is an open-source collaborative content delivery network developed at the Vrije Universiteit in Amsterdam. Globule aims to allow Web content providers to organize together and operate their own world-wide hosting platform. It provides replication of content, monitoring of servers and redirecting of client requests to available replicas.

 Origin  Surrogate  Content manager  Redirector  Clients  Access network On Content Delivery Network Implementation, Molina Moreno et al, 2006

 Replica servers ◦ Acts as proxy/cache servers ◦ Store and delivery content  Components ◦ Portal = http-based web-server  Provides access to the content stored in the CDN ◦ Streaming server  A media streaming server to distribute multimedia content ◦ Surrogate Database (DB)  Contains a list of all available streaming sessions, the objects stored in the surrogate and information for the management of the CDN  Research Questions ◦ Number of Surrogate Servers ◦ Surrogate Placement Algorithms (Greedy, Hot-spot, Tree-based,…)

 Research Questions ◦ Number of Surrogate Servers ◦ Surrogate Placement Algorithms  Placement Algorithms  Content distribution includes the placement of surrogate servers to some strategic positions, close to the clients.  Some theoretical approaches model the problem as the “center placement problem”: for the placement of a given number of centers, minimize the maximum distance between a node and the nearest center. ◦ Greedy ◦ Hot spot ◦ Tree-based

 Control the media objects stored in each Surrogate  Provide this information to the Redirector, to get each client served by the most suitable Surrogate or Peer  Content Locator Determines ◦ Number of replicas of a media object ◦ In which surrogate servers a new object has to be stored ◦ Elimination of non-popular objects from the surrogates ◦ Interaction of the CDN within the origin servers ◦ Update media objects in the surrogates ◦ Move media objects among surrogates  DB Content ◦ Stores the information managed by the Content Locator

 Research Questions  Content Outsourcing  Related to the relationship between surrogates ◦ Cooperative push-based  The content is initially pushed from Origin Server  The surrogate servers cooperate to reduce replication  This approach has not yet been adopted by a CDN provider ◦ Uncooperative pull-based  Clients’ requests are directed to their closest Surrogate Server  If there is a cache miss, the request is redirected to another Surrogate or to the Origin Server  Akamai and Mirror Image use this approach ◦ Cooperative pull-based  Client requests are directed to their closest surrogate  In case of cache misses, the surrogate servers use a distributed index to find nearby copies of the requested objects  Coral (academic CDN) has implemented this approach

 Research Questions (cont.)  Content Selection Related to how the content is replicated at the surrogates ◦ Entire replication  High disk prices  Web objects are increasing  Updating is a problem ◦ Partial site content selection and delivery  Empirical  Uses heuristics to decide the content to be replicated at the edge servers (uncertainty in choosing right heuristics)  Popularity-based  Most popular objects are replicated  Object-based  Content is replicated in units of objects that gives the maximum performance gain (high complexity to implement on real applications)  Cluster-based  Content is grouped based on correlation or access frequency and replicated in units of clusters (high complexity)

 Research Questions (cont.)  Caching Techniques Related to how the caches manage a cache miss ◦ Intra-cluster caching  Query-based  On a cache miss, a CDN server broadcasts a query to others cooperating CDN servers (significant query traffic)  Digest-based  Each CDN server has a map of content held by the other cooperating surrogates (update traffic overhead)  Directory-based  A centralized server keeps information of all the cooperating surrogates in a cluster (potential bottleneck = single point of failure)  Hashing-based / Semi-hashing based for streaming  Cooperating CDN servers maintain the same hash function  More efficient, highest content sharing efficiency

 Research Questions (cont.)  Cache update ◦ Periodic Update  Most common method ◦ Update propagation  An updated version is delivered to all caches whenever a change is made into the origin server ◦ On-demand update  The latest copy of a document is propagated to the surrogate cache server based on prior request for that content. The content is not updated until be requested ◦ Invalidation approach  An invalidation message is sent to all surrogate caches when there is a change at the origin server, and the caches need to fetch an updated version later

 Provides intelligence to the system ◦ Estimates the most adequate surrogate server for each different client request  CDN DNS ◦ Accept requests from the client local DNS and route the client to the most adequate surrogate (several approaches, Akamai uses servers in two hierarchies) ◦ DB DNS  Stores information about addresses and names of the surrogate servers, and some additional information

Redirector (Request-routing)  Monitor Module (Main knowledge gatherer of the system) ◦ Gets statistical information from different key elements of the CDN architecture ◦ Conducts a variety of measurements to obtain information about the network and the CDN components ◦ Uses SNMP to get data from the surrogates to estimate the RTT between Clients and Surrogates ◦ DB Monitor : Stores all information  Redirection Algorithm ◦ Selects the optimal surrogate server on the basis of the information gathered by the Monitor

 Research Questions  Which one is the appropriate surrogate server? ◦ The replica server ‘closest’ to the client  Decision based on Metrics (‘closest’) ◦ Network proximity ◦ Client perceived latency ◦ Distance ◦ Replica server load  Request Routing Algorithm ◦ Non-adaptive  Round robin, prediction on load of servers (number of requests), client-server distance ◦ Adaptive  Network proximity (path length), Client perceived latency  Akamai uses a complex combination of server load, reliability of paths, bandwidth available to a replica server  Cisco DistributedDirector uses inter-AS distance, intra-AS distance, end-to-end latency

 Pallis, G. and Vakali, A., “Insight and Perspectives for Content Delivery Networks”, Communications of the ACM, vol. 49, no. 1, ACM Press, NY, USA, January  Moreno, B. M.; Salvador, C. P.; Domingo, M. E.; Peña, I. A. & Extremera, V. R. (2006), 'On content delivery network implementation', Computer communications 29(12),  Pathan, A. K. & Buyya, R. (2007), 'A Taxonomy and Survey of Content Delivery Networks', Technical report, The University of Melbourne, Australia