Democratizing Content Publication with Coral Mike Freedman Eric Freudenthal David Mazières New York University NSDI 2004.

Slides:



Advertisements
Similar presentations
Information-Centric Networks05c-1 Week 5 / Paper 3 Democratizing content publication with Coral –Michael J. Freedman, Eric Freudenthal, David Mazières.
Advertisements

1 Server Selection & Content Distribution Networks (slides by Srini Seshan, CS CMU)
On the Effectiveness of Measurement Reuse for Performance-Based Detouring David Choffnes Fabian Bustamante Fabian Bustamante Northwestern University INFOCOM.
Experiences with CoralCDN A Five-Year Operational View Michael J. Freedman Princeton University
Content Distribution Networks Costin Raiciu Advanced Topics in Distributed Systems Fall 2012.
1 Content Delivery Networks iBAND2 May 24, 1999 Dave Farber CTO Sandpiper Networks, Inc.
Small-world Overlay P2P Network
P561: Network Systems Week 8: Content distribution Tom Anderson Ratul Mahajan TA: Colin Dixon.
Spring 2003CS 4611 Content Distribution Networks Outline Implementation Techniques Hashing Schemes Redirection Strategies.
Distributed hash tables Protocols and applications Jinyang Li.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
CS162 Operating Systems and Systems Programming Lecture 23 HTTP and Peer-to-Peer Networks April 20, 2011 Ion Stoica
Cis e-commerce -- lecture #6: Content Distribution Networks and P2P (based on notes from Dr Peter McBurney © )
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
Topics in Reliable Distributed Systems Lecture 2, Fall Dr. Idit Keidar.
P2P: Advanced Topics Filesystems over DHTs and P2P research Vyas Sekar.
CDNs & Replication Prof. Vern Paxson EE122 Fall 2007 TAs: Lisa Fowler, Daniel Killebrew, Jorge Ortiz.
Flash Crowds And Denial of Service Attacks: Characterization and Implications for CDNs and Web Sites Aaron Beach Cs395 network security.
Chord-over-Chord Overlay Sudhindra Rao Ph.D Qualifier Exam Department of ECECS.
Squirrel: A decentralized peer- to-peer web cache Paul Burstein 10/27/2003.
Topics in Reliable Distributed Systems Fall Dr. Idit Keidar.
1 Web Content Delivery Reading: Section and COS 461: Computer Networks Spring 2007 (MW 1:30-2:50 in Friend 004) Ioannis Avramopoulos Instructor:
Capacity planning for web sites. Promoting a web site Thoughts on increasing web site traffic but… Two possible scenarios…
Web Caching and CDNs March 3, Content Distribution Motivation –Network path from server to client is slow/congested –Web server is overloaded Web.
Content Networking - CON Content Overlay Network Vishal Kumar Singh Eilon Yardeni April, 28 th 2005.
Content Delivery Networks. History Early 1990s sees 100% growth in internet traffic per year 1994 o Netscape forms and releases their first browser.
AKAMAI Content Delivery Services AKAMAI Content Delivery Services CIS726 : PRESENTATION Avinash Ponugoti Avinash Ponugoti Nagarjuna Nagulapati Sathish.
Caching and Content Distribution Networks. Web Caching r As an example, we use the web to illustrate caching and other related issues browser Web Proxy.
Content Delivery Networks (CDN) Dr. Yingwu Zhu Reverse Proxy Reverse Proxy Reverse Proxy Intranet Web Cache Architecure Browser Local ISP cache L4 Switch.
Content Distribution Networks (CDNs) Mike Freedman COS 461: Computer Networks Lectures: MW 10-10:50am in Architecture N101
CSCI-1680 Web Performance and Content Distribution Based partly on lecture notes by Scott Shenker and John Jannotti Rodrigo Fonseca.
Content Distribution Network (CDN) Performance Punit Shah CSE581 Internet Technologies OGI, OHSU 2002, Jan 16th.
CS 4700 / CS 5700 Network Fundamentals Lecture 17.5: Project 5 Hints (Getting a job at Akamai) Revised 3/31/2014.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
On the Use and Performance of Content Distribution Networks Balachander Krishnamurthy Craig Wills Yin Zhang Presenter: Wei Zhang CSE Department of Lehigh.
Content Distribution March 8, : Application Layer1.
Oasis: Anycast for Any Service Michael J. Freedman Karthik Lakshminarayanan David Mazières in NSDI 2006 Presented by: Sailesh Kumar.
Real World Examples – Part II 7/26/2013Miro Remias, Sr. Solution Architect.
Ao-Jan Su, David R. Choffnes, Fabián E. Bustamante and Aleksandar Kuzmanovic Department of EECS Northwestern University Relative Network Positioning via.
SAINT ‘01 Proactive DNS Caching: Addressing a Performance Bottleneck Edith Cohen AT&T Labs-Research Haim Kaplan Tel-Aviv University.
CDN Brokering* Presented By Nick Arnold Authors Alexandros Biliris, et. Al.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
1 On the Placement of Web Server Replicas Lili Qiu, Microsoft Research Venkata N. Padmanabhan, Microsoft Research Geoffrey M. Voelker, UCSD IEEE INFOCOM’2001,
Information-Centric Networks06a-1 Week 6 / Paper 1 Untangling the Web from DNS –Michael Walfish, Hari Balakrishnan and Scott Shenker –Networked Systems.
Orbited Scaling Bi-directional web applications A presentation by Michael Carter
CDN: Content Distribution Networks  References:  CS613 textbook, “Computer Networking – A Top-Down Approach”, 6 th edition. Chapter  The text.
Adaptive Web Caching CS411 Dynamic Web-Based Systems Flying Pig Fei Teng/Long Zhao/Pallavi Shinde Computer Science Department.
DotSlash – or how to deal with 15 minutes of fame Weibin Zhao Henning Schulzrinne Columbia University CATT/WICAT Annual Research Review November 14, 2003.
Information-Centric Networks Section # 3.2: DNS Issues Instructor: George Xylomenos Department: Informatics.
Information-Centric Networks Section # 5.3: Content Distribution Instructor: George Xylomenos Department: Informatics.
Content Distribution Network, Proxy CDN: Distributed Environment
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
Content Delivery Networks: Status and Trends Speaker: Shao-Fen Chou Advisor: Dr. Ho-Ting Wu 5/8/
1 COMP 431 Internet Services & Protocols HTTP Persistence & Web Caching Jasleen Kaur February 11, 2016.
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
On the Placement of Web Server Replicas Yu Cai. Paper On the Placement of Web Server Replicas Lili Qiu, Venkata N. Padmanabhan, Geoffrey M. Voelker Infocom.
Content Distribution Networks (CDNs)
/ Fast Web Content Delivery An Introduction to Related Techniques by Paper Survey B Li, Chien-chang R Sung, Chih-kuei.
John S. Otto Mario A. Sánchez John P. Rula Fabián E. Bustamante Northwestern, EECS.
Performance Evaluation of Redirection Schemes in Content Distribution Networks Jussi Kangasharju, Keith W. Ross Institut Eurecom Jim W. Roberts France.
MICROSOFT AJAX CDN (CONTENT DELIVERY NETWORK) Make Your ASP.NET site faster to retrieve.
Content Distribution Networks
Coral: A Peer-to-peer Content Distribution Network
Web Caching? Web Caching:.
Utilization of Azure CDN for the large file distribution
Co* Projects : CoDNS, CoDeploy, CoMon
ECE 671 – Lecture 16 Content Distribution Networks
Content Distribution Networks
Content Distribution Networks + P2P File Sharing
Content Distribution Networks + P2P File Sharing
Presentation transcript:

Democratizing Content Publication with Coral Mike Freedman Eric Freudenthal David Mazières New York University NSDI 2004

A problem… Feb 3: Google linked banner to “julia fractals” Users clicking directed to Australian University web site …University’s network link overloaded, web server taken down temporarily…

The problem strikes again! Feb 4: Slashdot ran the story about Google …Site taken down temporarily…again

The response from down under… Feb 4, later…Paul Bourke asks: “They have hundreds (thousands?) of servers worldwide that distribute their traffic load. If even a small percentage of that traffic is directed to a single server … what chance does it have?” → Help the little guy ←

Existing approaches Client-side proxying Squid, Summary Cache, hierarchical cache, CoDeeN, Squirrel, Backslash, PROOFS, … Problem: Not 100% coverage Throw money at the problem Load-balanced servers, fast network connections Problem: Can’t afford or don’t anticipate need Content Distribution Networks (CDNs) Akamai, Digital Island, Mirror Image Centrally managed, needs to recoup costs

Coral’s solution… Implement an open CDN Allow anybody to contribute Works with unmodified clients CDN only fetches once from origin server Origin Server Coral httpprx dnssrv Coral httpprx dnssrv Coral httpprx dnssrv Coral httpprx dnssrv Coral httpprx dnssrv Coral httpprx dnssrv Browser Pool resources to dissipate flash crowds

Coral’s solution… Strong locality without a priori knowledge No hotspots in CDN Should all work automatically with nobody in charge Origin Server Coral httpprx dnssrv Coral httpprx dnssrv Coral httpprx dnssrv Coral httpprx dnssrv Coral httpprx dnssrv Coral httpprx dnssrv Browser Pool resources to dissipate flash crowds Coral httpprx dnssrv Coral httpprx dnssrv

Contributions Self-organizing clusters of nodes NYU and Columbia prefer one another to Germany Rate-limiting mechanism Everybody caching and fetching same URL does not overload any node in system Decentralized DNS Redirection Works with unmodified clients No centralized management or a priori knowledge of proxies’ locations or network configurations

Using CoralCDN Rewrite URLs into “Coralized” URLs → Directs clients to Coral, which absorbs load Who might “Coralize” URLs? Web server operators Coralize URLs Coralized URLs posted to portals, mailing lists Users explicitly Coralize URLs

httpprx dnssrv Browser Resolver DNS Redirection Return proxy, preferably one near client Cooperative Web Caching CoralCDN components httpprx Fetch data from nearby ? ? Origin Server 

Functionality needed DNS: Given network location of resolver, return a proxy near the client put (network info, self) get (resolver info) → {proxies} HTTP: Given URL, find proxy caching object, preferably one nearby put (URL, self) get (URL) → {proxies}

Supports put/get interface using key-based routing Problems with using DHTs as given Use a DHT? Lookup latency Transfer latency Hotspots NYUColumbia Germany Japan NYC

Coral distributed index Insight: Don’t need hash table semantics Just need one well-located proxy put (key, value, ttl) Avoid hotspots get (key) Retrieves some subset of values put under key Prefer values put by nodes near requestor Hierarchical clustering groups nearby nodes Expose hierarchy to applications Rate-limiting mechanism distributes puts

httpprx dnssrv Browser Resolver DNS Redirection Return proxy, preferably one near client Cooperative Web Caching CoralCDN components httpprx Fetch data from nearby

Coral httpprx dnssrv Coral httpprx dnssrv Browser Resolver DNS Redirection Return proxy, preferably one near client Cooperative Web Caching CoralCDN components Coral httpprx dnssrv Coral httpprx dnssrv Coral httpprx dnssrv Coral httpprx dnssrv Fetch data from nearby get

Key-based XOR routing 000… 111… Distance to key None < 60 ms < 20 ms Thresholds Minimizes lookup latency Prefer values stored by nodes within faster clusters

Prevent insertion hotspots NYU Halt put routing at full and loaded node Full → M vals/key with TTL > ½ insertion TTL Loaded → β puts traverse node in past minute Store at furthest, non-full node seen Store value once in each level cluster Always storing at closest node causes hotspot … (log n) β reqs / min

Coral lacks… Central management A priori knowledge of network topology Anybody can join system Any special tools (e.g., BGP feeds) Coral has… Large # of vantage points to probe topology Distributed index in which to store network hints Each Coral node maps nearby networks to self Challenges for DNS Redirection

Coral DNS server probes resolver Once local, stay local When serving requests from nearby DNS resolver Respond with nearby Coral proxies Respond with nearby Coral DNS servers → Ensures future requests remain local Else, help resolver find local Coral DNS server Coral’s DNS Redirection

Return servers within appropriate cluster e.g., for resolver RTT = 19 ms, return from cluster < 20 ms Use network hints to find nearby servers i.e., client and server on same subnet Otherwise, take random walk within cluster DNS measurement mechanism Resolver Browser Coral httpprx dnssrv Server probes client (2 RTTs) Coral httpprx dnssrv

Experimental results Consider requests to Australian web site: Does Coral absorb flash crowds? Does clustering help latency? Does Coral form sensible clusters? Does Coral prevent hotspots? Experimental setup 166 PlanetLab hosts; Coral node and client on each Twelve 41-KB files on 384 Kb/sec (DSL) web server (0.6 reqs / sec) / client → 32,800 Kb/sec aggregate

Solves flash-crowd problem Local caches begin to handle most requests Coral hits in 20 ms cluster Hits to origin web server

Benefits end-to-end client latency

Finds natural clusters Nodes share letter→in same < 60 ms cluster Size of letter →number of collocated nodes in same cluster

Prevents put hotspots Nodes aggregate put/get rate: ~12 million/ min Rate-limit per node (β): 12/ min RPCs at closest leaked through 7 others: 83/ min 494 nodes 3 β 2 β 1 β

Conclusions Coral indexing infrastructure Provides non-standard P2P storage abstraction Stores network hints and forms clusters Exposes hierarchy and hints to applications Prevents hotspots Use Coral to build fully decentralized CDN Solves Slashdot effect Popular data → widely replicated → highly available Democratizes content publication

For more information…