Summary Cache: A Scalable Wide-Area Web Cache Sharing Protocol Li Fan, Pei Cao and Jussara Almeida University of Wisconsin-Madison Andrei Broder Compaq/DEC.

Slides:



Advertisements
Similar presentations
Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
Advertisements

L.N. Bhuyan Adapted from Patterson’s slides
Virtual Hierarchies to Support Server Consolidation Michael Marty and Mark Hill University of Wisconsin - Madison.
Taming User-Generated Content in Mobile Networks via Drop Zones Ionut Trestian Supranamaya Ranjan Aleksandar Kuzmanovic Antonio Nucci Northwestern University.
1 The ns-2 Network Simulator H Plan: –Discuss discrete-event network simulation –Discuss ns-2 simulator in particular –Demonstration and examples: u Download,
Services Course Windows Live SkyDrive Participant Guide.
The University of Adelaide, School of Computer Science
A KTEC Center of Excellence 1 Cooperative Caching for Chip Multiprocessors Jichuan Chang and Gurindar S. Sohi University of Wisconsin-Madison.
Web Cache Characterizing Roles of Front-end Servers in End-to-End Performance of Dynamic Content Distribution  Li ZHANG  Dakuo WANG.
Modelling and Analysing of Security Protocol: Lecture 10 Anonymity: Systems.
Pervasive Web Content Delivery with Efficient Data Reuse Chi-Hung Chi and Cao Yang School of Computing National University of Singapore
Cooperative Caching of Dynamic Content on a Distributed Web Server Vegard Holmedahl, Ben Smith, Tao Yang Speaker: SeungLak Choi, DB Lab., CS Dept.
SIGMOD 2006University of Alberta1 Approximately Detecting Duplicates for Streaming Data using Stable Bloom Filters Presented by Fan Deng Joint work with.
Hit or Miss ? !!!.  Cache RAM is high-speed memory (usually SRAM).  The Cache stores frequently requested data.  If the CPU needs data, it will check.
Bloom Filters Kira Radinsky Slides based on material from:
EEC-484/584 Computer Networks Lecture 6 Wenbing Zhao
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Disconnected Operation in the Coda File System James J. Kistler and M. Satyanarayanan Carnegie Mellon University Presented by Deepak Mehtani.
Internet Networking Spring 2006 Tutorial 12 Web Caching Protocols ICP, CARP.
Summary Cache: A Scalable Wide-Area Web Cache Sharing Protocol By Abuzafor Rasal and Vinoth Rayappan.
EEC-484/584 Computer Networks Discussion Session for HTTP and DNS Wenbing Zhao
1 Spring Semester 2007, Dept. of Computer Science, Technion Internet Networking recitation #13 Web Caching Protocols ICP, CARP.
OCT1 Principles From Chapter One of “Distributed Systems Concepts and Design”
Collaborative Web Caching Based on Proxy Affinities Jiong Yang, Wei Wang in T. J.Watson Research Center Richard Muntz in Computer Science Department of.
Internet Networking Spring 2002 Tutorial 13 Web Caching Protocols ICP, CARP.
1 Lecture 5: Directory Protocols Topics: directory-based cache coherence implementations.
1Bloom Filters Lookup questions: Does item “ x ” exist in a set or multiset? Data set may be very big or expensive to access. Filter lookup questions with.
Web Caching Schemes For The Internet – cont. By Jia Wang.
Capacity planning for web sites. Promoting a web site Thoughts on increasing web site traffic but… Two possible scenarios…
1 The Mystery of Cooperative Web Caching 2 b b Web caching : is a process implemented by a caching proxy to improve the efficiency of the web. It reduces.
CS252/Patterson Lec /28/01 CS 213 Lecture 10: Multiprocessor 3: Directory Organization.
Application Layer  We will learn about protocols by examining popular application-level protocols  HTTP  FTP  SMTP / POP3 / IMAP  Focus on client-server.
World Wide Web Caching: Trends and Technology Greg Barish and Katia Obraczka USC Information Science Institute IEEE Communications Magazine, May 2000 Presented.
 Proxy Servers are software that act as intermediaries between client and servers on the Internet.  They help users on private networks get information.
1 Shared-memory Architectures Adapted from a lecture by Ian Watson, University of Machester.
Multiprocessor Cache Coherency
VPresent Collaborative Presentation System on Mobile Devices.
Supporting Strong Cache Coherency for Active Caches in Multi-Tier Data-Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan, S. Krishnamoorthy,
New Protocols for Remote File Synchronization Based on Erasure Codes Utku Irmak Svilen Mihaylov Torsten Suel Polytechnic University.
Web Cache Replacement Policies: Properties, Limitations and Implications Fabrício Benevenuto, Fernando Duarte, Virgílio Almeida, Jussara Almeida Computer.
Web Prefetching Between Low-Bandwidth Clients and Proxies : Potential and Performance Li Fan, Pei Cao and Wei Lin Quinn Jacobson (University of Wisconsin-Madsion)
World Wide Web Caching: Trends and Technologys Gerg Barish & Katia Obraczka USC Information Sciences Institute, USA,2000.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
Author : Ozgun Erdogan and Pei Cao Publisher : IEEE Globecom 2005 (IJSN 2007) Presenter : Zong-Lin Sie Date : 2010/12/08 1.
1 Lecture 11: Bloom Filters, Final Review December 7, 2011 Dan Suciu -- CSEP544 Fall 2011.
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
CH1. Hardware: CPU: Ex: compute server (executes processor-intensive applications for clients), Other servers, such as file servers, do some computation.
Web Performance 성민영 SNU Computer Systems lab.. 2 차례 4 Modeling the Performance of HTTP Over Several Transport Protocols. 4 Summary Cache : A Scaleable.
Architecture for Caching Responses with Multiple Dynamic Dependencies in Multi-Tier Data- Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan,
Dr. Yingwu Zhu Summary Cache : A Scalable Wide- Area Web Cache Sharing Protocol.
Increasing Web Server Throughput with Network Interface Data Caching October 9, 2002 Hyong-youb Kim, Vijay S. Pai, and Scott Rixner Rice Computer Architecture.
Data Communications and Computer Networks Chapter 2 CS 3830 Lecture 8 Omar Meqdadi Department of Computer Science and Software Engineering University of.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
Authors: Haowei Yuan and Patrick Crowley Publisher: 2013 Proceedings IEEE INFOCOM Presenter: Chia-Yi Chu Date: 2013/08/14 1.
ICP and the Squid Web Cache Duanc Wessels k Claffy August 13, 1997 元智大學系統實驗室 宮春富 2000/01/26.
ICP and the Squid Web Cache Duane Wessels and K. Claffy 산업공학과 조희권.
Hint-based Acceleration of Web Proxy Cache Daniela Rosu Arun Iyengar Daniel Dias IBM T.J.Watson Research Center Unversity of Yuan Ze,Syslab Mike Tien
1 Efficient System-on-Chip Energy Management with a Segmented Counting Bloom Filter Mrinmoy Ghosh- Georgia Tech Emre Özer- ARM Ltd Stuart Biles- ARM Ltd.
Bloom Filters. Lecture on Bloom Filters Not described in the textbook ! Lecture based in part on: Broder, Andrei; Mitzenmacher, Michael (2005), "Network.
Cache Digest Alex Rousskov Duane Wessels National Laboratory for Applied Network Research April 17, 1998 元智大學 資訊工程研究所 系統實驗室 陳桂慧 February 9, 1999.
Overview on Web Caching COSC 513 Class Presentation Instructor: Prof. M. Anvari Student name: Wei Wei ID:
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
This document is provided for informational purposes only and Microsoft makes no warranties, either express or implied, in this document. Information.
Ahoy: A Proximity-Based Discovery Protocol Robbert Haarman.
Presented by Tashana Landray
Internet Networking recitation #12
COS 518: Advanced Computer Systems Lecture 9 Michael Freedman
CS510 - Portland State University
Hash Functions for Network Applications (II)
Lecture 1: Bloom Filters
Presentation transcript:

Summary Cache: A Scalable Wide-Area Web Cache Sharing Protocol Li Fan, Pei Cao and Jussara Almeida University of Wisconsin-Madison Andrei Broder Compaq/DEC System Research Center

Why Web Caching One of the most important techniques to improve scalability of the Web Proxy caches are particularly effective

Why Cache Sharing? Proxy Caches Users Regional Network Rest of Internet Bottleneck...

Cache Sharing via ICP –When one proxy has a cache miss, send queries to all siblings (and parents): do you have the URL? –Whoever responds first with Yes, send a request to fetch the file –If no Yes response within certain time limit, send request to Web server Parent Cache (optional)

Overhead of ICP #_of_queries = (#_of_proxies * average_miss_ratio) * #_of_proxies Experiments –4 Squid proxies running on dual-processor 64MB SPARC20s linked with 100BaseT Ethernet links –Workloads: traces and synthetic benchmarks Compared with no cache sharing, ICP: –increases total network packets to each proxy by 8-29% –increases CPU overhead by 13-32% –increases user latency by 2-12%

Alternatives to ICP Force all users to go through the same cache or the same array of caches –Difficult in a wide-area environment Central directory server –Directory server can be a bottleneck Ideally, one wants a protocol: keeps the total cache hit ratio high minimizes inter-proxy traffic scales to a large number of proxies

Summary Cache Basic idea: –Let each proxy keep a directory of what URLs are cached in every other proxy, and use the directory as a filter to reduce number of queries Problem 1: keeping the directory up to date –Solution: delay and batch the updates => directory can be slightly out of date Problem 2: DRAM requirement –Solution: compress the directory => imprecise, but inclusive directory

Errors Tolerated Suppose A and B share caches, A has a request for URL r that misses in A, –false misses: r is cached at B, but A didnt know Effect: lower total cache hit ratio –false hits: r is not cached at B, but A thought it is Effect: wasted query messages –stale hits: r is cached at B, but Bs copy is stale Effect: wasted query messages

Effect of Delay in Directory Updates Method: delay the updates until a certain percentage of the cached documents are new

Compressing the Directories Requirements: –Inclusive –Low false positives –Concise we call the compressed directories summaries First try: use server URLs only –Problem: too many false hits, leading to too many messages between proxies

The Problem abc.com/index.html xyz.edu/ Place A Place B Compact Representation arbitrary URI ?

Bloom Filters Support membership test for a set of keys Key a Bit Vector v k hash functions H (a) = P m bits

Bloom Filters: the Math Given n keys, how to choose m and k? Bit Vector v m bits Suppose m is fixed (>2n), choose k: k is optimal when exactly half of the bits are 0 => optimal k = ln(2) * m/n False positive ratio under optimal k is (1/2) k => false positive ratio = (1/2) ln2 * m/n = (0.62) m/n

Bloom Filters: the Practice Choosing hash functions –bits from MD5 signatures of URLs Maintaining the summary –the proxy maintains an array of counters –for each bit, the counter records how many times the bit is set to 1 Updating the summary –either the whole bit array or the positions of changed bits (delta encoding)

Result: Inter-Proxy Traffic

Result: Total Hit Ratio

Enhancing ICP with Summary Cache Prototype implemented in Squid Repeating the 4-proxy experiments, the new ICP: –Reduces UDP messages by a factor of 12 to 50 compared with the old ICP –Little increase in network packets over no cache sharing –increase CPU time by 2 - 7% –reduce user latency up to 4% with remote cache hits

Conclusions Summary cache enables caches to share contents with low overheads over wide area An alternative implementation called Cache Digest is in Squid Many other applications of bloom filters Technical report version available at: