Squirrel: A decentralized peer- to-peer web cache Paul Burstein 10/27/2003.

Slides:



Advertisements
Similar presentations
Squirrel: A peer-to- peer web cache Sitaram Iyer (Rice University) Joint work with Ant Rowstron (MSR Cambridge) Peter Druschel (Rice University) PODC 2002.
Advertisements

Squirrel: A peer-to-peer web cache Sitaram Iyer Joint work with Ant Rowstron (MSRC) and Peter Druschel.
Evaluating scalability Peer-to-Peer File Sharing Networks of Sayantan Mitra Vibhor Goyal.
Scalable Content-Addressable Network Lintao Liu
Peer-to-Peer (P2P) Distributed Storage 1Dennis Kafura – CS5204 – Operating Systems.
Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
1 Content Delivery Networks iBAND2 May 24, 1999 Dave Farber CTO Sandpiper Networks, Inc.
Information-Centric Networks03c-1 Week 3 / Paper 3 The design and implementation of a next generation name service for the Internet –Venugopalan Ramasubramanian.
Peer-to-Peer Networks as a Distribution and Publishing Model Jorn De Boever (june 14, 2007)
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Name Services Jessie Crane CPSC 550. History ARPAnet – experimental computer network (late 1960s) hosts.txt – a file that contained all the information.
Storage Management and Caching in PAST, a large-scale, persistent peer- to-peer storage utility Authors: Antony Rowstorn (Microsoft Research) Peter Druschel.
Cis e-commerce -- lecture #6: Content Distribution Networks and P2P (based on notes from Dr Peter McBurney © )
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
The Google File System. Why? Google has lots of data –Cannot fit in traditional file system –Spans hundreds (thousands) of servers connected to (tens.
Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
presented by Hasan SÖZER1 Scalable P2P Search Daniel A. Menascé George Mason University.
Pastry And Squirrel Presented by Eirik T. Laberg Håvard Semundseth Orri G. Pálsson.
Caching And Prefetching For Web Content Distribution Presented By:- Harpreet Singh Sidong Zeng ECE Fall 2007.
Wide-area cooperative storage with CFS
Web Caching Schemes For The Internet – cont. By Jia Wang.
Web Caching and CDNs March 3, Content Distribution Motivation –Network path from server to client is slow/congested –Web server is overloaded Web.
1-1 Incentive Mechanisms for Large Collaborative Resource Sharing Objectives:  Why Resource harnessing  Resource sharing  Assumptions  Considerations.
World Wide Web Caching: Trends and Technology Greg Barish and Katia Obraczka USC Information Science Institute IEEE Communications Magazine, May 2000 Presented.
Storage management and caching in PAST PRESENTED BY BASKAR RETHINASABAPATHI 1.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
Distributed Data Stores – Facebook Presented by Ben Gooding University of Arkansas – April 21, 2015.
Distributing Content Simplifies ISP Traffic Engineering Abhigyan Sharma* Arun Venkataramani* Ramesh Sitaraman*~ *University of Massachusetts Amherst ~Akamai.
1 The Google File System Reporter: You-Wei Zhang.
PIC: Practical Internet Coordinates for Distance Estimation Manuel Costa joint work with Miguel Castro, Ant Rowstron, Peter Key Microsoft Research Cambridge.
Distributed Systems Concepts and Design Chapter 10: Peer-to-Peer Systems Bruce Hammer, Steve Wallis, Raymond Ho.
Thesis Proposal Data Consistency in DHTs. Background Peer-to-peer systems have become increasingly popular Lots of P2P applications around us –File sharing,
Overcast: Reliable Multicasting with an Overlay Network CS294 Paul Burstein 9/15/2003.
 A P2P IRC Network Built on Top of the Kademlia Distributed Hash Table.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
Caching and Data Consistency in P2P Dai Bing Tian Zeng Yiming.
Efficient Peer to Peer Keyword Searching Nathan Gray.
Module 11: Implementing ISA Server 2004 Enterprise Edition.
Storage Management and Caching in PAST A Large-scale persistent peer-to-peer storage utility Presented by Albert Tannous CSE 598D: Storage Systems – Dr.
Dr. Yingwu Zhu Summary Cache : A Scalable Wide- Area Web Cache Sharing Protocol.
DISTRIBUTED COMPUTING Introduction Dr. Yingwu Zhu.
Peer-to-Peer Supported Cache System for File Transfer Joonbok Lee
Fast Crash Recovery in RAMCloud. Motivation The role of DRAM has been increasing – Facebook used 150TB of DRAM For 200TB of disk storage However, there.
Peer to Peer A Survey and comparison of peer-to-peer overlay network schemes And so on… Chulhyun Park
PROP: A Scalable and Reliable P2P Assisted Proxy Streaming System Computer Science Department College of William and Mary Lei Guo, Songqing Chen, and Xiaodong.
1 Secure Peer-to-Peer File Sharing Frans Kaashoek, David Karger, Robert Morris, Ion Stoica, Hari Balakrishnan MIT Laboratory.
Plethora: Infrastructure and System Design. Introduction Peer-to-Peer (P2P) networks: –Self-organizing distributed systems –Nodes receive and provide.
ITGS Network Architecture. ITGS Network architecture –The way computers are logically organized on a network, and the role each takes. Client/server network.
Squirrel: A decentralized peer-to- peer web cache Paper by Sitaram Iyer, Antony Rowstron and Peter Druschel (© 2002) Presentation* by Alexander Prohaska.
CS 6401 Overlay Networks Outline Overlay networks overview Routing overlays Resilient Overlay Networks Content Distribution Networks.
MiddleMan: A Video Caching Proxy Server NOSSDAV 2000 Brian Smith Department of Computer Science Cornell University Ithaca, NY Soam Acharya Inktomi Corporation.
Algorithms and Techniques in Structured Scalable Peer-to-Peer Networks
Content Delivery Networks: Status and Trends Speaker: Shao-Fen Chou Advisor: Dr. Ho-Ting Wu 5/8/
09/13/04 CDA 6506 Network Architecture and Client/Server Computing Peer-to-Peer Computing and Content Distribution Networks by Zornitza Genova Prodanoff.
Large Scale Sharing Marco F. Duarte COMP 520: Distributed Systems September 19, 2004.
The Design and Implementation of a Next Generation Name Service for the Internet V. Ramasubramanian, E. Gun Sirer Cornell Univ. SIGCOMM 2004 Ciprian Tutu.
Overview on Web Caching COSC 513 Class Presentation Instructor: Prof. M. Anvari Student name: Wei Wei ID:
Performance Evaluation of Redirection Schemes in Content Distribution Networks Jussi Kangasharju, Keith W. Ross Institut Eurecom Jim W. Roberts France.
Clustered Web Server Model
Cluster-Based Scalable
Coral: A Peer-to-peer Content Distribution Network
Energy Conservation in Content Distribution Networks
Plethora: Infrastructure and System Design
Early Measurements of a Cluster-based Architecture for P2P Systems
COS 518: Advanced Computer Systems Lecture 9 Michael Freedman
Be Fast, Cheap and in Control
by Mikael Bjerga & Arne Lange
Lecture 1: Bloom Filters
Caching 50.5* + Apache Kafka
Presentation transcript:

Squirrel: A decentralized peer- to-peer web cache Paul Burstein 10/27/2003

Outline Overview Design Evaluation Discussion

Traditional Web Caching Goals Reduce browser latency Reduce aggregate bandwidth Reduce load on web servers Deployment Dedicated centralized machines Placed at local network boundaries

Squirrel Web Caching Decentralized caching Desktops cooperate in a peer-to-peer fashion Mutual sharing between hosts Hosts browse and cache

Pros Centralized Dedicated Hardware Cost Administration Handling load bursts Single point of failure Decentralized No additional hardware More users  more resources Automatic scaling Self organizing Easy deployment

Assumptions Cooperative hosts No security issues Link and node failures Nodes are in single geographic location Low internal network latencies

Outline Overview Design Evaluation Discussion

Design Goals Target environment: ,000 machines Goal: Achieve performance comparable to centralized cache

Design Overview Built on top of Pastry Objects have 128-bit objectIds SHA-1 hash of URL Mapped to home node with closest nodeId Requests: GET – new request cGET – conditional Two schemes Home-store Directory

Home-store Objects stored at client cache and home node External requests come through home node Cache replacement All objects are considered a. home node fresh b. home node stale

Directory Home node keeps a directory of pointers Randomly redirect to delegates a. no directory, add new delegate b. cGET not modified c. delegate fresh, get from delegate d. cGET and stale, update e. GET and stale, update

Outline Overview Design Evaluation Discussion

Evaluation Characteristics Compare two schemes and dedicated cache Performance Latency External bandwidth Hit ratio Overhead Load Storage Fault Tolerance

Trace Characteristics

Bandwidth and Hit ratio Bytes transferred to origin servers and back correlated with hit rate Centralized cache with infinite storage 100MB cache per node achieves optimal rates 10MB in-memory cache is reasonable Directory scheme Active nodes suffer from eviction Distributed LRU is worse than centralized Home-store More total storage required

Latency User-perceived time for a response With comparable hit ratios, only consider internal hops Many requests can be satisfied locally, with 0 hops Directory scheme latency is up to one hop greater Some requests can be satisfied by home node Squirrel Latency Based on Pastry hops on cache hit Overshadowed on cache miss

Load on Nodes(1/2) Bursty behavior observations Max objects served per second Up to 48 and 55 objects per second served for the two traces Directory scheme One delegate can get bombarded with requests from many home nodes Home-store scheme Replicate objects at request threshold

Load on Nodes(2/2) Sustained load measurements Max objects/minute Average load in any second or minute: 0.31 objects/minute Redmond trace, both models

Fault Tolerance Internet connection loss Internal partitioning Individual failure Desktop shutdown or reboot Graceful shutdown Pastry aided content transfer Directory scheme More vulnerable to failures

Results The home-store models seems to outperform the directory model Hit ratio Load balancing Internal network latency Compared to centralized cache?

Outline Overview Design Evaluation Discussion

Would this be deployed in a corporate network?