Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.

Slides:



Advertisements
Similar presentations
Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
Advertisements

Clayton Sullivan PEER-TO-PEER NETWORKS. INTRODUCTION What is a Peer-To-Peer Network A Peer Application Overlay Network Network Architecture and System.
Serverless Network File Systems. Network File Systems Allow sharing among independent file systems in a transparent manner Mounting a remote directory.
Cache Coherent Distributed Shared Memory. Motivations Small processor count –SMP machines –Single shared memory with multiple processors interconnected.
Adding scalability to legacy PHP web applications Overview Mario A. Valdez-Ramirez.
Latency-sensitive hashing for collaborative Web caching Presented by: Xin Qi Yong Yang 09/04/2002.
NETWORK LOAD BALANCING NLB.  Network Load Balancing (NLB) is a Clustering Technology.  Windows Based. (windows server).  To scale performance, Network.
1 Content Delivery Networks iBAND2 May 24, 1999 Dave Farber CTO Sandpiper Networks, Inc.
Internet Networking Spring 2006 Tutorial 12 Web Caching Protocols ICP, CARP.
CSE 190: Internet E-Commerce Lecture 16: Performance.
Improving Proxy Cache Performance: Analysis of Three Replacement Policies Dilley, J.; Arlitt, M. A journal paper of IEEE Internet Computing, Volume: 3.
Adaptive Web Caching: Towards a New Caching Architecture Authors and Institutions: Scott Michel, Khoi Nguyen, Adam Rosenstein and Lixia Zhang UCLA Computer.
Locality-Aware Request Distribution in Cluster-based Network Servers 1. Introduction and Motivation --- Why have this idea? 2. Strategies --- How to implement?
Analysis of Web Caching Architectures: Hierarchical and Distributed Caching Pablo Rodriguez, Christian Spanner, and Ernst W. Biersack IEEE/ACM TRANSACTIONS.
P2P: Advanced Topics Filesystems over DHTs and P2P research Vyas Sekar.
What’s a Web Cache? Why do people use them? Web cache location Web cache purpose There are two main reasons that Web cache are used:  to reduce latency.
1 Spring Semester 2007, Dept. of Computer Science, Technion Internet Networking recitation #13 Web Caching Protocols ICP, CARP.
Submitting: Barak Pinhas Gil Fiss Laurent Levy
Internet Networking Spring 2002 Tutorial 13 Web Caching Protocols ICP, CARP.
Object Naming & Content based Object Search 2/3/2003.
Web Caching1 By Amisha Thakkar Alpa Shah. Web Caching2 Overview What is a Web Cache ? Caching Terminology Why use a cache? Disadvantages of Web Cache.
Web Caching1 By Amisha Thakkar. Web Caching2 Overview What is a Web Cache ? Caching Terminology Why use a cache? Disadvantages of Web Cache Other Features.
Adaptive Web Caching Lixia Zhang, Sally Floyd, and Van Jacob-son. In the 2nd Web Caching Workshop, Boulder, Colorado, April 25, System Laboratory,
Implementing ISA Server Caching. Caching Overview ISA Server supports caching as a way to improve the speed of retrieving information from the Internet.
Caching And Prefetching For Web Content Distribution Presented By:- Harpreet Singh Sidong Zeng ECE Fall 2007.
Wide-area cooperative storage with CFS
1 Web Content Delivery Reading: Section and COS 461: Computer Networks Spring 2007 (MW 1:30-2:50 in Friend 004) Ioannis Avramopoulos Instructor:
Web Caching Schemes For The Internet – cont. By Jia Wang.
Capacity planning for web sites. Promoting a web site Thoughts on increasing web site traffic but… Two possible scenarios…
1 The Mystery of Cooperative Web Caching 2 b b Web caching : is a process implemented by a caching proxy to improve the efficiency of the web. It reduces.
.NET Mobile Application Development Introduction to Mobile and Distributed Applications.
1 ENHANCHING THE WEB’S INFRASTUCTURE: FROM CACHING TO REPLICATION ECE 7995 Presented By: Pooja Swami and Usha Parashetti.
World Wide Web Caching: Trends and Technology Greg Barish and Katia Obraczka USC Information Science Institute IEEE Communications Magazine, May 2000 Presented.
 Proxy Servers are software that act as intermediaries between client and servers on the Internet.  They help users on private networks get information.
Web Cache. Introduction what is web cache?  Introducing proxy servers at certain points in the network that serve in caching Web documents for faster.
FIREWALL TECHNOLOGIES Tahani al jehani. Firewall benefits  A firewall functions as a choke point – all traffic in and out must pass through this single.
1 Content Distribution Networks. 2 Replication Issues Request distribution: how to transparently distribute requests for content among replication servers.
Department of Computer Science Southern Illinois University Edwardsville Dr. Hiroshi Fujinoki and Kiran Gollamudi {hfujino,
Server Load Balancing. Introduction Why is load balancing of servers needed? If there is only one web server responding to all the incoming HTTP requests.
Storage Allocation in Prefetching Techniques of Web Caches D. Zeng, F. Wang, S. Ram Appeared in proceedings of ACM conference in Electronic commerce (EC’03)
Design and Implement an Efficient Web Application Server Presented by Tai-Lin Han Date: 11/28/2000.
Local Area Networks (LAN) are small networks, with a short distance for the cables to run, typically a room, a floor, or a building. - LANs are limited.
Infrastructure for Better Quality Internet Access & Web Publishing without Increasing Bandwidth Prof. Chi Chi Hung School of Computing, National University.
CH2 System models.
World Wide Web Caching: Trends and Technologys Gerg Barish & Katia Obraczka USC Information Sciences Institute, USA,2000.
Unit – I CLIENT / SERVER ARCHITECTURE. Unit Structure  Evolution of Client/Server Architecture  Client/Server Model  Characteristics of Client/Server.
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
NetCache Architecture and Deployment Peter Danzig Network Appliance, Santa Clara, CA 元智大學 系統實驗室 陳桂慧
Understanding the Performance of Web Caching System with an Analysis Model and Simulation Xiaosong Hu Nur Zincir-Heywood Sep
Web Cache Redirection using a Layer-4 switch: Architecture, issues, tradeoffs, and trends Shirish Sathaye Vice-President of Engineering.
Kiew-Hong Chua a.k.a Francis Computer Network Presentation 12/5/00.
Adaptive Web Caching CS411 Dynamic Web-Based Systems Flying Pig Fei Teng/Long Zhao/Pallavi Shinde Computer Science Department.
World Wide Web Caching CS457 Seminar Yutao Zhong 11/13/2001.
Computer Science Lecture 14, page 1 CS677: Distributed OS Last Class: Concurrency Control Concurrency control –Two phase locks –Time stamps Intro to Replication.
1 Secure Peer-to-Peer File Sharing Frans Kaashoek, David Karger, Robert Morris, Ion Stoica, Hari Balakrishnan MIT Laboratory.
Setup and Management for the CacheRaQ. Confidential, Page 2 Cache Installation Outline – Setup & Wizard – Cache Configurations –ICP.
MiddleMan: A Video Caching Proxy Server NOSSDAV 2000 Brian Smith Department of Computer Science Cornell University Ithaca, NY Soam Acharya Inktomi Corporation.
Content Delivery Networks: Status and Trends Speaker: Shao-Fen Chou Advisor: Dr. Ho-Ting Wu 5/8/
1 COMP 431 Internet Services & Protocols HTTP Persistence & Web Caching Jasleen Kaur February 11, 2016.
Overview on Web Caching COSC 513 Class Presentation Instructor: Prof. M. Anvari Student name: Wei Wei ID:
/ Fast Web Content Delivery An Introduction to Related Techniques by Paper Survey B Li, Chien-chang R Sung, Chih-kuei.
1 Traffic Engineering By Kavitha Ganapa. 2 Introduction Traffic engineering is concerned with the issue of performance evaluation and optimization of.
Improving the WWW: Caching or Multicast? Pablo RodriguezErnst W. BiersackKeith W. Ross Institut EURECOM 2229, route des Cretes. BP , Sophia Antipolis.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
Web Caching? Web Caching:.
Internet Networking recitation #12
An Introduction to Computer Networking
EE 122: Lecture 22 (Overlay Networks)
Database System Architectures
Presentation transcript:

Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang

Web Caching Schemes2 Agenda The World Wide Web Problem and solution (caching) Proxy servers Advantages of web caching Disadvantages of web caching Elements of A WWW caching system Desirable properties of WWW caching system Problems in designing caching systems for the WWW Caching architecture

Web Caching Schemes3 The World Wide Web The WWW can be considered as a large distributed information system. Exponential growth in size. On may 1999 included 600 millions of static web pages. Increases 15% per month. Very popular.

Web Caching Schemes4 SIZE OF DISTINCT STATIC WEB PAGES

Web Caching Schemes5 The World Wide Web Usage is relatively inexpensive Accessing information is very fast Documents appeal to a wide range of interests But …..

Web Caching Schemes6 The World Wide Web Network congestion Server overloading

Web Caching Schemes7 Problem Internet backbone capacity increases 60% per year. Bandwidth is not growing fast enough. Without solution WWW will become too congested and its entire appeal would be lost.

Web Caching Schemes8 Solution Caching: Placing popular objects at locations close to the clients.

Web Caching Schemes9 proxy servers HTTP servers handled by companies for security reasons. The bottleneck of the connection between the client and the internet. Shared by all clients inside the firewall.

Web Caching Schemes10

Web Caching Schemes11 proxy servers Belonging to same organization, clients share common interests. They probably access the same set of documents.

Web Caching Schemes12 thus On the proxy server, a previously requested and cached documents would likely result in future hits.

Web Caching Schemes13 proxy severs Caching most popular web pages on the proxy server can: Save network bandwidth Lower access latency for the client

Web Caching Schemes14 Advantages of web caching Reduces bandwidth consumption Decreases network traffic Lessens network congestion Access latency: frequently used docs are cached nearby less traffic  shorter delay for docs not cached

Web Caching Schemes15 Advantages of web caching (cont.) Reduces workload of remote server Data can be accessed when remote server is down (enhanced robustness). Allows analysis of organization usage patterns  cooperation between caches increases efficiency.

Web Caching Schemes16 Disadvantages of web caching Data not updated automatically Cache miss can cause increase in latency (extra proxy processing). Bottleneck effect – limit # of clients per proxy. A single proxy is a single point of failure Information providers can not monitor # of visits per site.

Web Caching Schemes17 Elements of A WWW caching system Documents can be cached at the clients, the proxies and the servers.

Web Caching Schemes18 Elements of a WWW caching system

Web Caching Schemes19 Desirable properties of WWW caching system fast access robustness transparency scalability efficiency adaptivity stability load balance ability to deal with heterogeneity simplicity

Web Caching Schemes20 Fast access Reduce web access latency to a minimum. Especially comparing to other servers not using caching techniques.

Web Caching Schemes21 Robustness Robustness = Availability to user eliminate single point failure in case of failure – fall down gracefully easy to recover from failure

Web Caching Schemes22 Transparency Transparent to the user The user should only notice: Faster response Higher availability

Web Caching Schemes23 Scalability Scale well along the increasing size and density of the network. All protocols should be as lightweight as possible.

Web Caching Schemes24 Efficiency impose minimal additional burden on the network (in control & data packets) do not adopt any scheme which leads to under-utilization of the network

Web Caching Schemes25 Adaptivity adapt to dynamic changing in the user demand and network environment achieve optimal performance

Web Caching Schemes26 Stability Do not introduce instabilities into the network

Web Caching Schemes27 Load balancing distribute load evenly through the entire network no bottlenecks / hot-spots

Web Caching Schemes28 Ability to deal with heterogeneity Adapt to a range of network architecture (hardware & software)

Web Caching Schemes29 Simplicity Mechanism simple to deploy simpler schemes are easier to implement and likely to be accepted as international standards

Web Caching Schemes30 What Problems do we face in designing caching systems for the WWW ???

Web Caching Schemes31 Problems in designing caching systems for the WWW Caching system architecture how cache proxies are organized – hierarchically, distributed or hybrid.

Web Caching Schemes32 Problems in designing caching systems for the WWW Proxy placement were to place a cache proxy in order to optimize performance

Web Caching Schemes33 Problems in designing caching systems for the WWW Caching contents What can be cached in the caching system

Web Caching Schemes34 Problems in designing caching systems for the WWW Proxy cooperation How do proxies cooperate with each other

Web Caching Schemes35 Problems in designing caching systems for the WWW Data sharing what kind of data/information can be shared among among cooperative proxies

Web Caching Schemes36 Problems in designing caching systems for the WWW Cache resolution/routing how does a proxy decide where to fetch a page requested by a client.

Web Caching Schemes37 Problems in designing caching systems for the WWW Prefetching How does a proxy decide what and when to prefetch from webservers or other proxies to reduce access latency.

Web Caching Schemes38 Problems in designing caching systems for the WWW Cache placement/ replacement how the proxy decides which page to be stored in its cache and which page to be removed from it.

Web Caching Schemes39 Problems in designing caching systems for the WWW Cache coherency how does a proxy maintain data consistency

Web Caching Schemes40 Problems in designing caching systems for the WWW Control information distribution how is the control information (e.g URL) distributed among proxies.

Web Caching Schemes41 Problems in designing caching systems for the WWW Dynamic data caching how to deal with data that is not cachable

Web Caching Schemes42 Caching architecture Hierarchical Caches are placed at multiple levels of the network. national regional institutional bottom

Web Caching Schemes43 Hierarchical architecture Bottom – clients/browsers caches. national regional institutional bottom web page not found

Web Caching Schemes44 Hierarchical architecture after web page is found national regional institutional bottom forward page, leave copy

Web Caching Schemes45 Hierarchical architecture Advantages: Bandwidth efficient – especially when cache servers are slow. Allows to efficiently diffuse popular web pages towards the demand.

Web Caching Schemes46 Hierarchical architecture Disadvantages Cache server needs to be placed at key access points of the network  requires coordination among caches. Each level adds a delay. High levels are bottlenecks. multiple copies at different cache levels.

Web Caching Schemes47 Distributed architecture Caches at the bottom level only. No other intermediate caching levels. Each cache server contains meta-data on the data stored on other servers. Hierarchy used only for distributing information about location of the copy. No copying of actual documents.

Web Caching Schemes48 Advantages: Traffic flows through low network levels which are less congested. No additional disk space required for intermediate network levels. Better load sharing. More fault tolerant. Distributed architecture

Web Caching Schemes49 Disadvantages: High connection times Higher bandwidth usage Administrative issues. Distributed architecture

Web Caching Schemes50 Examples ICP – Internet Cache Protocol (Harvest group) Retrieve data from neighboring caches + parent caches CARP – Cache Array Routing Protocol URL space divided to an array of caches. Each cache stores only documents whose URL are hashed to it. Distributed architecture

Web Caching Schemes51 Hybrid architecture Caches may cooperate with other caches at the same level or at a higher level using distributed caching. ICP is an example: the document is fetched from a parent/neighbor cache that has the lowest RTT.

Web Caching Schemes52 Performance of architectures Hierarchical caching has shorter connection times than distributed caching. Additional copies at intermediate level reduces retrieval latency for small documents. Distributed caching has shorter transmission times & higher bandwidth usage. “ Well configured ” hybrid scheme can reduce both connection time and transmission time.