Cooperative Caching of Dynamic Content on a Distributed Web Server Vegard Holmedahl, Ben Smith, Tao Yang Speaker: SeungLak Choi, DB Lab., CS Dept.

Slides:



Advertisements
Similar presentations
Summary Cache: A Scalable Wide-Area Web Cache Sharing Protocol Li Fan, Pei Cao and Jussara Almeida University of Wisconsin-Madison Andrei Broder Compaq/DEC.
Advertisements

Split Databases. What is a split database? Two databases Back-end database –Contains tables (data) only –Resides on server Front-end database –Contains.
1 Scoped and Approximate Queries in a Relational Grid Information Service Dong Lu, Peter A. Dinda, Jason A. Skicewicz Prescience Lab, Dept. of Computer.
Indications in green = Live content Indications in white = Edit in master Indications in blue = Locked elements Indications in black = Optional elements.
LYU0101 Wireless Digital Library on PDA Lam Yee Gordon Yeung Kam Wah Supervisor Prof. Michael Lyu First semester FYP Presentation 2001~2002.
Scalable Content-aware Request Distribution in Cluster-based Network Servers Jianbin Wei 10/4/2001.
1 Content Delivery Networks iBAND2 May 24, 1999 Dave Farber CTO Sandpiper Networks, Inc.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Summary Cache: A Scalable Wide-Area Web Cache Sharing Protocol By Abuzafor Rasal and Vinoth Rayappan.
Chapter 2: Application Layer
Other File Systems: LFS and NFS. 2 Log-Structured File Systems The trend: CPUs are faster, RAM & caches are bigger –So, a lot of reads do not require.
1 CAPS: A Peer Data Sharing System for Load Mitigation in Cellular Data Networks Young-Bae Ko, Kang-Won Lee, Thyaga Nandagopal Presentation by Tony Sung,
A Distributed Proxy Server for Wireless Mobile Web Service Kisup Kim, Hyukjoon Lee, and Kwangsue Chung Information Network 2001, 15 th Conference.
Data Sharing in OSD Environment Dingshan He September 30, 2002.
Distributed Resource Management: Distributed Shared Memory
Squirrel: A decentralized peer- to-peer web cache Paul Burstein 10/27/2003.
1 Awareness Services for Digital Libraries Arturo Crespo Hector Garcia-Molina Stanford University.
1 Seminar: Information Management in the Web Gnutella, Freenet and more: an overview of file sharing architectures Thomas Zahn.
What Can Databases Do for Peer-to-Peer Steven Gribble, Alon Halevy, Zachary Ives, Maya Rodrig, Dan Suciu Presented by: Ryan Huebsch CS294-4 P2P Systems.
1 The Mystery of Cooperative Web Caching 2 b b Web caching : is a process implemented by a caching proxy to improve the efficiency of the web. It reduces.
CS252/Patterson Lec /28/01 CS 213 Lecture 10: Multiprocessor 3: Directory Organization.
World Wide Web Caching: Trends and Technology Greg Barish and Katia Obraczka USC Information Science Institute IEEE Communications Magazine, May 2000 Presented.
RAID-x: A New Distributed Disk Array for I/O-Centric Cluster Computing Kai Hwang, Hai Jin, and Roy Ho.
A Scalable Content Distribution Service for Dynamic Web Content Seejo Sebastine Department of Computer Science University of Virginia.
File Systems and N/W attached storage (NAS) | VTU NOTES | QUESTION PAPERS | NEWS | VTU RESULTS | FORUM | BOOKSPAR ANDROID APP.
Kanika Chawla Parth Shah Sowmith Boyanpalli MEMORY MANAGEMENT IN MOBILE ENVIRONMENT.
SAINT ‘01 Proactive DNS Caching: Addressing a Performance Bottleneck Edith Cohen AT&T Labs-Research Haim Kaplan Tel-Aviv University.
1 Design and Performance of a Web Server Accelerator Eric Levy-Abegnoli, Arun Iyengar, Junehwa Song, and Daniel Dias INFOCOM ‘99.
5 Chapter Five Web Servers. 5 Chapter Objectives Learn about the Microsoft Personal Web Server Software Learn how to improve Web site performance Learn.
Consistency And Replication
Module 10: Monitoring ISA Server Overview Monitoring Overview Configuring Alerts Configuring Session Monitoring Configuring Logging Configuring.
World Wide Web Caching: Trends and Technologys Gerg Barish & Katia Obraczka USC Information Sciences Institute, USA,2000.
Web Caching By Neeraj Agrawal. Caching Caching is widely used for improving performance in many context( e.g processor caches in hardware, buffer pool.
Scalable Web Server on Heterogeneous Cluster CHEN Ge.
Scaling Dynamic Content Applications through Data Replication - Opportunities for Compiler Optimizations Cristiana Amza UofT.
CS 5204 (FALL 2005)1 Leases: An Efficient Fault Tolerant Mechanism for Distributed File Cache Consistency Gray and Cheriton By Farid Merchant Date: 9/21/05.
NetCache Architecture and Deployment Peter Danzig Network Appliance, Santa Clara, CA 元智大學 系統實驗室 陳桂慧
A Measurement Based Memory Performance Evaluation of High Throughput Servers Garba Isa Yau Department of Computer Engineering King Fahd University of Petroleum.
Web Performance 성민영 SNU Computer Systems lab.. 2 차례 4 Modeling the Performance of HTTP Over Several Transport Protocols. 4 Summary Cache : A Scaleable.
Architecture for Caching Responses with Multiple Dynamic Dependencies in Multi-Tier Data- Centers over InfiniBand S. Narravula, P. Balaji, K. Vaidyanathan,
Dr. Yingwu Zhu Summary Cache : A Scalable Wide- Area Web Cache Sharing Protocol.
Introduction to DFS. Distributed File Systems A file system whose clients, servers and storage devices are dispersed among the machines of a distributed.
Module 9: Implementing Caching. Overview Caching Overview Configuring General Cache Properties Configuring Cache Rules Configuring Content Download Jobs.
Authors: Haowei Yuan and Patrick Crowley Publisher: 2013 Proceedings IEEE INFOCOM Presenter: Chia-Yi Chu Date: 2013/08/14 1.
Fast Crash Recovery in RAMCloud. Motivation The role of DRAM has been increasing – Facebook used 150TB of DRAM For 200TB of disk storage However, there.
Computer Science Lecture 19, page 1 CS677: Distributed OS Last Class: Fault tolerance Reliable communication –One-one communication –One-many communication.
On The Cooperation of Web Clients and Proxy Caches Yiu Fai Sit, Francis C.M. Lau, Cho-Li Wang Department of Computer Science The University of Hong Kong.
Performance of Web Proxy Caching in Heterogeneous Bandwidth Environments IEEE Infocom, 1999 Anja Feldmann et.al. AT&T Research Lab 발표자 : 임 민 열, DB lab,
Topic 3 Analysing network traffic
Measuring the Capacity of a Web Server USENIX Sympo. on Internet Tech. and Sys. ‘ Koo-Min Ahn.
1 MSRBot Web Crawler Dennis Fetterly Microsoft Research Silicon Valley Lab © Microsoft Corporation.
Hint-based Acceleration of Web Proxy Cache Daniela Rosu Arun Iyengar Daniel Dias IBM T.J.Watson Research Center Unversity of Yuan Ze,Syslab Mike Tien
Adaptive Load Sharing for Clustered Digital Library Servers Song, yong-joo System Software Lab, EECS, KAIST 9. 13, 2000.
CS Spring 2014 CS 414 – Multimedia Systems Design Lecture 37 – Introduction to P2P (Part 1) Klara Nahrstedt.
System Software Lab. A Scalable Web Cache Consistency Architecture Kim Sangyup SSLAB. EE. KAIST SIGCOMM ’ 99 Haobo Yu, Lee Breslau.
Communication and Information Systems Lab. Application-Level Differentiated Services for Web Servers Tue. KAIST Dept. EECS CISLAB 송지영
WebScan: Implementing QueryServer 2.0 Karl Geiger, Amgen Inc. BRS NA UG August 1999.
Computer Science Lecture 19, page 1 CS677: Distributed OS Last Class: Fault tolerance Reliable communication –One-one communication –One-many communication.
Cofax Scalability Document Version Scaling Cofax in General The scalability of Cofax is directly related to the system software, hardware and network.
CS Spring 2010 CS 414 – Multimedia Systems Design Lecture 24 – Introduction to Peer-to-Peer (P2P) Systems Klara Nahrstedt (presented by Long Vu)
DISTRIBUTED FILE SYSTEM- ENHANCEMENT AND FURTHER DEVELOPMENT BY:- PALLAWI(10BIT0033)
Clustered Web Server Model
Lecture 18: Coherence and Synchronization
Memory Management for Scalable Web Data Servers
Edge computing (1) Content Distribution Networks
CS510 - Portland State University
Database System Architectures
Distributed Resource Management: Distributed Shared Memory
Lecture 1: Bloom Filters
Presentation transcript:

Cooperative Caching of Dynamic Content on a Distributed Web Server Vegard Holmedahl, Ben Smith, Tao Yang Speaker: SeungLak Choi, DB Lab., CS Dept.

Contents Introduction Access log analysis Design of the Swala Experiments Conclusions Critiques

Introduction (1/2) Web caching reduces mainly the network delay Processor bottleneck is also important in sites serving extensive requests to dynamic contents

Introduction (2/2) Swala Distributed Web server Cooperatively caches the results of CGI requests Alexandria Digital Library (ADL) system at UCSB Accessing multi-resolution images Spatial database queries

Access Log Analysis (1/2) There is significant potential for reducing response time by optimizing CGI Total # of requests69,337 # of CGI requests28,663 (41.3%) Total response time of CGI requests 97% of total response time

Access Log Analysis (2/2) By caching a few number of CGI results, achieve significant response time reduction Time threshold #long requests Total # repeats # uniq. repeats Time saved Saved % 0.5 sec sec sec sec

Design of the Swala (1/3) Module design HTTP Module Handling HTTP requests Cache module Updates local directory Send contents to other node Delete expired cache entries

Design of the Swala (2/3) Cache table consistency Intra-node consistency Avoiding corruption from simultaneous updates Locking granularity – directory, table, entry Inter-node consistency Weak consistency Broadcast update information to the other nodes False cache miss, false cache hit

Design of the Swala (3/3) Content consistency Weak consistency – TTL Reasonable for dynamic contents with infrequent updates Store the cache directory in main memory Store CGI results in disk System administrator must specify uncacheable CGI

Experiments (1/6) Environment Sun Ultra MHz Sun Ultra MHz 64 or 128 MB Connected by 100Mbps Ethernet WebStone

Experiments (2/6) Single-node performance and overhead of cache fetch 24 clients request nullcgi that does no work and produces <1000 bytes output

Experiments (3/6) Multi-node performance

Experiments (4/6) Cache insertion overhead Each unique request generates a cache miss

Experiments (5/6) Cache directory maintenance overhead Generate update messages to Swala

Experiments (6/6) A comparison between cooperative and stand-alone caching Cache size 2000 Cache size 20

Conclusions Caching CGI results on a cluster of workstations Significant potential for time saving through caching CGI results Overhead of cooperative cache management is small Cooperative caching of CGI results substantially improves the response time

Critiques Strengths Attack the important problem occurred in common web sites CGI programs don’t have to be rewritten Weaknesses Weak content consistency Remote cache fetch can incur network congestion in heavy network traffic Contents hashing of requests?