Storage Allocation in Prefetching Techniques of Web Caches D. Zeng, F. Wang, S. Ram Appeared in proceedings of ACM conference in Electronic commerce (EC’03)

Slides:

Advertisements

Similar presentations

2000 Making DADS distributed a Nordunet2 project Jochen Hollmann Chalmers University of Technology.

Advertisements

A Cloud Data Center Optimization Approach using Dynamic Data Interchanges Prof. Stephan Robert University of Applied Sciences.

A Survey of Web Cache Replacement Strategies Stefan Podlipnig, Laszlo Boszormenyl University Klagenfurt ACM Computing Surveys, December 2003 Presenter:

Consistency and Replication Chapter 7 Part II Replica Management & Consistency Protocols.

Dave Bradley Rick Harper Steve Hunter 4/28/2003 CoolRunnings.

1 Efficient and Robust Streaming Provisioning in VPNs Z. Morley Mao David Johnson Oliver Spatscheck Kobus van der Merwe Jia Wang.

Cloud Computing Resource provisioning Keke Chen. Outline  For Web applications statistical Learning and automatic control for datacenters  For data.

A SLA Framework for QoS Provisioning and Dynamic Capacity Allocation Rahul Garg (IBM India Research Lab), R. S. Randhawa (Stanford University), Huzur Saran.

All Hands Meeting, 2006 Title: Grid Workflow Scheduling in WOSE (Workflow Optimisation Services for e- Science Applications) Authors: Yash Patel, Andrew.

OPTIMIZATION Lecture 24. Optimization Uses sophisticated mathematical modeling techniques for the analysis Multi-step process Provides improved benefit.

Complexity 16-1 Complexity Andrei Bulatov Non-Approximability.

1 School of Computing Science Simon Fraser University, Canada Modeling and Caching of P2P Traffic Mohamed Hefeeda Osama Saleh ICNP’06 15 November 2006.

Kuang-Hao Liu et al Presented by Xin Che 11/18/09.

Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.

Beneficial Caching in Mobile Ad Hoc Networks Bin Tang, Samir Das, Himanshu Gupta Computer Science Department Stony Brook University.

Peer-to-Peer Based Multimedia Distribution Service Zhe Xiang, Qian Zhang, Wenwu Zhu, Zhensheng Zhang IEEE Transactions on Multimedia, Vol. 6, No. 2, April.

Improving Proxy Cache Performance: Analysis of Three Replacement Policies Dilley, J.; Arlitt, M. A journal paper of IEEE Internet Computing, Volume: 3.

Prefix Caching assisted Periodic Broadcast for Streaming Popular Videos Yang Guo, Subhabrata Sen, and Don Towsley.

1 Caching/storage problems and solutions in wireless sensor network Bin Tang CSE 658 Seminar on Wireless and Mobile Networking.

Exploiting Content Localities for Efficient Search in P2P Systems Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang 1 1 College of William and Mary,

ICNP'061 Benefit-based Data Caching in Ad Hoc Networks Bin Tang, Himanshu Gupta and Samir Das Department of Computer Science Stony Brook University.

1 Probabilistic Models for Web Caching David Starobinski, David Tse UC Berkeley Conference and Workshop on Stochastic Networks Madison, Wisconsin, June.

Virtual Memory BY JEMINI ISLAM. What is Virtual Memory Virtual memory is a memory management system that gives a computer the appearance of having more.

Caching And Prefetching For Web Content Distribution Presented By:- Harpreet Singh Sidong Zeng ECE Fall 2007.

Web Caching Schemes For The Internet – cont. By Jia Wang.

1 Efficient Management of Data Center Resources for Massively Multiplayer Online Games V. Nae, A. Iosup, S. Podlipnig, R. Prodan, D. Epema, T. Fahringer,

Efficient Algorithms for Locating Web Proxies Copyright, 1996 © Dale Carnegie & Associates, Inc. Li-Chuan Chen The MITRE Corporation Co-author:

By Ravi Shankar Dubasi Sivani Kavuri A Popularity-Based Prediction Model for Web Prefetching.

HeteroPar 2013 Optimization of a Cloud Resource Management Problem from a Consumer Perspective Rafaelli de C. Coutinho, Lucia M. A. Drummond and Yuri Frota.

A User Experience-based Cloud Service Redeployment Mechanism KANG Yu.

Energy Efficiency in Cloud Data Centers: Energy Efficient VM Placement for Cloud Data Centers Doctoral Student : Chaima Ghribi Advisor : Djamal Zeghlache.

Achieving Load Balance and Effective Caching in Clustered Web Servers Richard B. Bunt Derek L. Eager Gregory M. Oster Carey L. Williamson Department of.

DISKS IS421. DISK  A disk consists of Read/write head, and arm  A platter is divided into Tracks and sector  The R/W heads can R/W at the same time.

Bargaining Towards Maximized Resource Utilization in Video Streaming Datacenters Yuan Feng 1, Baochun Li 1, and Bo Li 2 1 Department of Electrical and.

© 2006 IBM Corporation Adaptive Self-Tuning Memory in DB2 Adam Storm, Christian Garcia-Arellano, Sam Lightstone – IBM Toronto Lab Yixin Diao, M. Surendra.

Design and Implement an Efficient Web Application Server Presented by Tai-Lin Han Date: 11/28/2000.

Section 15.1 Identify Webmastering tasks Identify Web server maintenance techniques Describe the importance of backups Section 15.2 Identify guidelines.

M i SMob i S Mob i Store - Mobile i nternet File Storage Platform Chetna Kaur.

Infrastructure for Better Quality Internet Access & Web Publishing without Increasing Bandwidth Prof. Chi Chi Hung School of Computing, National University.

Network Aware Resource Allocation in Distributed Clouds.

DELAYED CHAINING: A PRACTICAL P2P SOLUTION FOR VIDEO-ON-DEMAND Speaker : 童耀民 MA1G Authors: Paris, J.-F.Paris, J.-F. ; Amer, A. Computer.

Service Architecture of Grid Faults Diagnosis Expert System Based on Web Service Wang Mingzan, Zhang ziye Northeastern University, Shenyang, China.

Distributing Layered Encoded Video through Caches Authors: Jussi Kangasharju Felix HartantoMartin Reisslein Keith W. Ross Proceedings of IEEE Infocom 2001,

Workshop PRIXNET Distributed Virtual Circuit Switching protocol with auction pricing Loubna ECHABBI Dominique BARTH Laboratoire PRISM.

A Dynamic Data Grid Replication Strategy to Minimize the Data Missed Ming Lei, Susan Vrbsky, Xiaoyan Hong University of Alabama.

The Design and Implementation of Log-Structure File System M. Rosenblum and J. Ousterhout.

Group 3 Sandeep Chinni Arif Khan Venkat Rajiv. Delay Tolerant Networks Path from source to destination is not present at any single point in time. Combining.

The Application of The Improved Hybrid Ant Colony Algorithm in Vehicle Routing Optimization Problem International Conference on Future Computer and Communication,

1 ACTIVE FAULT TOLERANT SYSTEM for OPEN DISTRIBUTED COMPUTING (Autonomic and Trusted Computing 2006) Giray Kömürcü.

1 S ystems Analysis Laboratory Helsinki University of Technology Flight Time Allocation Using Reinforcement Learning Ville Mattila and Kai Virtanen Systems.

Efficient P2P Search by Exploiting Localities in Peer Community and Individual Peers A DISC’04 paper Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang.

Jennifer Rexford Fall 2014 (TTh 3:00-4:20 in CS 105) COS 561: Advanced Computer Networks TCP.

ASSIGNMENT, DISTRIBUTION AND QOS PROVISIONING IN COMMUNICATION NETWORKS.

Content caching and scheduling in wireless networks with elastic and inelastic traffic Group-VI 09CS CS CS30020 Performance Modelling in Computer.

Best Available Technologies: External Storage Overview of Opportunities and Impacts November 18, 2015.

Web Prefetching Lili Qiu Microsoft Research March 27, 2003.

Author Utility-Based Scheduling for Bulk Data Transfers between Distributed Computing Facilities Xin Wang, Wei Tang, Raj Kettimuthu,

Video Caching in Radio Access network: Impact on Delay and Capacity

On the Placement of Web Server Replicas Yu Cai. Paper On the Placement of Web Server Replicas Lili Qiu, Venkata N. Padmanabhan, Geoffrey M. Voelker Infocom.

Quality Is in the Eye of the Beholder: Meeting Users ’ Requirements for Internet Quality of Service Anna Bouch, Allan Kuchinsky, Nina Bhatti HP Labs Technical.

Management of Broadband Media Assets on Wide Area Networks Lars-Olof Burchard.

Resource Optimization for Publisher/Subscriber-based Avionics Systems Institute for Software Integrated Systems Vanderbilt University Nashville, Tennessee.

The Impact of Replacement Granularity on Video Caching

Section 15.1 Section 15.2 Identify Webmastering tasks

Server Allocation for Multiplayer Cloud Gaming

ISP and Egress Path Selection for Multihomed Networks

Load Balancing in Distributed Systems

Group Based Management of Distributed File Caches

Presented By: Darlene Banta

Chapter 6. Large Scale Optimization

Presentation transcript:

Storage Allocation in Prefetching Techniques of Web Caches D. Zeng, F. Wang, S. Ram Appeared in proceedings of ACM conference in Electronic commerce (EC’03) San Diego June 9-12, 2003 Presented by Laura D. Goadrich

The Web Large-scale distributed information system where data Objects are published and accessible by users Problems caused by the demand of increased web capacity: Network traffic congestion Web server overloads Solution: web caching

Web caching: Benefits: Improves web performance (reduces access latency) Increases web capacity Alleviate traffic congestion (reducing network bandwidth consumption) Reducing number of client requests (workload) Possibly improve failure tolerance and robustness of Web (maintaining cached copies of web objects for unreachable networks) Prefetching: Anticipate users’ future needs This research: Focuses on making cache-related storage capacity decisions (storage capacity limits the number of prefetched web objects) Therefore allocate cache storage in prefetching The authors state this focus has not been researched**

Ideas: Current research: Predict user web accesses without considering cache storage limit This research: optimization based models Maximize hit rate Maximize byte hit rate Minimize access latency (first 2 are primary goals of web caching: maximize) Benefit of this research: guide the operations of a prefetching system

Web prefetching techniques Client-initiated policies User A is likely to access URL U2 right after URL U1 Patterns learned via Markov algorithms Server-initiated policies Anticipate future requests based on server logs and proactively send the corresponding Web objects to participating cache servers or client browsers Top-n algorithm Hybrid policies Combine user access patterns from clients and general statistics from servers to improve the quality of prediction Failing of policies: how to make decisions of which Web objects to prefetch considering storage capacity

Assumptions/Notation Cmaximum amount of storage space is available to store prefetched Web objects iURL of potential interest PiPi Predicted probability with which URL i will be visited (i, P i )Prediction of users’ future accesses NSet of all URLs of potential interest S i э (S i <C)Size of each Web object referred to by i

Hit Rate (HR) Model (1) (2) (3)

Byte Hit Rate (BHR) Model (4) (2) (3)

Byte Hit Rate (BHR) Model (7) (2) (3) αiαi # of seconds to establish the network connection between the client machine and the Web server hosting i βiβi # of seconds per byte to transmit i over the network

Transforming HR, BHR & AL into the Knapsack problem Benefits of Knapsack problem Well studied “easiest” NP-hard problem Can solve optimally by a pseudo-polynomial algorithm based on dynamic programming A fully polynomial approximation is possible Focus on greedy algorithm (due to paper length limits)

Greedy Algorithm: 1. Sort all URLs into a sequence 2. Determine a threshold k defined as: 3. Prefetch Web objects referred to by URLs

Other Allocation Policies Tested Optimal policy using CPLEX Disadvantages Complex Increased implementation time Difficult to implement Top-n Developed for Web usage prediction Used to regulate storage allocations by appropriately setting n Equivalent to Greedy BHR relying only on P i

Simulations SmallLarge |N|50200 repTextmultimedia C100,000 α/ β 5,000 (slow)30,000 (fast) LN(μ,σ)= lognormal distribution with mean e μ and shape σ a. b.

Performance Comparison Experimental Condition Hit RateByte Hit Rate% Savings in Access Latency OptG-HRTop-nOptG-HRTop-nOptG-HRTop-n a=50, LN(10,.05), b= a=50, LN(10,.05) ), b= a=50, LN(10,1) ), b= a=50, LN(10,1) ), b= a=200, LN(10,.05) ), b= a=200, LN(10,.05) ), b= a=200, LN(10,1) ), b= a=200, LN(10,1) ), b=

Results Greedy algorithms and Top-n in general achieve reasonable performance Greedy algorithms outperform Top-n with respect to hit rate and access latency There exists a relatively large performance gap between an optimal approach and fast heuristic methods when Web objects vary greatly in size Suggests the need for developing more sophisticated allocation policies such as a dynamic programming-based approach

Contributions: Focus: stress importance of effective storage allocation in prefetching Paper contributions: 1. Provide new formulations for prefetching storage allocation 2. Create computationally efficient allocation policies based on storage allocations solved by the knapsack problem 3. Models created lead to more precise understanding of the applicability and effectiveness of Top-n policy

Future Work Trace-based simulation Actual web access logs More realistic environment Modeling Integrate allocation models with caching storage management models i.e. Cache replacement

Changes- Recommendations Not renaming the same constraints More resources (5 articles, 2 books) Discuss feasible solve times (opt) Test/Hypothesize implementation strategies for real application