1 School of Computing Science Simon Fraser University, Canada Modeling and Caching of P2P Traffic Mohamed Hefeeda Osama Saleh ICNP’06 15 November 2006.

Slides:



Advertisements
Similar presentations
A Measurement Study of Peer-to-Peer File Sharing Systems Presented by Cristina Abad.
Advertisements

A Survey of Web Cache Replacement Strategies Stefan Podlipnig, Laszlo Boszormenyl University Klagenfurt ACM Computing Surveys, December 2003 Presenter:
Web Server Benchmarking Using the Internet Protocol Traffic and Network Emulator Carey Williamson, Rob Simmonds, Martin Arlitt et al. University of Calgary.
October 15, 2002MASCOTS WebTraff: A GUI for Web Proxy Cache Workload Modeling and Analysis Nayden Markatchev Carey Williamson Department of Computer.
Cloud Download : Using Cloud Utilities to Achieve High-quality Content Distribution for Unpopular Videos Yan Huang, Tencent Research, Shanghai, China Zhenhua.
1 School of Computing Science Simon Fraser University, Canada PCP: A Probabilistic Coverage Protocol for Wireless Sensor Networks Mohamed Hefeeda and Hossein.
Measurement, Modeling and Analysis of a Peer-to-Peer File-Sharing Workload Krishna Gummadi, Richard Dunn, Stefan Saroiu Steve Gribble, Hank Levy, John.
GlobeTraff A traffic workload generator for the performance evaluation of ICN architectures K.V. Katsaros, G. Xylomenos, G.C. Polyzos A.U.E.B. (presented.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada Analysis of Multimedia Authentication Schemes Mohamed Hefeeda (Joint work.
Cross-Layer Optimization for Video Streaming in Single- Hop Wireless Networks Cheng-Hsin Hsu Joint Work with Mohamed Hefeeda MMCN ‘09January 19, 2009 Simon.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada ISP-Friendly Peer Matching without ISP Collaboration Mohamed Hefeeda (Joint.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada Multimedia Streaming in Dynamic Peer-to-Peer Systems and Mobile Wireless.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Beneficial Caching in Mobile Ad Hoc Networks Bin Tang, Samir Das, Himanshu Gupta Computer Science Department Stony Brook University.
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
Measurement, Modeling, and Analysis of a Peer-to-Peer File sharing Workload Krishna P. Gummadi, Richard J. Dunn, Stefan Saroiu, Steven D. Gribble, Henry.
Improving Proxy Cache Performance: Analysis of Three Replacement Policies Dilley, J.; Arlitt, M. A journal paper of IEEE Internet Computing, Volume: 3.
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
Improving Proxy Cache Performance: Analysis of Three Replacement Policies John Dilley and Martin Arlitt IEEE internet computing volume3 Nov-Dec 1999 Chun-Fu.
Exploiting Content Localities for Efficient Search in P2P Systems Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang 1 1 College of William and Mary,
PROMISE: Peer-to-Peer Media Streaming Using CollectCast M. Hefeeda, A. Habib, B. Botev, D. Xu, and B. Bhargava ACM Multimedia 2003, November 2003.
1 CAPS: A Peer Data Sharing System for Load Mitigation in Cellular Data Networks Young-Bae Ko, Kang-Won Lee, Thyaga Nandagopal Presentation by Tony Sung,
Quality-Aware Segment Transmission Scheduling in Peer-to-Peer Streaming Systems Cheng-Hsin Hsu Senior Research Scientist Deutsche Telekom R&D Lab USA Los.
Adaptive Content Management in Structured P2P Communities Jussi Kangasharju Keith W. Ross David A. Turner.
Looking at the Server-side of P2P Systems Yi Qiao, Dong Lu, Fabian E. Bustamante and Peter A. Dinda Department of Computer Science Northwestern University.
Internet Cache Pollution Attacks and Countermeasures Yan Gao, Leiwen Deng, Aleksandar Kuzmanovic, and Yan Chen Electrical Engineering and Computer Science.
A Distributed Search Service for Peer-to-Peer File Sharing in Mobile Application Presented by Tony Sung On Loy, MC Lab, CUHK IE 1 A Distributed Search.
Web Caching Robert Grimm New York University. Before We Get Started  Illustrating Results  Type Theory 101.
Squirrel: A decentralized peer- to-peer web cache Paul Burstein 10/27/2003.
A Hybrid Caching Strategy for Streaming Media Files Jussara M. Almeida Derek L. Eager Mary K. Vernon University of Wisconsin-Madison University of Saskatchewan.
Caching And Prefetching For Web Content Distribution Presented By:- Harpreet Singh Sidong Zeng ECE Fall 2007.
Web Caching Schemes For The Internet – cont. By Jia Wang.
On-Demand Media Streaming Over the Internet Mohamed M. Hefeeda, Bharat K. Bhargava Presented by Sam Distributed Computing Systems, FTDCS Proceedings.
Web Caching and Content Delivery. Caching for a Better Web Performance is a major concern in the Web Proxy caching is the most widely used method to improve.
Storage management and caching in PAST PRESENTED BY BASKAR RETHINASABAPATHI 1.
P2P Architecture Case Study: Gnutella Network
1 One-Click Hosting Services: A File-Sharing Hideout Demetris Antoniades Evangelos P. Markatos ICS-FORTH Heraklion,
Storage Allocation in Prefetching Techniques of Web Caches D. Zeng, F. Wang, S. Ram Appeared in proceedings of ACM conference in Electronic commerce (EC’03)
SAINT ‘01 Proactive DNS Caching: Addressing a Performance Bottleneck Edith Cohen AT&T Labs-Research Haim Kaplan Tel-Aviv University.
Web Cache Replacement Policies: Properties, Limitations and Implications Fabrício Benevenuto, Fernando Duarte, Virgílio Almeida, Jussara Almeida Computer.
1 Towards Cinematic Internet Video-on-Demand Bin Cheng, Lex Stein, Hai Jin and Zheng Zhang HUST and MSRA Huazhong University of Science & Technology Microsoft.
Segment-Based Proxy Caching of Multimedia Streams Authors: Kun-Lung Wu, Philip S. Yu, and Joel L. Wolf IBM T.J. Watson Research Center Proceedings of The.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada Video Streaming over Cooperative Wireless Networks Mohamed Hefeeda (Joint.
1 CS 425 Distributed Systems Fall 2011 Slides by Indranil Gupta Measurement Studies All Slides © IG Acknowledgments: Jay Patel.
Mohamed Hefeeda 1 School of Computing Science Simon Fraser University, Canada Optimal Partitioning of Fine-Grained Scalable Video Streams Mohamed Hefeeda.
Design and Analysis of Advanced Replacement Policies for WWW Caching Kai Cheng, Yusuke Yokota, Yahiko Kambayashi Department of Social Informatics Graduate.
Quantitative Evaluation of Unstructured Peer-to-Peer Architectures Fabrício Benevenuto José Ismael Jr. Jussara M. Almeida Department of Computer Science.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
Kaleidoscope – Adding Colors to Kademlia Gil Einziger, Roy Friedman, Eyal Kibbar Computer Science, Technion 1.
Efficient P2P Search by Exploiting Localities in Peer Community and Individual Peers A DISC’04 paper Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang.
Performance of Web Proxy Caching in Heterogeneous Bandwidth Environments IEEE Infocom, 1999 Anja Feldmann et.al. AT&T Research Lab 발표자 : 임 민 열, DB lab,
Plethora: Infrastructure and System Design. Introduction Peer-to-Peer (P2P) networks: –Self-organizing distributed systems –Nodes receive and provide.
Jennifer Rexford Fall 2014 (TTh 3:00-4:20 in CS 105) COS 561: Advanced Computer Networks TCP.
NTMS 2012 GlobeTraff: a traffic workload generator for the performance evaluation of future Internet architectures K.V. Katsaros, G. Xylomenos, G.C. Polyzos.
An Overview of Proxy Caching Algorithms Haifeng Wang.
MiddleMan: A Video Caching Proxy Server NOSSDAV 2000 Brian Smith Department of Computer Science Cornell University Ithaca, NY Soam Acharya Inktomi Corporation.
On the Placement of Web Server Replicas Yu Cai. Paper On the Placement of Web Server Replicas Lili Qiu, Venkata N. Padmanabhan, Geoffrey M. Voelker Infocom.
An Analysis of Internet Content Delivery Systems 19 rd November, 2007 Youngsub CSE, SNU.
#16 Application Measurement Presentation by Bobin John.
1 Internet Traffic Measurement and Modeling Carey Williamson Department of Computer Science University of Calgary.
On the scale and performance of cooperative Web proxy caching 2/3/06.
Plethora: A Locality Enhancing Peer-to-Peer Network Ronaldo Alves Ferreira Advisor: Ananth Grama Co-advisor: Suresh Jagannathan Department of Computer.
Modeling and Caching of P2P Traffic Osama Saleh Thesis Defense and Seminar 21 November 2006.
1 Evaluation of Cooperative Web Caching with Web Polygraph Ping Du and Jaspal Subhlok Department of Computer Science University of Houston presented at.
Proxy Caching for Peer-to-Peer Live Streaming The International Journal of Computer Networks, 2010 Ke Xu, Ming Zhang, Mingjiang Ye Dept. of Computer Science,
The Impact of Replacement Granularity on Video Caching
Web Caching? Web Caching:.
Plethora: Infrastructure and System Design
Evaluating Proxy Caching Algorithms in Mobile Environments
A Measurement Study of Napster and Gnutella
Presentation transcript:

1 School of Computing Science Simon Fraser University, Canada Modeling and Caching of P2P Traffic Mohamed Hefeeda Osama Saleh ICNP’06 15 November 2006

2 Motivations  P2P traffic is a major fraction of Internet traffic  … and it is increasing [Karagiannis 04]  Negative consequences -increased load on networks  -higher cost on ISPs (and users!), and -more congestion  Can traffic caching help ?

3 Our Problem  Design an effective caching scheme for P2P traffic  Main objective: -Reduce WAN traffic  reduce cost & congestion

4 Our Solution Approach  Measure and model P2P traffic characteristics relevant to caching, i.e., -seen by cache deployed in an autonomous systems (AS)  Then, develop a caching algorithm

5 Why not use Web/Video Caching Algorithms?  Different traffic characteristics: -P2P vs. Web: P2P objects are large, immutable and have different popularity models -P2P vs. Video: P2P objects do not impose any timing constraints  Different caching objectives: -Web: minimize latency, make users happy -Video: minimize start-up delay, latency and enhance quality -P2P: minimize bandwidth consumption

6 Related Work  Several P2P measurement studies, e.g., -[Gummadi 03]: Object popularity is not Zipf, but no closed-form model is given, conducted in one network domain -[Klemm 04]: Query popularity follows mixture of two Zipf distributions, we use popularity of actual object transfers -[Leibowitz 02] [Karagiannis 04]: highlight potential of caching P2P traffic, no caching algorithms presented -All provide useful insights, but they were not explicitly designed to study caching P2P traffic  P2P caching algorithms -[Wierzbicki 04]: proposed two P2P caching algorithms, we compare against the best of them (LSB) -We also compare against LRU, LFU, and GDS

7 Measurement Study  Modified Limewire (Gnutella) to: -Run in super peer mode -Maintain up to 500 concurrent connections (70% with other super nodes -Log all query and queryhit messages  Measure and model -Object popularity -Popularity dynamics -Object sizes  Why Gnutella? -Supports passive measurements -Open source: easy to modify -One of the top-three most popular protocols [Zhao 06]

8 Measurement Study: Stats  Is it representative for P2P traffic? We believe so. -Traffic characteristics are similar in different P2P systems [Gummadi 03]: Non-Zipf traffic in Kazza, same as ours [Saroiu 03]: Napster and Gnutella have similar session duration, host up time, #files shared [Pouwelse 04]: Similar host up time and object properties in BitTorrent Measurement PeriodJan 06 – Sep 06 Unique Objects17 M Unique IPs39 M ASes with more than 100,000 downloads 127 Total traffic volume6,262 Tera Bytes

9 Measurement Study: Object Popularity Notice the flattened head, unlike Zipf

10 Modeling Object Popularity  We propose a Mandelbrot-Zipf (MZipf) model for P2P object popularity: -α: skewness factor, same as Zipf-like distributions -q: plateau factor, controls the plateau shape (flattened head) near the lowest ranked objects -Larger q values  more flattened head  Validation across top 20 ASes (in terms of traffic) -Sample in previous slide

11 Zipf vs. Mandelbrot-Zipf  Zipf over-estimates popularity of objects at lowest ranks  Which are the good candidates for caching AS Zipf

12 Effect of MZipf on Caching Simple analysis using LFU policy Significant bye hit rate loss at realistic cache sizes (e.g., 10%) (H Zipf - H MZipf ) / H Zipf

13 Effect of MZipf on Caching (cont’d) Trace-based simulation using Optimal policy in two ASes larger q (more flattened head)  smaller byte hit rate

14 When is q large?  In ASes with small number of hosts -Immutable objects  download at most once behavior  -Object popularity bounded by number of hosts  large q q values for 20 ASes

15 P2P Caching Algorithm: Basic Idea  Proportional Partial Caching -Cache fraction of the object proportional to its popularity -Motivated by the Mandelbrot-Zipf popularity model -Minimizes the effect of caching large unpopular objects  Segmentation -Divide objects into segments of different sizes -Motivated by the existence of multiple workloads  Replacement -Replace segments of the object with the least number of served bytes normalized by its cached fraction

16 Trace-based Performance Evaluation  Algorithms Implemented -Web policies: LRU, LFU, Greedy-Dual Size (GDS) -P2P policies: Least Sent Bytes (LSB) [Wierzbicki 04] -Offline Optimal Policy (OPT): looks at entire trace, caches objects that maximize byte hit rate  Scenarios -With and without aborted downloads -Various degrees of temporal locality (popularity, temporal correlation)  Performance -Byte Hit Rate (BHR) in top 10 ASes -Importance of partial caching -Sensitivity of our algorithm to: segment size, plateau and skewness factors

17 Byte Hit Rate: No Aborted Downloads  BHR of our algorithm is close to the optimal, much better than LRU, LFU, GDS, LSB AS 397

18 Byte Hit Rate: No Aborted Downloads (cont’d)  Our algorithm consistently outperforms all others in top 10 ASes Top 10 ASes

19 Byte Hit Rate: Aborted Downloads  Same traces as before, adding 2 partial transactions for every complete transaction [Gummadi 03]  Performance gap is even wider -BHR is at least 40% more, and -At most triple the BHR of other algorithms Top 10 ASesAS 397

20 Importance of Partial Caching (1)  Compare our algorithm with and without partial caching -Keeping everything else fixed  Performance of our algorithm degrades without partial caching in all top 10 ASes

21 Importance of Partial Caching (2)  Compare against an optimal policy that does not do partial caching  MKP = store Most K Popular full objects that fill the cache  Our policy outperforms MKP in 6 out of 10 top ASes, and close to it in the others  MKP: optimal, no partial caching  P2P: heuristic with partial caching

22 Importance of Partial Caching (3)  Now, given that our P2P partial caching algorithm -Outperforms LRU, LFU, GDS (all full caching) -Is close to the offline OPT (maximizes byte hit rate) -Outperforms the offline MKP (stores most K-popular objects) -Suffers when we remove partial caching  It is reasonable to believe that Partial caching is critical in P2P systems, because of large object sizes and MZipf popularity

23 Conclusions  Conducted eight-month study to measure and model P2P traffic characteristics relevant caching  Found that object popularity can be modeled by Mandelbrot-Zipf distribution (flattened head)  Proposed a new proportional partial caching algorithm for P2P traffic -Outperforms other algorithms by wide margins, -Robust against different traffic patterns

24 Thank You! Questions??  Some of the presented results are available in the extended version of the paper  All traces are available:

25 Future Work  Implement a P2P proxy cache prototype  Extend measurement study to include other P2P protocols  Analytically analyze our P2P caching algorithm  Use cooperative caching between proxy caches at different ASes