Quantitative Evaluation of Unstructured Peer-to-Peer Architectures Fabrício Benevenuto José Ismael Jr. Jussara M. Almeida Department of Computer Science.

Slides:



Advertisements
Similar presentations
Making P2P Networks Scalable a paper presentation by Derek Tingle.
Advertisements

P2PR-tree: An R-tree-based Spatial Index for P2P Environments ANIRBAN MONDAL YI LIFU MASARU KITSUREGAWA University of Tokyo.
Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
Clayton Sullivan PEER-TO-PEER NETWORKS. INTRODUCTION What is a Peer-To-Peer Network A Peer Application Overlay Network Network Architecture and System.
Search and Replication in Unstructured Peer-to-Peer Networks Pei Cao, Christine Lv., Edith Cohen, Kai Li and Scott Shenker ICS 2002.
LightFlood: An Optimal Flooding Scheme for File Search in Unstructured P2P Systems Song Jiang, Lei Guo, and Xiaodong Zhang College of William and Mary.
Denial-of-Service Resilience in Peer-to-Peer Systems D. Dumitriu, E. Knightly, A. Kuzmanovic, I. Stoica and W. Zwaenepoel Presenter: Yan Gao.
CompSci 356: Computer Network Architectures Lecture 21: Content Distribution Chapter 9.4 Xiaowei Yang
Expediting Searching Processes via Long Paths in P2P Systems 05/30 IDEA Lab.
Gnutella 2 GNUTELLA A Summary Of The Protocol and it’s Purpose By
Peer-to-Peer Networks João Guerreiro Truong Cong Thanh Department of Information Technology Uppsala University.
Peer-to-Peer Networks as a Distribution and Publishing Model Jorn De Boever (june 14, 2007)
P2p, Spring 05 1 Topics in Database Systems: Data Management in Peer-to-Peer Systems March 29, 2005.
Web Caching Schemes1 A Survey of Web Caching Schemes for the Internet Jia Wang.
Evaluation of Ad hoc Routing Protocols under a Peer-to-Peer Application Authors: Leonardo Barbosa Isabela Siqueira Antonio A. Loureiro Federal University.
YAPPERS: A Peer-to-Peer Lookup Service over Arbitrary Topology Qixiang Sun Prasanna Ganesan Hector Garcia-Molina Stanford University.
An Analysis of Internet Content Delivery Systems Stefan Saroiu, Krishna P. Gommadi, Richard J. Dunn, Steven D. Gribble, and Henry M. Levy Proceedings of.
CSc 461/561 CSc 461/561 Peer-to-Peer Streaming. CSc 461/561 Summary (1) Service Models (2) P2P challenges (3) Service Discovery (4) P2P Streaming (5)
A Trust Based Assess Control Framework for P2P File-Sharing System Speaker : Jia-Hui Huang Adviser : Kai-Wei Ke Date : 2004 / 3 / 15.
Rendezvous Points-Based Scalable Content Discovery with Load Balancing Jun Gao Peter Steenkiste Computer Science Department Carnegie Mellon University.
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
Exploiting Content Localities for Efficient Search in P2P Systems Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang 1 1 College of William and Mary,
Search and Replication in Unstructured Peer-to-Peer Networks Pei Cao Cisco Systems, Inc. (Joint work with Christine Lv, Edith Cohen, Kai Li and Scott Shenker)
presented by Hasan SÖZER1 Scalable P2P Search Daniel A. Menascé George Mason University.
Chord-over-Chord Overlay Sudhindra Rao Ph.D Qualifier Exam Department of ECECS.
Comparing Hybrid Peer-to-Peer Systems Beverly Yang and Hector Garcia-Molina Presented by Marco Barreno November 3, 2003 CS 294-4: Peer-to-peer systems.
1 Seminar: Information Management in the Web Gnutella, Freenet and more: an overview of file sharing architectures Thomas Zahn.
Searching in Unstructured Networks Joining Theory with P-P2P.
ICDE A Peer-to-peer Framework for Caching Range Queries Ozgur D. Sahin Abhishek Gupta Divyakant Agrawal Amr El Abbadi Department of Computer Science.
Focus on Distributed Hash Tables Distributed hash tables (DHT) provide resource locating and routing in peer-to-peer networks –But, more than object locating.
1CS 6401 Peer-to-Peer Networks Outline Overview Gnutella Structured Overlays BitTorrent.
P2P File Sharing Systems
INTRODUCTION TO PEER TO PEER NETWORKS Z.M. Joseph CSE 6392 – DB Exploration Spring 2006 CSE, UT Arlington.
Roger ZimmermannCOMPSAC 2004, September 30 Spatial Data Query Support in Peer-to-Peer Systems Roger Zimmermann, Wei-Shinn Ku, and Haojun Wang Computer.
1 Napster & Gnutella An Overview. 2 About Napster Distributed application allowing users to search and exchange MP3 files. Written by Shawn Fanning in.
Introduction Widespread unstructured P2P network
P2P Architecture Case Study: Gnutella Network
Peer to Peer Research survey TingYang Chang. Intro. Of P2P Computers of the system was known as peers which sharing data files with each other. Build.
Web Cache Replacement Policies: Properties, Limitations and Implications Fabrício Benevenuto, Fernando Duarte, Virgílio Almeida, Jussara Almeida Computer.
1 Towards Cinematic Internet Video-on-Demand Bin Cheng, Lex Stein, Hai Jin and Zheng Zhang HUST and MSRA Huazhong University of Science & Technology Microsoft.
2: Application Layer1 Chapter 2 outline r 2.1 Principles of app layer protocols r 2.2 Web and HTTP r 2.3 FTP r 2.4 Electronic Mail r 2.5 DNS r 2.6 Socket.
Using the Small-World Model to Improve Freenet Performance Hui Zhang Ashish Goel Ramesh Govindan USC.
Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications Xiaozhou Li COS 461: Computer Networks (precept 04/06/12) Princeton University.
Network Computing Laboratory Scalable File Sharing System Using Distributed Hash Table Idea Proposal April 14, 2005 Presentation by Jaesun Han.
A Scalable Content-Addressable Network (CAN) Seminar “Peer-to-peer Information Systems” Speaker Vladimir Eske Advisor Dr. Ralf Schenkel November 2003.
TOMA: A Viable Solution for Large- Scale Multicast Service Support Li Lao, Jun-Hong Cui, and Mario Gerla UCLA and University of Connecticut Networking.
Super-peer Network. Motivation: Search in P2P Centralised (Napster) Flooding (Gnutella)  Essentially a breadth-first search using TTLs Distributed Hash.
A Peer-to-Peer Approach to Resource Discovery in Grid Environments (in HPDC’02, by U of Chicago) Gisik Kwon Nov. 18, 2002.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
Kaleidoscope – Adding Colors to Kademlia Gil Einziger, Roy Friedman, Eyal Kibbar Computer Science, Technion 1.
1 Peer-to-Peer Technologies Seminar by: Kunal Goswami (05IT6006) School of Information Technology Guided by: Prof. C.R.Mandal, School of Information Technology.
Peer to Peer A Survey and comparison of peer-to-peer overlay network schemes And so on… Chulhyun Park
Efficient P2P Search by Exploiting Localities in Peer Community and Individual Peers A DISC’04 paper Lei Guo 1 Song Jiang 2 Li Xiao 3 and Xiaodong Zhang.
A Utility-based Approach to Scheduling Multimedia Streams in P2P Systems Fang Chen Computer Science Dept. University of California, Riverside
LightFlood: An Efficient Flooding Scheme for File Search in Unstructured P2P Systems Song Jiang, Lei Guo, and Xiaodong Zhang College of William and Mary.
Peer to Peer Network Design Discovery and Routing algorithms
Evaluation GUESS and Non-Forwarding Peer-to-Peer search ICDCS paper Beverly Yang Patrick Vinograd Hector Garcia-Molina Computer Science Department, Stanford.
Algorithms and Techniques in Structured Scalable Peer-to-Peer Networks
Mapping the Gnutella Network: Properties of Large-Scale Peer-to-Peer Systems and Implications for System Design Authors: Matei Ripeanu Ian Foster Adriana.
Peer-to-Peer Systems: An Overview Hongyu Li. Outline  Introduction  Characteristics of P2P  Algorithms  P2P Applications  Conclusion.
P2P Search COP6731 Advanced Database Systems. P2P Computing  Powerful personal computer Share computing resources P2P Computing  Advantages: Shared.
P2P Search COP P2P Search Techniques Centralized P2P systems  e.g. Napster, Decentralized & unstructured P2P systems  e.g. Gnutella.
School of Electrical Engineering &Telecommunications UNSW Cost-effective Broadcast for Fully Decentralized Peer-to-peer Networks Marius Portmann & Aruna.
Malugo – a scalable peer-to-peer storage system..
Reputation Systems for Fighting Pollution in Peer-to-Peer File Sharing Systems 7 th.IEEE International Conference on Peer-to-Peer Computing Cristiano Costa,
Distributed Caching and Adaptive Search in Multilayer P2P Networks Chen Wang, Li Xiao, Yunhao Liu, Pei Zheng The 24th International Conference on Distributed.
Architecture and Algorithms for an IEEE 802
Early Measurements of a Cluster-based Architecture for P2P Systems
A Scalable content-addressable network
Joydeep Chandra, Santosh Shaw and Niloy Ganguly
Presentation transcript:

Quantitative Evaluation of Unstructured Peer-to-Peer Architectures Fabrício Benevenuto José Ismael Jr. Jussara M. Almeida Department of Computer Science Federal University of Minas Gerais Brazil

Motivation P2P systems are responsible for a large portion of Internet traffic First generation unstructured P2P systems are decreasing in popularity due to poor scalability – Ex: original Gnutella protocol (v. 0.4) New popular hybrid unstructured P2P systems – Explore heterogeneity inherent to peers Super-peers: highly available and powerful peers – Intuitively more scalable Ex: KaZaA, Gnutella 2

Goals Quantify the main performance benefits provided by each individual feature of super-peer architectures Provide insights to guide the design of future P2P systems

Outline Overview of unstructured P2P architectures – Message flooding architecture – Hybrid super-peer architecture Evaluation methodology – Simulation environment – Performance metrics Results Conclusions and future Work

Overview of Unstructured Peer-to-Peer Architectures Message Flooding Architecture – First generation: Gnutella 0.4 – Poor scalability due to network overload Super-Peer Architectures – Explore peers heterogeneity: Gnutella 2.0, Kazaa Super-peers: typically more powerful and available – Intuitively better scalability due to several new features

Content Location in Message Flooding Architecture Gnutella 0.4

Content Location in Super-Peer Architecture Gnutella 2.0

Features of Gnutella 2.0 Architecture Super-peer backbone speed-up search – A super-peer that receives a query from a leaf or initiates a new query only forwards it to other super-peers directly connected to it (one hop away) Content-aware query routing mechanism – A super-peer only forwards a query to other super-peers or leaves where there is a chance file is stored Super-peers maintain local query hash tables User-controlled query retransmission – User may restart query from other super-peers hoping to increase number of hits and reduce download time Swarm Download – User downloads file pieces from multiple peers: expect reduced download time (feature in other systems as well)

How much performance benefit does each such feature provide over the original message-flooding based Gnutella 0.4 protocol?

Evaluation Methodology Simulators – Previous optimized message flooding Gnutella with communities Communities explore locality of interests among peers Content search first in a peer’s community. If not found, use original message flooding mechanism Significant system load reductions [BCAA04] – New super-peer Gnutella 2.0 protocol (specification) – Both simulators: heterogeneous aspects found in real systems Performance Metrics – System load: average # messages processed by each node – Query latency: time until download starts – Query success rate: % queries successfully responded – Download time: average download time

Most Relevant Results Message flooding + peer community vs. Super-Peer backbone Content-aware query routing mechanism User-controlled query retransmission Swarm download

Message-Flooding + Community vs. Super-peer Backbone System Load Super-peer: Avg # msgs processed by a peer drops by roughly 95% 50% reduction on latency: limited traffic over the backbone (paper) Shorter average download times (paper) Query success rate is the same for both architectures (~90%)

Content-Aware Query Routing Further reductions on average system load: Content-aware query routing: system load drops by a factor of 41 System Load No Query Routing 0 Query Retransmissions Query Routing 0 Query Retransmissions No Query Routing 2 Query Retransmissions Query Routing 2 Query Retransmissions

User-Controlled Query Retransmission Query success rate: one or two retransmissions deliver most performance gains (96-98% success rate): diminishing returns Average Download Time: significant reductions for small music files (more popular workload) Query latency and system load increases linearly with # retransmissions (paper) Query Success Rate (%) Reduction on Avgerage Download Time (%) Time# Retransmissions

User-Controlled Query Retransmission System Load System load increases linearly with # retransmissions mainly query and query hits, as expected

Swarm Download Average Download Time (sec) All possible VideoTV ShowMusic Workload# Simultanous Downloads Only reduces download time significantly for small files Higher probability of downloading from low bandwidth or highly utilized peers as number of simultaneous downloads increases Bottleneck especially critical for large files

Conclusions and Future Work Conclusions – Super-peer architecture itself provides much better scalability over optimized message flooding protocol 95% system load reduction but same query success rate – Content-aware query routing provides further load reductions – One or two query retransmissions should be enough to provide almost max query success rate, 40% download time reduction for small files, while keeping latency and load at low levels – Swarm download may be detrimental to performance if download sources are not carefully selected Future Work – Extend performance evaluation to allow peers to dynamically join and leave system – Design new peer selection policies that explore locality of interest (peer communities) and peer characteristics