Friendships that last Peer lifespan and its role in P2P protocols

Slides:



Advertisements
Similar presentations
Ranveer Chandra Ramasubramanian Venugopalan Ken Birman
Advertisements

Peer-to-Peer and Social Networks An overview of Gnutella.
Ranveer Chandra , Kenneth P. Birman Department of Computer Science
1 An Overview of Gnutella. 2 History The Gnutella network is a fully distributed alternative to the centralized Napster. Initial popularity of the network.
Search and Replication in Unstructured Peer-to-Peer Networks Pei Cao, Christine Lv., Edith Cohen, Kai Li and Scott Shenker ICS 2002.
Peer-to-Peer Networks João Guerreiro Truong Cong Thanh Department of Information Technology Uppsala University.
P2p, Spring 05 1 Topics in Database Systems: Data Management in Peer-to-Peer Systems March 29, 2005.
Scalable Application Layer Multicast Suman Banerjee Bobby Bhattacharjee Christopher Kommareddy ACM SIGCOMM Computer Communication Review, Proceedings of.
Decentralized resource management for a distributed continuous media server Cyrus Shahabi and Farnoush Banaei-Kashani IEEE Transactions on Parallel and.
UNIVERSITY OF JYVÄSKYLÄ Topology Management in Unstructured P2P Networks Using Neural Networks Presentation for IEEE Congress on Evolutionary Computing.
Building Low-Diameter P2P Networks Eli Upfal Department of Computer Science Brown University Joint work with Gopal Pandurangan and Prabhakar Raghavan.
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
Looking at the Server-side of P2P Systems Yi Qiao, Dong Lu, Fabian E. Bustamante and Peter A. Dinda Department of Computer Science Northwestern University.
Introspective Replica Management Yan Chen, Hakim Weatherspoon, and Dennis Geels Our project developed and evaluated a replica management algorithm suitable.
Searching in Unstructured Networks Joining Theory with P-P2P.
What Can Databases Do for Peer-to-Peer Steven Gribble, Alon Halevy, Zachary Ives, Maya Rodrig, Dan Suciu Presented by: Ryan Huebsch CS294-4 P2P Systems.
1CS 6401 Peer-to-Peer Networks Outline Overview Gnutella Structured Overlays BitTorrent.
Presentation by Manasee Conjeepuram Krishnamoorthy.
INTRODUCTION TO PEER TO PEER NETWORKS Z.M. Joseph CSE 6392 – DB Exploration Spring 2006 CSE, UT Arlington.
Freenet. Anonymity  Napster, Gnutella, Kazaa do not provide anonymity  Users know who they are downloading from  Others know who sent a query  Freenet.
1 Napster & Gnutella An Overview. 2 About Napster Distributed application allowing users to search and exchange MP3 files. Written by Shawn Fanning in.
IPDPS 2007 Making Peer-to-Peer Anonymous Routing Resilient to Failures Yingwu Zhu Seattle University
COCONET: Co-Operative Cache driven Overlay NETwork for p2p VoD streaming Abhishek Bhattacharya, Zhenyu Yang & Deng Pan.
1 Telematica di Base Applicazioni P2P. 2 The Peer-to-Peer System Architecture  peer-to-peer is a network architecture where computer resources and services.
Resilient P2P Anonymous Routing by Using Redundancy Yingwu Zhu.
CCAN: Cache-based CAN Using the Small World Model Shanghai Jiaotong University Internet Computing R&D Center.
Quantitative Evaluation of Unstructured Peer-to-Peer Architectures Fabrício Benevenuto José Ismael Jr. Jussara M. Almeida Department of Computer Science.
GIA: Making Gnutella-like P2P Systems Scalable Yatin Chawathe Sylvia Ratnasamy, Scott Shenker, Nick Lanham, Lee Breslau (Several slides have been taken.
Peer Pressure: Distributed Recovery in Gnutella Pedram Keyani Brian Larson Muthukumar Senthil Computer Science Department Stanford University.
Freelib: A Self-sustainable Digital Library for Education Community Ashraf Amrou, Kurt Maly, Mohammad Zubair Computer Science Dept., Old Dominion University.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
GIA: Making Gnutella-like P2P Systems Scalable Yatin Chawathe Sylvia Ratnasamy, Scott Shenker, Nick Lanham, Lee Breslau Parts of it has been adopted from.
Peer to Peer A Survey and comparison of peer-to-peer overlay network schemes And so on… Chulhyun Park
By Jonathan Drake.  The Gnutella protocol is simply not scalable  This is due to the flooding approach it currently utilizes  As the nodes increase.
Taxonomy Caching: A Scalable Low- Cost Mechanism for Indexing Remote Contents in Peer-to-Peer Systems Kjetil Nørvåg Norwegian University of Science and.
Stefanos Antaris A Socio-Aware Decentralized Topology Construction Protocol Stefanos Antaris *, Despina Stasi *, Mikael Högqvist † George Pallis *, Marios.
Minimizing Churn in Distributed Systems P. Brighten Godfrey, Scott Shenker, and Ion Stoica UC Berkeley SIGCOMM’06.
Scaling RADIUS to Support a Nationwide Network Access Infrastructure Kostas Kalevras NTUA Network Operations Centre.
P2P Search COP P2P Search Techniques Centralized P2P systems  e.g. Napster, Decentralized & unstructured P2P systems  e.g. Gnutella.
School of Electrical Engineering &Telecommunications UNSW Cost-effective Broadcast for Fully Decentralized Peer-to-peer Networks Marius Portmann & Aruna.
An Analysis of Internet Content Delivery Systems 19 rd November, 2007 Youngsub CSE, SNU.
Adaptive Configuration of a Web Caching Hierarchy Pranav A. Desai Jaspal Subhlok Presented by: Pranav A. Desai.
Distributed Caching and Adaptive Search in Multilayer P2P Networks Chen Wang, Li Xiao, Yunhao Liu, Pei Zheng The 24th International Conference on Distributed.
Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications * CS587x Lecture Department of Computer Science Iowa State University *I. Stoica,
Marco Conti, Enrico Gregori, Giovanni Turi Istituto di Informatica e Telematica – CNR MobiHoc ‘ Jongsoo Lee
Christian Scheideler Dept. of Computer Science
Looking at the Server-side of P2P Systems
Impact of Neighbor Selection on Performance and Resilience of Structured P2P Networks Sushma Maramreddy.
DISTRIBUTED SYSTEMS Principles and Paradigms Second Edition ANDREW S
On Growth of Limited Scale-free Overlay Network Topologies
CHAPTER 3 Architectures for Distributed Systems
Internet Networking recitation #4
Handling Churn in Less-structured P2P Systems Elders Know Best
Peer-to-Peer and Social Networks
Net 435: Wireless sensor network (WSN)
Plethora: Infrastructure and System Design
Early Measurements of a Cluster-based Architecture for P2P Systems
A Measurement Study of Peer-to-Peer File Sharing Systems
Peer-to-Peer Protocol (P2PP)
GIA: Making Gnutella-like P2P Systems Scalable
A Case for Mutual Notification
Peer-to-Peer Video Services
Improving Performance in the Gnutella Protocol
Elders know best Lifespan-based ideas in P2P systems
Joydeep Chandra, Santosh Shaw and Niloy Ganguly
Mobile P2P Data Retrieval and Caching
DNSR: Domain Name Suffix-based Routing in Overlay Networks
#02 Peer to Peer Networking
Design and Implementation of OverLay Multicast Tree Protocol
Jeonghun Noh Sachin Deshpande* Information Systems Laboratory
Presentation transcript:

Friendships that last Peer lifespan and its role in P2P protocols Fabián E. Bustamante & Yi Qiao Department of Computer Science Northwestern University {fabianb,yqiao}cs.northwestern.edu www.aqualab.cs.northwestern.edu

Dept. of Computer Science Northwestern University P2P and heterogeneity P2P computing: sharing of computer resources & services by direct exchange between participants Purest form … all peers are equal Problem: clash between assumption and reality - peer populations show high variations on storage, bandwidth, latency, degree of sharing, uptime, … P2P – Idea; purest form; problem: clash bet/ assumption and heterogeneity and transient population Dept. of Computer Science Northwestern University

Transient peers and P2P systems Peers defined an overlay network Set of connections to other peers (their “friends”) Maintenance protocol that repairs the overlay Degree of peer transiency Median up-time ~ 70’ Implications Maintenance-related messages Plus degree of replication, effectiveness of caches, spread of queries, overall system scalability, … Dept. of Computer Science Northwestern University

Dept. of Computer Science Northwestern University Our approach Part of the problem is whom one befriends One solution: pick those that will live/stay long Without knowing the future, can we predict it? Yes; peer lifespan follows a Pareto distribution! Given a good prediction - how should it be used in P2P protocols? Can it really help? Dept. of Computer Science Northwestern University

Determining lifespan distribution In Gnutella, using a modified client, between March 1st-8th, 2003 Some details: Attempt a Gnutella connection setup 20 monitoring peers for fine probe granularity First-time found peers only recorded with Time-When-Found Peer considered dead when Connection attempt fails 3rd time Unexpected response is received Dept. of Computer Science Northwestern University

Peer lifespan distribution 500,000 peers, ~1 million peers’ lifespans Create-based method for sample limited scope Figures show RCDF of peers with lifespan in [1,300 sec, 3.5 days] Pareto distribution of the form λTk (k < 0) Dept. of Computer Science Northwestern University

Peer Lifespan and P2P protocols Choosing among “acquaintances”: When deciding whom to befriend Responding to requests for references In most P2P protocols – random selection Peer lifespan fits a Pareto distribution Pareto distributions Є UBNE class (Used Better than New in Expectation) Peer’s expected remaining lifetime directly proportional to current age Dept. of Computer Science Northwestern University

Dept. of Computer Science Northwestern University Some of the questions … How could we incorporate lifespan-based ideas into P2P systems? Potential gains in reduced maintenance overhead Effects on application performance … Dept. of Computer Science Northwestern University

Lifespan-based protocols Increased dependency as commitment to the community becomes clear Protocol Connect? Recommend? LSPAN-1 Oldest Random LSPAN-2 LSPAN-3 Oldest & more available connections Dept. of Computer Science Northwestern University

Dept. of Computer Science Northwestern University Experimental setup Trace-driven simulation – P2P simulator includes membership management and various query distribution, cache and replication strategies Runs of one of the 20 collected traces for a period of 510,000 sec., ~36,577 peers Cold start, warm-up ~80,000 sec. excluded ~1,000 peers under stable conditions Newer results where obtained using 4 traces (instead of 1) Dept. of Computer Science Northwestern University

Alternative protocols compared Unstructured Decentralized Protocol (UDP) ~ early Gnutella Separate pools for cached pongs (per connection) Pong replies include random set of entries from cache Hierarchical Decentralized Protocol (HDP) ~ new Gnutella, KaZaa Leaf- and ultra-peers: leafs can only connect to ultras; ultras to anybody To decide a peer’s role – trace information Dept. of Computer Science Northwestern University

Comparing connection breakdowns Indicator of stability √ Lifespan-based protocols More selective → fewer breakdowns Reductions 42-43% -LSPAN-2 26-30% -LSPAN-1 and LSPAN-3 Saw-tooth shape → time-of-day patterns Dept. of Computer Science Northwestern University

Comparing connection rejections Does preference for long-lived peers have to mean high rejection rates? True for LSPAN-2 – although may be a reasonable “cost” Still, for LSPAN-1 and LSPAN-3 low enough to be ignored LSPAN-3 ~ 1/17.58 hrs! Dept. of Computer Science Northwestern University

Comparing number of connections … not just rejections, what about number of connections? LSPAN-1 and LSPAN-3 – higher ratio of connections per peer Little benefit from checking available connections Dept. of Computer Science Northwestern University

A preview: Effects on applications Gains in scalability With random-walkers & NCU (Neighboring Caching) Lifespan-based: 5 and random topology: 16 walkers Dept. of Computer Science Northwestern University

Conclusions and future work Peer lifespan fits a Pareto distribution – current age to predict lifespan Illustrative lifespan-based protocols Advantages of considering peers’ age in P2P protocols Possible research paths Effect on query distribution and cache strategies Lifespan-based strategies Determining a peer’s age in decentralized P2P systems Lifespan and DHTs Dept. of Computer Science Northwestern University