The Impact of DHT Routing Geometry on Resilience and Proximity Krishna Gummadi, Ramakrishna Gummadi, Sylvia Ratnasamy, Steve Gribble, Scott Shenker, Ion.

Slides:



Advertisements
Similar presentations
Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
Advertisements

Peer to Peer and Distributed Hash Tables
Pastry Peter Druschel, Rice University Antony Rowstron, Microsoft Research UK Some slides are borrowed from the original presentation by the authors.
Kademlia: A Peer-to-peer Information System Based on the XOR Metric.
1 Greedy Forwarding in Dynamic Scale-Free Networks Embedded in Hyperbolic Metric Spaces Dmitri Krioukov CAIDA/UCSD Joint work with F. Papadopoulos, M.
Topologically-Aware Overlay Construction and Server Selection Sylvia Ratnasamy, Mark Handly, Richard Karp and Scott Shenker Presented by Shreeram Sahasrabudhe.
Kademlia: A Peer-to-peer Information System Based on the XOR Metric Petar Mayamounkov David Mazières A few slides are taken from the authors’ original.
Lecture 5 - Routing On the Flat Labels M.Sc Ilya Nikolaevskiy Helsinki Institute for Information Technology (HIIT)
Xiaowei Yang CompSci 356: Computer Network Architectures Lecture 22: Overlay Networks Xiaowei Yang
Common approach 1. Define space: assign random ID (160-bit) to each node and key 2. Define a metric topology in this space,  that is, the space of keys.
Presented by Elisavet Kozyri. A distributed application architecture that partitions tasks or work loads between peers Main actions: Find the owner of.
1 Canon in G Major: Designing DHTs with Hierarchical Structure Prasanna Ganesan (Stanford University) Krishna Gummadi (U. of Washington) Hector Garcia-Molina.
The Impact of DHT Routing Geometry on Resilience and Proximity New DHTs constantly proposed –CAN, Chord, Pastry, Tapestry, Plaxton, Viceroy, Kademlia,
Topics in Reliable Distributed Systems Lecture 2, Fall Dr. Idit Keidar.
Looking Up Data in P2P Systems Hari Balakrishnan M.Frans Kaashoek David Karger Robert Morris Ion Stoica.
P2P: Advanced Topics Filesystems over DHTs and P2P research Vyas Sekar.
The Impact of DHT Routing Geometry on Resilience and Proximity Krishna Gummadi, Ramakrishna Gummadi, Sylvia Ratnasamy, Steve Gribble, Scott Shenker, Ion.
SkipNet: A Scalable Overlay Network with Practical Locality Properties Nick Harvey, Mike Jones, Stefan Saroiu, Marvin Theimer, Alec Wolman Microsoft Research.
Idit Keidar, Principles of Reliable Distributed Systems, Technion EE, Spring Principles of Reliable Distributed Systems Lecture 2: Peer-to-Peer.
Aggregating Information in Peer-to-Peer Systems for Improved Join and Leave Distributed Computing Group Keno Albrecht Ruedi Arnold Michael Gähwiler Roger.
1 Canon in G Major: Designing DHTs with Hierarchical Structure Prasanna Ganesan (Stanford University) Krishna Gummadi (U. of Washington) Hector Garcia-Molina.
Topics in Reliable Distributed Systems Fall Dr. Idit Keidar.
1 CS 194: Distributed Systems Distributed Hash Tables Scott Shenker and Ion Stoica Computer Science Division Department of Electrical Engineering and Computer.
1 Routing as a Service Karthik Lakshminarayanan (with Ion Stoica and Scott Shenker) Sahara/i3 retreat, January 2004.
1 Characterizing Selfishly Constructed Overlay Routing Networks March 11, 2004 Byung-Gon Chun, Rodrigo Fonseca, Ion Stoica, and John Kubiatowicz University.
File Sharing : Hash/Lookup Yossi Shasho (HW in last slide) Based on Chord: A Scalable Peer-to-peer Lookup Service for Internet ApplicationsChord: A Scalable.
Topology-Aware Overlay Networks By Huseyin Ozgur TAN.
Internet Indirection Infrastructure (i3) Ion Stoica, Daniel Adkins, Shelley Zhuang, Scott Shenker, Sonesh Surana UC Berkeley SIGCOMM 2002.
Distributed Hash Table Systems Hui Zhang University of Southern California.
1 PASTRY. 2 Pastry paper “ Pastry: Scalable, decentralized object location and routing for large- scale peer-to-peer systems ” by Antony Rowstron (Microsoft.
CS An Overlay Routing Scheme For Moving Large Files Su Zhang Kai Xu.
DHTs and Peer-to-Peer Systems Supplemental Slides Aditya Akella 03/21/2007.
Impact of Neighbor Selection on Performance and Resilience of Structured P2P Networks IPTPS Feb. 25, 2005 Byung-Gon Chun, Ben Y. Zhao, and John Kubiatowicz.
The Impact of DHT Routing Geometry on Resilience and Proximity K. Gummadi, R. Gummadi..,S.Gribble, S. Ratnasamy, S. Shenker, I. Stoica.
Optimisation des DHT à partir des propriétés physiques, logiques et sociologiques des clients Pierre Fraigniaud CNRS LRI, Univ. Paris-Sud
Canon in G Major: Designing DHTs with Hierarchical Structure Prasanna Ganesan(Stanford University) Krishna Gummadi(University of Washington) Hector Garcia-Molina(Stanford.
Canon in G Major: Designing DHTs with Hierarchical Structure Prasanna Ganesan, Krishna Gummadi, Hector Garcia-Molina ICDCS2004 報告者 : 陳奕鈞.
Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
Paper Survey of DHT Distributed Hash Table. Usages Directory service  Very little amount of information, such as URI, metadata, … Storage  Data, such.
1 Distributed Hash Table CS780-3 Lecture Notes In courtesy of Heng Yin.
Idit Keidar, Principles of Reliable Distributed Systems, Technion EE, Spring Principles of Reliable Distributed Systems Lecture 2: Distributed Hash.
Location aware CHORD Ashwin, Vivek, Manu CS-7460 Project Presentation.
A Comparative Study of the DNS Design with DHT-Based Alternatives 95/08/31 Chen Chih-Ming.
Peer to Peer Network Design Discovery and Routing algorithms
CS 268: Computer Networking
Topologically-Aware Overlay Construction and Sever Selection Sylvia Ratnasamy, Mark Handley, Richard Karp, Scott Shenker.
Peter R Pietzuch and Jean Bacon Peer-to-Peer Overlay Networks in an Event-Based Middleware DEBS’03, San Diego, CA, USA,
LOOKING UP DATA IN P2P SYSTEMS Hari Balakrishnan M. Frans Kaashoek David Karger Robert Morris Ion Stoica MIT LCS.
A Simulation-Based Study of Overlay Routing Performance CS 268 Course Project Andrey Ermolinskiy, Hovig Bayandorian, Daniel Chen.
15-744: Computer Networking L-16 P2P/DHT. 2 Overview P2P Lookup Overview Centralized/Flooded Lookups Bittorrent Routed Lookups – Chord Comparison of DHTs.
Plethora: A Locality Enhancing Peer-to-Peer Network Ronaldo Alves Ferreira Advisor: Ananth Grama Co-advisor: Suresh Jagannathan Department of Computer.
Data Structure of Chord  For each peer -successor link on the ring -predecessor link on the ring -for all i ∈ {0,..,m-1} Finger[i] := the peer following.
CS694 - DHT1 Distributed Hash Table Systems Hui Zhang University of Southern California.
Peer-to-Peer Protocol (P2PP) Salman Baset, Henning Schulzrinne Columbia University.
Incrementally Improving Lookup Latency in Distributed Hash Table Systems Hui Zhang 1, Ashish Goel 2, Ramesh Govindan 1 1 University of Southern California.
Distributed Hash Tables (DHT) Jukka K. Nurminen *Adapted from slides provided by Stefan Götz and Klaus Wehrle (University of Tübingen)
Chord: A Scalable Peer-to-Peer Lookup Service for Internet Applications * CS587x Lecture Department of Computer Science Iowa State University *I. Stoica,
Internet Indirection Infrastructure (i3)
Distributed Hash Tables
Scott Shenker and Ion Stoica April 11, 2005
Impact of Neighbor Selection on Performance and Resilience of Structured P2P Networks Sushma Maramreddy.
Improving and Generalizing Chord
Turning Heterogeneity into an Advantage in Overlay Routing
DHT Routing Geometries and Chord
15-744: Computer Networking
Exploiting Routing Redundancy via Structured Peer-to-Peer Overlays
Consistent Hashing and Distributed Hash Table
P2P: Distributed Hash Tables
Kademlia: A Peer-to-peer Information System Based on the XOR Metric
Presentation transcript:

The Impact of DHT Routing Geometry on Resilience and Proximity Krishna Gummadi, Ramakrishna Gummadi, Sylvia Ratnasamy, Steve Gribble, Scott Shenker, Ion Stoica Presented by Karthik Lakshminarayanan at P2P Systems class (Slides liberally borrowed from Krishna’s SIGCOMM talk)

Motivation New DHTs constantly proposed –CAN, Chord, Pastry, Tapestry, Viceroy, Kademlia, Skipnet, Symphony, Koorde, Apocrypha, Land, Bamboo, ORDI … Each is extensively analyzed but in isolation Each DHT has many algorithmic details making it difficult to compare Goals: a)Separate fundamental design choices from algorithmic details b)Understand their effect on reliability and efficiency

Approach:Component-based analysis Break DHT design into independent components Analyze impact of each component choice separately –compare with black-box analysis: benchmark each DHT implementation rankings of existing DHTs vs. hints on better designs

Different components of analysis Two types of components –Routing-level : neighbor & route selection –System-level : caching, replication, querying policy etc. Separating “routing” and “system” level issues –Good to understand them in isolation –Cons of this approach?

Outline DHT Design Compare DHT Routing Geometries Geometry’s impact on Resilience Geometry’s impact on Proximity

Three aspects of a DHT design 1)Geometry: a graph structure that inspires a DHT design –Tree, Hypercube, Ring, Butterfly, Debruijn 2)Distance function: captures a geometric structure –d(id1, id2) for any two node identifiers 3)Algorithm: rules for selecting neighbors and routes using the distance function

Chord DHT has Ring Geometry

Chord Distance function captures Ring Nodes are points on a clock-wise Ring d(id1, id2) = length of clock-wise arc between ids = (id2 – id1) mod N d(100, 111) =

CAN => Hypercube Geometry d(id1, id2) = #differing bits between id1 and id2 Nodes are the corners of a hypercube d(001, 111)= 2

PRR => Tree Nodes are leaves in a binary tree d(id1, id2) = height of smallest sub-tree with ids = logN – length of prefix_match(id1, id2) d(000, 011) = h = 2 2

Geometry Vs Algorithm Algorithm : exact rules for selecting neighbors, routes –Chord, CAN, PRR, Tapestry, Pastry etc. –Inspired by geometric structures like Ring, Hyper-cube, Tree Geometry : an algorithm’s underlying structure –Distance function is the formal representation of Geometry –Chord, Symphony => Ring –Many algorithms can have same geometry

Is the notion of Geometry clear? Notion of geometry is vague (as the authors admit) It is really a distance function on an ID- space –Hypercube is a special case of XOR! Possible formal definitions?

Chord Neighbor and Route selection Algorithms Neighbor selection: i th neighbor at 2 i distance Route selection: pick neighbor closest to destination d(000, 001) = 1 d(000, 010) = 2 d(000, 001) = 4 110

Geometry => Flexibility => Performance Geometry captures flexibility in selecting algorithms Flexibility is important for routing performance –Flexibility in selecting routes leads to shorter, reliable paths –Flexibility in selecting neighbors leads to shorter paths

Outline Routing GeometryRouting Geometry Comparing DHT Geometries Geometry’s impact on Resilience Geometry’s impact on Proximity

Geometries considered GeometryAlgorithm RingChord, Symphony HypercubeCAN TreePRR Hybrid = Tree + RingTapestry, Pastry XOR d(id1, id2) = id1 XOR id2 Kademlia

Route selection flexibility allowed by Ring Geometry Chord algorithm picks neighbor closest to destination A different algorithm picks the best of alternate paths

Neighbor selection flexibility allowed by Ring Geometry Chord algorithm picks i th neighbor at 2 i distance A different algorithm picks i th neighbor from [2 i, 2 i+1 )

Metrics for flexibility FNS: Flexibility in Neighbor Selection = number of node choices for a neighbor FRS: Flexibility in Route Selection = avg. number of next-hop choices for all destinations Constraints for neighbors and routes –select neighbors to have paths of O(logN) –select routes so that each hop is closer to destination

Flexibility of Ring d(000, 111) = 7 d(001, 111) = 6 d(011, 111) = 4 d(101, 111) = 2 logN neighbors at exponential distances FNS = 2 i-1 for i th neighbor Route along the circle in clock-wise direction

Flexibility for Tree h = 2 h = 1 h = 3 logN neighbors in sub-trees of varying heights FNS = 2 i-1 for i th neighbor of a node Route to a smaller sub-tree with destination;FRS=1

Flexibility for Hypercube Routing to next hop fixes one bit FRS =Avg. (#bits destination differs in)=logN/2 logN neighbors differing in exactly one bit; FNS= d(000, 011) = 2d(001, 011) = 1 d(010, 011) = 1 d(010, 011) = 3

Summary of flexibility analysis FlexibilityOrdering of Geometries Neighbors (FNS) Hypercube << Tree, XOR, Ring, Hybrid (1) (2 i-1 ) Routes (FRS) Tree << XOR, Hybrid < Hypercube < Ring (1) (logN/2) (logN/2) (logN) How relevant is flexibility for DHT routing performance?

Outline Routing GeometryRouting Geometry Comparing DHT GeometriesComparing DHT Geometries Geometry’s impact on Resilience Geometry’s impact on Proximity

Static Resilience Two aspects of robust routing Dynamic Recovery : how quickly routing state is recovered after failures Static Resilience : how well the network routes before recovery finishes –captures how quickly recovery algorithms need to work –depends on FRS Evaluation: Fail a fraction of nodes, without recovering any state Metric: % Paths Failed

Does flexibility affect Static Resilience? Tree << XOR ≈ Hybrid < Hypercube < Ring

Static Resilience: Summary Tree << XOR ≈ Hybrid < Hypercube < Ring –What about trees with 2 neighbors? Addition of sequential neighbors helps resilience, but increases stretch Sequential neighbors offer more benefit, again at the cost of increased stretch Flexibility in Route Selection matters for Static Resilience

Outline Routing GeometryRouting Geometry Comparing flexibility of DHT GeometriesComparing flexibility of DHT Geometries Geometry’s impact on ResilienceGeometry’s impact on Resilience Geometry’s impact on Proximity –Overlay Path Latency –Local Convergence

Analysis of Overlay Path Latency Goal: Minimize end-to-end overlay path latency Both FNS and FRS can reduce latency –Tree has FNS, Hypercube has FRS, Ring & XOR have both Evaluation: Using Internet latency distributions

Problems with existing Network Models How to assign edge latencies to network topologies? –topology models: GT-ITM, Power-law, Mercator, Rocketfuel –no edge latency models, even for measured topologies Solution : A model using only latency distribution seen by a typical node

Simulations using latency distribution only SimulateCompute Simulated Overlay ≈ Computed Overlay Path Latency Distribution Path Latency Distribution 1) Topology, Edge Latencies 2) Latency Distribution

Which is more useful: FNS or FRS? Plain << FRS << FNS ≈ FNS+FRS Neighbor Selection is much better than Route Selection

Proximity results: Summary Using neighbor selection is much better than using route selection flexibility Performance of FNS/FRS is independent of geometry beyond its support for neighbor selection In absolute terms, proximity techniques perform well (stretch of <2)

Local convergence: Summary Flexibility in neighbor selection helps much better than that in route selection Relevance of FRS depends on whether FNS restricted to a k-random sample closely approximates ideal FNS

Limitations Notion of geometry is vague (as the authors admit) – it is really a distance function on an ID- space –Hypercube is a special case of XOR! Not considered other factors that might matter –algorithmic details, symmetry in routing table entries Metrics under consideration can bias results – eg. In ring, do not distinguish between OPT and slightly sub-optimal paths