A Scalable Content-Addressable Network

Slides:

Advertisements

Similar presentations

Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.

Advertisements

CAN 1.Distributed Hash Tables a)DHT recap b)Uses c)Example – CAN.

P2P data retrieval DHT (Distributed Hash Tables) Partially based on Hellerstein’s presentation at VLDB2004.

Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.

Scalable Content-Addressable Network Lintao Liu

Peer-to-Peer Systems Chapter 25. What is Peer-to-Peer (P2P)? Napster? Gnutella? Most people think of P2P as music sharing.

CHORD – peer to peer lookup protocol Shankar Karthik Vaithianathan & Aravind Sivaraman University of Central Florida.

Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Schenker Presented by Greg Nims.

LightFlood: An Optimal Flooding Scheme for File Search in Unstructured P2P Systems Song Jiang, Lei Guo, and Xiaodong Zhang College of William and Mary.

Common approach 1. Define space: assign random ID (160-bit) to each node and key 2. Define a metric topology in this space,  that is, the space of keys.

A Scalable Content Addressable Network (CAN)

Peer to Peer File Sharing Huseyin Ozgur TAN. What is Peer-to-Peer?  Every node is designed to(but may not by user choice) provide some service that helps.

Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Shenker A Scalable, Content- Addressable Network (CAN) ACIRI U.C.Berkeley Tahoe Networks.

A Scalable Content Addressable Network Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, and Scott Shenker Presented by: Ilya Mirsky, Alex.

A Scalable Content-Addressable Network Authors: S. Ratnasamy, P. Francis, M. Handley, R. Karp, S. Shenker University of California, Berkeley Presenter:

Distributed Lookup Systems

Overlay Networks EECS 122: Lecture 18 Department of Electrical Engineering and Computer Sciences University of California Berkeley.

1 A Scalable Content- Addressable Network S. Ratnasamy, P. Francis, M. Handley, R. Karp, S. Shenker Proceedings of ACM SIGCOMM ’01 Sections: 3.5 & 3.7.

SCALLOP A Scalable and Load-Balanced Peer- to-Peer Lookup Protocol for High- Performance Distributed System Jerry Chou, Tai-Yi Huang & Kuang-Li Huang Embedded.

1 Load Balance and Efficient Hierarchical Data-Centric Storage in Sensor Networks Yao Zhao, List Lab, Northwestern Univ Yan Chen, List Lab, Northwestern.

Chord-over-Chord Overlay Sudhindra Rao Ph.D Qualifier Exam Department of ECECS.

1 CS 194: Distributed Systems Distributed Hash Tables Scott Shenker and Ion Stoica Computer Science Division Department of Electrical Engineering and Computer.

1 Load Balance and Efficient Hierarchical Data-Centric Storage in Sensor Networks Yao Zhao, List Lab, Northwestern Univ Yan Chen, List Lab, Northwestern.

Or, Providing Scalable, Decentralized Location and Routing Network Services Tapestry: Fault-tolerant Wide-area Application Infrastructure Motivation and.

Beacon Vector Routing: Scalable Point-to-Point Routing in Wireless Sensornets.

Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Shenker A Scalable, Content- Addressable Network ACIRI U.C.Berkeley Tahoe Networks 1.

Improving Data Access in P2P Systems Karl Aberer and Magdalena Punceva Swiss Federal Institute of Technology Manfred Hauswirth and Roman Schmidt Technical.

1CS 6401 Peer-to-Peer Networks Outline Overview Gnutella Structured Overlays BitTorrent.

Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Shenker A Scalable, Content- Addressable Network ACIRI U.C.Berkeley Tahoe Networks 1.

Structured P2P Network Group14: Qiwei Zhang; Shi Yan; Dawei Ouyang; Boyu Sun.

1 A scalable Content- Addressable Network Sylvia Rathnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Shenker Pirammanayagam Manickavasagam.

Roger ZimmermannCOMPSAC 2004, September 30 Spatial Data Query Support in Peer-to-Peer Systems Roger Zimmermann, Wei-Shinn Ku, and Haojun Wang Computer.

CONTENT ADDRESSABLE NETWORK Sylvia Ratsanamy, Mark Handley Paul Francis, Richard Karp Scott Shenker.

GeoGrid: A scalable Location Service Network Authors: J.Zhang, G.Zhang, L.Liu Georgia Institute of Technology presented by Olga Weiss Com S 587x, Fall.

Chord & CFS Presenter: Gang ZhouNov. 11th, University of Virginia.

Applied Research Laboratory David E. Taylor A Scalable Content-Addressable Network Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp, Scott Shenker.

Jonathan Walpole CSE515 - Distributed Computing Systems 1 Teaching Assistant for CSE515 Rahul Dubey.

Sylvia Ratnasamy (UC Berkley Dissertation 2002) Paul Francis Mark Handley Richard Karp Scott Shenker A Scalable, Content Addressable Network Slides by.

Using the Small-World Model to Improve Freenet Performance Hui Zhang Ashish Goel Ramesh Govindan USC.

Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications Xiaozhou Li COS 461: Computer Networks (precept 04/06/12) Princeton University.

1 Distributed Hash Tables (DHTs) Lars Jørgen Lillehovde Jo Grimstad Bang Distributed Hash Tables (DHTs)

Network Computing Laboratory Scalable File Sharing System Using Distributed Hash Table Idea Proposal April 14, 2005 Presentation by Jaesun Han.

Vincent Matossian September 21st 2001 ECE 579 An Overview of Decentralized Discovery mechanisms.

Ion Stoica, Robert Morris, David Karger, M. Frans Kaashoek, Hari Balakrishnan MIT and Berkeley presented by Daniel Figueiredo Chord: A Scalable Peer-to-peer.

Content Addressable Network CAN. The CAN is essentially a distributed Internet-scale hash table that maps file names to their location in the network.

A Scalable Content-Addressable Network (CAN) Seminar “Peer-to-peer Information Systems” Speaker Vladimir Eske Advisor Dr. Ralf Schenkel November 2003.

SIGCOMM 2001 Lecture slides by Dr. Yingwu Zhu Chord: A Scalable Peer-to-peer Lookup Service for Internet Applications.

Scalable Content- Addressable Networks Prepared by Kuhan Paramsothy March 5, 2007.

Peer to Peer A Survey and comparison of peer-to-peer overlay network schemes And so on… Chulhyun Park

CarNet/Grid: Scalable Ad-Hoc Geographic Routing Robert Morris MIT / LCS

P2P Group Meeting (ICS/FORTH) Monday, 28 March, 2005 A Scalable Content-Addressable Network Sylvia Ratnasamy, Paul Francis, Mark Handley, Richard Karp,

1 Distributed Hash Table CS780-3 Lecture Notes In courtesy of Heng Yin.

Plethora: Infrastructure and System Design. Introduction Peer-to-Peer (P2P) networks: –Self-organizing distributed systems –Nodes receive and provide.

Peer to Peer Network Design Discovery and Routing algorithms

Topologically-Aware Overlay Construction and Sever Selection Sylvia Ratnasamy, Mark Handley, Richard Karp, Scott Shenker.

Peer-to-Peer Networks 03 CAN (Content Addressable Network) Christian Schindelhauer Technical Faculty Computer-Networks and Telematics University of Freiburg.

LOOKING UP DATA IN P2P SYSTEMS Hari Balakrishnan M. Frans Kaashoek David Karger Robert Morris Ion Stoica MIT LCS.

Two Peer-to-Peer Networking Approaches Ken Calvert Net Seminar, 23 October 2001 Note: Many slides “borrowed” from S. Ratnasamy’s Qualifying Exam talk.

1 Distributed Hash Tables and Structured P2P Systems Ningfang Mi September 27, 2004.

P2P Search COP P2P Search Techniques Centralized P2P systems  e.g. Napster, Decentralized & unstructured P2P systems  e.g. Gnutella.

An overlay for latency gradated multicasting Anwitaman Datta SCE, NTU Singapore Ion Stoica, Mike Franklin EECS, UC Berkeley

CSCI 599: Beyond Web Browsers Professor Shahram Ghandeharizadeh Computer Science Department Los Angeles, CA

Zhichen Xu, Mallik Mahalingam, Magnus Karlsson

Early Measurements of a Cluster-based Architecture for P2P Systems

A Scalable content-addressable network

A Scalable, Content-Addressable Network

A Scalable Content Addressable Network

A Scalable, Content-Addressable Network

MIT LCS Proceedings of the 2001 ACM SIGCOMM Conference

Presentation transcript:

A Scalable Content-Addressable Network University of Athens Department of Informatics and Telecommunications Postgraduate Program Course: Distributed Systems Supervisor: Professor Alex Delis A Scalable Content-Addressable Network In Proceedings of ACM SIGCOMM 2001 S. Ratnasamy, P. Francis, M. Handley, R. Karp, S. Shenker Zacharias Boufidis boufidis@di.uoa.gr June 3rd, 2002

Scope CAN: Content-Addressable Network: A distributed infrastructure for providing hash table-like functionality on Internet-like scales A distributed indexing scheme for mapping file names to location Content-Addressable Network: storing content at a “point” / routing between points Department of Informatics and Telecommunications - University of Athens

Outline Motivation / Objectives CAN Design: insertion, retrieval, deletion Design Optimizations Performance Evaluation Related Work Discussion Department of Informatics and Telecommunications - University of Athens

Motivation - Applicability P2P file sharing systems - challenge: huge amount of resources - drawback: centralized indexing scheme (Napster: single point of failure/control, Gnutella: flooding => scalabilitty) Large-scale storage management systems (e.g OceanStore) - Efficient insertion and retrieval of content Location-independent naming schemes (vs. DNS) - Naming scheme decoupled from the name resolution process Application-layer multicasting - No need for distribution trees - Scalability (group sizes larger than 1000 nodes, no single source service model)

Objectives Indexing mechanism: totally distributed scalable (per-node storage and communication costs grow slowly as the network size grows) fault-tolerant self-organizing query speed (low-latency - efficient routing) locality (queries for nearby nodes stay local) guarantees for content retrieval Department of Informatics and Telecommunications - University of Athens

Design Overview Basic idea: a virtual d-dimensional Cartesian coordinate space dynamically partioned => zone per node Data stored as <key,value> pairs Insertion: Key k uniform hash function point P in a zone Retrieval: Key k uniform hash function point P in a zone or routing the request until it reaches the node in which zone P lies

Assumptions Associated DNS domain name - external mechanism upon joining A priori knowledge of the deterministic hash function Department of Informatics and Telecommunications - University of Athens

Design - CAN Routing Routing table: IP address & virtual coordinate zone of neighbours (coordinate zones adjoining) Greedy forwarding (recursively) of msg[P(k), dst_coordinates] to neighbour with coordinates closest to P Department of Informatics and Telecommunications - University of Athens

Design - CAN Construction (1) Node Insertion (of N1) 1. N1: find_anyCANnode(N2) 2. N1: choose_point(rand, P) 3. Route to N3 owning P 4. N1: JOIN request to N3 5. N3: zone_splitting(), send(<key,value> from half_zone) to N1, send(neighbout_info (IPaddr)) to N1, inform_neighrbours Pros: only neighbours affected => scalability

Design - CAN Construction (2) Node departure 1. Explicit handover of a zone and <k,v> pairs => possible merge of zones 2. Node failure: Robustness: immediate takeover based on metrics (smallest current zone, least-loaded node, connectivity, etc.) (-) loss of <k,v> pairs solution: periodic update messages Expanding ring seach first to avoid inconsistencies Background zone-reassignment => 1 zone/node

Design Considerations Low per-node state [O(d), independent of n] vs. short path lengths [O(d (n^1/d)) hops] d: dimensions, n: nodes Design Goal: lookup latency = (Avg. # CAN hops) x (Avg. per hop latency) comparable to IP latency => techniques for reducing the CAN routing latency => (+) robustness (routing, data availability) (-) per-node state (-) system complexity

Design Improvements (1) Multi-dimensioned coordinate spaces - More dimensions => more neighbours per node Goal: reduction of routing path length (# hops) => reduction of path latency (+) routing fault tolerance (-) more state per node (routing table) Multiple realities - More independent coordinate spaces => allocation of multiple different zones per node (replication of hash tables on every reality) => selection of min CAN hop route => reduction of Avg. path length =>path latency (+) data availability (+) enhanced routing fault tolerance (-) more state per node The first solution outperforms in terms of routing efficiency

Design Improvements (2) Refinement of CAN routing metrics RTT-weighted routing: - Goal: reduction of per-hop latency metric: progress(Cartesian distance) / RTT - Simulation results: 24-40% improvement Department of Informatics and Telecommunications - University of Athens

Design Improvements (3) Overloading coordinate zones - When joining, zone sharing (if possible) instead of splitting - State info: neighbour list + peer list Selection of a peer’s neighbour based on measured lowest RTT - Hash tables: replication vs. Partitioning tradeoffs: availability, size of data stores, consistency (+) Reduced # hops (+) Reduced per hop latency (+) Improved fault tolerance (-) System complexity (-) Additional control traffic reduced path length - Simulation results: 45% improvement of per-hop latency

Design Improvements (4) Multiple hash functions - Mapping a single key to multiple nodes (replication) => parallel queries (+) Reduced query latency (+) Increased data availability (-) Increased size of the <key, value> database (-) Increased query traffic Topologically-sensitive CAN construction - Node insertion based on RTT from landmarks (instead of random insertion) metric: latency stretch = CAN latency / network level latency (+) Reduced path latency (-) Uneven load distribution => need for load balancing

Design Improvements (5) Uniform partitioning - Volume-based zone splitting (+) Achieves some form of load balancing (-) «Hot spot» problem: some <key, value> pairs are more popular Caching & Replication for «hot spot» management - Caching of recently accessed keys (belonging to other nodes) - Replication: active pushing out of popular keys to neighbours within a region (+) Increased data availability (+) Reduced query latency (+) Load balancing Department of Informatics and Telecommunications - University of Athens

Performance Evaluation (1) Critical factors: 1. increase in # of dimensions d => reduction of path length 2. Use of RTT-weighted routing => optimization of next-hop forwarding => reduction of path latency Department of Informatics and Telecommunications - University of Athens

Performance Evaluation (2) Effect of link delay distribution on latency stretch: Remarks: 1. Increase in # of nodes => slow increase of latency stretch (i.e., slow increase of total path latency) 2. Random delay: the largest latency stretch 3. Larger backbone => reduced density of CAN nodes => less effect of RTT-weighted routing => degraded gains Department of Informatics and Telecommunications - University of Athens

Related Work LS and DV routing Hierarchical routing (BGP) (-) Widespread, frequent (in case of topology changes) dissemination of topology info Hierarchical routing (BGP) (-) Fault tolerance CAN: Self-configured routing Plaxton’s Algorithm (~prefix-based routing) - Similar routing hops & routing table size (-) Neighbour discovery (global knowledge of topology) Geographic routing (-) Geographic forwarding needs real-world location service - difficulty in implementing neighbour relationships (no radio coverage feature available in CANs)

Conclusion Content-Addressable Networks - Added Value: Scalable routing and efficient indexing Completely distributed, self-organizing, fault tolerant Department of Informatics and Telecommunications - University of Athens

Discussion (1) Paper Evaluation Plus: - Novelty - Comparison to best rival scheme & other competing indexing schemes Minus: - Limited evaluation criteria - Drawbacks of the proposed solution? -“Chalk and Cheese”: indexing techniques & routing algorithms Department of Informatics and Telecommunications - University of Athens

Discussion (2) Evaluation of indexing technique - Open issues - Index length? - Index construction time? (Convergence speed) - Index construction = f(size of data store)? - Index update cost? - Extensibility? (types of queries supported, apart from keyword search) - Memory/processing requirements? - Effect of caches compared to “cold start”? - Stability (concurrent joins) &Consistencies: a) intermediate consistencies b) simultaneous updates? - Ease of implementation?

Discussion (3) Evaluation of routing algorithm - Open issues - Routing loops? - Control of routing paths? (recursive routing) - Routing of query responses? Department of Informatics and Telecommunications - University of Athens

Discussion (4) Other Issues - Security (DoS attacks, AAA) - Parameter tuning needed to achieve scalability (Cannot vary d as n increases - n not known by any node) - CAN maintenance protocol overhead? (Cost of update operation) - Accommodation of administrative boundaries? (handling of key value pairs?) - Initial knowledge of the deterministic hash function? Ways to be changed dynamically? Implications? (total reconstruction of the CAN?) - Specification of inter-update times, caching TTL values, etc.

Discussion (5) Enhancements - Initialization of alternate paths from intermediate nodes - Employment of hierarchical model or islands of CANs: - The uniform manipulation fo CAN nodes does not represent the state of the Internet - even P2P systems need not be purel P2P - some degree of loose organization might be required - Dynamic parameter tuning to achieve scalability & efficient routing - Landmark hierarchal routing or GLS spatial hierarchy? - Stronger coupling of routing algorithms to underlying topology and node capability (some knowledge about the network could really help) - Derivation of appropriate network-layer models - QoS-routing based on CANs (idea: hashing IPaddr & ToS field, use one reality per QoS metric, etc.)

Thank You! Department of Informatics and Telecommunications - University of Athens