Small-Scale Peer-to-Peer Publish/Subscribe

Slides:



Advertisements
Similar presentations
Peer-to-Peer Infrastructure and Applications Andrew Herbert Microsoft Research, Cambridge
Advertisements

Dynamic Replica Placement for Scalable Content Delivery Yan Chen, Randy H. Katz, John D. Kubiatowicz {yanchen, randy, EECS Department.
Efficient Event-based Resource Discovery Wei Yan*, Songlin Hu*, Vinod Muthusamy +, Hans-Arno Jacobsen +, Li Zha* * Chinese Academy of Sciences, Beijing.
Alex Cheung and Hans-Arno Jacobsen August, 14 th 2009 MIDDLEWARE SYSTEMS RESEARCH GROUP.
P2P data retrieval DHT (Distributed Hash Tables) Partially based on Hellerstein’s presentation at VLDB2004.
Pastry Peter Druschel, Rice University Antony Rowstron, Microsoft Research UK Some slides are borrowed from the original presentation by the authors.
Scalable Content-Addressable Network Lintao Liu
Management of Uncertainty in Publish/Subscribe Systems Haifeng Liu Department of Computer Sceince University of Toronto.
Dynamo: Amazon's Highly Available Key-value Store Distributed Storage Systems CS presented by: Hussam Abu-Libdeh.
Small-Scale Peer-to-Peer Publish/Subscribe
Transactional Mobility in Distributed Content-Based Publish/Subscribe Systems Songlin Hu*, Vinod Muthusamy +, Guoli Li +, Hans-Arno Jacobsen + * Chinese.
Applications over P2P Structured Overlays Antonino Virgillito.
©NEC Laboratories America 1 Hui Zhang Samrat Ganguly Sudeept Bhatnagar Rauf Izmailov NEC Labs America Abhishek Sharma University of Southern California.
M ERCURY : A Scalable Publish-Subscribe System for Internet Games Ashwin R. Bharambe, Sanjay Rao & Srinivasan Seshan Carnegie Mellon University.
Carnegie Mellon University Complex queries in distributed publish- subscribe systems Ashwin R. Bharambe, Justin Weisz and Srinivasan Seshan.
Efficient Content Location Using Interest-based Locality in Peer-to-Peer Systems Presented by: Lin Wing Kai.
SCALLOP A Scalable and Load-Balanced Peer- to-Peer Lookup Protocol for High- Performance Distributed System Jerry Chou, Tai-Yi Huang & Kuang-Li Huang Embedded.
Squirrel: A decentralized peer- to-peer web cache Paul Burstein 10/27/2003.
Hermes: A Distributed Event- Based Middleware Architecture Peter Pietzuch and Jean Bacon 1st DEBS Workshop, Vienna,
Project Mimir A Distributed Filesystem Uses Rateless Erasure Codes for Reliability Uses Pastry’s Multicast System Scribe for Resource discovery and Utilization.
Distributed Publish/Subscribe Network Presented by: Yu-Ling Chang.
Alex King Yeung Cheung and Hans-Arno Jacobsen University of Toronto June, 24 th 2010 ICDCS 2010 MIDDLEWARE SYSTEMS RESEARCH GROUP.
Effects of Routing Computations in Content-Based Routing Networks with Mobile Data Sources Vinod Muthusamy, Milenko Petrovic, Hans-Arno Jacobsen University.
Publisher Mobility in Distributed Publish/Subscribe Systems Vinod Muthusamy, Milenko Petrovic, Dapeng Gao, Hans-Arno Jacobsen University of Toronto June.
Chord & CFS Presenter: Gang ZhouNov. 11th, University of Virginia.
Gil EinzigerRoy Friedman Computer Science Department Technion.
Supporting Disconnected Operations in Publish/Subscribe Systems Vinod Muthusamy Joint work with Milenko Petrovic, Ioana Burcea, H.-Arno Jacobsen, Eyal.
Chord: A Scalable Peer-to-peer Lookup Protocol for Internet Applications Xiaozhou Li COS 461: Computer Networks (precept 04/06/12) Princeton University.
Content-Based Routing in Mobile Ad Hoc Networks Milenko Petrovic, Vinod Muthusamy, Hans-Arno Jacobsen University of Toronto July 18, 2005 MobiQuitous 2005.
1 Distributed Hash Tables (DHTs) Lars Jørgen Lillehovde Jo Grimstad Bang Distributed Hash Tables (DHTs)
MIDDLEWARE SYSTEMS RESEARCH GROUP MSRG.ORG Total Order in Content-based Publish/Subscribe Systems Joint work with: Vinod Muthusamy, Hans-Arno Jacobsen.
Distributed Automatic Service Composition in Large-Scale Systems Songlin Hu*, Vinod Muthusamy +, Guoli Li +, Hans-Arno Jacobsen + * Chinese Academy of.
An IP Address Based Caching Scheme for Peer-to-Peer Networks Ronaldo Alves Ferreira Joint work with Ananth Grama and Suresh Jagannathan Department of Computer.
1 Peer-to-Peer Technologies Seminar by: Kunal Goswami (05IT6006) School of Information Technology Guided by: Prof. C.R.Mandal, School of Information Technology.
Presentation: Architectural Design for Distributed Systems Objektorienteret netværkskom.
MIDDLEWARE SYSTEMS RESEARCH GROUP Adaptive Content-based Routing In General Overlay Topologies Guoli Li, Vinod Muthusamy Hans-Arno Jacobsen Middleware.
Chord Fay Chang, Jeffrey Dean, Sanjay Ghemawat, Wilson C. Hsieh, Deborah A. Wallach, Mike Burrows, Tushar Chandra, Andrew Fikes, Robert E. Gruber Google,
Information-Centric Networks10b-1 Week 10 / Paper 2 Hermes: a distributed event-based middleware architecture –P.R. Pietzuch, J.M. Bacon –ICDCS 2002 Workshops.
Information-Centric Networks Section # 10.2: Publish/Subscribe Instructor: George Xylomenos Department: Informatics.
Peer to Peer Network Design Discovery and Routing algorithms
Data Indexing in Peer- to-Peer DHT Networks Garces-Erice, P.A.Felber, E.W.Biersack, G.Urvoy-Keller, K.W.Ross ICDCS 2004.
Peter R Pietzuch and Jean Bacon Peer-to-Peer Overlay Networks in an Event-Based Middleware DEBS’03, San Diego, CA, USA,
Distributed Automatic Service Composition in Large-Scale Systems Songlin Hu*, Vinod Muthusamy +, Guoli Li +, Hans-Arno Jacobsen + * Chinese Academy of.
INTERNET TECHNOLOGIES Week 10 Peer to Peer Paradigm 1.
Malugo – a scalable peer-to-peer storage system..
Querying the Internet with PIER CS294-4 Paul Burstein 11/10/2003.
Congestion Avoidance with Incremental Filter Aggregation in Content-Based Routing Networks Mingwen Chen 1, Songlin Hu 1, Vinod Muthusamy 2, Hans-Arno Jacobsen.
Brocade: Landmark Routing on Overlay Networks
Data Management on Opportunistic Grids
Architecture and Algorithms for an IEEE 802
A Framework for Object-Based Event Composition in Distributed Systems
COS 461: Computer Networks
Distribution and components
CHAPTER 3 Architectures for Distributed Systems
Plethora: Infrastructure and System Design
Navneet Kumar Pandey1 Stéphane Weiss1 Roman Vitenberg1
Early Measurements of a Cluster-based Architecture for P2P Systems
SCOPE: Scalable Consistency in Structured P2P Systems
CS5412: Using Gossip to Build Overlay Networks
A Case for Mutual Notification
AWS Cloud Computing Masaki.
Distributed Publish/Subscribe Network
Distributed Hash Tables
Foundations for Highly-Available Content-based Publish/Subscribe Overlays Young Yoon, Vinod Muthusamy and Hans-Arno Jacobsen.
Indirect Communication Paradigms (or Messaging Methods)
Indirect Communication Paradigms (or Messaging Methods)
COS 461: Computer Networks
Chord and CFS Philip Skov Knudsen
Consistent Hashing and Distributed Hash Table
CS5412: Using Gossip to Build Overlay Networks
Presentation transcript:

Small-Scale Peer-to-Peer Publish/Subscribe Vinod Muthusamy, Hans-Arno Jacobsen University of Toronto July 17, 2005 P2PKM 2005 San Diego, CA

Small-Scale P2P Publish/Subscribe Motivation Many distributed hash table (DHT)-based peer-to-peer applications Storage (OceanStore, Past) Database (PIER) Naming service (Chord-DNS) Publish/subscribe (Scribe, P2P-ToPSS, Meghdoot) Typically designed for and evaluated on a large-scale Small-scale performance is important Even large systems start small Benefits even on small scale Small-scale case study of a DHT-based pub/sub protocol July 17, 2005 (P2PKM ’05) Small-Scale P2P Publish/Subscribe

Small-Scale P2P Publish/Subscribe Agenda Background Publish/subscribe model Distributed hash tables Small-scale benefits Publish/subscribe on DHT Evaluation Conclusions July 17, 2005 (P2PKM ’05) Small-Scale P2P Publish/Subscribe

Publish/subscribe model TSX Stock markets NASDAQ NYSE Simple interface Decoupled interaction Network efficiency Publisher Publisher AMGN=58 Publications IBM=84 ORCL=12 JNJ=58 HON=24 INTC=19 MSFT=27 Broker Network Notification Subscriptions Subscriber Subscriber Subscriptions: IBM > 85 ORCL < 10 JNJ > 60 July 17, 2005 (P2PKM ’05) Small-Scale P2P Publish/Subscribe

Distributed hash tables Example: Pastry Distributed version of a hash table data structure Store (key, value) pairs Each node receives at most K/N keys Each node has about O(logN) routing state Lookups resolved with O(logN) hops July 17, 2005 (P2PKM ’05) Small-Scale P2P Publish/Subscribe

Small-Scale P2P Publish/Subscribe Small-scale benefits Distributed pub/sub DHT Large-scale benefits (traditional focus) Multicast Distributed matching Organic (infrastructureless) scaling Self-organizing Small-scale benefits (untapped) (also large-scale benefits) Decoupled interaction Declarative bindings Incremental service deployment Late binding Effective use of commodity components (fault tolerance, load balancing) Incremental scalability July 17, 2005 (P2PKM ’05) Small-Scale P2P Publish/Subscribe

Service oriented architecture (SOA) An illustration of small-scale benefits SOA: a distributed architecture Loosely coupled reusable components Enterprise service bus (ESB) comm. fabric Can be realized as a pub/sub broker network Pub/sub support in IBM Websphere, Sonic ESBs Example Three-tier Web server Communication between components through pubs and subs Decoupled interaction Declarative bindings Simple to add a Monitor component Incremental service deployment Late binding Application Server Enterprise Service Bus User Products Monitor Database Database July 17, 2005 (P2PKM ’05) Small-Scale P2P Publish/Subscribe

Small-Scale P2P Publish/Subscribe Pub/sub case study P2P-ToPSS A pub/sub system built over DHT Large-scale performance previously studied [under submission] Evaluate small-scale benefits (of the same algorithm) now Subscription load balance July 17, 2005 (P2PKM ’05) Small-Scale P2P Publish/Subscribe

Distributed multidimensional matching (DMM) 1) Pub/sub domain: 3) Network domain: price < 50 weight > 50 Subscription price = 10 weight = 70 Publication - 1 2) Spatial domain: 100 00 01 10 11 01 11 1 weight 00 10 DHT network used to map from tree nodes to peers price 100 July 17, 2005 (P2PKM ’05) Small-Scale P2P Publish/Subscribe

Small-Scale P2P Publish/Subscribe Distributing the tree Let region r have z-code z, and tree node n(r) Store n(r) at peer p(r), where p(r) is where DHT would store key z Each peer performs mapping locally Hash function ensures even distribution Network price weight Region Subscription 1101 z-code price = … weight = … p/w p_w_1101 name/code AF3483BQ1 peer-id hash July 17, 2005 (P2PKM ’05) Small-Scale P2P Publish/Subscribe

Small-Scale P2P Publish/Subscribe Load balancing Peers can be overloaded due to Excessive subscriptions storage Excessive publication traffic Overloaded peer delegates subscriptions to children in the DMM tree A local decision Network 00 01 p/w July 17, 2005 (P2PKM ’05) Small-Scale P2P Publish/Subscribe

Small-Scale P2P Publish/Subscribe Evaluation SimPastry simulator (Microsoft Research) Pastry DHT Simulate network latencies, message costs Nodes connected in a LAN 1ms latency between every pair of nodes Subscriptions Attribute name + range E.g.: “20 < weight < 40” Metric Storage load of subscriptions July 17, 2005 (P2PKM ’05) Small-Scale P2P Publish/Subscribe

Small-Scale P2P Publish/Subscribe DHT performance 1000 random keys DHT successfully balances load July 17, 2005 (P2PKM ’05) Small-Scale P2P Publish/Subscribe

Small-Scale P2P Publish/Subscribe Pub/sub performance weight price 1000 fine-grained subscriptions Load balancing distributes storage load July 17, 2005 (P2PKM ’05) Small-Scale P2P Publish/Subscribe

Small-Scale P2P Publish/Subscribe Pub/sub performance weight price 1000 coarse-grained subscriptions Load balancing distributes but also increases storage load July 17, 2005 (P2PKM ’05) Small-Scale P2P Publish/Subscribe

Small-Scale P2P Publish/Subscribe Observation Load balancing technique assumes available resources somewhere in the network Leads to runaway subscription delegation Common assumption in DHT applications CAN DHT’s active caching and replication Meghdoot’s hotspot circumvention In DHT, no knowledge of aggregate available resources Fundamental assumption of DHT applications not valid in small-scale networks July 17, 2005 (P2PKM ’05) Small-Scale P2P Publish/Subscribe

Small-Scale P2P Publish/Subscribe Conclusions Pub/sub and DHTs Proven benefits in large-scale network Untapped benefits in small-scale networks Common assumption on DHT applications Sufficient available resources in the network Small-scale networks break this assumption Future work Implement algorithms to estimate global resource availability Curb runaway subscription delegation with this information Evaluate other DHT applications in small-scale networks July 17, 2005 (P2PKM ’05) Small-Scale P2P Publish/Subscribe

Small-Scale Peer-to-Peer Publish/Subscribe Q&A Small-Scale Peer-to-Peer Publish/Subscribe www.msrg.toronto.edu July 17, 2005 (P2PKM ’05) Small-Scale P2P Publish/Subscribe