Navneet Kumar Pandey1 Stéphane Weiss1 Roman Vitenberg1

Slides:



Advertisements
Similar presentations
Low Overhead With Speed Aware Routing (LOWSAR) in VANETs By Kannikar Siriwong Na Ayutaya.
Advertisements

Solving Manufacturing Equipment Monitoring Through Efficient Complex Event Processing Tilmann Rabl, Kaiwen Zhang, Mohammad Sadoghi, Navneet Kumar Pandey,
Opportunistic Multipath Forwarding in Publish/Subscribe Systems Reza Sherafat Kazemzadeh AND Hans-Arno Jacobsen Middleware Systems Research Group University.
Efficient Event-based Resource Discovery Wei Yan*, Songlin Hu*, Vinod Muthusamy +, Hans-Arno Jacobsen +, Li Zha* * Chinese Academy of Sciences, Beijing.
Alex Cheung and Hans-Arno Jacobsen August, 14 th 2009 MIDDLEWARE SYSTEMS RESEARCH GROUP.
Management of Uncertainty in Publish/Subscribe Systems Haifeng Liu Department of Computer Sceince University of Toronto.
1 Message Oriented Middleware and Hierarchical Routing Protocols Smita Singhaniya Sowmya Marianallur Dhanasekaran Madan Puthige.
Improving TCP Performance over Mobile Ad Hoc Networks by Exploiting Cross- Layer Information Awareness Xin Yu Department Of Computer Science New York University,
Yu Stephanie Sun 1, Lei Xie 1, Qi Alfred Chen 2, Sanglu Lu 1, Daoxu Chen 1 1 State Key Laboratory for Novel Software Technology, Nanjing University, China.
1 Sensor Relocation in Mobile Sensor Networks Guiling Wang, Guohong Cao, Tom La Porta, and Wensheng Zhang Department of Computer Science & Engineering.
Small-Scale Peer-to-Peer Publish/Subscribe
Transactional Mobility in Distributed Content-Based Publish/Subscribe Systems Songlin Hu*, Vinod Muthusamy +, Guoli Li +, Hans-Arno Jacobsen + * Chinese.
Subscription Subsumption Evaluation for Content-Based Publish/Subscribe Systems Hojjat Jafarpour, Bijit Hore, Sharad Mehrotra, and Nalini Venkatasubramanian.
A Comparison of Layering and Stream Replication Video Multicast Schemes Taehyun Kim and Mostafa H. Ammar.
©NEC Laboratories America 1 Hui Zhang Samrat Ganguly Sudeept Bhatnagar Rauf Izmailov NEC Labs America Abhishek Sharma University of Southern California.
1 Introduction to Load Balancing: l Definition of Distributed systems. Collection of independent loosely coupled computing resources. l Load Balancing.
Study of Distance Vector Routing Protocols for Mobile Ad Hoc Networks Yi Lu, Weichao Wang, Bharat Bhargava CERIAS and Department of Computer Sciences Purdue.
Alex King Yeung Cheung and Hans-Arno Jacobsen University of Toronto June, 24 th 2010 ICDCS 2010 MIDDLEWARE SYSTEMS RESEARCH GROUP.
Effects of Routing Computations in Content-Based Routing Networks with Mobile Data Sources Vinod Muthusamy, Milenko Petrovic, Hans-Arno Jacobsen University.
An adaptive framework of multiple schemes for event and query distribution in wireless sensor networks Vincent Tam, Keng-Teck Ma, and King-Shan Lui IEEE.
Sidewinder A Predictive Data Forwarding Protocol for Mobile Wireless Sensor Networks Matt Keally 1, Gang Zhou 1, Guoliang Xing 2 1 College of William and.
Publisher Mobility in Distributed Publish/Subscribe Systems Vinod Muthusamy, Milenko Petrovic, Dapeng Gao, Hans-Arno Jacobsen University of Toronto June.
Gil EinzigerRoy Friedman Computer Science Department Technion.
Navneet Kumar Pandey 1 Stéphane Weiss 1 Roman Vitenberg 1 Kaiwen Zhang 2 Hans-Arno Jacobsen 2 2 University of Toronto 1 University of Oslo Minimizing the.
Supporting Disconnected Operations in Publish/Subscribe Systems Vinod Muthusamy Joint work with Milenko Petrovic, Ioana Burcea, H.-Arno Jacobsen, Eyal.
Content-Based Routing in Mobile Ad Hoc Networks Milenko Petrovic, Vinod Muthusamy, Hans-Arno Jacobsen University of Toronto July 18, 2005 MobiQuitous 2005.
Towards Low Overhead Provenance Tracking in Near Real-Time Stream Filtering Nithya N. Vijayakumar, Beth Plale DDE Lab, Indiana University {nvijayak,
MIDDLEWARE SYSTEMS RESEARCH GROUP Middleware A Policy Management Framework for Content-based Publish/Subscribe Middleware Hans-Arno Jacobsen Department.
DISTRIBUTED EVENT AGGREGATION FOR CONTENT-BASED PUBLISH/SUBSCRIBE SYSTEMS Navneet Kumar Pandey 1 Stéphane Weiss 1 Roman Vitenberg 1 Kaiwen Zhang 2 Hans-Arno.
Dynamic Load Balancing in Distributed Content-based Publish/Subscribe Alex K. Y. Cheung & Hans-Arno Jacobsen University of Toronto November 30 th, 2006.
MIDDLEWARE SYSTEMS RESEARCH GROUP MSRG.ORG Total Order in Content-based Publish/Subscribe Systems Joint work with: Vinod Muthusamy, Hans-Arno Jacobsen.
Classification and Analysis of Distributed Event Filtering Algorithms Sven Bittner Dr. Annika Hinze University of Waikato New Zealand Presentation at CoopIS.
Paper # – 2009 A Comparison of Heterogeneous Video Multicast schemes: Layered encoding or Stream Replication Authors: Taehyun Kim and Mostafa H.
REED: Robust, Efficient Filtering and Event Detection in Sensor Networks Daniel Abadi, Samuel Madden, Wolfgang Lindner MIT United States VLDB 2005.
Energy-Efficient Shortest Path Self-Stabilizing Multicast Protocol for Mobile Ad Hoc Networks Ganesh Sridharan
MIDDLEWARE SYSTEMS RESEARCH GROUP Adaptive Content-based Routing In General Overlay Topologies Guoli Li, Vinod Muthusamy Hans-Arno Jacobsen Middleware.
Minimal Broker Overlay Design for Content-Based Publish/Subscribe Systems Naweed Tajuddin Balasubramaneyam Maniymaran Hans-Arno Jacobsen University of.
ICDCS Beijing China Routing of XML and XPath Queries in Data Dissemination Networks Guoli Li, Shuang Hou Hans-Arno Jacobsen Middleware Systems Research.
VLDB2005 CMS-ToPSS: Efficient Dissemination of RSS Documents Milenko Petrovic Haifeng Liu Hans-Arno Jacobsen University of Toronto.
Peer-to-Peer Result Dissemination in High-Volume Data Filtering Shariq Rizvi and Paul Burstein CS 294-4: Peer-to-Peer Systems.
Peter R Pietzuch and Jean Bacon Peer-to-Peer Overlay Networks in an Event-Based Middleware DEBS’03, San Diego, CA, USA,
November 4, 2003Applied Research Laboratory, Washington University in St. Louis APOC 2003 Wuhan, China Cost Efficient Routing in Ad Hoc Mobile Wireless.
MIDDLEWARE SYSTEMS RESEARCH GROUP MSRG.ORG Distributed Ranked Data Dissemination in Social Networks Joint work with: Mo Sadoghi Vinod Muthusamy Hans-Arno.
Community Clustering in Distributed Publish/Subscribe System Wei Li 1,2,Songlin Hu 1, Jintao Li 1, Hans-Arno Jacobsen 3 1 Institute of Computing Technology,
Congestion Avoidance with Incremental Filter Aggregation in Content-Based Routing Networks Mingwen Chen 1, Songlin Hu 1, Vinod Muthusamy 2, Hans-Arno Jacobsen.
A Hierarchical Edge Cloud Architecture for Mobile Computing IEEE INFOCOM 2016 Liang Tong, Yong Li and Wei Gao University of Tennessee – Knoxville 1.
Introduction to Machine Learning, its potential usage in network area,
Network Layer COMPUTER NETWORKS Networking Standards (Network LAYER)
Dominik Kaspar, Eunsook Kim, Carles Gomez, Carsten Bormann
Architecture and Algorithms for an IEEE 802
Introduction to Load Balancing:
International Conference on Data Engineering (ICDE 2016)
A Study of Group-Tree Matching in Large Scale Group Communications
PROTEAN: A Scalable Architecture for Active Networks
Wireless Sensor Network Architectures
Introduction to Wireless Sensor Networks
Performance Evaluation of Adaptive MPI
Presented by Prashant Duhoon
A Framework for Automatic Resource and Accuracy Management in A Cloud Environment Smita Vijayakumar.
Batch Forwarding in Wireless Sensor Networks (WSNs)
Smita Vijayakumar Qian Zhu Gagan Agrawal
Composite Subscriptions in Content-based Pub/Sub Systems
Foundations for Highly-Available Content-based Publish/Subscribe Overlays Young Yoon, Vinod Muthusamy and Hans-Arno Jacobsen.
GATES: A Grid-Based Middleware for Processing Distributed Data Streams
Data-Centric Networking
Small-Scale Peer-to-Peer Publish/Subscribe
Motion-Aware Routing in Vehicular Ad-hoc Networks
Storing and Replication in Topic-Based Pub/Sub Networks
Adaptive Traffic Control
Presentation transcript:

Navneet Kumar Pandey1 Stéphane Weiss1 Roman Vitenberg1 Distributed event aggregation for content-based Publish/Subscribe systems Navneet Kumar Pandey1 Stéphane Weiss1 Roman Vitenberg1 Kaiwen Zhang2 Hans-Arno Jacobsen2 1University of Oslo 2University of Toronto

Motivation: Intelligent Transport System (ITS) Information providers: road sensors, crowdsourced mobile apps Information seekers: commuters, police, first responders, radio networks etc. Aggregate subscriptions Count number of cars passing a street light per hour Average speed of cars on a road segment per day Non-aggregate subscriptions Accident reports Traffic violation reports http://www.wired.com/images_blogs/autopia/2012/08/12A914.jpg

Aggregation in pub/sub Pub/sub is well known for efficient content filtering and dissemination for distributed event sources and sinks.  However, pub/sub does not support aggregation, which is required in emerging applications. Our primary objective is to retain the traditional pub/sub focus on low communication cost, while adding support for aggregation.

Contributions: aggregation in pub/sub We propose a framework and baseline approaches for aggregation in content-based pub/sub systems (CBPS). We show how the relative performance of the baseline approaches varies with workload properties. We propose a per-broker distributed adaptive approach.

Advertisement-based pub/sub model Broker P[val,8] A[val, > ,4] Subscription Delivery Tree (SDT) Bp Subscriber Publishers BI BI BS S[val, > ,3] Bq

Comparison with stream processing Aggregation in stream processing Aggregation in pub/sub Requires global view of topology Topology is not known to individual broker nodes Requires a priori knowledge of publication sources Publication sources and sinks are dynamic Needs control layer Brokers are loosely coupled Usually have a static query plan SDTs are dynamic and determined by the pub/sub implementation Optimized for continuous data streams Publications come at an irregular rate

Proposed aggregation framework Publication filtering procedure (PFP) Subscription: { RoadID = 101, speed > 10, op=‘avg’ , Duration (ω) = 2 hour, shift size (δ) = 1 hour} NWR3 NWR1 NWR2 subscription 1 2 3 Time Notification window ranges (NWR) Pub1 Pub2 Pub3 A single publication can participate in several NWRs, even for the same subscription.

Proposed aggregation framework Publication filtering procedure (PFP) Initial computation procedure (ICP) Pub1 Pub2 Pub3 NWR1 subscription NWR2 x NWR3 1 2 3 Time Notification window ranges (NWR) Outgoing messages: { avg(Pub1, Pub2, Pub3), avg(Pub2, Pub3) } Outgoing messages: { avg(Pub1, Pub2), avg(Pub2), Pub3 } Processing start time presents a trade-off between communication cost and end-to-end delay.

Proposed aggregation framework Publication filtering procedure (PFP) Initial computation procedure (ICP) Recurrent processing procedure (RPP) avgp Collection delay Bp BI avgq Bq avgpq Collection delay is another parameter affecting the delay-communication trade-off.

Late aggregation approach PFS ICP X RPP P[val,3] P[val,5] Messages exchanged in Late aggregation: 6 Bp Subscriber Publishers BI BS Bs X Smin[val,>,2] Bq P[val,9] P[Valmin,3] P[val,2] Late approach aggregates messages at subscriber-edge brokers. 10

Early aggregation approach PFS ICP X RPP P[val,3] P[val,5] Messages exchanged in Late aggregation: 6 Bp BA Messages exchanged in Early aggregation: 3 P[valmin,3] Subscriber X Publishers BI BS P[valmin,3] X Smin[val,>,2] Bq P[valmin,9] P[val,9] P[valmin,3] P[val,2] Early approach aggregates messages at publisher-edge brokers. 11

Early does not always outperform Late P[val,3] P[val,5] P[valmin,3] P[valmax,5] Bp P[valcount,3] P[valcount,2] P[valmax,9] Smin[val,>,2] P[valmin,3] BI BS Smax[val,>,2] Scount[val,>,2] P[valcount,1] Bq P[valmax,9] P[valmin,9] P[val,9] P[val,2] Late aggregation Messages exchanged: 6 Early aggregation Messages exchanged: 9 12

Comparison between Early and Late Several parameters affect the performance of our baselines: Increasing parameter Favors Publication matching rate Early Matching number of NWRs Late Overlap among aggregate subscriptions Ratio between aggregate and regular subscriptions Reducing the communication cost requires an adaptive solution

Benefits of adaptive aggregation Late 6 Early 5 P[val,3] P[val,5] BA Bp P[valmin,3] P[valmin,3] Smin[val,>,2] P[val,9] BA BI BS BA S[val,>,6] BF Bq P[valmin,9] P[val,9] P[val,9] P[val,2] 14

Benefits of adaptive aggregation Late 6 Early 5 Adaptive 4 Adaptive P[val,3] P[val,5] BA Bp P[valmin,3] P[valmin,3] Smin[val,>,2] P[val,9] BI BI BA BS S[val,>,6] Bq Bq P[val,9] P[val,9] P[val,2] Per-broker adaptation reduces communication cost 15

Adaptation process (MAPE-K) Analyze Plan Compare the ratio between Pubs vs. NWRs Estimate the notification rate Choose the suitable mode Transition between aggregate and forward mode Knowledge Information at a broker Registered subscriptions Current execution mode Monitor Execute Matching publications within sampling period Changes in subscription set Start/stop aggregation at broker General framework with a parametric cost model

Experimental setup Implemented in Java over the PADRES framework Topology: 16 brokers Combination of publisher-edge only, subscriber-edge only and mixed brokers Real life datasets: Traffic dataset from the ONE-ITS service1 Yahoo! Finance Stock dataset Metrics: Number of messages exchanged Processing overhead End-to-end delay B ONE 1http://one-its-webapp1.transport.utoronto.ca

Results (Stock dataset) Decision becomes more accurate when available information is sufficient Varying Publication/second Varying number of subscriptions Early perform better at high pub rates whereas Late is better with large number of subscriptions. Adaptive aggregation performs close to the best among Early and Late for all settings.

Results (Traffic dataset) Varying Publication/second Varying number of subscriptions Per-Broker adaptation can cause individual brokers to make incorrect decisions

Processing overhead (Stock) Predicate matching cost Aggregation-related overhead Adaptation overhead is dominating the aggregation overhead

Conclusions We provide an aggregation framework for CBPS with baseline solutions. We demonstrate that neither baseline is dominant and depends upon workload parameters. We provide a generic adaptive aggregation framework. We experimentally demonstrate that our distributed adaptive solution performs close to the best baseline across all settings.

For questions and comments Contact: navneet@ifi.uio.no Thank you! For questions and comments Contact: navneet@ifi.uio.no