Approximating Sensor Network Queries Using In-Network Summaries Alexandra Meliou Carlos Guestrin Joseph Hellerstein.

Slides:



Advertisements
Similar presentations
Sequoia: Virtual-Tree Models for Internet Path Metrics Rama Microsoft Research Also:Ittai Abraham (Hebrew Univ.) Mahesh Balakrishnan (Cornell) Archit Gupta.
Advertisements

Wavelet and Matrix Mechanism CompSci Instructor: Ashwin Machanavajjhala 1Lecture 11 : Fall 12.
Adopt Algorithm for Distributed Constraint Optimization
Fast Algorithms For Hierarchical Range Histogram Constructions
1 K-clustering in Wireless Ad Hoc Networks using local search Rachel Ben-Eliyahu-Zohary JCE and BGU Joint work with Ran Giladi (BGU) and Stuart Sheiber.
BOAT - Optimistic Decision Tree Construction Gehrke, J. Ganti V., Ramakrishnan R., Loh, W.
David Chu--UC Berkeley Amol Deshpande--University of Maryland Joseph M. Hellerstein--UC Berkeley Intel Research Berkeley Wei Hong--Arched Rock Corp. Approximate.
Windows Scheduling Problems for Broadcast System 1 Amotz Bar-Noy, and Richard E. Ladner Presented by Qiaosheng Shi.
Routing, Anycast, and Multicast for Mesh and Sensor Networks Roland Flury Roger Wattenhofer RAM Distributed Computing Group.
Department of Computer Science, University of Maryland, College Park, USA TexPoint fonts used in EMF. Read the TexPoint manual before you delete this box.:
Minimum-Buffered Routing of Non- Critical Nets for Slew Rate and Reliability Control Supported by Cadence Design Systems, Inc. and the MARCO Gigascale.
Deterministic Wavelet Thresholding for Maximum-Error Metrics Minos Garofalakis Bell Laboratories Lucent Technologies 600 Mountain Avenue Murray Hill, NJ.
Ashish Gupta Under Guidance of Prof. B.N. Jain Department of Computer Science and Engineering Advanced Networking Laboratory.
1 Data Persistence in Large-scale Sensor Networks with Decentralized Fountain Codes Yunfeng Lin, Ben Liang, Baochun Li INFOCOM 2007.
1 Distributed localization of networked cameras Stanislav Funiak Carlos Guestrin Carnegie Mellon University Mark Paskin Stanford University Rahul Sukthankar.
Sharing Aggregate Computation for Distributed Queries Ryan Huebsch, UC Berkeley Minos Garofalakis, Yahoo! Research † Joe Hellerstein, UC Berkeley Ion Stoica,
Localized Techniques for Power Minimization and Information Gathering in Sensor Networks EE249 Final Presentation David Tong Nguyen Abhijit Davare Mentor:
Distributed Regression: an Efficient Framework for Modeling Sensor Network Data Carlos Guestrin Peter Bodik Romain Thibaux Mark Paskin Samuel Madden.
Placement of Integration Points in Multi-hop Community Networks Ranveer Chandra (Cornell University) Lili Qiu, Kamal Jain and Mohammad Mahdian (Microsoft.
Multi-dimensional Range Query in Sensor Networks Xin Li,Young Jim Kim, Ramesh Govindan (University of Southern California ) Wei Hong (Intel Research Lab.
Operator Placement for In-Network Stream Query Processing.
Non-myopic Informative Path Planning in Spatio-Temporal Models Alexandra Meliou Andreas Krause Carlos Guestrin Joe Hellerstein.
Cache Placement in Sensor Networks Under Update Cost Constraint Bin Tang, Samir Das and Himanshu Gupta Department of Computer Science Stony Brook University.
I/O-Algorithms Lars Arge Spring 2009 March 3, 2009.
1 An Evaluation of Multi-resolution Storage for Sensor Networks D. Ganesan, B. Greenstein, D. Perelyubskiy, D. Estrin, J. Heidemann ACM SenSys 2003.
[1][1][1][1] Lecture 2-3: Coping with NP-Hardness of Optimization Problems in Practice May 26 + June 1, Introduction to Algorithmic Wireless.
Computability and Complexity 24-1 Computability and Complexity Andrei Bulatov Approximation.
Extending Network Lifetime for Precision-Constrained Data Aggregation in Wireless Sensor Networks Xueyan Tang School of Computer Engineering Nanyang Technological.
Online Data Gathering for Maximizing Network Lifetime in Sensor Networks IEEE transactions on Mobile Computing Weifa Liang, YuZhen Liu.
Performance guarantees for hierarchical clustering Sanjoy Dasgupta University of California, San Diego Philip Long Genomics Institute of Singapore.
CSE 550 Computer Network Design Dr. Mohammed H. Sqalli COE, KFUPM Spring 2007 (Term 062)
Model-driven Data Acquisition in Sensor Networks Amol Deshpande 1,4 Carlos Guestrin 4,2 Sam Madden 4,3 Joe Hellerstein 1,4 Wei Hong 4 1 UC Berkeley 2 Carnegie.
Energy-efficient Self-adapting Online Linear Forecasting for Wireless Sensor Network Applications Jai-Jin Lim and Kang G. Shin Real-Time Computing Laboratory,
CS 580S Sensor Networks and Systems Professor Kyoung Don Kang Lecture 7 February 13, 2006.
Connected Dominating Sets in Wireless Networks My T. Thai Dept of Comp & Info Sci & Engineering University of Florida June 20, 2006.
Sensor Networks Storage Sanket Totala Sudarshan Jagannathan.
Efficient Gathering of Correlated Data in Sensor Networks
Network Aware Resource Allocation in Distributed Clouds.
An Integration Framework for Sensor Networks and Data Stream Management Systems.
Sensor Network Databases1 Overview: Chapter 6  Sensor Network Databases  Sensor networks are conceptually a distributed DB  Store collected data  Indexes.
Topology aggregation and Multi-constraint QoS routing Presented by Almas Ansari.
A Polynomial Time Approximation Scheme For Timing Constrained Minimum Cost Layer Assignment Shiyan Hu*, Zhuo Li**, Charles J. Alpert** *Dept of Electrical.
Trust-Aware Optimal Crowdsourcing With Budget Constraint Xiangyang Liu 1, He He 2, and John S. Baras 1 1 Institute for Systems Research and Department.
De-Nian Young Ming-Syan Chen IEEE Transactions on Mobile Computing Slide content thanks in part to Yu-Hsun Chen, University of Taiwan.
Constructing Optimal Wavelet Synopses Dimitris Sacharidis Timos Sellis
Multi-Criteria Routing in Pervasive Environment with Sensors Santhanakrishnan, G., Li, Q., Beaver, J., Chrysanthis, P.K., Amer, A. and Labrinidis, A Department.
A Faster Approximation Scheme for Timing Driven Minimum Cost Layer Assignment Shiyan Hu*, Zhuo Li**, and Charles J. Alpert** *Dept of ECE, Michigan Technological.
Complexity of Bellman-Ford
DIST: A Distributed Spatio-temporal Index Structure for Sensor Networks Anand Meka and Ambuj Singh UCSB, 2005.
1 Shape Segmentation and Applications in Sensor Networks Xianjin Xhu, Rik Sarkar, Jie Gao Department of CS, Stony Brook University INFOCOM 2007.
A Method for Distributed Computation of Semi-Optimal Multicast Tree in MANET Eiichi Takashima, Yoshihiro Murata, Naoki Shibata*, Keiichi Yasumoto, and.
Dr. Sudharman K. Jayaweera and Amila Kariyapperuma ECE Department University of New Mexico Ankur Sharma Department of ECE Indian Institute of Technology,
Space-Efficient Online Computation of Quantile Summaries SIGMOD 01 Michael Greenwald & Sanjeev Khanna Presented by ellery.
Using local geometry for Topology Construction in Wireless Sensor Networks Sameera Poduri Robotic Embedded Systems Lab(RESL)
An Efficient CUDA Implementation of the Tree-Based Barnes Hut n-body Algorithm By Martin Burtscher and Keshav Pingali Jason Wengert.
1 Approximate XML Query Answers Presenter: Hongyu Guo Authors: N. polyzotis, M. Garofalakis, Y. Ioannidis.
1 An Arc-Path Model for OSPF Weight Setting Problem Dr.Jeffery Kennington Anusha Madhavan.
By: Gang Zhou Computer Science Department University of Virginia 1 Medians and Beyond: New Aggregation Techniques for Sensor Networks CS851 Seminar Presentation.
Network-Aware Query Processing for Stream- based Application Yanif Ahmad, Ugur Cetintemel - Brown University VLDB 2004.
Efficient Resource Allocation for Wireless Multicast De-Nian Yang, Member, IEEE Ming-Syan Chen, Fellow, IEEE IEEE Transactions on Mobile Computing, April.
1 Algorithmic aspects of radio access network design in 4G cellular networks David Amzallag Computer Science Department, Technion Joint work with Seffi.
Dynamic Proxy Tree-Based Data Dissemination Schemes for Wireless Sensor Networks Wensheng Zhang, Guohong Cao and Tom La Porta Department of Computer Science.
Models of Greedy Algorithms for Graph Problems Sashka Davis, UCSD Russell Impagliazzo, UCSD SIAM SODA 2004.
Construction of Optimal Data Aggregation Trees for Wireless Sensor Networks Deying Li, Jiannong Cao, Ming Liu, and Yuan Zheng Computer Communications and.
Spatial Data Management
Data Driven Resource Allocation for Distributed Learning
Near-optimal Observation Selection using Submodular Functions
Spatial Online Sampling and Aggregation
Design of Hierarchical Classifiers for Efficient and Accurate Pattern Classification M N S S K Pavan Kumar Advisor : Dr. C. V. Jawahar.
CSE 550 Computer Network Design
Presentation transcript:

Approximating Sensor Network Queries Using In-Network Summaries Alexandra Meliou Carlos Guestrin Joseph Hellerstein

Approximate Answer Queries Approximate representation of the world: Discrete locations Lossy communication Noisy measurements Applications do not expect accurate values (tolerance to noise) Example: Return the temperature at all locations ±1C, with 95% confidence Query Satisfaction: On expectation the requested portion of sensor values lies within the error range

In-network Decisions Query Use in-network models to make routing decisions No centralized planning

In-network Summaries Spanning tree T(V,E’) + Models M v for all nodes v M v represents the whole subtree rooted at v.

Model Complexity Need for compression Gaussian distributions at the leaves: good for modeling individual node measurements Gaussian distributions at the leaves: good for modeling individual node measurements

Talk “outline” Compression Traversal Construction In-network summaries

Collapsing Gaussian Mixtures Compress an m-size mixture to a k-size mixture. Look at simple case (k=1) Minimize KL- divergence? “Fake” mass

Quality of Compression Depends on query workload Query with acceptable error window W Query with acceptable error window W’<W

Compression Accurate mass inside interval No guarantee on the tails

Talk “outline” Compression Traversal Construction In-network summaries

Query Satisfaction A response R={r 1 …r n } satisfies query Q(w,δ) if: In expectation the values of at least δn nodes lie within [r i -w,r i +w] In-network summary Q R [r 1, r 2, r 3, r 4, r 5, r 6, r 7, r 8, r 9, r 10 ] Within error bounds

Optimal Traversal Given: tree and models Find: subtree such that Can be computed with Dynamic Programming response [μ leaves ]

Greedy Traversal If local model satisfies Return μ Else descend to child node More conservative solution: enforces query satisfiability on every subtree instead of the whole tree More conservative solution: enforces query satisfiability on every subtree instead of the whole tree

Traversal Evaluation

Talk “outline” Compression Traversal Construction In-network summaries

Optimal Tree Construction Given a structure, we know how to build the models But how do we pick the structure?

Traversal = cut Theorem: In a fixed fanout tree, the cost of the traversal is where |C| is the size of the cut, and F the fanout Theorem: In a fixed fanout tree, the cost of the traversal is where |C| is the size of the cut, and F the fanout Intuition: minimize cut size Group nodes into a minimum number of groups which satisfy the query constraints Clustering problem

Optimal Clustering Given a query Q(w,δ), optimal clustering is NP-hard Related to the Group Steiner Tree Problem Greedy algorithm with factor log(n) approximation Greedily pick max size cluster Issue: does not enforce connectivity of clusters

Greedy Clustering Include extra nodes to enforce connectivity Augment clusters only with accessible nodes (losing the logn guarantee)

Clustering comparison 2 distributed clustering algorithms are compared to the centralized greedy clustering

Talk “outline” Compression Traversal Construction In-network summaries Enriched models

Support more complex models k-mixtures Compress to a k-size mixture instead of a SGM Virtual nodes Every component of the k-size mixture is stored as a separate “virtual node” SGMs on multiple windows Maintain additional SGMs for different window sizes More space, more expensive model updates (SGM = Single Gaussian Model)

Evaluation of enriched models SGM surprisingly effective in representing the underlying data

Sensitivity analysis Talk “outline” Compression Traversal Construction In-network summaries

Tree Construction Parameters and Effect on Performance Confidence Performance for workloads of different confidence than the hierarchy design Error window Broader vs narrower ranges of window sizes Assignment of windows across tree levels Temporal changes How often should the models be updated

Confidence Workload of 0.95 confidence Design confidence does not have a big impact on performance

Error windows A wide range is not always better, because it forces the traversal of more levels

Model Updates

Sensitivity analysis Conclusions Analyzed compression schemes for in-network summaries Evaluated summary traversal Studied optimal hierarchy construction Studied increased complexity models Showed that simple SGM are sufficient Analyzed the effect on efficiency of various parameters Compression Traversal Construction In-network summaries Enriched models