Model-Driven Data Acquisition in Sensor Networks - Amol Deshpande et al., VLDB ‘04 Jisu Oh March 20, 2006 CS 580S Paper Presentation.

Slides:



Advertisements
Similar presentations
Copyright ©2004 Carlos Guestrin VLDB 2004 Efficient Data Acquisition in Sensor Networks Presented By Kedar Bellare (Slides adapted.
Advertisements

Bayesian Belief Propagation
Energy Efficient Data Collection In Distributed Sensor Environments Qi Han, Sharad Mehrotra, Nalini Venkatasubramanian {qhan, sharad,
QoS-based Management of Multiple Shared Resources in Dynamic Real-Time Systems Klaus Ecker, Frank Drews School of EECS, Ohio University, Athens, OH {ecker,
Online Filtering, Smoothing & Probabilistic Modeling of Streaming Data In short, Applying probabilistic models to Streams Bhargav Kanagal & Amol Deshpande.
Active Learning for Streaming Networked Data Zhilin Yang, Jie Tang, Yutao Zhang Computer Science Department, Tsinghua University.
CS479/679 Pattern Recognition Dr. George Bebis
Dynamic Bayesian Networks (DBNs)
1 Slides for the book: Probabilistic Robotics Authors: Sebastian Thrun Wolfram Burgard Dieter Fox Publisher: MIT Press, Web site for the book & more.
David Chu--UC Berkeley Amol Deshpande--University of Maryland Joseph M. Hellerstein--UC Berkeley Intel Research Berkeley Wei Hong--Arched Rock Corp. Approximate.
More MR Fingerprinting
Bayesian Robust Principal Component Analysis Presenter: Raghu Ranganathan ECE / CMR Tennessee Technological University January 21, 2011 Reading Group (Xinghao.
A Generic Framework for Handling Uncertain Data with Local Correlations Xiang Lian and Lei Chen Department of Computer Science and Engineering The Hong.
Paper Discussion: “Simultaneous Localization and Environmental Mapping with a Sensor Network”, Marinakis et. al. ICRA 2011.
On Computing Compression Trees for Data Collection in Wireless Sensor Networks Jian Li, Amol Deshpande and Samir Khuller Department of Computer Science,
Parameter Estimation: Maximum Likelihood Estimation Chapter 3 (Duda et al.) – Sections CS479/679 Pattern Recognition Dr. George Bebis.
Distributed Regression: an Efficient Framework for Modeling Sensor Network Data Carlos Guestrin Peter Bodik Romain Thibaux Mark Paskin Samuel Madden.
CS 547: Sensing and Planning in Robotics Gaurav S. Sukhatme Computer Science Robotic Embedded Systems Laboratory University of Southern California
1 On the Statistical Analysis of Dirty Pictures Julian Besag.
Announcements  Homework 4 is due on this Thursday (02/27/2004)  Project proposal is due on 03/02.
CS 547: Sensing and Planning in Robotics Gaurav S. Sukhatme Computer Science Robotic Embedded Systems Laboratory University of Southern California
Geographic Gossip: Efficient Aggregations for Sensor Networks Author: Alex Dimakis, Anand Sarwate, Martin Wainwright University: UC Berkeley Venue: IPSN.
An Optimal Learning Approach to Finding an Outbreak of a Disease Warren Scott Warren Powell
Nonlinear and Non-Gaussian Estimation with A Focus on Particle Filters Prasanth Jeevan Mary Knox May 12, 2006.
Exploiting Correlated Attributes in Acquisitional Query Processing Amol Deshpande University of Maryland Joint work with Carlos Sam
Probabilistic Robotics Introduction Probabilities Bayes rule Bayes filters.
Improving the Accuracy of Continuous Aggregates & Mining Queries Under Load Shedding Yan-Nei Law* and Carlo Zaniolo Computer Science Dept. UCLA * Bioinformatics.
Nearest Neighbor Retrieval Using Distance-Based Hashing Michalis Potamias and Panagiotis Papapetrou supervised by Prof George Kollios A method is proposed.
NSF-RPI Workshop on Pervasive Computing and Networking, April 29-30, Self Optimization in Wireless Sensor Networks Bhaskar Krishnamachari Autonomous.
Thanks to Nir Friedman, HU
Model-driven Data Acquisition in Sensor Networks Amol Deshpande 1,4 Carlos Guestrin 4,2 Sam Madden 4,3 Joe Hellerstein 1,4 Wei Hong 4 1 UC Berkeley 2 Carnegie.
Remote Sensing Laboratory Dept. of Information Engineering and Computer Science University of Trento Via Sommarive, 14, I Povo, Trento, Italy Remote.
Bayesian Filtering for Robot Localization
Using Probabilistic Models for Data Management in Acquisitional Environments Sam Madden MIT CSAIL With Amol Deshpande (UMD), Carlos Guestrin (CMU)
1 A Bayesian Method for Guessing the Extreme Values in a Data Set Mingxi Wu, Chris Jermaine University of Florida September 2007.
An Integration Framework for Sensor Networks and Data Stream Management Systems.
A Survey Based Seminar: Data Cleaning & Uncertain Data Management Speaker: Shawn Yang Supervisor: Dr. Reynold Cheng Prof. David Cheung
1 Robot Environment Interaction Environment perception provides information about the environment’s state, and it tends to increase the robot’s knowledge.
Mean Field Variational Bayesian Data Assimilation EGU 2012, Vienna Michail Vrettas 1, Dan Cornford 1, Manfred Opper 2 1 NCRG, Computer Science, Aston University,
Mixture Models, Monte Carlo, Bayesian Updating and Dynamic Models Mike West Computing Science and Statistics, Vol. 24, pp , 1993.
CS 782 – Machine Learning Lecture 4 Linear Models for Classification  Probabilistic generative models  Probabilistic discriminative models.
ICS280 Presentation by Suraj Nagasrinivasa (1) Evaluating Probabilistic Queries over Imprecise Data (SIGMOD 2003) by R Cheng, D Kalashnikov, S Prabhakar.
Bayesian Classification. Bayesian Classification: Why? A statistical classifier: performs probabilistic prediction, i.e., predicts class membership probabilities.
Approximate Dynamic Programming Methods for Resource Constrained Sensor Management John W. Fisher III, Jason L. Williams and Alan S. Willsky MIT CSAIL.
LANGUAGE MODELS FOR RELEVANCE FEEDBACK Lee Won Hee.
-Arnaud Doucet, Nando de Freitas et al, UAI
The famous “sprinkler” example (J. Pearl, Probabilistic Reasoning in Intelligent Systems, 1988)
Confidence Interval & Unbiased Estimator Review and Foreword.
Spectrum Sensing In Cognitive Radio Networks
Probabilistic Robotics
CS 547: Sensing and Planning in Robotics Gaurav S. Sukhatme Computer Science Robotic Embedded Systems Laboratory University of Southern California
NTU & MSRA Ming-Feng Tsai
1 Scalable Probabilistic Databases with Factor Graphs and MCMC Michael Wick, Andrew McCallum, and Gerome Miklau VLDB 2010.
The Unscented Particle Filter 2000/09/29 이 시은. Introduction Filtering –estimate the states(parameters or hidden variable) as a set of observations becomes.
Submitted by: Sounak Paul Computer Science & Engineering 4 th Year, 7 th semester Roll No:
Probabilistic Robotics Introduction Probabilities Bayes rule Bayes filters.
A Unified Approach to Ranking in Probabilistic Databases Jian Li, Barna Saha, Amol Deshpande University of Maryland, College Park, USA VLDB
Data-Driven Processing in Sensor Networks Adam Silberstein, Rebecca Braynard, Gregory Filpus, Gavino Puggioni, Alan Gelfand, Kamesh Munagala, Jun Yang.
Bayesian Brain Probabilistic Approaches to Neural Coding 1.1 A Probability Primer Bayesian Brain Probabilistic Approaches to Neural Coding 1.1 A Probability.
Probabilistic Robotics Probability Theory Basics Error Propagation Slides from Autonomous Robots (Siegwart and Nourbaksh), Chapter 5 Probabilistic Robotics.
Lecture 1.31 Criteria for optimal reception of radio signals.
CS479/679 Pattern Recognition Dr. George Bebis
Probability Theory and Parameter Estimation I
Ch3: Model Building through Regression
Data-Streams and Histograms
State Estimation Probability, Bayes Filtering
CSE-490DF Robotics Capstone
Sam Madden MIT CSAIL With Amol Deshpande (UMD), Carlos Guestrin (CMU)
DATABASE HISTOGRAMS E0 261 Jayant Haritsa
Overview: Chapter 2 Localization and Tracking
Presentation transcript:

Model-Driven Data Acquisition in Sensor Networks - Amol Deshpande et al., VLDB ‘04 Jisu Oh March 20, 2006 CS 580S Paper Presentation

Problems – Sensornet is not a database! Databases - complete, authoritative sources of information - Answer a query “CORRECTLY” based upon all the available data Sensornets - Misrepresentations of data :only samples acquired, not random :need to complement the sensornet readings - Inefficient approximate queries :Existing query processing from a completist’s approach is costly

Solutions – Model-driven data acquisition Use a statistical model which maps the raw sensor readings onto physical reality - in order to robustly interpret sensor readings - and provide a framework for optimizing the acquisition of sensor readings

Proposed approaches - Architecture Architecture for model-based querying in sensor networks

Proposed approaches (cont.) - Probability density function (pdf) Probability density function (pdf) - Based on time-varying multivariate Gaussians - Estimates sensor readings in the current time period - Properties : correlation between different attributes : cost differential - Constraints: need historical data and training with them

Key concepts – Probabilistic queries BBQ query processing (static probabilistic model)  A user requests a range query that ask if an attribute Xi is in the range [ai, bi] with confidence (1-α)  Marginalize a prior density (probability density function), p(Xi, …, Xn) to a density over only attribute Xi, p(x i )  Compute P(Xi ∈ [a i, b i ]) = ∫ ai bi p(x i )dx i  Answer true if p>1-α, false if p> α  Otherwise, move on conditioning step

Key concepts – Probabilistic queries (cont.) BBQ query processing (conditioning)  Acquire new sensor readings, o, a set of observations  Marginalize a posterior density (conditional probability density function), p(X i, … X j-1, X j+1, … X n | x j ) to a density over only attribute X j p(x j |o)  Compute P(Xi ∈ [a i, b i ] | x j ) = ∫ ai, bi p(x i |o)dx i  Answer true if p>1-α, false if p> α

Key concepts – Probabilistic queries (cont.) BBQ query processing (dynamic probabilistic model)  describe the evolution of the system over time  for time evolved attributes  use transition model p(X 1 t+1,.. X n t+1 | X 1 t.. X n t ) for conditioning  marginalize transition model then obtain p(X 1 t+1,.. X n t+1 | O 1…t )

Key concepts (cont.) – Choosing on observation plan Choose a data acquisition plan for the sensornet to best refine the query anser.  Cost(O) = C a (O) + C t (O)  R i (O), statistical benefit of acquiring a reading  Minimize O ⊆ {1, …, n} C(O), such that R(O) ≥ 1-α

Contribution integrate a database system with a correlation- aware probabilistic model build the model from historical readings and improve it from current readings answer approximately SQL queries by consulting the model + shield from faulty sensors + reduce # expensive sensor readings + reduce # radio transmissions

Experiment results

Conclusions Probabilistic model-driven data acquisition Help to provide approximations with probabilistic confidences, significantly more efficient to compute in both time and energy Strong assumptions and constraints - The model must be trained - based on static network topology - all sensor nodes are well synchronized

Questions?