한국기술교육대학교 컴퓨터 공학 김홍연 TinyDB : An Acquisitional Query Processing System for Sensor Networks. - Samuel R. Madden, Michael J. Franklin, Joseph M. Hellerstein,

Slides:



Advertisements
Similar presentations
Analysis of : Operator Scheduling in a Data Stream Manager CS561 – Advanced Database Systems By Eric Bloom.
Advertisements

Sec-TEEN: Secure Threshold sensitive Energy Efficient sensor Network protocol Ibrahim Alkhori, Tamer Abukhalil & Abdel-shakour A. Abuznied Department of.
1 Routing Techniques in Wireless Sensor networks: A Survey.
Towards a Sensor Network Architecture: Lowering the Waistline Culler et.al. UCB.
1 Sensor Network Databases Ref: Wireless sensor networks---An information processing approach Feng Zhao and Leonidas Guibas (chapter 6)
Approximating Sensor Network Queries Using In-Network Summaries Alexandra Meliou Carlos Guestrin Joseph Hellerstein.
The Cougar Approach to In-Network Query Processing in Sensor Networks By Yong Yao and Johannes Gehrke Cornell University Presented by Penelope Brooks.
Dissemination protocols for large sensor networks Fan Ye, Haiyun Luo, Songwu Lu and Lixia Zhang Department of Computer Science UCLA Chien Kang Wu.
Aggregation in Sensor Networks NEST Weekly Meeting Sam Madden Rob Szewczyk 10/4/01.
Adaptive Sampling in Distributed Streaming Environment Ankur Jain 2/4/03.
Sensor Database: Querying Sensor Networks Yinghua Wu, Haiyong Xie.
A Survey of Wireless Sensor Network Data Collection Schemes by Brett Wilson.
Approximate data collection in sensor networks the appeal of probabilistic models David Chu Amol Deshpande Joe Hellerstein Wei Hong ICDE 2006 Atlanta,
Cost-Based Plan Selection Choosing an Order for Joins Chapter 16.5 and16.6 by:- Vikas Vittal Rao ID: 124/227 Chiu Luk ID: 210.
1 Acquisitional Query Processing in TinyDB Sam Madden UC Berkeley NEST Winter Retreat 2003.
The Design of an Acquisitional Query Processor For Sensor Networks Samuel Madden, Michael J. Franklin, Joseph M. Hellerstein, and Wei Hong Presentation.
Efficient Search in Peer to Peer Networks By: Beverly Yang Hector Garcia-Molina Presented By: Anshumaan Rajshiva Date: May 20,2002.
Data Management for Sensor Networks Zachary G. Ives University of Pennsylvania CIS 650 – Database & Information Systems April 4, 2005.
TAG: A TINY AGGREGATION SERVICE FOR AD-HOC SENSOR NETWORKS Presented by Akash Kapoor SAMUEL MADDEN, MICHAEL J. FRANKLIN, JOSEPH HELLERSTEIN, AND WEI HONG.
T AG : A TINY AGGREGATION SERVICE FOR AD - HOC SENSOR NETWORKS Samuel Madden, Michael J. Franklin, Joseph Hellerstein, and Wei Hong Presented by – Mahanth.
Connecting LANs, Backbone Networks, and Virtual LANs
TAG: a Tiny Aggregation Service for Ad-Hoc Sensor Networks Paper By : Samuel Madden, Michael J. Franklin, Joseph Hellerstein, and Wei Hong Instructor :
Information Quality Aware Routing in Event-Driven Sensor Networks Hwee-Xian TAN 1, Mun Choon CHAN 1, Wendong XIAO 2, Peng-Yong KONG 2 and Chen-Khong THAM.
INF5100 Autumn 2007 © Ellen Munthe-Kaas and Jarle Søberg 1 Data Management in Sensor Networks Ellen Munthe-Kaas Jarle Søberg.
Context Tailoring the DBMS –To support particular applications Beyond alphanumerical data Beyond retrieve + process –To support particular hardware New.
The Design of an Acquisitional Query Processor For Sensor Networks Samuel Madden, Michael J. Franklin, Joseph M. Hellerstein, and Wei Hong.
Sensor Network Databases1 Overview: Chapter 6  Sensor Network Databases  Sensor networks are conceptually a distributed DB  Store collected data  Indexes.
March 6th, 2008Andrew Ofstad ECE 256, Spring 2008 TAG: a Tiny Aggregation Service for Ad-Hoc Sensor Networks Samuel Madden, Michael J. Franklin, Joseph.
1 Pradeep Kumar Gunda (Thanks to Jigar Doshi and Shivnath Babu for some slides) TAG: a Tiny Aggregation Service for Ad-Hoc Sensor Networks Samuel Madden,
TAG: a Tiny Aggregation Service for Ad-Hoc Sensor Networks Authors: Samuel Madden, Michael Franklin, Joseph Hellerstein Presented by: Vikas Motwani CSE.
1 TAG: A Tiny Aggregation Service for Ad-Hoc Sensor Networks Samuel Madden UC Berkeley with Michael Franklin, Joseph Hellerstein, and Wei Hong December.
INT 598 Data Management for Sensor Networks Silvia Nittel Spatial Information Science & Engineering University of Maine Fall 2006.
Towards Low Overhead Provenance Tracking in Near Real-Time Stream Filtering Nithya N. Vijayakumar, Beth Plale DDE Lab, Indiana University {nvijayak,
Sensor Database System Sultan Alhazmi
1 EnviroStore: A Cooperative Storage System for Disconnected Operation in Sensor Networks Liqian Luo, Chengdu Huang, Tarek Abdelzaher John Stankovic INFOCOM.
The Design of an Acquisitional Query Processor for Sensor Networks CS851 Presentation 2005 Presented by: Gang Zhou University of Virginia.
CS542 Seminar – Sensor OS A Virtual Machine For Sensor Networks Oct. 28, 2009 Seok Kim Eugene Seo R. Muller, G. Alonso, and D. Kossmann.
Content Addressable Network CAN. The CAN is essentially a distributed Internet-scale hash table that maps file names to their location in the network.
1 Fjording The Stream An Architecture for Queries over Streaming Sensor Data Samuel Madden, Michael Franklin UC Berkeley.
한국기술교육대학교 컴퓨터 공학 김홍연 Habitat Monitoring with Sensor Networks DKE.
Query Processing for Sensor Networks Yong Yao and Johannes Gehrke (Presentation: Anne Denton March 8, 2003)
REED: Robust, Efficient Filtering and Event Detection in Sensor Networks Daniel Abadi, Samuel Madden, Wolfgang Lindner MIT United States VLDB 2005.
1 REED: Robust, Efficient Filtering and Event Detection in Sensor Networks Daniel Abadi, Samuel Madden, Wolfgang Lindner MIT United States VLDB 2005.
8 1 Chapter 8 Advanced SQL Database Systems: Design, Implementation, and Management, Seventh Edition, Rob and Coronel.
Database Systems Design, Implementation, and Management Coronel | Morris 11e ©2015 Cengage Learning. All Rights Reserved. May not be scanned, copied or.
KAIS T Distributed cross-layer scheduling for In-network sensor query processing PERCOM (THU) Lee Cheol-Ki Network & Security Lab.
McGraw-Hill©The McGraw-Hill Companies, Inc., 2004 Connecting Devices CORPORATE INSTITUTE OF SCIENCE & TECHNOLOGY, BHOPAL Department of Electronics and.
Session 1 Module 1: Introduction to Data Integrity
Cross-Layer Scheduling for Power Efficiency in Wireless Sensor Networks Mihail L. Sichitiu Department of Electrical and Computer Engineering North Carolina.
Aggregation and Secure Aggregation. Learning Objectives Understand why we need aggregation in WSNs Understand aggregation protocols in WSNs Understand.
W. Hong & S. Madden – Implementation and Research Issues in Query Processing for Wireless Sensor Networks, ICDE 2004.
20 Copyright © 2008, Oracle. All rights reserved. Cache Management.
A Bandwidth Scheduling Algorithm Based on Minimum Interference Traffic in Mesh Mode Xu-Yajing, Li-ZhiTao, Zhong-XiuFang and Xu-HuiMin International Conference.
Embedded Computer - Definition When a microcomputer is part of a larger product, it is said to be an embedded computer. The embedded computer retrieves.
REED : Robust, Efficient Filtering and Event Detection in Sensor Network Daniel J. Abadi, Samuel Madden, Wolfgang Lindner Proceedings of the 31st VLDB.
INF5100 Autumn 2008 © Ellen Munthe-Kaas and Jarle Søberg 1 Data Management in Sensor Networks Ellen Munthe-Kaas Jarle Søberg.
Aggregation and Secure Aggregation. [Aggre_1] Section 12 Why do we need Aggregation? Sensor networks – Event-based Systems Example Query: –What is the.
Sep Multiple Query Optimization for Wireless Sensor Networks Shili Xiang Hock Beng Lim Kian-Lee Tan (ICDE 2007) Presented by Shan Bai.
On Mobile Sink Node for Target Tracking in Wireless Sensor Networks Thanh Hai Trinh and Hee Yong Youn Pervasive Computing and Communications Workshops(PerComW'07)
Building Wireless Efficient Sensor Networks with Low-Level Naming J. Heihmann, F.Silva, C. Intanagonwiwat, R.Govindan, D. Estrin, D. Ganesan Presentation.
The Design of an Acquisitional Query Processor For Sensor Networks Samuel Madden, Michael J. Franklin, Joseph M. Hellerstein, and Wei Hong Presentation.
TAG: a Tiny AGgregation service for ad-hoc sensor networks Authors: Samuel Madden, Michael J. Franklin, Joseph M. Hellerstein, Wei Hong Presenter: Mingwei.
Distributed database approach,
The Design of an Acquisitional Query Processor For Sensor Networks
Streaming Sensor Data Fjord / Sensor Proxy Multiquery Eddy
Interpret the execution mode of SQL query in F1 Query paper
Programmable Networks
Chapter 8 Advanced SQL.
Aggregation.
Eddies for Continuous Queries
Presentation transcript:

한국기술교육대학교 컴퓨터 공학 김홍연 TinyDB : An Acquisitional Query Processing System for Sensor Networks. - Samuel R. Madden, Michael J. Franklin, Joseph M. Hellerstein, Wei Hong DKE

 Acquisitional issues.  Query Optimization.  The significant costs of sampling sensors.  Query Dissemination.  The physical co-location of sampling and processing.  Query Execution.  Choices of when to sample.  Proposed.  Incorporate acquisitional techniques designed to minimize power consumption.  The structure of query is simple (or SQL-like).  Additional parameters.  Collects data from motes / Filters data / Aggregates data. Introduction.

 Basic architecture.  Queries are submitted at a powered PC, parsed, optimized and sent into the sensor network.  Disseminated and processed.  Results flowing back up the routing tree. Introduction.

 Features of TinyDB.  Declarative SQL-like query interface.  Metadata catalog management.  Multiple concurrent queries.  Network monitoring.  In-network, distributed query processing.  Extensible framework for attributes, commands and aggregates.  Goal.  The primary goal of TinyDB is to allow data-driven applications to be developed and deployed much more quickly. Introduction.

 Basic Language Features.  Queries in TinyDB, as in SQL, consist of a SELECT-FROM- WHERE-GROUPBY clause  supporting selection, join, projection, and aggregation.  Sensor data is viewed as a single table (sensors) with one column per sensor type (temperature, humidity, light, …).  Tuples are appended to this table periodically, at well-defined sample intervals.  For example,  SELECT nodeid, light, temp FROM sensors SAMPLE INTERVAL 1s FOR 10s  Meaning : Report light and temperature readings once per second for 10 seconds. Basic Language.

 Materialization point.  Stored table in the nodes.  For example,  CREATE STORAGE POINT recentLight SIZE 8 AS (SELECT nodeid, light FROM sensors SAMPLE PERIOD 10s  Meaning : Store the latest eight light readings, doing one reading every 10 seconds.  DROP clause. Basic Language.

 Joins.  Joins are allowed between two materialization points or between a materialization point and the sensors table.  For example,  SELECT COUNT(*) FROM sensors AS s, recentLight AS r WHERE r.nodeid = s.nodeid AND s.light < r.light SAMPLE PERIOD 10s  Meaning : Count the number of recent light readings (from 0 to 8 samples) that were brighter than the current reading. Each current reading collected during a time span of 10 seconds. Basic Language.

 Aggregation.  Aggregation can be performed on grouped values as in ordinary SQL. (Aggregation queries)  It reduces the quantity of data that must be transmitted through the network.  For example (using microphone sensors),  SELECT AVG(volume), room FROM sensors WHERE floor = 6 GROUP BY room HAVING AVG(volume) > threshold SAMPLE PERIOD 30s  Meaning : Find the rooms on floor 6 where the average volume is over some threshold (during a time span of 30 seconds). Basic Language.

 Assign query.  When a query is issued in TinyDB, it is assigned an identifier that is returned to the issuer.  This identifier can be used to stop a query.  Except these case.  A query to run for specific time period.  Include a stopping condition as an event. Basic Language.

 Event-Based Queries.  TinyDB supports events as a mechanism for initiating data collection.  Events in TinyDB are generated either by another query or by a lower-level part of the operation system.  For example,  ON EVENT bird-detect(loc): SELECT AVG(light), AVG(temp), event.loc FROM sensors AS s WHERE dist(s.loc, event.loc) < 10m SAMPLE PERIOD 2s FOR 30s  Meaning : Every time a bird-detect event occurs, the query is issued from the detecting node and the average light and temperature are collected from nearby nodes once every 2 seconds for 30 seconds. Event-Based Queries.

 Events are central in ACQP,  Events are central in ACQP, as they allow the system to be dormant until some external conditions occurs,  instead of continually polling or blocking on an iterator waiting for some data to arrive. Event-Based Queries.

 Generating an event from a query.  For example,  SELECT nodeid, temp FROM sensors WHERE temp > threshold OUTPUT ACTION SIGNAL hot(nodeid, temp) SAMPLE PERIOD 10s  Meaning : Signal the event ‘hot’ whenever the temperature goes above some threshold (during a time span of 10 seconds). Event-Based Queries.

Lifetime-Based Queries.

 Network health queries.  Meta-queries over the network itself.  For example,  SELECT nodeid, voltage WHERE voltage < threshold FROM sensors SAMPLE PERIOD 10 minutes  Actuation queries.  Users specifies an external command that should be invoked in response to a tuple satisfying the query.  For example,  SELECT nodeid, temp FROM sensors WHERE temp > threshold OUTPUT ACTION power-on(nodeid) SAMPLE PERIOD 30s Types of Queries in Sensor Networks.

 Using a simple cost-based optimizer.  To choose a query plan that will yield the lowest overall power consumption.  The cost of a plan.  The cost of a plan is dominated by the cost of  Sampling the physical sensors  Transmitting query results,  Rather than the cost of applying individual operators.  Focus on.  Optimizer focuses on ordering joins, selections, and sampling operations that run on individual nodes. Power-aware Optimization.

 Metadata Management.  Each node in TinyDB maintains a catalog of metadata the describes its local attributes, events, and user-defined functions.  This metadata is periodically copied to the root of the network for use by the optimizer. Power-aware Optimization.

 Ordering of Sampling and Predicates.  Sampling is often an expensive operation in terms of power.  If predicate discards a tuple of the sensors table, then subsequent predicates need not examine the tuple.  The metadata information is used in query optimization to order the sampling and predicates.  Besides predicates in the WHERE clause, expensive sampling operators must also be ordered appropriately with respect to the SELECT, GROUP BY, and HAVING clauses. Power-aware Optimization.

 Ordering of Sampling and Predicates.  For example,  Consider the query below:  SELECT accel, mag FROM sensors WHERE accel > c1 AND mag > c2 SAMPLEINTERVAL 1s Power-aware Optimization.

 Event Query Batching to Conserve Power.  It is possible for multiple instances of the internal query to be running at the same time. -> power waste.  Multi-query optimization technique based on rewriting to alleviate the burden of running multiple copies of the same identical query.  The advantage of this approach is that only one query runs at a time no matter how frequently the events of type ‘e’ are triggered.  For frequent event-based queries, rewriting them as a join between an event stream and the sensors stream  can significantly reduce the rate at which a sensor must acquire samples. Power-aware Optimization.

 Event Query Batching to Conserve Power. Power-sensitive Dissemination and Routing.

 Event Query Batching to Conserve Power. Power-sensitive Dissemination and Routing.

 After the query has been optimized,  As each node hears the query, it must decide if the query applies locally and/or needs to be broadcast to its children in the routing tree.  If a query does not apply at a particular node, and the node does not have any children for which the query applies,  Then the entire sub-tree rooted at that node can be excluded from the query,  Saving the costs of disseminating, executing, and forwarding results for the query across several nodes, significantly extending the node’s lifetime.  Proposed a data structure.  Semantic Routing Tree (SRT). Power sensitive dissemination and routing.

 Figure 8. A semantic routing tree in use for a query.  Gray arrows : flow of the query down the tree.  Gray nodes must produce or forward results in the query. Power sensitive dissemination and routing.

 Communication Scheduling & Aggregate Queries.  The basic idea is to Subdivide the epoch into a number of intervals, and assign nodes to intervals based on their position in the routing tree.  During a node’s interval, if it is aggregating, it computes the partial state record consisting of the combination of any child values.  After this computation, it transmits either its partial state record or raw sensor readings up the network. Processing Queries.

 Interval-based communication. Processing Queries.

 Prioritizing Data Delivery.  The system must decide if it should discard the overflow tuple, discard some other tuple already in the queue, or combine two tuples via some aggregation policy.  Policies for Selection Queries.  Naïve scheme.  No tuple is considered more valuable than any other.  FIFO.  Tuples are dropped if they do not fit in the queue.  Winavg scheme.  This works similarly, except that instead of dropping results when the queue fills, the two results at the head of the queue are averaged to make room for new results. Processing Queries.

 Figure 13. Processing Queries.

 RMS (Root Mean Square) error. Processing Queries.

 Policies for Aggregate Queries.  For example : MAX aggregation (Figure 14).  Here node 2’s value can be suppressed if it is less than the maximum value snooped from nodes 3, 4, and 5. Processing Queries.

 Adapting Rates.  When initially optimizing a query, TinyDB’s optimizer chooses a transmission and sample rate,  based on current network load conditions, and requested sample rates and life-times.  The need for adaptivity in two contexts:  Network contention & Power consumption.  Adaptive back-off : transmission and sample rate changes. Processing Queries.

 Adapting Rates. Processing Queries.

Summary.