Semantics and Evaluation Techniques for Window Aggregates in Data Stream Jin Li, David Maier, Kristin Tufte, Vassillis Papadimos, Peter Tucker. Presented.

Slides:



Advertisements
Similar presentations
Semantics and Evaluation Techniques for Window Aggregates in Data Streams Jin Li, David Maier, Kristin Tufte, Vassilis Papadimos, Peter A. Tucker SIGMOD.
Advertisements

1 Quick recap of the SQL & the GUI way in Management Studio The Adwentureworks database from the book A script to create tables & insert data for the Amazon.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 5 More SQL: Complex Queries, Triggers, Views, and Schema Modification.
Semantics and Evaluation Techniques for Window Aggregates in Data Streams Jin Li, David Maier, Kristin Tufte, Vassilis Papadimos, Peter A. Tucker SIGMOD.
CS240B Midterm Spring 2013 Your Name: and your ID: Problem Max scoreScore Problem 140% Problem 232% Problem 228% Total 100%
Exploring Microsoft Access 2003 Chapter 3 Information From the Database: Reports and Queries.
Copyright © 2004 Pearson Education, Inc.. Chapter 9 More SQL: Assertions, Views, and Programming Techniques.
1 Efficient Temporal Coalescing Query Support in Relational Database Systems Xin Zhou 1, Carlo Zaniolo 1, Fusheng Wang 2 1 UCLA, 2 Simens Corporate Research.
1 11. Streaming Data Management Chapter 18 Current Issues: Streaming Data and Cloud Computing The 3rd edition of the textbook.
3/13/2012Data Streams: Lecture 161 CS 410/510 Data Streams Lecture 16: Data-Stream Sampling: Basic Techniques and Results Kristin Tufte, David Maier.
Continuous Analytics Over Discontinuous Streams Sailesh Krishnamurthy, Michael Franklin, Jeff Davis, Daniel Farina, Pasha Golovko, Alan Li, Neil Thombre.
Exploring Microsoft Access
CREATE VIEW SYNTAX CREATE VIEW name [(view_col [, view_col …])] AS [WITH CHECK OPTION];
STREAM: The Stanford Data Stream Management System Rebuttal Team Mingzhu Wei Di Yang CS525s - Fall 2006.
Windows in Niagara Jin (Jenny) Li, David Maier, Vassilis Papadimos, Peter Tucker, Kristin Tufte.
Data Integration Aggregate Query Answering under Uncertain Schema Mappings Avigdor Gal, Maria Vanina Martinez, Gerardo I. Simari, VS Subrahmanian Presented.
Constraints and Triggers Foreign Keys Local and Global Constraints Triggers.
An Abstract Semantics and Concrete Language for Continuous Queries over Streams and Relations Presenter: Liyan Zhang Presentation of ICS
Title and Authors of the Paper you reviewed goes here Your name goes here.
Database Systems More SQL Database Design -- More SQL1.
STREAM The Stanford Data Stream Management System.
CSE314 Database Systems More SQL: Complex Queries, Triggers, Views, and Schema Modification Doç. Dr. Mehmet Göktürk src: Elmasri & Navanthe 6E Pearson.
Introduction to Accounting Information Systems
1 11/3/05CS360 Windows Programming Databases and Data Representation.
Query Processing, Resource Management, and Approximation in a Data Stream Management System.
Structured Query Language Chris Nelson CS 157B Spring 2008.
VICTORIA UNIVERSITY OF WELLINGTON Te Whare Wananga o te Upoko o te Ika a Maui SWEN 432 Advanced Database Design and Implementation Exam and Lecture Overview.
Bdbms: A Database System for Scientific Data Management Mohamed Y. Eltabakh, Mourad Ouzzani, Walid G. Aref, Ahmed Elmagarmid, Yasin Silva, Umer Arshad,
Data Streams: Lecture 101 Window Aggregates in NiagaraST Kristin Tufte, Jin Li Thanks to the NiagaraST PSU.
Integrating Scale Out and Fault Tolerance in Stream Processing using Operator State Management Author: Raul Castro Fernandez, Matteo Migliavacca, et al.
Constraints, Triggers and Views COMSATS INSTITUTE OF INFORMATION TECHNOLOGY, VEHARI.
JONATHAN LESSINGER A CRITIQUE OF CQL. PLAN 1.Background (How CQL, STREAM work) 2.Issues.
CS499 Project #3 XML mySQL Test Generation Members Erica Wade Kevin Hardison Sameer Patwa Yi Lu.
Constraints and Triggers. What’s IC? Integrity Constraints define the valid states of SQL-data by constraining the values in the base tables. –Restrictions.
Network Computing Laboratory A programming framework for Stream Synthesizing Service.
CS4432: Database Systems II Query Processing- Part 2.
Triggers and Streams Zachary G. Ives University of Pennsylvania CIS 650 – Database & Information Systems March 28, 2005.
1 Semantics and Evaluation Techniques for Window Aggregates in Data Streams Jin Li, David Maier, Kristin Tufte, Vassilis Papadimos, Peter Tucker This work.
Jennifer Widom Relational Databases The Relational Model.
CS240A: Databases and Knowledge Bases TSQL2 Carlo Zaniolo Department of Computer Science University of California, Los Angeles Notes From Chapter 6 of.
SQL and Query Execution for Aggregation. Example Instances Reserves Sailors Boats.
SQL: Interactive Queries (2) Prof. Weining Zhang Cs.utsa.edu.
What’s Next 2015 H2 – Improve cube algorithm Cube by segments, 30%-50% faster Build delay down to tens of minutes – Streaming cubing Analyze real-time.
The latte Stream-Archive Query Project - Exploring Stream+Archive Data in Intelligent Transportation Systems Jin Li (with Kristin Tufte, Vassilis Papadimos,
Data Streams COMP3017 Advanced Databases Dr Nicholas Gibbins –
IncApprox The marriage of incremental and approximate computing Pramod Bhatotia Dhanya Krishnan, Do Le Quoc, Christof Fetzer, Rodrigo Rodrigues* (TU Dresden.
1 Out of Order Processing for Stream Query Evaluation Jin Li (Portland State Universtiy) Joint work with Theodore Johnson, Vladislav Shkapenyuk, David.
More SQL: Complex Queries, Triggers, Views, and Schema Modification
S. Sudarshan CS632 Course, Mar 2004 IIT Bombay
Supporting Ranking and Clustering as Generalized Order-By and Group-By
COMP3211 Advanced Databases
Lecturer : Dr. Pavle Mogin
CS422 Principles of Database Systems Course Overview
Applying Control Theory to Stream Processing Systems
Relational Algebra - Part 1
SQL in Oracle.
Schedule Today: Next After that Subqueries, Grouping and Aggregation.
Data stream as an unbounded table
High-resolution air quality forecasting for Hong Kong
Relational Databases The Relational Model.
Relational Databases The Relational Model.
Question 1 Find four… 4 marks 5 minutes Paper 1
The Relational Model Textbook /7/2018.
CSE 491/891 Lecture 21 (Pig).
Adding Multiple Logical Table Sources
CS240B: Assignment1 Winter 2016.
Dop d d 1 2 reconst reconst sop P P 1 2.
Theppatorn rhujittawiwat
CS240A: Databases and Knowledge Bases TSQL2
Streams and Stuff Sirish and Sam and Mike.
Presentation transcript:

Semantics and Evaluation Techniques for Window Aggregates in Data Stream Jin Li, David Maier, Kristin Tufte, Vassillis Papadimos, Peter Tucker. Presented by: Venkatesh Raghvan Charudatta Wad CS 525 Class discussion

Overview  Background  Problem Statement  Window semantics  WID approach  Discussion

Background  Disorders Handling: Punctuations.  Aggregate Queries: In SQL? In CQL? (without WIDs)  In sliding windows, what causes an output?

Problem Statement  Lack of explicit window semantics.  Implementation efficiency.  Out of order arrival of data.

Running Example  Consider the example from the paper: Schema Query: SELECT seg-id, max(speed), min(speed) FROM Traffic [Range 300 seconds SLIDE 60 seconds WATTR ts] GROUP BY seg-id.

Running Example - This picture is taken from the paper itself.

Big Picture  Mapping of tuples to window extents and vice versa.  New Window semantics.  Window specifications: RANGE, SLIDE and WATTR.

Window specification  Time based query: Counting the number of vehicles in each segment for the past 1 hour, update the result every 20 min. SELECT seg-id, count(*) FROM Traffic [RANGE 60 minutes SLIDE 20 minutes WATTR ts] GROUP BY seg-id.

Window specification  Tuple-based query: Counting the number of vehicles in each segment for the past 100 rows, update the result every 10 rows. SELECT seg-id, count(*) FROM Traffic [RANGE 100 rows SLIDE 10 rows WATTR row-num] GROUP BY seg-id.

Window specification  Can we specify RANGE and SLIDE on different attributes:  YES!! SELECT seg-id, count(*) FROM Traffic [RANGE 300 seconds SLIDE 10 rows RATTR ts SATTR row-num] GROUP BY seg-id.

WID Approach  Explained by Venky.