Indexing Spatio-Temporal Data Warehouses Dimitris Papadias, Yufei Tao, Panos Kalnis, Jun Zhang Department of Computer Science Hong Kong University of Science.

Slides:



Advertisements
Similar presentations
CpSc 3220 File and Database Processing Lecture 17 Indexed Files.
Advertisements

1 Query Processing in Spatial Network Databases presented by Hao Hong Dimitris Papadias Jun Zhang Hong Kong University of Science and Technology Nikos.
Spatio-temporal Databases
Indexing and Range Queries in Spatio-Temporal Databases
Maintaining Sliding Widow Skylines on Data Streams.
School of Computer Science and Engineering Finding Top k Most Influential Spatial Facilities over Uncertain Objects Liming Zhan Ying Zhang Wenjie Zhang.
©Silberschatz, Korth and Sudarshan12.1Database System Concepts Chapter 12: Part C Part A:  Index Definition in SQL  Ordered Indices  Index Sequential.
Chapter 11 Indexing and Hashing (2) Yonsei University 2 nd Semester, 2013 Sanghyun Park.
STUN: SPATIO-TEMPORAL UNCERTAIN (SOCIAL) NETWORKS Chanhyun Kang Computer Science Dept. University of Maryland, USA Andrea Pugliese.
Progressive Computation of The Min-Dist Optimal-Location Query Donghui Zhang, Yang Du, Tian Xia, Yufei Tao* Northeastern University * Chinese University.
Continuous Intersection Joins Over Moving Objects Rui Zhang University of Melbourne Dan Lin Purdue University Kotagiri Ramamohanarao University of Melbourne.
Indexing the imprecise positions of moving objects Xiaofeng Ding and Yansheng Lu Department of Computer Science Huazhong University of Science & Technology.
1 Lecture 8: Data structures for databases II Jose M. Peña
Query Processing in Databases Dr. M. Gavrilova.  Introduction  I/O algorithms for large databases  Complex geometric operations in graphical querying.
2-dimensional indexing structure
Spatio-temporal Databases Time Parameterized Queries.
Spatio-Temporal Databases
Temporal Databases. Outline Spatial Databases Indexing, Query processing Temporal Databases Spatio-temporal ….
Computer Science Spatio-Temporal Aggregation Using Sketches Yufei Tao, George Kollios, Jeffrey Considine, Feifei Li, Dimitris Papadias Department of Computer.
Spatio-Temporal Databases. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases …..
Spatio-Temporal Databases. Introduction Spatiotemporal Databases: manage spatial data whose geometry changes over time Geometry: position and/or extent.
1 SINA: Scalable Incremental Processing of Continuous Queries in Spatio-temporal Databases Mohamed F. Mokbel, Xiaopeng Xiong, Walid G. Aref Presented by.
Approximate querying about the Past, the Present, and the Future in Spatio-Temporal Databases Jimeng Sun, Dimitris Papadias, Yufei Tao, Bin Liu.
Temporal Databases. Outline Spatial Databases Indexing, Query processing Temporal Databases Spatio-temporal ….
Spatial Information Systems (SIS) COMP Spatial access methods: Indexing.
Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis.
Dieter Pfoser, LBS Workshop1 Issues in the Management of Moving Point Objects Dieter Pfoser Nykredit Center for Database Research Aalborg University, Denmark.
Introduction to Databases CIS 5.2. Where would you find info about yourself stored in a computer? College Physician’s office Library Grocery Store Dentist’s.
An Incremental Refining Spatial Join Algorithm for Estimating Query Results in GIS Wan D. Bae, Shayma Alkobaisi, Scott T. Leutenegger Department of Computer.
Advanced Querying OLAP Part 2. Context OLAP systems for supporting decision making. Components: –Dimensions with hierarchies, –Measures, –Aggregation.
Spatio-Temporal Databases. Introduction Spatiotemporal Databases: manage spatial data whose geometry changes over time Geometry: position and/or extent.
Probabilistic Skyline Operator over sliding Windows Wan Qian HKUST DB Group.
The Tornado Model: Uncertainty Model for Continuously Changing Data Byunggu Yu 1, Seon Ho Kim 2, Shayma Alkobaisi 2, Wan Bae 2, Thomas Bailey 3 Department.
R-Trees 2-dimensional indexing structure. R-trees 2-dimensional version of the B-tree: B-tree of maximum degree 8; degree between 3 and 8 Internal nodes.
Spatio-Temporal Databases. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases …..
1 Indexing Large Trajectory Data Sets With SETI V.Prasad Chakka Adam C.Everspaugh Jignesh M.Patel University of Michigan Presented by Guangyue Jia.
Moving Objects Databases Nilanshu Dharma Shalva Singh.
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
Improving Min/Max Aggregation over Spatial Objects Donghui Zhang, Vassilis J. Tsotras University of California, Riverside ACM GIS’01.
Spatial Data Management Chapter 28. Types of Spatial Data Point Data –Points in a multidimensional space E.g., Raster data such as satellite imagery,
Dept. of Electrical Engineering and Computer Science, Northwestern University Context-Aware Optimization of Continuous Query Maintenance for Trajectories.
Cache-Conscious Performance Optimization for Similarity Search Maha Alabduljalil, Xun Tang, Tao Yang Department of Computer Science University of California.
R-Tree. 2 Spatial Database (Ia) Consider: Given a city map, ‘index’ all university buildings in an efficient structure for quick topological search.
1 The MV3R-Tree: A Spatio- Temporal Access Method for Timestamp and Interval Queries Yufei Tao and Dimitris Papadias Hong Kong University of Science and.
MySQL spatial indexing for GIS data in a web 2.0 internet application Brian Toone Samford University
Spatio-temporal Pattern Queries M. Hadjieleftheriou G. Kollios P. Bakalov V. J. Tsotras.
On Computing Top-t Influential Spatial Sites Authors: T. Xia, D. Zhang, E. Kanoulas, Y.Du Northeastern University, USA Appeared in: VLDB 2005 Presenter:
August 30, 2004STDBM 2004 at Toronto Extracting Mobility Statistics from Indexed Spatio-Temporal Datasets Yoshiharu Ishikawa Yuichi Tsukamoto Hiroyuki.
ST Data-warehouse for trajectories Some preliminary ideas S. Orlando, R. Orsini, A. Raffaetà, A. Roncato.
Variant Indexes. Specialized Indexes? Data warehouses are large databases with data integrated from many independent sources. Queries are often complex.
Efficient OLAP Operations in Spatial Data Warehouses Dimitris Papadias, Panos Kalnis, Jun Zhang and Yufei Tao Department of Computer Science Hong Kong.
1 Complex Spatio-Temporal Pattern Queries Cahide Sen University of Minnesota.
Location-based Spatial Queries AGM SIGMOD 2003 Jun Zhang §, Manli Zhu §, Dimitris Papadias §, Yufei Tao †, Dik Lun Lee § Department of Computer Science.
Progressive Computation of The Min-Dist Optimal-Location Query Donghui Zhang, Yang Du, Tian Xia, Yufei Tao* Northeastern University * Chinese University.
Spatio-Temporal Databases. Term Project Groups of 2 students You can take a look on some project ideas from here:
Jeremy Iverson & Zhang Yun 1.  Chapter 6 Key Concepts ◦ Structures and access methods ◦ R-Tree  R*-Tree  Mobile Object Indexing  Questions 2.
CSIS7101 – Advanced Database Technologies Spatio-Temporal Data (Part 3) The MV3-Tree: A Spatio-Temporal Access Method for Timestamp and Interval Queries.
Dense-Region Based Compact Data Cube
Advanced Database Aggregation Query Processing
Spatio-Temporal Databases
Progressive Computation of The Min-Dist Optimal-Location Query
Spatio-temporal Pattern Queries
Spatial Online Sampling and Aggregation
Spatio-temporal Databases
Chapter 11: Indexing and Hashing
Spatio-Temporal Databases
Similarity Search: A Matching Based Approach
Spatio-temporal Databases
Chapter 11: Indexing and Hashing
Efficient Aggregation over Objects with Extent
Presentation transcript:

Indexing Spatio-Temporal Data Warehouses Dimitris Papadias, Yufei Tao, Panos Kalnis, Jun Zhang Department of Computer Science Hong Kong University of Science and Technology Clear Water Bay, Hong Kong 26, Feb, 2002 This work was supported by grants HKUST 6081/01E and 6070/00E from Hong Kong RGC.

2 Outline Preliminary – Spatial data warehouses and aggregate trees Applications and motivation Solution for static objects Solution for dynamic objects Performance study Conclusion

3 Preliminary – Spatial Data Warehouses Each spatial object carries some sort of aggregate information (i.e., each landscape may involve the population). A common query is the window aggregate query, which specifies a window query and retrieves the aggregate sum of all objects intersecting it. –Analogy of the “group-by” in conventional data warehouses. Materialization techniques common in traditional data warehouses are of limited use since possible positions of queries are infinite. –Ad-hoc “group-by” R1R1 R2R2 R3R3 R4R qsqs

4 Preliminaries – Spatial Data Warehouse A better approach is to deploy aggregate trees to introduce the spatial hierarchy [Kline and Snodgrass, 1995, Papadias, et al, 2001, Lazaridis and Mehrotra, 2001]. R1R1 R2R2 R3R3 R4R4 R5R5 R6R R1R1 150 R2R2 75 R3R3 132 R4R4 12 R5R5 225 R6R6 144 qsqs Aggregation R-tree Retrieve the sum of aggregate of objects intersecting q s

5 Spatio-Temporal DW: Applications and Motivation Spatio-temporal databases deal with objects whose properties may change with time. Traditional studies in spatio-temporal databases focus on retrieving the actual objects that satisfy the query predicates. –Retrieve all vehicles that appear in the north district during 3pm to 5pm yesterday. A more useful type of queries may be to retrieve, instead of the actual object IDs, the number of objects that satisfy the query conditions. –Retrieve the (approximate) number of vehicles in the north district during 3pm-5pm yesterday. In the above example, the spatial objects (i.e., streets in the north district) that carry aggregate information (i.e., number of cars) are static. Other queries may involve dynamic objects. –The mobile phone antenna (i.e., the aggregate information = # of users served by the antenna) whose spatial extents (i.e., covering areas) may change over time.

6 Example (Static Objects) Query q s retrieve the aggregate sum (during time T 1 -T 4 ) of all rectangles that intersect it.

7 Traditional Methods Pre-materialization –Even more difficult than spatial DW due to the inclusion of the temporal dimension. Use an aggregation tree. –When the aggregate of a region changes, create a 3D box. An aggregate 3D R-tree is used to index all these boxes. –Problem: The spatial extent of a region must be duplicated many times although it does not change. 3D boxes for region R T3T3 T4T4 T5T5 T1T1

8 Aggregate RB-tree Spatial extents are stored only once.

9 Example (Dynamic Objects) Query q s retrieve the aggregate sum (during time T 1 -T 4 ) of all rectangles that intersect it. R1R1 R2R2 R3R3 R4R4 qsqs Situation during timestamps 1-4

10 Example (cont.) Query q s retrieve the aggregate sum (during time T 1 -T 4 ) of all rectangles that intersect it. R1R1 R2R2 R3R3 R4R4 qsqs change position at timestamp 5

11 Aggregate HRB-tree Integrates the previous idea with the spatio-temporal access method HR-trees. timestamp 5 timestamp 1-4

12 Aggregate 3D RB-tree Creates a 3D box only when the spatial extent of an object changes.

13 Managing Numerous B-trees If each B-tree is too small (i.e., the rates of spatial extent and aggregate changes are similar) –A block contains too few entries and much space is wasted. –Not suitable for caching. Our solution is to use a B-File, which “packs” numerous B-trees into a single file –Avoiding empty spaces in a disk page. –Maintaining the same query performance.

14 Performance Dataset settings –Number of spatial objects = 10,000 –History length = 1,000 timestamps –Aggregate agility – describes how fast the aggregate information changes (4%, 8%, 16%, 32%, 64%) –Region agility – describes how fast the spatial extents change 0% for static objects 0.01% for dynamic objects (capturing the fact that spatial dimension changes much slower than the aggregate data) –Datasets include 500,000 to 6,500,5000 records. Each query contains 2 parameters: (spatial extents and interval length).

15 Results (Static Objects)

16 Results (Static Objects)

17 Results (Dynamic Objects)

18 Results (Dynamic Objects)

19 Conclusion We propose indexing techniques that replace the data cube in spatio- temporal data warehouses and answer ad-hoc group-by queries very efficiently. –Both static and dynamic spatial dimensions are discussed. Extensions –Cost models that predict the performance of alternative structures. –Query optimization based on the cost models. –Complex query evaluation