Continuous Intersection Joins Over Moving Objects Rui Zhang University of Melbourne Dan Lin Purdue University Kotagiri Ramamohanarao University of Melbourne.

Slides:



Advertisements
Similar presentations
The Optimal-Location Query
Advertisements

Spatial Join Queries. Spatial Queries Given a collection of geometric objects (points, lines, polygons,...) organize them on disk, to answer point queries.
Spatial Join Yan Huang Spatial Join Given two sets of spatial data Find the pair of objects satisfying certain spatial predicate – e.g.
1 Top-k Spatial Joins
Indexing and Range Queries in Spatio-Temporal Databases
School of Computer Science and Engineering Finding Top k Most Influential Spatial Facilities over Uncertain Objects Liming Zhan Ying Zhang Wenjie Zhang.
Nearest Neighbor Queries using R-trees Based on notes from G. Kollios.
Jianzhong Qi Rui Zhang Lars Kulik Dan Lin Yuan Xue The Min-dist Location Selection Query University of Melbourne 14/05/2015.
Progressive Computation of The Min-Dist Optimal-Location Query Donghui Zhang, Yang Du, Tian Xia, Yufei Tao* Northeastern University * Chinese University.
Efficient Reverse k-Nearest Neighbors Retrieval with Local kNN-Distance Estimation Mike Lin.
Indexing the imprecise positions of moving objects Xiaofeng Ding and Yansheng Lu Department of Computer Science Huazhong University of Science & Technology.
1 CSIS 7101: CSIS 7101: Spatial Data (Part 2) Efficient Processing of Spatial Joins Using R-trees Rollo Chan Chu Chung Man Mak Wai Yip Vivian Lee Eric.
COMP 451/651 B-Trees Size and Lookup Chapter 1.
Stabbing the Sky: Efficient Skyline Computation over Sliding Windows COMP9314 Lecture Notes.
Nearest Neighbor Search in Spatial and Spatiotemporal Databases
Indexing the Positions of Continuously Moving Objects Saltenis, Jensen, Leutenegger and Lopez.
2-dimensional indexing structure
Spatial Access Methods Chapter 26 of book Read only 26.1, 26.2, 26.6 Dr Eamonn Keogh Computer Science & Engineering Department University of California.
Spatio-Temporal Databases
Spatio-Temporal Databases. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases …..
Spatial Queries Nearest Neighbor and Join Queries.
Hierarchical Constraint Satisfaction in Spatial Database Dimitris Papadias, Panos Kalnis And Nikos Mamoulis.
Chapter 3: Data Storage and Access Methods
Spatial Queries Nearest Neighbor Queries.
Spatio-Temporal Databases. Introduction Spatiotemporal Databases: manage spatial data whose geometry changes over time Geometry: position and/or extent.
Handling Location Imprecision in Moving Object Database Xinfa Hu March 2007.
R-Trees 2-dimensional indexing structure. R-trees 2-dimensional version of the B-tree: B-tree of maximum degree 8; degree between 3 and 8 Internal nodes.
Indexing Spatio-Temporal Data Warehouses Dimitris Papadias, Yufei Tao, Panos Kalnis, Jun Zhang Department of Computer Science Hong Kong University of Science.
Spatio-Temporal Databases. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases …..
AALG, lecture 11, © Simonas Šaltenis, Range Searching in 2D Main goals of the lecture: to understand and to be able to analyze the kd-trees and.
Spatial Indexing. Spatial Queries Given a collection of geometric objects (points, lines, polygons,...) organize them on disk, to answer point queries.
Making B+-Trees Cache Conscious in Main Memory
Improving Min/Max Aggregation over Spatial Objects Donghui Zhang, Vassilis J. Tsotras University of California, Riverside ACM GIS’01.
AAU A Trajectory Splitting Model for Efficient Spatio-Temporal Indexing Presented by YuQing Zhang  Slobodan Rasetic Jorg Sander James Elding Mario A.
Join-Queries between two Spatial Datasets Indexed by a Single R*-tree Join-Queries between two Spatial Datasets Indexed by a Single R*-tree Michael Vassilakopoulos.
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
Towards Robust Indexing for Ranked Queries Dong Xin, Chen Chen, Jiawei Han Department of Computer Science University of Illinois at Urbana-Champaign VLDB.
Approximate XML Joins Huang-Chun Yu Li Xu. Introduction XML is widely used to integrate data from different sources. Perform join operation for XML documents:
Efficiently Processing Queries on Interval-and-Value Tuples in Relational Databases Jost Enderle, Nicole Schneider, Thomas Seidl RWTH Aachen University,
CSIS7101 – Advanced Database Technologies Spatio-Temporal Data (Part 1) On Indexing Mobile Objects Kwong Chi Ho Leo Wong Chi Kwong Simon Lui, Tak Sing.
Antonin Guttman In Proceedings of the 1984 ACM SIGMOD international conference on Management of data (SIGMOD '84). ACM, New York, NY, USA.
IKI 10100: Data Structures & Algorithms Ruli Manurung (acknowledgments to Denny & Ade Azurat) 1 Fasilkom UI Ruli Manurung (Fasilkom UI)IKI10100: Lecture17.
Nearest Neighbor Queries Chris Buzzerd, Dave Boerner, and Kevin Stewart.
Spatial Indexing Techniques Introduction to Spatial Computing CSE 5ISC Some slides adapted from Spatial Databases: A Tour by Shashi Shekhar Prentice Hall.
R-Trees: A Dynamic Index Structure For Spatial Searching Antonin Guttman.
By: Gang Zhou Computer Science Department University of Virginia 1 Medians and Beyond: New Aggregation Techniques for Sensor Networks CS851 Seminar Presentation.
Efficient OLAP Operations in Spatial Data Warehouses Dimitris Papadias, Panos Kalnis, Jun Zhang and Yufei Tao Department of Computer Science Hong Kong.
1 CSIS 7101: CSIS 7101: Spatial Data (Part 1) The R*-tree : An Efficient and Robust Access Method for Points and Rectangles Rollo Chan Chu Chung Man Mak.
Location-based Spatial Queries AGM SIGMOD 2003 Jun Zhang §, Manli Zhu §, Dimitris Papadias §, Yufei Tao †, Dik Lun Lee § Department of Computer Science.
Spatio-Temporal Databases
23 1 Christian Böhm 1, Florian Krebs 2, and Hans-Peter Kriegel 2 1 University for Health Informatics and Technology, Innsbruck 2 University of Munich Optimal.
1 Reverse Nearest Neighbor Queries for Dynamic Databases SHOU Yu Tao Jan. 10 th, 2003 SIGMOD 2000.
Spatio-Temporal Databases. Term Project Groups of 2 students You can take a look on some project ideas from here:
Spatial Queries Nearest Neighbor and Join Queries Most slides are based on slides provided By Prof. Christos Faloutsos (CMU) and Prof. Dimitris Papadias.
Jeremy Iverson & Zhang Yun 1.  Chapter 6 Key Concepts ◦ Structures and access methods ◦ R-Tree  R*-Tree  Mobile Object Indexing  Questions 2.
1 Spatial Query Processing using the R-tree Donghui Zhang CCIS, Northeastern University Feb 8, 2005.
Indexing Multidimensional Data
Keogh, E. , Chakrabarti, K. , Pazzani, M. & Mehrotra, S. (2001)
Strategies for Spatial Joins
Spatial Queries Nearest Neighbor and Join Queries.
Spatio-Temporal Databases
Spatial Indexing.
Sameh Shohdy, Yu Su, and Gagan Agrawal
Nearest Neighbor Queries using R-trees
Spatio-Temporal Databases
Joining Interval Data in Relational Databases
Distributed Probabilistic Range-Aggregate Query on Uncertain Data
Database Design and Programming
R-tree – Another Example (1/2)
Donghui Zhang, Tian Xia Northeastern University
Presentation transcript:

Continuous Intersection Joins Over Moving Objects Rui Zhang University of Melbourne Dan Lin Purdue University Kotagiri Ramamohanarao University of Melbourne Elisa Bertino Purdue University

Outline Backgrounds Intersection Joins on moving objects Indexes for moving objects Algorithms Adapting existing algorithms Our approach Time constrained processing Improvement techniques Experiments

Motivation (Traditional) Intersection join Given two sets of spatial objects A and B, find all object pairs ‹i,j›, where i  A, j  B, such that i intersects j. Intersection join on moving objects Moving Continuous

Join Algorithms Nested loops join Basic Expensive Block nested loops join Efficient Dependent on buffer size Index nested loops join Efficient and robust Sort-merge join Efficient Difficult for spatial objects

Indexing Moving Objects Monitoring moving objects Sampling-based Trajectory-based p = p ( t ref ) + v (t - t ref ) T M : maximum update interval R-tree [SIGMOD’84] Minimum bounding rectangle (MBR) TPR-tree [SIGMOD’00] Add time parameters to the R-tree Other indexes: B x -tree [VLDB’04], STRIPES [SIGMOD’04] Only for points uuu u u u u N2N2 N1N1 N1N1 N2N2 N1N1 ACD N1N1 BE N2N2 F N3N3 N3N3 A B C D E F

Naive Algorithm (NaiveJoin) Join nodes from two TPR-trees recursively If intersected, check on children Otherwise, disregard it For an update, compute its join pairs and update the answer Join result ‹a 1,b 1 ›, [0,3] ‹a 2,b 2 ›, [1,4] ‹a 3,b 4 ›, [6,8] Node access (IO) roots, N 1, N 2, N 3, N 4 Comparison (CPU) root A vs root B, N 1 vs N 3, N 2 vs N 4

Extended TP-Join Algorithm (ETP-Join) Time Parameterized Join (TP-Join) [SIGMOD’02] Current result ‹a 1,b 1 › Expiry time 1 Event that causes the change ‹a 2,b 2 › Join result ‹a 1,b 1 ›, [0,3] ‹a 2,b 2 ›, [1,4] ‹a 3,b 4 ›, [6,8] root A vs root B, N 1 vs N 3 Comparison (CPU) roots, N 1, N 3 Node access (IO) For the 1 st TP-Join

Summary NaiveJoin One tree traversal per update, but expensive traversal ETP-Join Cheaper traversal, but too frequent traversals Node access (IO) roots, N 1, N 2, N 3, N 4 Comparison (CPU) root A vs root B, N 1 vs N 3, N 2 vs N 4 Node access (IO) roots, N 1, N 3 Comparison (CPU) root A vs root B, N 1 vs N 3 For the 1 st TP-Join Too longToo short

Key Problem Find a good time range for computing the join pairs Observation Consider object a and b Let the next update time for them be t a and t b Perfect time range for computing their join result is [t c, min(t a,t b )] How do we know t a or t b ? T M gives a bound for them Time range is cut from [t c,  ] to [t c, t c +T M ] Is this correct for all objects? Yes. Proof in technical report:

Time Constrained Processing (TC-Join) NaiveJoin with constrained processing time range [t c, t c +T M ] Join result ‹a 1,b 1 ›, [0,3] ‹a 2,b 2 ›, [1,4] ‹a 3,b 4 ›, [6,8] Node access (IO) roots, N 1, N 3 Comparison (CPU) root A vs root B, N 1 vs N 3

Further Optimization (MTB-Join) Many objects will not update at the time bound Put objects in time buckets Each time bucket has an associated TPR-tree An object is inserted into the tree whose time bucket contains the object’s latest update time t c is in [T M, 3/2T M ]

Improvement on the Basic Join Algorithm Plane Sweep Sorting based on the lower left corner in dimension x Two sequences: S a = ‹a 3, a 4, a 5 ›; S b = ‹ b 1, b 2, b 3, b 4 › Two essential components for PS Lower bound Upper bound

Other Improvements Sorting dimension selection Smaller average speed Intersection check First intersection check and then plane sweep

Experiments: setting Computer: 2.6G Pentium IV CPU, 1G RAM Datasets: Uniform, Gaussian, Battlefield Measure: IO and Time ParameterValue Node capacity113 Maximum update interval (T M )60, 120, 240 Maximum object speed1, 2, 3, 4, 5 Object size (% of space)0.5, 0.1, 0.2, 0.4, 0.8 Dataset size1K, 10K, 50K, 100K DatasetUniform, Gaussian, Battlefield

Experiments: TC processing Up to 15 times improvement

Experiments: Improvement techniques Up to 6 times improvement

Comparison: Initial Join MTB-Join outperforms others Half an hour for NaiveJoin

Comparison: Maintenance Up to 10 4 times improvement Time for processing the join for one second 1K10K100K MTB-Join0.03 millisecs0.05 secs6 secs ETP-Join6.3 secs15 minshours

Conclusion and future work Conclusion Time Constrained processing Further optimization by bucketing in time Improvement techniques Several orders of magnitude performance improvement Future work Applying TC processing to other queries

References R-tree [SIGMOD’04] Antonin Guttman. R-Trees: A Dynamic Index Structure for Spatial Searching. ACM SIGMOD Conference TPR-tree [SIGMOD ’ 00] S. Saltenis, C. S.Jensen, S. T. Leutenegger, and M. A. Lopez. Indexing the positions of continuously moving objects. ACM SIGMOD Conference B x -tree [VLDB ’ 04] C. Jensen, D. Lin, and B.C.Ooi. Query and update efficient B+-tree based indexing of moving objects. International conference on Very Large Databases, STRIPES [SIGMOD’04] J. M. Patel, Y. Chen, and V. P. Chakka. STRIPES: An efficient index for predicted trajectories. ACM SIGMOD Conference TP-Join [SIGMOD’02] Y. Tao and D. Papadias. Time-parameterized queries in spatio-temporal databases. ACM SIGMOD Conference 2002.

Questions Please send your questions to Rui Zhang