Reverse Furthest Neighbors in Spatial Databases Bin Yao, Feifei Li, Piyush Kumar Florida State University, USA.

Slides:



Advertisements
Similar presentations
The Optimal-Location Query
Advertisements

Voronoi-based Geospatial Query Processing with MapReduce
The A-tree: An Index Structure for High-dimensional Spaces Using Relative Approximation Yasushi Sakurai (NTT Cyber Space Laboratories) Masatoshi Yoshikawa.
Finding the Sites with Best Accessibilities to Amenities Qianlu Lin, Chuan Xiao, Muhammad Aamir Cheema and Wei Wang University of New South Wales, Australia.
Ranking Outliers Using Symmetric Neighborhood Relationship Wen Jin, Anthony K.H. Tung, Jiawei Han, and Wei Wang Advances in Knowledge Discovery and Data.
1 A FAIR ASSIGNMENT FOR MULTIPLE PREFERENCE QUERIES Leong Hou U, Nikos Mamoulis, Kyriakos Mouratidis Gruppo 10: Paolo Barboni, Tommaso Campanella, Simone.
1 Top-k Spatial Joins
Nearest Neighbor Queries using R-trees
Indexing and Range Queries in Spatio-Temporal Databases
School of Computer Science and Engineering Finding Top k Most Influential Spatial Facilities over Uncertain Objects Liming Zhan Ying Zhang Wenjie Zhang.
A Generic Framework for Monitoring Continuous Spatial Queries over Moving Objects.
Distributed Indexing and Querying in Sensor Networks using Statistical Models Arnab Bhattacharya Indian Institute of Technology (IIT),
Nearest Neighbor Queries using R-trees Based on notes from G. Kollios.
Jianzhong Qi Rui Zhang Lars Kulik Dan Lin Yuan Xue The Min-dist Location Selection Query University of Melbourne 14/05/2015.
Improving the Performance of M-tree Family by Nearest-Neighbor Graphs Tomáš Skopal, David Hoksza Charles University in Prague Department of Software Engineering.
1 NNH: Improving Performance of Nearest- Neighbor Searches Using Histograms Liang Jin (UC Irvine) Nick Koudas (AT&T Labs Research) Chen Li (UC Irvine)
Efficient Reverse k-Nearest Neighbors Retrieval with Local kNN-Distance Estimation Mike Lin.
Effectively Indexing Uncertain Moving Objects for Predictive Queries School of Computing National University of Singapore Department of Computer Science.
A Generic Framework for Handling Uncertain Data with Local Correlations Xiang Lian and Lei Chen Department of Computer Science and Engineering The Hong.
Using Trees to Depict a Forest Bin Liu, H. V. Jagadish EECS, University of Michigan, Ann Arbor Presented by Sergey Shepshelvich 1.
Efficient Processing of Top-k Spatial Keyword Queries João B. Rocha-Junior, Orestis Gkorgkas, Simon Jonassen, and Kjetil Nørvåg 1 SSTD 2011.
Liang Jin (UC Irvine) Nick Koudas (AT&T) Chen Li (UC Irvine)
Optimization of Spatial Joins on Mobile Devices N. Mamoulis 1, P. Kalnis 2, S. Bakiras 3, X. Li 2 1 Department of Computer Science and Information Systems,
On Efficient Spatial Matching Raymond Chi-Wing Wong (the Chinese University of Hong Kong) Yufei Tao (the Chinese University of Hong Kong) Ada Wai-Chee.
1 Efficient Method for Maximizing Bichromatic Reverse Nearest Neighbor Raymond Chi-Wing Wong (Hong Kong University of Science and Technology) M. Tamer.
1 SINA: Scalable Incremental Processing of Continuous Queries in Spatio-temporal Databases Mohamed F. Mokbel, Xiaopeng Xiong, Walid G. Aref Presented by.
Probabilistic Similarity Search for Uncertain Time Series Presented by CAO Chen 21 st Feb, 2011.
A New Point Access Method based on Wavelet Trees Nieves R. Brisaboa, Miguel R. Luaces, Diego Seco Database Laboratory University of A Coruña A Coruña,
Spatial Queries Nearest Neighbor Queries.
1 SINA: Scalable Incremental Processing of Continuous Queries in Spatio-temporal Databases Mohamed F. Mokbel, Xiaopeng Xiong, Walid G. Aref Presented by.
Scalable Network Distance Browsing in Spatial Database Samet, H., Sankaranarayanan, J., and Alborzi H. Proceedings of the 2008 ACM SIGMOD international.
Evaluation of Top-k OLAP Queries Using Aggregate R-trees Nikos Mamoulis (HKU) Spiridon Bakiras (HKUST) Panos Kalnis (NUS)
Trip Planning Queries F. Li, D. Cheng, M. Hadjieleftheriou, G. Kollios, S.-H. Teng Boston University.
Clustering Vertices of 3D Animated Meshes
Approximate Encoding for Direct Access and Query Processing over Compressed Bitmaps Tan Apaydin – The Ohio State University Guadalupe Canahuate – The Ohio.
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
Towards Robust Indexing for Ranked Queries Dong Xin, Chen Chen, Jiawei Han Department of Computer Science University of Illinois at Urbana-Champaign VLDB.
Reverse Top-k Queries Akrivi Vlachou *, Christos Doulkeridis *, Yannis Kotidis #, Kjetil Nørvåg * *Norwegian University of Science and Technology (NTNU),
Learning Geographical Preferences for Point-of-Interest Recommendation Author(s): Bin Liu Yanjie Fu, Zijun Yao, Hui Xiong [KDD-2013]
Parallel dynamic batch loading in the M-tree Jakub Lokoč Department of Software Engineering Charles University in Prague, FMP.
Clustering Moving Objects in Spatial Networks Jidong Chen, Caifeng Lai, Xiaofeng Meng, Renmin University of China Jianliang Xu, and Haibo Hu Hong Kong.
Influence Zone: Efficiently Processing Reverse k Nearest Neighbors Queries Presented By: Muhammad Aamir Cheema Joint work with Xuemin Lin, Wenjie Zhang,
Nearest Neighbor Queries Chris Buzzerd, Dave Boerner, and Kevin Stewart.
Efficient Processing of Top-k Spatial Preference Queries
Zhuo Peng, Chaokun Wang, Lu Han, Jingchao Hao and Yiyuan Ba Proceedings of the Third International Conference on Emerging Databases, Incheon, Korea (August.
Spatio-temporal Pattern Queries M. Hadjieleftheriou G. Kollios P. Bakalov V. J. Tsotras.
Open Problem: Dynamic Planar Nearest Neighbors CSCE 620 Problem 63 from the Open Problems Project
On Computing Top-t Influential Spatial Sites Authors: T. Xia, D. Zhang, E. Kanoulas, Y.Du Northeastern University, USA Appeared in: VLDB 2005 Presenter:
9/2/2005VLDB 2005, Trondheim, Norway1 On Computing Top-t Most Influential Spatial Sites Tian Xia, Donghui Zhang, Evangelos Kanoulas, Yang Du Northeastern.
QED: A Novel Quaternary Encoding to Completely Avoid Re-labeling in XML Updates Changqing Li,Tok Wang Ling.
Exact indexing of Dynamic Time Warping
1 On Optimal Worst-Case Matching Cheng Long (Hong Kong University of Science and Technology) Raymond Chi-Wing Wong (Hong Kong University of Science and.
Bin Yao, Feifei Li, Piyush Kumar Presenter: Lian Liu.
A FAIR ASSIGNMENT FOR MULTIPLE PREFERENCE QUERIES
R-Trees: A Dynamic Index Structure For Spatial Searching Antonin Guttman.
Database Management Systems, R. Ramakrishnan 1 Algorithms for clustering large datasets in arbitrary metric spaces.
Bin Yao, Feifei Li, Piyush Kumar Presenter: Lian Liu.
ICDE-2006 Subramanian Arumugam Christopher Jermaine Department of Computer Science University of Florida 22nd International Conference on Data Engineering.
A Spatial Index Structure for High Dimensional Point Data Wei Wang, Jiong Yang, and Richard Muntz Data Mining Lab Department of Computer Science University.
1 Reverse Nearest Neighbor Queries for Dynamic Databases SHOU Yu Tao Jan. 10 th, 2003 SIGMOD 2000.
Presenters: Amool Gupta Amit Sharma. MOTIVATION Basic problem that it addresses?(Why) Other techniques to solve same problem and how this one is step.
Rethinking Choices for Multi-dimensional Point Indexing You Jung Kim and Jignesh M. Patel University of Michigan.
1 Spatial Query Processing using the R-tree Donghui Zhang CCIS, Northeastern University Feb 8, 2005.
A Unified Framework for Efficiently Processing Ranking Related Queries
Influence sets based on Reverse Nearest Neighbor Queries
Spatio-temporal Pattern Queries
Fast Nearest Neighbor Search on Road Networks
Presented by: Mahady Hasan Joint work with
Efficient Processing of Top-k Spatial Preference Queries
Liang Jin (UC Irvine) Nick Koudas (AT&T Labs Research)
Presentation transcript:

Reverse Furthest Neighbors in Spatial Databases Bin Yao, Feifei Li, Piyush Kumar Florida State University, USA

A Novel Query Type Reverse Furthest Neighbors (RFN) Given a point q and a data set P, find the set of points in P that take q as their furthest neighbor Two versions :  Monochromatic Reverse Furthest Neighbors (MRFN)  Bichromatic Reverse Furthest Neighbors (BRFN)

Motivation and Related works Motivation: inspired by RNN Reverse Nearest Neighbor  Set of points taking query point as their NN.  Monochromatic & Bichromatic RNN Many applications that are behind the studies of the RNN have the corresponding “furthest” versions.

MRFN Application P: a set of sites of interest in a region For any site, it could find the sites that take itself as their furthest neighbors This has an implication that visitors to the RFN of a site are unlikely to visit this site because of the long distance. Ideally, it should put more efforts in advertising itself in those sites.

BRFN Application P: a set of customers Q: a set of business competitors offering similar products A distance measure reflecting the rating of customer(p) to competitor(q)’s product. A larger distance indicates a lower preference. For any competitor in Q, an interesting query is to discover the customers that dislike his product the most among all competing products in the market.

BRFN Example : customer : product

MRFN and BRFN MRFN for q and P: BRFN for a point q in Q and P are:

Outline MRFN  Progressive Furthest Cell Algorithm  Convex Hull Furthest Cell Algorithm  Dynamically updating to dataset BRFN

MRFN: Progressive Furthest Cell Algorithm (first algorithm) Lemma: Any point from the furthest Voronoi cell(fvc) of p takes p as its furthest neighbor among all points in P.

Progressive Furthest Cell Algorithm (PFC) PFC(Query q; R-tree T) Initialize two empty vectors and ; priority queue L with T’s root node; fvc(q)=S; While L is not empty do  Pop the head entry e of L  If e is a point then, update the fvc(q) If fvc(q) is empty, return; If e is in fvc(q), then Push e into ;  else If e fvc(q) is empty then push e to ; Else for every child u of node e  If u fvc(q) is empty, insert u into ;  Else insert u into L ; Update fvc(q) using points contained by entries in ; Filter points in using fvc(q);

Outline MRFN  Progressive Furthest Cell Algorithm  Convex Hull Furthest Cell Algorithm  Dynamically updating to dataset BRFN

MRFN: Convex Hull Furthest Cell Algorithm(second algorithm) Lemma: the furthest point for p from P is always a vertex of the convex hull of P. (i.e., only vertices of CH have RFN.) Find the convex hull of P; if, then return empty; else  Compute using ;  Set fvc(q,P*) equal to fvc(q, );  Execute a range query using fvc(q,P*) on T; CHFC(Query q; R-tree T (on P)) // compute only once

Outline MRFN  Progressive Furthest Cell Algorithm  Convex Hull Furthest Cell Algorithm  Dynamically updating to dataset BRFN

Dynamically updating to dataset PFC: update R-tree CHFC:  update R-tree& re-compute CH (expensive)  Qhull algorithm

Dynamically Maintaining CH: insertion

Dynamically Maintaining CH: deletion The qhull algorithm

Dynamically Maintaining CH Adapt qhull to R-tree

Outline MRFN  Progressive Furthest Cell Algorithm  Convex Hull Furthest Cell Algorithm  Dynamically updating to dataset BRFN

After resolving all the difficulties for the MRFN problem, solving the BRFN problem becomes almost immediate. Observations:  all points in P that are contained by fvc(q,Q) will have q as their furthest neighbor.  Only the vertexes of the convex hull have fvc.

BRFN algorithm BRFN(Query q, Q; R-tree T) Compute the convex hull of Q; If then return empty; Else  Compute fvc(q, );  Execute a range query using fvc(q, ) on T;

BRFN: Disk-Resident Query Group Limitation: query group size may not fit in memory Solution: Approximate convex hull of Q (Dudley’s approximation)

Experiment Setup Dataset:  Real dataset (Map: USA, CA, SF)  Synthetic dataset (UN, CB, R-Cluster) Measurement  Computation time  Number of IOs  Average of 1000 queries

MRFN algorithm CPU computation Number of IOs

BRFN algorithms CPU: vary A, Q=1000 IOs: vary A, Q=1000

Scalability of various algorithms MRFN number of IOs BRFN number of IOs

Conclusion Introduced a novel query (RFN) for spatial databases. Presented R-tree based algorithms for both versions of RFN that feature excellent pruning capability. Conducted a comprehensive experimental evaluation.

Thank you! Questions?

Datasets: San Francisco

Datasets: California

Datasets: North America

Datasets : uncorrelated uniform

Datasets : correlated bivariate

Datasets : random clusters