1 Hypersphere Dominance: An Optimal Approach Cheng Long, Raymond Chi-Wing Wong, Bin Zhang, Min Xie The Hong Kong University of Science and Technology Prepared.

Slides:



Advertisements
Similar presentations
Computer Science and Engineering Inverted Linear Quadtree: Efficient Top K Spatial Keyword Search Chengyuan Zhang 1,Ying Zhang 1,Wenjie Zhang 1, Xuemin.
Advertisements

Spatio-temporal Databases
Probabilistic Skyline Operator over Sliding Windows Wenjie Zhang University of New South Wales & NICTA, Australia Joint work: Xuemin Lin, Ying Zhang, Wei.
Cleaning Uncertain Data with Quality Guarantees Reynold Cheng, Jinchuan Chen, Xike Xie 2008 VLDB Presented by SHAO Yufeng.
1 Finding Shortest Paths on Terrains by Killing Two Birds with One Stone Manohar Kaul (Aarhus University) Raymond Chi-Wing Wong (Hong Kong University of.
School of Computer Science and Engineering Finding Top k Most Influential Spatial Facilities over Uncertain Objects Liming Zhan Ying Zhang Wenjie Zhang.
Nearest Neighbor Queries using R-trees Based on notes from G. Kollios.
Jianzhong Qi Rui Zhang Lars Kulik Dan Lin Yuan Xue The Min-dist Location Selection Query University of Melbourne 14/05/2015.
Progressive Computation of The Min-Dist Optimal-Location Query Donghui Zhang, Yang Du, Tian Xia, Yufei Tao* Northeastern University * Chinese University.
Similarity Search in High Dimensions via Hashing
1 NNH: Improving Performance of Nearest- Neighbor Searches Using Histograms Liang Jin (UC Irvine) Nick Koudas (AT&T Labs Research) Chen Li (UC Irvine)
Pivoting M-tree: A Metric Access Method for Efficient Similarity Search Tomáš Skopal Department of Computer Science, VŠB-Technical.
Similarity Search for Adaptive Ellipsoid Queries Using Spatial Transformation Yasushi Sakurai (NTT Cyber Space Laboratories) Masatoshi Yoshikawa (Nara.
Indexing the imprecise positions of moving objects Xiaofeng Ding and Yansheng Lu Department of Computer Science Huazhong University of Science & Technology.
Voronoi-based Nearest Neighbor Search for Multi-Dimensional Uncertain Databases Peiwu Zhang Reynold Cheng Nikos Mamoulis Yu Tang University of Hong Kong.
Indexing Network Voronoi Diagrams*
A Generic Framework for Handling Uncertain Data with Local Correlations Xiang Lian and Lei Chen Department of Computer Science and Engineering The Hong.
Nearest Neighbor Search in Spatial and Spatiotemporal Databases
Quantile-Based KNN over Multi- Valued Objects Wenjie Zhang Xuemin Lin, Muhammad Aamir Cheema, Ying Zhang, Wei Wang The University of New South Wales, Australia.
Spatio-temporal Databases Time Parameterized Queries.
Subscription Subsumption Evaluation for Content-Based Publish/Subscribe Systems Hojjat Jafarpour, Bijit Hore, Sharad Mehrotra, and Nalini Venkatasubramanian.
Spatial Indexing SAMs. Spatial Access Methods PAMs Grid File kd-tree based (LSD-, hB- trees) Z-ordering + B+-tree R-tree Variations: R*-tree, Hilbert.
Redundant Bit Vectors for the Audio Fingerprinting Server John Platt Jonathan Goldstein Chris Burges.
On Efficient Spatial Matching Raymond Chi-Wing Wong (the Chinese University of Hong Kong) Yufei Tao (the Chinese University of Hong Kong) Ada Wai-Chee.
1 Efficient Method for Maximizing Bichromatic Reverse Nearest Neighbor Raymond Chi-Wing Wong (Hong Kong University of Science and Technology) M. Tamer.
High-Dimensional Similarity Search using Data-Sensitive Space Partitioning ┼ Sachin Kulkarni 1 and Ratko Orlandic 2 1 Illinois Institute of Technology,
Spatial Queries Nearest Neighbor Queries.
Probabilistic Skyline Operator over sliding Windows Wan Qian HKUST DB Group.
Euripides G.M. PetrakisIR'2001 Oulu, Sept Indexing Images with Multiple Regions Euripides G.M. Petrakis Dept.
Indexing Spatio-Temporal Data Warehouses Dimitris Papadias, Yufei Tao, Panos Kalnis, Jun Zhang Department of Computer Science Hong Kong University of Science.
1 Efficient Algorithms for Optimal Location Queries in Road Networks Zitong Chen (Sun Yat-Sen University) Yubao Liu (Sun Yat-Sen University) Raymond Chi-Wing.
Fast Subsequence Matching in Time-Series Databases Christos Faloutsos M. Ranganathan Yannis Manolopoulos Department of Computer Science and ISR University.
Improving Min/Max Aggregation over Spatial Objects Donghui Zhang, Vassilis J. Tsotras University of California, Riverside ACM GIS’01.
AAU A Trajectory Splitting Model for Efficient Spatio-Temporal Indexing Presented by YuQing Zhang  Slobodan Rasetic Jorg Sander James Elding Mario A.
Da Yan and Wilfred Ng The Hong Kong University of Science and Technology.
The X-Tree An Index Structure for High Dimensional Data Stefan Berchtold, Daniel A Keim, Hans Peter Kriegel Institute of Computer Science Munich, Germany.
Towards Robust Indexing for Ranked Queries Dong Xin, Chen Chen, Jiawei Han Department of Computer Science University of Illinois at Urbana-Champaign VLDB.
Top-k Similarity Join over Multi- valued Objects Wenjie Zhang Jing Xu, Xin Liang, Ying Zhang, Xuemin Lin The University of New South Wales, Australia.
1 Introduction to Spatial Databases Donghui Zhang CCIS Northeastern University.
Computer Science and Engineering Efficiently Monitoring Top-k Pairs over Sliding Windows Presented By: Zhitao Shen 1 Joint work with Muhammad Aamir Cheema.
Wei-Shinn Ku Slide 1 Auburn University Computer Science and Software Engineering Query Integrity Assurance of Location-based Services Accessing Outsourced.
1 On Optimal Worst-Case Matching Cheng Long (Hong Kong University of Science and Technology) Raymond Chi-Wing Wong (Hong Kong University of Science and.
Euripides G.M. PetrakisIR'2001 Oulu, Sept Indexing Images with Multiple Regions Euripides G.M. Petrakis Dept. of Electronic.
The σ-neighborhood skyline queries Chen, Yi-Chung; LEE, Chiang. The σ-neighborhood skyline queries. Information Sciences, 2015, 322: 張天彥 2015/12/05.
Information Technology Selecting Representative Objects Considering Coverage and Diversity Shenlu Wang 1, Muhammad Aamir Cheema 2, Ying Zhang 3, Xuemin.
1 Finding Competitive Price Yu Peng (Hong Kong University of Science and Technology) Raymond Chi-Wing Wong (Hong Kong University of Science and Technology)
Trajectory Simplification: On Minimizing the Direction-based Error
1 CSIS 7101: CSIS 7101: Spatial Data (Part 1) The R*-tree : An Efficient and Robust Access Method for Points and Rectangles Rollo Chan Chu Chung Man Mak.
Progressive Computation of The Min-Dist Optimal-Location Query Donghui Zhang, Yang Du, Tian Xia, Yufei Tao* Northeastern University * Chinese University.
Spatial Range Querying for Gaussian-Based Imprecise Query Objects Yoshiharu Ishikawa, Yuichi Iijima Nagoya University Jeffrey Xu Yu The Chinese University.
23 1 Christian Böhm 1, Florian Krebs 2, and Hans-Peter Kriegel 2 1 University for Health Informatics and Technology, Innsbruck 2 University of Munich Optimal.
Shaoxu Song 1, Aoqian Zhang 1, Lei Chen 2, Jianmin Wang 1 1 Tsinghua University, China 2Hong Kong University of Science & Technology, China 1/19 VLDB 2015.
Privacy Preserving Outlier Detection using Locality Sensitive Hashing
High-Dimensional Data. Topics Motivation Similarity Measures Index Structures.
Da Yan, Raymond Chi-Wing Wong, and Wilfred Ng The Hong Kong University of Science and Technology.
1 Spatial Query Processing using the R-tree Donghui Zhang CCIS, Northeastern University Feb 8, 2005.
1 Introduction to Spatial Databases Donghui Zhang CCIS Northeastern University.
Fast Subsequence Matching in Time-Series Databases.
A Unified Algorithm for Continuous Monitoring of Spatial Queries
Abolfazl Asudeh Azade Nazi Nan Zhang Gautam DaS
Progressive Computation of The Min-Dist Optimal-Location Query
Spatio-temporal Databases
Chapter 4: Probabilistic Query Answering (2)
Introduction to Spatial Databases
Probabilistic Data Management
Distributed Probabilistic Range-Aggregate Query on Uncertain Data
Probabilistic n-of-N Skyline Computation over Uncertain Data Streams
Spatio-temporal Databases
Fraction-Score: A New Support Measure for Co-location Pattern Mining
Donghui Zhang, Tian Xia Northeastern University
Presentation transcript:

1 Hypersphere Dominance: An Optimal Approach Cheng Long, Raymond Chi-Wing Wong, Bin Zhang, Min Xie The Hong Kong University of Science and Technology Prepared by Cheng Long Presented by Cheng Long 24 June, 2014

Hyperspheres A hypersphere in a d-dimensional space (center, radius) the set of all points that have their distances from the center bounded by the radius 2 2D: a disk 3D: a ball

Hyperspheres are commonly used Uncertain databases the location of an uncertain object Spatial databases SS-tree, SS + -tree, M-tree, VP-tree and SR-tree 3 SS-tree: similar to R-tree with hyperrectangles replaced by hyperspheres SS-tree based on A-H layout of 8 objects: A-H

Motivating example Scenario Ada has her location uncertain, but constrained in a disk S a. Bob has his location uncertain, but constrained in a disk S b. Connie has her location uncertain, but constrained in a disk S q. Question Is Ada always closer to Connie than Bob? 4 S b (Bob) S q (Connie) S b (Bob) No For this specification of the locations, Ada is closer to Connie than Bob In fact, for all specifications of the locations, Ada is closer to Connie than Bob Yes

Hypersphere dominance: definition 5 Dominance condition Basic operator used in many queries Probabilistic RkNN query [Lian and Chen, VLDBJ’09] AkNN query [Emrich et al., SSDBM’10] kNN query [Long et al., SIGMOD’14]

Hypersphere dominance: existing solutions—overview MinMax [Roussopoulos et al., SIGMOD Record’95; Hjaltason and Samet, TODS’99] MBR [Emrich et al., SIGMOD’10] GP [Lian and Chen, VLDBJ’09] Trigonometric [Emrich et al., SSDBM’10] 6

Hypersphere dominance: existing solutions—MinMax (1) 7

Hypersphere dominance: existing solutions—MinMax (2) 8 SbSb SqSq SbSb SqSq “false negative” correct

Hypersphere dominance: existing solutions--Insufficiency MethodsCorrect?Sound?Efficient? MinMaxYesNoYes MBRYesNoYes GPYesNoYes TrigonometricNoYes 9 Criteria of a method: 1. Correctness: No false positive 2. Soundness: No false negative 3. Efficiency: runs in O(d) where d is the number of dimensionality Our approach is the only one which is correct, sound and efficient! Our approach (Hyperbola) Yes

Our approach: major idea 10 For cases where it is easy to decide whether the dominance condition is true For cases where it is difficult to decide whether the dominance condition is true directly

Our approach: pre-checking 11 SbSb SqSq SbSb SqSq S b and S q overlap

Our approach: dominance checking (1) 12

Our approach: dominance checking (5) 13

Our approach: dominance checking (3) 14 Region R a Region R b SaSa SbSb caca cbcb SqSq cqcq

Our approach: dominance checking (4) 15 rqrq SaSa SbSb caca cbcb Region R a Region R b SqSq cqcq

Our approach (2) 16 correct sound efficient The condition (3) is equivalent to the dominance condition Each condition transformation takes O(d) time and the cost of LM is also O(d)

Empirical study: set-up Datasets: Real datasets: NBA, Color, Texture, and Forest Synthetic datasets Algorithms: MinMax, MBR, GP, Trigonometric, Hyperbola (our method) Measures: precision = TP/(TP+FP) recall = TP/(TP+FN) running time 17 A correct method has the precision always equal to 1 A sound method has the recall always equal to 1 Criteria of a method: 1. Correctness: No false positive (FP) 2. Soundness: No false negative (FN) 3. Efficiency: runs in O(d) where d is the number of dimensionality

Empirical study: results (precision, NBA) All algorithms except Trigonometric have precisions = MethodsCorrect?Sound?Efficient? MinMaxYesNoYes MBRYesNoYes GPYesNoYes TrigonometricNoYes Our approachYes

Empirical study: results (recall, NBA) Only our approach (Hyperbola) and Trigonometirc have recalls = MethodsCorrect?Sound?Efficient? MinMaxYesNoYes MBRYesNoYes GPYesNoYes TrigonometricNoYes Our approachYes

Empirical study: results (running time, NBA) MinMax < GP < Hyperbola (our method) < MBR < Trigonometric 20

Conclusion First solution for the hypersphere dominance problem, which is correct, sound and efficient for any dimension An application study: kNN Experiments 21

Q & A 22

The following slides are for backup use only 23

Hyperspheres in uncertain databases Song and Roussopoulos [SSTD’01] Cheng et al. [TKDE’04] Chen and Cheng [ICDE’07] Beskales et al. [PVLDB’08] 24

Our approach (1) 25 Dominance condition

An application study: kNN qeury 26

27 Region R a Region R b SqSq cqcq