Answering Similar Region Search Queries Chang Sheng, Yu Zheng.

Slides:



Advertisements
Similar presentations
1 DATA STRUCTURES USED IN SPATIAL DATA MINING. 2 What is Spatial data ? broadly be defined as data which covers multidimensional points, lines, rectangles,
Advertisements

The A-tree: An Index Structure for High-dimensional Spaces Using Relative Approximation Yasushi Sakurai (NTT Cyber Space Laboratories) Masatoshi Yoshikawa.
DECISION TREES. Decision trees  One possible representation for hypotheses.
Computer Science and Engineering Inverted Linear Quadtree: Efficient Top K Spatial Keyword Search Chengyuan Zhang 1,Ying Zhang 1,Wenjie Zhang 1, Xuemin.
Spatial Indexing SAMs. Spatial Indexing Point Access Methods can index only points. What about regions? Z-ordering and quadtrees Use the transformation.
Efficient Evaluation of k-Range Nearest Neighbor Queries in Road Networks Jie BaoChi-Yin ChowMohamed F. Mokbel Department of Computer Science and Engineering.
1 Storage of images for Efficient Retrieval  Representing IDB as relations  straightforward  Representing IDB with spatial data structures  represent.
Query Specific Fusion for Image Retrieval
Introduction to Spatial Database System Presented by Xiaozhi Yu.
Chapter 8 Content-Based Image Retrieval. Query By Keyword: Some textual attributes (keywords) should be maintained for each image. The image can be indexed.
Multimedia DBs. Multimedia dbs A multimedia database stores text, strings and images Similarity queries (content based retrieval) Given an image find.
Mining Query Subtopics from Search Log Data Date : 2012/12/06 Resource : SIGIR’12 Advisor : Dr. Jia-Ling Koh Speaker : I-Chih Chiu.
Stephan Gammeter, Lukas Bossard, Till Quack, Luc Van Gool.
Preferential top-k search over local data dissertation thesis RNDr. Martin Šumák supervisor: doc. RNDr. Stanislav Krajči, PhD. consultant: RNDr. Peter.
Spatial Mining.
A Crowd-Enabled Approach for Efficient Processing of Nearest Neighbor Queries in Incomplete Databases Samia Kabir, Mehnaz Tabassum Mahin Department of.
A Novel Scheme for Video Similarity Detection Chu-Hong Hoi, Steven March 5, 2003.
SASH Spatial Approximation Sample Hierarchy
1 Oct 30, 2006 LogicSQL-based Enterprise Archive and Search System How to organize the information and make it accessible and useful ? Li-Yan Yuan.
Content Based Image Clustering and Image Retrieval Using Multiple Instance Learning Using Multiple Instance Learning Xin Chen Advisor: Chengcui Zhang Department.
Multiple-key indexes Index on one attribute provides pointer to an index on the other. If V is a value of the first attribute, then the index we reach.
An efficient and effective region-based image retrieval framework Reporter: Francis 2005/5/12.
Video Google: Text Retrieval Approach to Object Matching in Videos Authors: Josef Sivic and Andrew Zisserman ICCV 2003 Presented by: Indriyati Atmosukarto.
Modern Information Retrieval Chapter 2 Modeling. Can keywords be used to represent a document or a query? keywords as query and matching as query processing.
Spatial Information Systems (SIS) COMP Spatial access methods: Indexing.
Techniques and Data Structures for Efficient Multimedia Similarity Search.
1 An Empirical Study on Large-Scale Content-Based Image Retrieval Group Meeting Presented by Wyman
Spatial Indexing I Point Access Methods. Spatial Indexing Point Access Methods (PAMs) vs Spatial Access Methods (SAMs) PAM: index only point data Hierarchical.
Hybrid Bounding Volumes for Distance Queries Distance Query returns the minimum distance between two geometric models Major application is path planning.
Friends and Locations Recommendation with the use of LBSN
Text mining.
Spatial Data Management Chapter 28. Types of Spatial Data Point Data –Points in a multidimensional space E.g., Raster data such as satellite imagery,
Presented by Tienwei Tsai July, 2005
Mining Interesting Locations and Travel Sequences from GPS Trajectories IDB & IDS Lab. Seminar Summer 2009 강 민 석강 민 석 July 23 rd,
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
PMLAB Finding Similar Image Quickly Using Object Shapes Heng Tao Shen Dept. of Computer Science National University of Singapore Presented by Chin-Yi Tsai.
IMAGE DATABASES Prof. Hyoung-Joo Kim OOPSLA Lab. Computer Engineering Seoul National University.
FI-WARE Points of Interest (POI) Data Provider Short Introduction Nonprofit educational material. Fair use of copyrighted content, if any, is assumed.
Handling Spatial Data In P2P Systems Verena Kantere, Timos Sellis, Yannis Kouvaras.
Video Google: A Text Retrieval Approach to Object Matching in Videos Josef Sivic and Andrew Zisserman.
Friends and Locations Recommendation with the use of LBSN By EKUNDAYO OLUFEMI ADEOLA
Wei Feng , Jiawei Han, Jianyong Wang , Charu Aggarwal , Jianbin Huang
Beyond Sliding Windows: Object Localization by Efficient Subwindow Search The best paper prize at CVPR 2008.
Efficient Subwindow Search: A Branch and Bound Framework for Object Localization ‘PAMI09 Beyond Sliding Windows: Object Localization by Efficient Subwindow.
Query Suggestion Naama Kraus Slides are based on the papers: Baeza-Yates, Hurtado, Mendoza, Improving search engines by query clustering Boldi, Bonchi,
Enhancing Cluster Labeling Using Wikipedia David Carmel, Haggai Roitman, Naama Zwerdling IBM Research Lab (SIGIR’09) Date: 11/09/2009 Speaker: Cho, Chin.
Efficient Processing of Top-k Spatial Preference Queries
Non-Photorealistic Rendering and Content- Based Image Retrieval Yuan-Hao Lai Pacific Graphics (2003)
Group 8: Denial Hess, Yun Zhang Project presentation.
A more efficient Collaborative Filtering method Tam Ming Wai Dr. Nikos Mamoulis.
CVPR2013 Poster Detecting and Naming Actors in Movies using Generative Appearance Models.
So, what’s the “point” to all of this?….
Content Based Color Image Retrieval vi Wavelet Transformations Information Retrieval Class Presentation May 2, 2012 Author: Mrs. Y.M. Latha Presenter:
1 Complex Spatio-Temporal Pattern Queries Cahide Sen University of Minnesota.
Query by Image and Video Content: The QBIC System M. Flickner et al. IEEE Computer Special Issue on Content-Based Retrieval Vol. 28, No. 9, September 1995.
Presenters: Amool Gupta Amit Sharma. MOTIVATION Basic problem that it addresses?(Why) Other techniques to solve same problem and how this one is step.
Pete Bohman Adam Kunk.  Introduction  Related Work  System Overview  Indexing Scheme  Ranking  Evaluation  Conclusion.
Similarity Measurement and Detection of Video Sequences Chu-Hong HOI Supervisor: Prof. Michael R. LYU Marker: Prof. Yiu Sang MOON 25 April, 2003 Dept.
Jeremy Iverson & Zhang Yun 1.  Chapter 6 Key Concepts ◦ Structures and access methods ◦ R-Tree  R*-Tree  Mobile Object Indexing  Questions 2.
AP-Tree: Efficiently Support Continuous Spatial-Keyword Queries Over Stream lecturer : 秦靖雅.
Content-Based Image Retrieval Readings: Chapter 8:
Indexing Multidimensional Data
Fast Subsequence Matching in Time-Series Databases.
Spatial Data Management
Spatial Indexing I Point Access Methods.
Mining the Most Influential k-Location Set from Massive Trajectories
Multimedia Information Retrieval
K Nearest Neighbor Classification
Distributed Probabilistic Range-Aggregate Query on Uncertain Data
Strength of relation High Low Number of data Relationship Data
Presentation transcript:

Answering Similar Region Search Queries Chang Sheng, Yu Zheng

An Irrelevant ResultExpected Results A region specified by a user Objective : Given a query region on a map, return the top-k similar regions on this map

Motivation Possible applications – Location recommendation: recommending similar shopping malls, movie centers or travel spots Challenges – How to define the similarity between geo-regions – How to retrieve the similar region based on a user- specified region Different scales (as big as a shopping street or as small as a cinema) Different shapes (rectangles of different size)

What we do Devise a similarity measure between geo-regions – Content similarity: Representative categories located in a region – Spatial similarity: geo-spatial distribution of representative categories Design a fast K-NN search algorithm – Retrieve the top-k similar regions accords to user-specified query region – The algorithm can ensure the returned regions have similar shape and scale as the query (basic criteria); have the top-k similarity scores in terms of the defined similarity measure Fast enough for online search

Geometric properties – Scales and shapes Content properties – POI (point of interest) categories – Representative categories Spatial properties – Distribution of POIs of representative categories. – Reference points Similarity Measures (c) Shopping area A query region

Content similarity Detect the representative categories: CF-IRF – Category Frequency (CF) of the category C i in region R j, denoted as Cf ij, is the fraction of the number of PoIs with category C i occurring in region R j to the total number of PoIs in region R j – The Inverse Region Frequency (IRF) of category C i, denoted as IRF i, is the logarithm of the fraction of the total number of grids to the number of grids that contain PoIs with category C i. – The significance of a category C i in region R j, is

Spatial Similarity Two methods – Mutual distance – Reference distance: The average distance of all the points in P/Q to each of the reference points The distance of K categories to the reference point O i is a vector of K entries.

Fast Retrieval Algorithm Offline process – Quad-tree-based space partition – Detect the representative categories – Extract the feature vectors – Indexing features and feature bounds Online process – Detect representative categories – Category-based pruning – Spatial-based pruning – Expanding

Quadtree and inverted list Partition geo-spaces into grids based on quadtree Each quadtree node stores – the features bound of its four adjacent children – The feature bound is calculated in a bottom-up manner

System overview

Pruning Category-based Pruning – A candidate region must have some overlaps of representative categories with the query region – The cosine similarity should exceed a threshold Spatial feature-based pruning To speed up the pruning process

Expand Region Select the seed regions which do not be pruned Expand the seed regions