The Evolution of Spatial Outlier Detection Algorithms - An Analysis of Design CSci 8715 Spatial Databases Ryan Stello Kriti Mehra.

Slides:



Advertisements
Similar presentations
Sampling Design, Spatial Allocation, and Proposed Analyses Don Stevens Department of Statistics Oregon State University.
Advertisements

Complex Networks for Representation and Characterization of Images For CS790g Project Bingdong Li 9/23/2009.
By Venkata Sai Pulluri ( ) Narendra Muppavarapu ( )
Spatial Database Systems. Spatial Database Applications GIS applications (maps): Urban planning, route optimization, fire or pollution monitoring, utility.
Learning Trajectory Patterns by Clustering: Comparative Evaluation Group D.
A Framework for Clustering Evolving Data Streams Charu C. Aggarwal, Jiawei Han, Jianyong Wang, Philip S. Yu Presented by: Di Yang Charudatta Wad.
On Map-Matching Vehicle Tracking Data
Spatiotemporal Pattern Mining For Travel Behavior Prediction UIC IGERT Seminar 02/14/2007 Chad Williams.
Proportion Priors for Image Sequence Segmentation Claudia Nieuwenhuis, etc. ICCV 2013 Oral.
Data Mining Techniques: Clustering
Spatio-Temporal Outlier Detection in Precipitation Data
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
© University of Minnesota Data Mining for the Discovery of Ocean Climate Indices 1 CSci 8980: Data Mining (Fall 2002) Vipin Kumar Army High Performance.
Spatio-Temporal Databases
Spatio-Temporal Databases. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases …..
Advancing Wireless Link Signatures for Location Distinction J. Zhang, M. H. Firooz, N. Patwari, S. K. Kasera MobiCom’ 08 Presenter: Yuan Song.
Segmentation Divide the image into segments. Each segment:
Spatial Outlier Detection and implementation in Weka Implemented by: Shan Huang Jisu Oh CSCI8715 Class Project, April Presented by Jisu.
Spatiotemporal GIS: Incorporating Time Group 7 Nathan Hunstad, Kyle Martin Csci 5980.
Game Mathematics & Game State The Complexity of Games Expectations of Players Efficiency Game Mathematics o Collision Detection & Response o Object Overlap.
An Intelligent & Incremental Approach to kNN using R-trees DJ Oneil & Esten Rye (G01)
Information retrieval Finding relevant data using irrelevant keys Example: database of photographic images sorted by number, date. DBMS: Well structured.
Supported in part by the National Science Foundation – ISS/Digital Science & Technology Analysis of the Open Source Software development community using.
Presented by Arun Qamra
Major Tasks in Data Preprocessing(Ref Chap 3) By Prof. Muhammad Amir Alam.
DVMM Lab, Columbia UniversityVideo Event Recognition Video Event Recognition: Multilevel Pyramid Matching Dong Xu and Shih-Fu Chang Digital Video and Multimedia.
Ch 5 Practical Point Pattern Analysis Spatial Stats & Data Analysis by Magdaléna Dohnalová.
LÊ QU Ố C HUY ID: QLU OUTLINE  What is data mining ?  Major issues in data mining 2.
GPS Trajectories Analysis in MOPSI Project Minjie Chen SIPU group Univ. of Eastern Finland.
Learning to classify the visual dynamics of a scene Nicoletta Noceti Università degli Studi di Genova Corso di Dottorato.
Data Mining Techniques
TEMPORAL VIDEO BOUNDARIES -PART ONE- SNUEE KIM KYUNGMIN.
Database and Data-Intensive Systems. Data-Intensive Systems From monolithic architectures to diverse systems Dedicated/specialized systems, column stores.
National Survey and Cadastre – Denmark Conceptual Modeling of Geographic Databases - Emphasis on Relationships among Geographic Databases Anders Friis-Christensen.
Conceptual design of web application families: the BWW approach OOPSLA th Workshop on Domain-Specific Modeling October 22, 2006 Oregon Convention.
Techniques for Analysis and Calibration of Multi- Agent Simulations Manuel Fehler Franziska Klügl Frank Puppe Universität Würzburg Lehrstuhl für Künstliche.
Spatial Data Analysis Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What is spatial data and their special.
Combined Central and Subspace Clustering for Computer Vision Applications Le Lu 1 René Vidal 2 1 Computer Science Department, Johns Hopkins University,
University of Texas at Austin CS384G - Computer Graphics Fall 2010 Don Fussell Image processing.
A Graph-based Friend Recommendation System Using Genetic Algorithm
INTERACTIVE ANALYSIS OF COMPUTER CRIMES PRESENTED FOR CS-689 ON 10/12/2000 BY NAGAKALYANA ESKALA.
Extending Spatial Hot Spot Detection Techniques to Temporal Dimensions Sungsoon Hwang Department of Geography State University of New York at Buffalo DMGIS.
HP-PURDUE-CONFIDENTIAL Final Exam May 16th 2008 Slide No.1 Outline Motivations Analytical Model of Skew Effect and its Compensation in Banding and MTF.
DB group seminar 2006/06/29The University of Hong Kong, Dept. of Computer Science Neighborhood based detection of anomalies in high dimensional spatio-temporal.
Automatic Image Annotation by Using Concept-Sensitive Salient Objects for Image Content Representation Jianping Fan, Yuli Gao, Hangzai Luo, Guangyou Xu.
Role of Spatial Database in Biodiversity Conservation Planning Sham Davande, GIS Expert Arid Communities Technologies, Bhuj 11 September, 2015.
Ground Truth Free Evaluation of Segment Based Maps Rolf Lakaemper Temple University, Philadelphia,PA,USA.
Data Mining – Intro. Course Overview Spatial Databases Temporal and Spatio-Temporal Databases Multimedia Databases Data Mining.
Performance evaluation on grid Zsolt Németh MTA SZTAKI Computer and Automation Research Institute.
Autonomous Robots Vision © Manfred Huber 2014.
Change Analysis in Spatial Datasets by Interestingness Comparison Vadeerat Rinsurongkawong, and Christoph F. Eick Department of Computer Science, University.
Unsupervised Mining of Statistical Temporal Structures in Video Liu ze yuan May 15,2011.
ANALYSIS TOOL TO PROCESS PASSIVELY- COLLECTED GPS DATA FOR COMMERCIAL VEHICLE DEMAND MODELLING APPLICATIONS Bryce Sharman & Matthew Roorda University of.
Finite State Machines (FSM) OR Finite State Automation (FSA) - are models of the behaviors of a system or a complex object, with a limited number of defined.
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
Model Based Event Detection in Sensor Networks Jayant Gupchup, Andreas Terzis, Randal Burns, Alex Szalay.
Eick: kNN kNN: A Non-parametric Classification and Prediction Technique Goals of this set of transparencies: 1.Introduce kNN---a popular non-parameric.
Introduction to Game Programming & Design III Lecture III.
5 1 Normalization of Database Tables. 5 2 Database Tables and Normalization Normalization –Process for evaluating and correcting table structures to minimize.
Why Is It There? Chapter 6. Review: Dueker’s (1979) Definition “a geographic information system is a special case of information systems where the database.
A Self-organizing Semantic Map for Information Retrieval Xia Lin, Dagobert Soergel, Gary Marchionini presented by Yi-Ting.
The PLA Model: On the Combination of Product-Line Analyses 강태준.
Profiling: What is it? Notes and reflections on profiling and how it could be used in process mining.
Data Mining – Intro.
G10 Anuj Karpatne Vijay Borra
Context-Aware Modeling and Recognition of Activities in Video
Image Segmentation Techniques
A weight-incorporated similarity-based clustering ensemble method based on swarm intelligence Yue Ming NJIT#:
Data Warehousing and Data Mining
Similarity Search: A Matching Based Approach
Presentation transcript:

The Evolution of Spatial Outlier Detection Algorithms - An Analysis of Design CSci 8715 Spatial Databases Ryan Stello Kriti Mehra

Outline  Background of the project  Problem Statement  Review of issues in traditional outlier detection  Spatial Outlier Detection  Outlier Detection in Spatio-Temporal Data  Issues with modeling spatio-temporal data  Solutions proposed  Techniques to model spatio-temporal data  Issues with detecting outliers in these models  Summary  Suggestions for Future Work

Background of the project - Survey of papers - Focus is on Spatio-Temporal Outlier Detection - Classification is done with a view to understand outlier detection in the Spatio-Temporal domain

Problem Statement  Given: Techniques for outlier detection in traditional, spatial and spatio-temporal domain  Find: To provide a classification of spatial outlier detection algorithms and highlight shortcomings as problem complexity increases to the spatio-temporal domain  Objective: To inform the reader of the complexity of spatial outlier detection and motivate further efforts  Constraints: Non-exhaustive

Review of issues in traditional outlier techniques Low – Dimensional spatial outlier detection Low – Dimensional spatial outlier detection  Restricted so that imposing a grid is easy  Distributive:  Normalizes the whole data and pulls out an outlier  Average of values is a typical metric  Median could be used  Iterative:  Apply to each neighborhood High- Dimensional Spatial Outlier Detection High- Dimensional Spatial Outlier Detection  Statistical: Applied to the entire data and hence fails  Space Reduction

Spatial Outlier Detection Spatial Data: Define a region Define proximity relationship Only when region and proximity relationship are defined can the concept of spatial outlier be defined Various methods have been devised to define region and proximity relationship…. Which one should be applied to our application?

Spatio-Temporal Data Increased Complexity due to:  Issues with spatial outlier detection already exist  A new attribute has to be considered - Time  High dimensionality Scenarios: Car making a sharp turn Movement of a cluster of stars Pollution of a lake due to industrial dump Global warming Should the definition of region be the same? Should the definition of proximity relationship be the same? Would the data model used in these scenarios be the same?

Modeling Spatio-Temporal Data Issues and Solutions

Issues Based on the “snapshot” phenomena: GIS applications take photographs of a region periodically. Difficult to determine whether two mobile systems interacted between snapshots Drawback : Data-oriented, data can be recorded only at fixed intervals of time. F1, F2: Initial and final position of Flock of sheep I1, I2: Initial and final position of rain clouds Did the flock of sheep get wet?

Solution Proposed Transition from Data based modeling to Representation based modeling Representation of the data is required to incorporate spatial and temporal aspect

Spatio-Temporal Representations - Neighborhood-Based - Time Series Matching

Neighborhood-Based Determine the neighborhood of object Merge neighborhood sharing edges based on common concept Process: 1. Create Micro Neighborhood based on immediate spatial neighborhood to obtain a Voronoi Polygon Voronoi Diagram of a set of objects O is the subdivision of the plane into n polygons, with the property that a point q lies in the polygon corresponding to an object o i iff dist (q,o i )<=dist (q,o j ) for each o j belonging to O and j<>i; 2. Create Macro Neighborhood by merging micro neighborhoods that share an edge. 3. Detect outliers based on how different the value is from threshold.

Extension to the concept Is the temporal aspect embedded in the semantic process? More spatial than spatio – temporal representation Extension – E.g. Cars : neighborhood overlap X Outlier! Y

Outlier Detection in Mobile Objects  Data for mobile objects contains large number of outliers  Metric-based outlier detection is not effective  Non-metric distance based functions : Similarity Based  Time Series compared against a known/expected time series  This method has complexity due to difficulty in  Determining the expected time series  What is the acceptable tolerance for imprecise matches?  How much noise is acceptable?

Example of Spatio-temporal analysis of mobile object  Normal behavior : Representation of normal behavior of a car would require defining possibilities ( variations caused by taking an exit and lane changing)  Precision:  frequency matching  deviance from norm

Summary TraditionalSpatialSpatio-Temporal TransactionNeighborhoodEvents – where and when does the event occur Non-spatial attribute Representation with data e.g. Distance Abstracting with data to create a model

Future Work Past helps to analyze cause of events Food for thought Using spatio-temporal outlier detection to predict the future is more relevant than using it to analyze the past

Questions?