Spatial Data Mining. 2 Introduction Spatial data mining is the process of discovering interesting, useful, non-trivial patterns from large spatial datasets.

Slides:



Advertisements
Similar presentations
1 DATA STRUCTURES USED IN SPATIAL DATA MINING. 2 What is Spatial data ? broadly be defined as data which covers multidimensional points, lines, rectangles,
Advertisements

Chapter 4 Part C: Queries, Computations & Map Algebra.
The Role of Error Map and attribute data errors are the data producer's responsibility, GIS user must understand error. Accuracy and precision of map and.
Spatial Database Systems. Spatial Database Applications GIS applications (maps): Urban planning, route optimization, fire or pollution monitoring, utility.
PARTITIONAL CLUSTERING
1 Enviromatics Spatial database systems Spatial database systems Вонр. проф. д-р Александар Маркоски Технички факултет – Битола 2008 год.
You will learn to describe relationships among lines, parts of lines, and planes. In geometry, two lines in a plane that are always the same distance.
You will learn to describe relationships among lines, parts of lines, and planes. In geometry, two lines in a plane that are always the same distance.
Spatial Mining.
Border around project area Everything else is hardly noticeable… but it’s there Big circles… and semi- transparent Color distinction is clear.
Spatio-Temporal Databases
CS 128/ES Lecture 5b1 Vector Based Data. CS 128/ES Lecture 5b2 Spatial data models 1.Raster 2.Vector 3.Object-oriented Spatial data formats:
Spatial Information Systems (SIS) COMP Spatial relations.
Spatio-Temporal Databases. Outline Spatial Databases Temporal Databases Spatio-temporal Databases Multimedia Databases …..
Information Systems and GIS Chapter 2 Slides from James Pick, Geo-Business: GIS in the Digital Organization, John Wiley and Sons, Copyright © 2008.
Oracle spatial – Creating spatial tables Object Relational Model Creating Spatial Tables.
Geographic Information Systems
Week 7. Feature relationship and topology Oct. 17 th, 2005.
BASIC SPATIAL ANALYSIS TOOLS IN A GIS
GIS Introduction What is GIS?. Geographic Information Systems A database system in which the organizing principle is explicitly SPATIAL.
Basic Concepts of GIS January 29, What is GIS? “A powerful set of tools for collecting, storing, retrieving, transforming and displaying spatial.
The Coordinate Plane coordinate plane In coordinate Geometry, grid paper is used to locate points. The plane of the grid is called the coordinate plane.
Intro. To GIS Lecture 6 Spatial Analysis April 8th, 2013
Prepared by Abzamiyeva Laura Candidate of the department of KKGU named after Al-Farabi Kizilorda, Kazakstan 2012.
Spatial data Visualization spatial data Ruslan Bobov
October 8, 2013Computer Vision Lecture 11: The Hough Transform 1 Fitting Curve Models to Edges Most contours can be well described by combining several.
JTS Topology Suite JTS Topology Suite An API for Processing Linear Geometry Martin Davis, Senior Technical Architect
Spatial Data Models. What is a Data Model? What is a model? (Dictionary meaning) A set of plans (blueprint drawing) for a building A miniature representation.
Preparing Data for Analysis and Analyzing Spatial Data/ Geoprocessing Class 11 GISG 110.
Week Aug-10 – Aug-15 Introduction to Spatial Computing CSE 5ISC Some slides adapted from Worboys and Duckham (2004) GIS: A Computing Perspective, Second.
United Nations Regional Seminar on Census Data Dissemination and Spatial Analysis Amman, Jordan, May, 2011 Spatial Analysis & Dissemination of Census.
CS654: Digital Image Analysis Lecture 3: Data Structure for Image Analysis.
6. Simple Features Specification Background information UML overview Simple features geometry.
Spatial Data Analysis Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What is spatial data and their special.
Time Series Data Analysis - I Yaji Sripada. Dept. of Computing Science, University of Aberdeen2 In this lecture you learn What are Time Series? How to.
Spatial Concepts and Data Models Reading: Shekhar & Chawla Chapter 2 November 22, 2005.
URBDP 422 Urban and Regional Geo-Spatial Analysis Lecture 2: Spatial Data Models and Structures Lab Exercise 2: Topology January 9, 2014.
GI Science Database Management Systems Nigel Trodd Coventry University.
1 Spatial Data Models and Structure. 2 Part 1: Basic Geographic Concepts Real world -> Digital Environment –GIS data represent a simplified view of physical.
GIS Data Structures How do we represent the world in a GIS database?
1 Spatio-Temporal Predicates Martin Erwig and Markus Schneider IEEE TRANSACTIONS ON KNOWLEDGE AND DATA ENGINEERING Presented by Mamadou Hassimiou Diallo.
Creating and Maintaining Geographic Databases. Outline Definitions Characteristics of DBMS Types of database Relational model SQL Spatial databases.
Spatial DBMS Spatial Database Management Systems.
Query and Reasoning. Types of Queries Most GIS queries will select spatial features Query by Attribute (Select by Attribute) –Structured Query Language.
NR 143 Study Overview: part 1 By Austin Troy University of Vermont Using GIS-- Introduction to GIS.
Advanced Editing: Rules-Based Topology in ArcEditor
L1-Spatial Concepts NGEN06 & TEK230: Algorithms in Geographical Information Systems by: Irene Rangel, updated by Sadegh Jamali 1.
Presented by Ho Wai Shing
GIS Data Models III GEOG 370 Instructor: Christine Erlien.
What is GIS? “A powerful set of tools for collecting, storing, retrieving, transforming and displaying spatial data”
Intro. To GIS Pre-Lab Spatial Analysis April 1 st, 2013.
GTECH 361 Lecture 09 Features in the Geodatabase.
CENTENNIAL COLLEGE SCHOOL OF ENGINEERING & APPLIED SCIENCE VS 361 Introduction to GIS SPATIAL OPERATIONS COURSE NOTES 1.
3.1 Clustering Finding a good clustering of the points is a fundamental issue in computing a representative simplicial complex. Mapper does not place any.
We propose a method which can be used to reduce high dimensional data sets into simplicial complexes with far fewer points which can capture topological.
Introduction to Polygons
Physical Structure of GDB
Data Mining K-means Algorithm
Mean Shift Segmentation
3.1 Clustering Finding a good clustering of the points is a fundamental issue in computing a representative simplicial complex. Mapper does not place any.
Fitting Curve Models to Edges
Introdução to Geoinformatics: vector geometries
Nicholas A. Procopio, Ph.D, GISP
Content-Based Image Retrieval
Content-Based Image Retrieval
Parallel Lines and Planes
JTS Topology Suite An API for Processing Linear Geometry
I. The Problem of Molding
Introduction to Geoinformatics: Topology
Additive Relationship
Presentation transcript:

Spatial Data Mining

2 Introduction Spatial data mining is the process of discovering interesting, useful, non-trivial patterns from large spatial datasets –E.g. co-location patterns of water pumps and cholera –Determining hotspots: unusual locations Spatial Data Mining Tasks –Classification/Prediction –Co-location Mining –Clustering Recap of special properties of Spatial Data –Spatial autocorrelation –Spatial heterogeneity –Implicit Spatial Relations

3 Spatial Relations Spatial databases do not store spatial relations explicitly –Additional functionality required to compute them Three types of spatial relations specified by the OGC reference model –Distance relations Euclidean distance between two spatial features –Direction relations Ordering of spatial features in space –Topological relations Characterise the type of intersection between spatial features

4 Distance relations If dist is a distance function and c is some real number 1.dist(A,B)>c, 2.dist(A,B)<c and 3.dist(A,B)=c A B A B BA

5 Direction relations If directions of B and C are required with respect to A Define a representative point, rep(A) rep(A) defines the origin of a virtual coordinate system The quadrants and half planes define the direction relations B can have two values {northeast, east} Exact direction relation is northeast A C B rep(A) C north A B northeast A

6 Topological Relations Topological relations describe how geometries intersect spatially Simple geometry types –Point, 0-dimension –Line, 1-dimension –Polygon, 2-dimension Each geometry represented in terms of –boundary (B) – geometry of the lower dimension –interior (I) – points of the geometry when boundary is removed –exterior (E) – points not in the interior or boundary Examples for simple geometries –For a point, I = {point}, B={} and E={Points not in I and B} –For a line, I={points except boundary points}, B={two end points} and E={Points not in I and B} –For a polygon, I={points within the boundary}, B={the boundary} and E={points not in I and B}

7 DE-9IM Topological relations are defined using any one of the following models –4IM, four intersection model (only B and E considered) –9IM, nine intersection models (B, I, and E) –DE-9IM, dimensionally extended 9 intersection model DE-9IM is an OGC complaint model Dim is the dimension function

8 Example Consider two polygons –A - POLYGON ((10 10, 15 0, 25 0, 30 10, 25 20, 15 20, 10 10)) –B - POLYGON ((20 10, 30 0, 40 10, 30 20, 20 10))

9 I(B)B(B) E(B) I(A) B(A) E(A) 9-Intersection Matrix of example geometries

10 DE-9IM for the example geometries I(B)B(B)E(B) I(A)212 B(A)101 E(A)212

11 Relationships using DE-9IM Different geometries may give rise to different numbers in the DE-9IM For a specific type of relationship we are only interested in certain values in certain positions –That is, we are interested in patterns in the matrix than actual values Actual values are replaced by wild cards –T: value is "true" - non empty - any dimension >= 0 –F: value is "false" - empty - dimension < 0 –*: Don't care what the value is –0: value is exactly zero –1: value is exactly one –2: value is exactly two A over laps B I(B)B(B)E(B) I(A)T*T B(A)*** E(A)T**

12 Topological Relations x.Disjoint(y) –FF*FF**** x.Touches(y) –FT******* Area/Area, Line/Line, Line/Area, Point/Area –F**T***** Not Point/Point –F***T**** x.Crosses(y) –T*T****** Point/Line, Point/Area, Line/Area –0******** Line/Line x.Within(y) –TF*F***** x.Overlaps(y) –T*T***T** Point/Point, Area/Area –1*T***T** Line/Line DE-9IM string for example geometries was ‘ ’ (from earlier slide) –A crosses B –A overlaps B

13 Approaches to Spatial Data Mining Materialize spatial features and use Weka –Required features are added as additional attributes to the main feature –To create a flat file of data Use special data mining techniques that take spatial dependency into account

14 Materializing features- Example

15 Materializing features- Example (2)

16 Spatial Data Mining Architecture Retrieve data belonging to multiple themes Preprocess spatial data to materialize spatial features –Select the required features –Use the methods to compute spatial relations to create a flat file of data Use Weka like tool to perform data mining OGC Complaint Spatial DBMS Feature Selection & OGC complaint methods to compute relations Weka Flat File Multiple Themes

17 Spatial Clustering Also called spatial segmentation Input –a table of area names and their corresponding attributes such as population density, number of adult illiterates etc. –Information about the neighbourhood relationships among the areas –A list of categories/classes of the attributes Output –Grouped (segmented) areas where each group has areas with similar attribute values Census Website has plenty of examples – ps/index.htmlhttp:// ps/index.html

18 Similarity with image segmentation Spatial segmentation is performed in image processing –Identify regions (areas) of an image that have similar colour (or other image attributes). –Many image segmentation techniques are available E.g. region-growing technique

19 Region Growing Technique There are many flavours of this technique One of them is described below: –Assign seed areas to each of the segments (classes of the attribute) –Add neighbouring areas to these segments if the incoming areas have similar values of attributes –Repeat the above step until all the regions are allocated to one of the segments Functionality to compute spatial relations (neighbours) assumed

20 Summary Spatial data storage available as extensions of RDBMS Visualization of Spatial data available in GIS Spatial Data Mining requires functionality to compute spatial relations OGC specifications provide the standards for all the above resources MYSQL provides data spatial data storage –But only partially provides the functionality for computing relations Several OpenSource systems provide all the above resources for spatial data –OpenJump, GeoTools