Quality issues in Spatial Databases M. Mostafavi, G. Edwards, R. Jeansoulin CRG & GEOIDE & REVIGIS Victoria, May 2003.

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

WP5 – Chapter 7. Harmonisation Harmonisation of geometry, data definitions, data models, naming ISSUES: MS deliveries are described in WP 4.1 in an enhanced.
The Process of Data Ingestion in ÆKOS Andrew Graham and Matt Schneider TERN Ecoinformatics Data Analysts Logos used with consent. Content of this presentation.
Office of SA to CNS GeoIntelligence Introduction Data Mining vs Image Mining Image Mining - Issues and Challenges CBIR Image Mining Process Ontology.
Center for Modeling & Simulation.  A Map is the most effective shorthand to show locations of objects with attributes, which can be physical or cultural.
Geographical Information Systems and Science Longley P A, Goodchild M F, Maguire D J, Rhind D W (2001) John Wiley and Sons Ltd 9. Geographic Data Modeling.
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
Raster Based GIS Analysis
The Vuel Concept: Towards a new way to manage Multiple Representations in Spatial Databases ISPRS / ICA Workshop Multi-Scale Representations of Spatial.
Geodatabases by Shawn J. Dorsch Spatial Databases Part 2.
Managing Data Resources
Geog 458: Map Sources and Errors January Representing Geography.
West Hills College Farm of the Future. West Hills College Farm of the Future Where are you NOW?! Precision Agriculture – Lesson 3.
Systems Analysis Requirements structuring Process Modeling Logic Modeling Data Modeling  Represents the contents and structure of the DFD’s data flows.
Geographic Information Systems
Visual Querying By Color Perceptive Regions Alberto del Bimbo, M. Mugnaini, P. Pala, and F. Turco University of Florence, Italy Pattern Recognition, 1998.
NPS Introduction to GIS: Lecture 1
Introduction to ArcView ArcView_module_2 May 12, 10:40 AM.
Completeness February 27, 2006 Geog 458: Map Sources and Errors.
GI Systems and Science January 23, Points to Cover  What is spatial data modeling?  Entity definition  Topology  Spatial data models Raster.
© Pearson Education Limited, Chapter 12 Physical Database Design – Step 3 (Translate Logical Design) Transparencies.
Copyright 2002 Prentice-Hall, Inc. Modern Systems Analysis and Design Third Edition Jeffrey A. Hoffer Joey F. George Joseph S. Valacich Chapter 10 Structuring.
Computer System Analysis Chapter 10 Structuring System Requirements: Conceptual Data Modeling Dr. Sana’a Wafa Al-Sayegh 1 st quadmaster University of Palestine.
Lecture 2 The Relational Model. Objectives Terminology of relational model. How tables are used to represent data. Connection between mathematical relations.
Chapter 4 The Relational Model Pearson Education © 2014.
Relational Model Session 6 Course Name: Database System Year : 2012.
GIS 1110 Designing Geodatabases. Representation Q. How will we model our real world data? A. Typically: Features Continuous Surfaces and Imagery Map Graphics.
Business Process Management. Key Definitions Process model A formal way of representing how a business operates Illustrates the activities that are performed.
Map Scale, Resolution and Data Models. Components of a GIS Map Maps can be displayed at various scales –Scale - the relationship between the size of features.
TOURISM PLANNING OF ALMATY INFRASTRUCTURE IN GEOINFORMATION SYSTEMS Erkin H. KakymzhanovErkin H. Kakymzhanov.
Chapter 3 Sections 3.5 – 3.7. Vector Data Representation object-based “discrete objects”
Concepts and Terminology Introduction to Database.
Geographic Information System GIS This project is implemented through the CENTRAL EUROPE Programme co-financed by the ERDF GIS Geographic Inf o rmation.
Major parts of ArcGIS ArcView -Basic mapping, editing and Analysis tools ArcEditor -all of ArcView plus Adds ability to deal with topological and network.
Spatial Databases: Data Collection Spring, 2015 Ki-Joune Li.
1/26/2004TCSS545A Isabelle Bichindaritz1 Database Management Systems Design Methodology.
URBDP 422 Urban and Regional Geo-Spatial Analysis Lecture 2: Spatial Data Models and Structures Lab Exercise 2: Topology January 9, 2014.
University of L’Aquila, Department of Electrical and Information Engineering
Spatial Databases: Building Spatial DB Spring, 2015 Ki-Joune Li.
1 Chapter 1 Introduction. 2 Introduction n Definition A database management system (DBMS) is a general-purpose software system that facilitates the process.
KNOWLEDGE BASED TECHNIQUES INTRODUCTION many geographical problems are ill-structured an ill-structured problem "lacks a solution algorithm.
Rupa Tiwari, CSci5980 Fall  Course Material Classification  GIS Encyclopedia Articles  Classification Diagram  Course – Encyclopedia Mapping.
ISPRS Congress 2000 Multidimensional Representation of Geographic Features E. Lynn Usery Research Geographer U.S. Geological Survey.
GIS Data Structures How do we represent the world in a GIS database?
Spatial DBMS Spatial Database Management Systems.
NR 143 Study Overview: part 1 By Austin Troy University of Vermont Using GIS-- Introduction to GIS.
Central Arizona Phoenix LTER Center for Environmental Studies Arizona State University Database Design Peter McCartney (CAP) RDIFS Training Workshop Sevilleta.
Introduction to GIS. Introduction How to answer geographical questions such as follows: – What is the population of a particular city? – What are the.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
©2003 Prentice Hall Business Publishing, Accounting Information Systems, 9/e, Romney/Steinbart 4-1 Relational Databases.
Definition of Spatial Analysis
Towards Unifying Vector and Raster Data Models for Hybrid Spatial Regions Philip Dougherty.
©2003 Prentice Hall Business Publishing, Accounting Information Systems, 9/e, Romney/Steinbart 4-1 Relational Databases.
VERA AULIA ( ).  Oil palm is one of the major edible oil traded in the global market.  Oil palm tree will start to produce fruits within three.
1 The Relational Data Model David J. Stucki. Relational Model Concepts 2 Fundamental concept: the relation  The Relational Model represents an entire.
Introduction to Geodatabases
Lesson 3 GIS Fundamentals MEASURE Evaluation PHFI Training of Trainers May 2011.
WELLS AND TIME SERIES DATA. Framework Temporal Aquifers & Wells.
COP Introduction to Database Structures
Spatial Databases: Building Spatial DB
Physical Structure of GDB
Geographic Information Systems
Evaluating Land-Use Classification Methodology Using Landsat Imagery
Data Queries Raster & Vector Data Models
Spatial Databases: Building Spatial DB
URBDP 422 Urban and Regional Geo-Spatial Analysis
Spatial Databases: Building Spatial DB
Kalman filter, and Logical revision: comparing them on GI
NPS Introduction to GIS: Lecture 1 Based on NIMC and Other Sources.
Presentation transcript:

Quality issues in Spatial Databases M. Mostafavi, G. Edwards, R. Jeansoulin CRG & GEOIDE & REVIGIS Victoria, May 2003

Contents Introduction Problems Objective Methodology Results Discussion Conclusions and perspectives

Introduction Data fusion and Data Quality Multi sources spatial data Vector data : BNDT, BDTQ, … Raster data: satellites images, aerial images,… Need for better quality Logical consistency Completeness Semantic accuracy Temporal accuracy Positional accuracy and more … Decision making ( Effective crisis management (MSPQ))

A real case problem BNDT: good geometry Statistics Canada database, Canada election database: reach descriptive information but weak geometry How to reconcile these two data sets? BNDT SC, EC

Context Fusion SDB1 SDB3 SDB2 Information of greater quality User vision (fitness for use) Producer vision (Product ontology)

Logical consistency Logical consistency is an important element of data quality. It defines the degree of consistency of the data with respect to its specifications. Integrity constrains Explicit rules stated in the data specifications (e.g. connectivity between two objects) Implicit rules (e.g. a river always flows downstream) Ontology vs. specifications Ontology specifications

Project definition NTDB Ontology BDTQ Ontology NTDB data BDTQ data New data set Ontology consistency data consistency data Consistency vs. BNDT data consistency Step 1 Step 2 Step 3 Step 4 Step 5 Integrated ontology Mapping the ontologies Ontology fusion Data fusion Does this Help? Lack of explicit rules Yes ?......No

Consistency in NTDB NTDB Ontologies Delphi Interface Delphi Interface Studying the Logical consistency of the dataset Prolog Dataset Step 1Step 2

Formalizing the ontology Knowledge base Facts Rules Queries BNDT Ontology

Spatial relations in NTDB Spatial relations in NTDB are: 1.Connection relations 2.Sharing relations 3.adjacency relations 4.Superposition relations A B A B A B CDE A B C

Logical approach- facts For NTDB the facts consist of Taxonomy of NTDB Themes Entities Allowed Combinations Code (NTDB identity code) Geometric representations Spatial relations Connection Sharing Superposition/ adjacency Minimal values (e.g. distance constraints between objects)

Logical approach- facts Types of factsTotal groupsFacts Taxonomy Connection Sharing Adjacency/superposition Total There are about 350,000 facts describing the NTDB Remark: regrouping of objects for programming purposes has created some inconsistencies

Logical approach- rules RelationInconsistency rules Connection a. Object A is connected to object B and the inverse relation is not defined. b. Connection is illegal (C1=0-0) and for the same objects we have C1 ≠ 0-0. Sharing a. Object A shares with object B but the inverse relation is not defined. b. The same objects share with different values of C2. Superposition Adjacency a. Two objects are superposed and are adjacent at the same time. Several rules are defined to analyze the ontological consistency of the NTDB. Inconsistency rules

Results (1/2) Inconsistency (inverse connection) Data dictionary: (generic relation) between themes: Railway (L) Connected to Road (L) between themes : Road (L) Connected to Railway (L) Table of connection and cardinalities CodeEntityCombinationC1Code 3002RailwayStandard, Ground level, Operational, Multiple RoadSecondary, Ground level,Hard surface Not verified ?

Results (2/2) Inconsistency (Different Values for the cardinality one) Data dictionary: (Generic relation) Gas and oil facilities (P) is Connected to Building (P) Table of connection and cardinalities CodeEntityCombinationC1Code 788Gas and oil facilities Generic/ unknown Gas and oil facilities Generic/ unknown - (Not verified) 147 ?

Consistency in Data NTDB Ontologies Delphi Interface Studying the Logical consistency of the dataset Prolog VB Interface Dataset Step 1Step 2

Geomedia professional Meet Entirely Contained Entirely Contained by Contains and Contained by Spatially equal touch Meet Overlap Spatial operations

Mapping Polygon – Polygon Relations Relations DisjointMeetEqualInsideContains Covered by CoversOverlap Connection Sharing xxxx Superposition xxxxxx Adjacent x

Mapping problems Several problems Confusions in spatial relations Unique mapping is not possible Cardinalities cannot be considered

Step 2: BNDT Data vs its ontology

Data vs ontology File 21E05 Region: Sherbrooke 68 Entities 23,283 objects Analyzed binary relations: Contours vs. water bodies Buildings vs. roads Water bodies vs. buildings Liquid depot vs. Liquid depot Roads vs. water bodies …

Results Liquid depot vs. Liquid depot Spatial representations (Point, Area) Spatial relations Ontology/ specification (superposition is illegal) Data (superposition case is found)

Problem: Road crosses a water body Illegal relation with respect to semantics of the objects Incomplete ontology Results

Problem: Cut line crosses a water body Illegal relation with respect to semantic definition of the objects Incomplete ontology Results

Problem: Contour crosses water body Illegal relation with respect to the ontology Inconsistent data Results

Problem: Road crosses water body Illegal relation with respect to the ontology Inconsistent data

Problem: Road crosses Building Illegal relation with respect to the semantics of objects Incomplete ontology Results

Problem: Water body (L) superposed Vegetation (A) Illegal relation with respect to the ontology Inconsistent data Control system problem Results

Problem: Buildings (S) superposed to water body (A) Illegal relation with respect to the semantics of objects Inconsistent data Results

Problem: Building (A) Overlap Vegetation (A) Illegal relation with respect to the semantics of objects Inconsistent data Results

Suggestions, solutions Adding new rules Building (a) and vegetation (a) (illegal superposition) Road (l) and building (conditional superposition) A better control system is needed Find exceptions

Current situation Product ontology is analyzed Mapping of topological relations to binary relations Ontology translation in prolog (Delphi program) Consistency studding of spatial relations Connection (table C) Sharing (table D) Superposition and adjacency (table E) Consistency between different relations (fusion of facts) connection and sharing, connection and superposition / adjacency, sharing and superposition / adjacency Consistency of data vs. specifications are studied

Future work logical consistency of other available datasets Mapping of ontologies Fusion of ontologies Fusion of data Consistency of the newly created data set