Approximate Queries by Relaxing Structural Constraints in GIS Arianna D’Ulizia Fernando Ferri Patrizia Grifoni IRPPS-CNR, Rome, Italy First International.

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

Dr. Leo Obrst MITRE Information Semantics Information Discovery & Understanding Command & Control Center February 6, 2014February 6, 2014February 6, 2014.
Answering Approximate Queries over Autonomous Web Databases Xiangfu Meng, Z. M. Ma, and Li Yan College of Information Science and Engineering, Northeastern.
Relational data objects 1 Lecture 6. Relational data objects 2 Answer to last lectures activity.
LAOS: Layered WWW AHS Authoring Model and their corresponding Algebraic Operators Dr. Alexandra Cristea
Traditional IR models Jian-Yun Nie.
XML DOCUMENTS AND DATABASES
DATA MODELS A collection of conceptual tools for describing data, data relationships, data semantics, and consistency constraints. Provide a way to describe.
Spatial Database Systems. Spatial Database Applications GIS applications (maps): Urban planning, route optimization, fire or pollution monitoring, utility.
Improved TF-IDF Ranker
Fast Algorithms For Hierarchical Range Histogram Constructions
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
Evaluating semantic similarity using GML in Geographic Information Systems Fernando Ferri 1, Anna Formica 2, Patrizia Grifoni 1, and Maurizio Rafanelli.
The Management of Spatial and Temporal Constraints in GIS using Pictorial Interaction on the Web Fernando Ferri IRPPS-CNR-Italy Patrizia Grifoni IRPPS-CNR-Italy.
GIS for Environmental Science
From portions of Chapter 8, 9, 10, &11. Real world is complex. GIS is used model reality. The GIS models then enable us to ask questions of the data by.
A Framework for Ontology-Based Knowledge Management System
IR Models: Overview, Boolean, and Vector
Copyright © 2012 Pearson Education, Inc. Publishing as Prentice Hall 7.1.
TERMS, CONCEPTS and DATA TYPES IN GIS Orhan Gündüz.
Ragionamento in condizioni di incertezza: Approccio fuzzy Paolo Radaelli Corso di Inelligenza Articifiale - Elementi.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
Answering Imprecise Queries over Autonomous Web Databases Ullas Nambiar Dept. of Computer Science University of California, Davis Subbarao Kambhampati.
What is Where? u Getting Started With Geographic Information Systems u Chapter 5.
Copyright © 2012 Pearson Education, Inc. Publishing as Prentice Hall 7.1.
Information Retrieval in Practice
Chapter One Overview of Database Objectives: -Introduction -DBMS architecture -Definitions -Data models -DB lifecycle.
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Modeling (Chap. 2) Modern Information Retrieval Spring 2000.
Modern Information Retrieval Chap. 02: Modeling (Structured Text Models)
Timed UML State Machines Ognyana Hristova Tutor: Priv.-Doz. Dr. Thomas Noll June, 2007.
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
An Approach to Task Modelling for User Interface Design Costin Pribeanu National Institute for Research and Development in Informatics, Bucureşti, Romania.
Logics for Data and Knowledge Representation Semantic Matching.
Number Sense Standards Measurement and Geometry Statistics, Data Analysis and Probability CST Math 6 Released Questions Algebra and Functions 0 Questions.
Exploiting Ontologies for Automatic Image Annotation M. Srikanth, J. Varner, M. Bowden, D. Moldovan Language Computer Corporation
Information Systems: Databases Define the role of general information systems Describe the elements of a database management system (DBMS) Describe the.
A Z Approach in Validating ORA-SS Data Models Scott Uk-Jin Lee Jing Sun Gillian Dobbie Yuan Fang Li.
UOS 1 Ontology Based Personalized Search Zhang Tao The University of Seoul.
A view-based approach for semantic service descriptions Carsten Jacob, Heiko Pfeffer, Stephan Steglich, Li Yan, and Ma Qifeng
Querying Structured Text in an XML Database By Xuemei Luo.
RCDL Conference, Petrozavodsk, Russia Context-Based Retrieval in Digital Libraries: Approach and Technological Framework Kurt Sandkuhl, Alexander Smirnov,
September 5-7, Trento Deriving “sub-source” similarities from heterogeneous, semi-stuctured information sources D. Rosaci, G. Terracina, D. Ursino Dipartimento.
Web Usage Mining for Semantic Web Personalization جینی شیره شعاعی زهرا.
Friends and Locations Recommendation with the use of LBSN By EKUNDAYO OLUFEMI ADEOLA
1 Relational Databases and SQL. Learning Objectives Understand techniques to model complex accounting phenomena in an E-R diagram Develop E-R diagrams.
ICS 321 Fall 2011 The Relational Model of Data (i) Asst. Prof. Lipyeow Lim Information & Computer Science Department University of Hawaii at Manoa 8/29/20111Lipyeow.
Personalized Interaction With Semantic Information Portals Eric Schwarzkopf DFKI
Algorithmic Detection of Semantic Similarity WWW 2005.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
A Quick Introduction to GIS
DATA Spatial Data – where things are Non Spatial Data or Attribute Data – What things are Data in a computer database are managed and accessed through.
Lecture 10 Creating and Maintaining Geographic Databases Longley et al., Ch. 10, through section 10.4.
Predicting the Location and Time of Mobile Phone Users by Using Sequential Pattern Mining Techniques Mert Özer, Ilkcan Keles, Ismail Hakki Toroslu, Pinar.
Modeling Security-Relevant Data Semantics Xue Ying Chen Department of Computer Science.
Discriminative Training and Machine Learning Approaches Machine Learning Lab, Dept. of CSIE, NCKU Chih-Pin Liao.
Presented by Kyumars Sheykh Esmaili Description Logics for Data Bases (DLHB,Chapter 16) Semantic Web Seminar.
WFM 6202: Remote Sensing and GIS in Water Management © Dr. Akm Saiful IslamDr. Akm Saiful Islam WFM 6202: Remote Sensing and GIS in Water Management Dr.
Of 24 lecture 11: ontology – mediation, merging & aligning.
A Methodology for automatic retrieval of similarly shaped machinable components Mark Ascher - Dept of ECE.
Why Is It There? Chapter 6. Review: Dueker’s (1979) Definition “a geographic information system is a special case of information systems where the database.
Ontology Engineering and Feature Construction for Predicting Friendship Links in the Live Journal Social Network Author:Vikas Bahirwani 、 Doina Caragea.
COP Introduction to Database Structures
Associative Query Answering via Query Feature Similarity
Database.
Instance Model Structure
The Relational Model Textbook /7/2018.
Data Model.
Database Systems Instructor Name: Lecture-3.
Ying Dai Faculty of software and information science,
Presentation transcript:

Approximate Queries by Relaxing Structural Constraints in GIS Arianna D’Ulizia Fernando Ferri Patrizia Grifoni IRPPS-CNR, Rome, Italy First International Workshop on Semantic and Conceptual Issues in Geographic Information Systems (SeCoGIS 2007)

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  Motivation  The Geographical Pictorial Query Language (GeoPQL)  The similarity model Semantic similarity Structural similarity  Motivation

SeCoGIS 2007, Auckland, New Zealand, November 7-8, Query geographical database Null answer search condition Similar concepts Relaxing constraints Similarity model Most similar concepts: 1)Airline 2)Busline 3)Steamship EXAMPLE “Find all the Towns which are PASSED THROUGH by a Railway” Railway Town

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  Spatial constraints: spatial relationships existing between geographical objects  Semantic constraints: concepts represented by the geographical objects Semantic similarity can be calculated by providing a weighted taxonomy of concepts represented by each geographical object and by evaluating their related degrees of similarity Structural similarity can be measured considering similarity of attributes, types and values of each pair of geographical objects  Structural constraints: internal features of geographical objects Spatial similarity can be measured by a topological similarity graph, that links all configurations between two geographical objects by the lowest topological distance.

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  Similarity between concept names expressed in the query and concept names in the database.  Two strategies: to calculate the distance between nodes corresponding to the items being compared  items have to be represented as a network to evaluate the information content of items  items have to be represented as an Is-a taxonomy

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  Similarity between attributes of the concept expressed in the query and concepts in the database.  Research fields: Information retrieval: evaluation of distances between tree structures in terms of the hierarchical relationships of their nodes  similar XML documents Ontology matching: measure of distances among concepts in terms of their structures (attributes, types, values)

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  To analyze the problem of matching a geographical query with imprecise or missing data.  To propose a relaxation model that considers with different weights: the semantic similarity of geographical concepts, that is evaluated by adopting the information content approach, and the structural similarity of the attributes and types of geographical objects that is inspired by the maximum weighted matching problem in bipartite graphs.

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  Motivation  The Geographical Pictorial Query Language (GeoPQL)  The similarity model Semantic similarity Structural similarity  The Geographical Pictorial Query Language (GeoPQL)

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  GeoPQL allows to specify queries using Symbolic Graphical Objects (SGOs)  An SGO is a 4-tuple ψ  = where:  id is the SGO identifier;  objclass is the concept name iconized by ψ;  Σ represents the set of attributes of ψ;  Λ is the ordered set of coordinate pairs (h, v), which defines the spatial extent and position of the SGO with respect to the coordinate reference system of the working area. DEFINITION

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  GeoPQL algebra consists of 12 operators: Geo-union, Geo-difference, Geo-disjunction, Geo-touching, Geo-inclusion, Geo-crossing (cross between two polylines), Geo-pass-through (a polyline passes through a polygon), Geo-overlapping, Geo-equality, Geo-distance, Geo-any (any relationship is valid between two SGO) Geo-alias (OR operator). explicitly expressed automatically deduced by the query’s pictorial representatio n

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  GeoPQL interfaces with ESRI’s GIS ArcView®.  The geographical query drawn by the user is translated in an eXtended SQL language, called XSQL.  At the end of the drawing phase the XSQL query is translated into ArcView® and executed on ArcMap® (the geographical database of ArcView®).

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  “Find all the Towns which are PASSED THROUGH by a Railway AND which INCLUDE a Church”

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  Railway(number:int,local:boolean,goods:boolean,passengers:i nt,dep_railway_station:string,dest_railway_station:string)  Airline(number:int,company:string,travelers:int,dep_air_term inal:string,dest_air_terminal:string)  Busline(number:int,local:boolean,passengers:int,dep_bus_ter minal:string,dest_bus_terminal:string)  Steamship(number:int,passengers:int,dep_port:string,dest_por t:string)  Urban_area (region:string,country:string)  City (name:string,region:string,country:string, people:int)  Town(name:string,region:string,country:string,inhabitants:int)  Market_town(name:string,region:string,country:string,market _day:string)

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  Motivation  The Geographical Pictorial Query Language (GeoPQL)  The similarity model Semantic similarity Structural similarity  The similarity model

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  The total similarity between two SGOs ψ 1 and ψ 2 is a weighted sum of two kinds of similarity:  the semantic similarity SemSIM, that performs a conceptual comparison between the two SGOs in term of the concept names objclass 1 and objclass 2 they belong to, and  the structural similarity StrSIM, that performs a comparison between ψ 1 and ψ 2 in term of their attributes Σ 1 and Σ 2, the types of Σ 1 and Σ 2.

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  Motivation  The Geographical Pictorial Query Language (GeoPQL)  The similarity model Semantic similarity Structural similarity

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  Starting assumption: geographical concepts are organized according to a taxonomy  Each concept is associated with a weight, standing for the probability of encountering instances of that concepts along the hierarchy.  Formally: Given a concept c, the probability p(c) is defined as follows: p(c) = freq(c)/M where freq(c) is the frequency of the concept c estimated using noun frequencies from large text corpora, such as WordNet and M is the total number of concepts.

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  Weighted taxonomy: association of probabilities with the concepts of the taxonomy  Information content of a concept c: - log p(c) p(Line)= 131/ Entity (1) Line ( ) Airline ( ) Railway ( ) Transit Line ( ) Bus line ( ) Railway_station ( ) Air_terminal ( ) Bus_terminal ( ) Steamship (0.0004) Facility (0.0004) Station ( ) Terminal ( ) Town ( ) Market town ( ) Urban area ( ) City ( ) Entity Line Airline Railway Transit Line Bus line Railway_station Air_terminal Bus_terminal Steamship Facility Station Terminal Town Market town Urban area City

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  Information content similarity (ics): maximum information content shared by the concepts divided by the information content of the compared concepts. Consider the two SGO concept names City and Town: EXAMPLE  Semantic similarity between two SGOs ψ 1 and ψ 2 is: Town ( ) Market town ( ) Urban area ( ) City ( ) SemSIM( City,Town) = 2 log p(Urban_area) /(log p(City) + log p(Town)) = 2 * / ( ) = 0.92

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  Motivation  The Geographical Pictorial Query Language (GeoPQL)  The similarity model Semantic similarity Structural similarity

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  The evaluation of structural similarity between two SGOs ψ 1 and ψ 2 requires the comparison between: Σ 1 and Σ 2, that are the sets of attributes of ψ 1 and ψ 2 respectively, the cardinalities of Σ 1 and Σ 2 and finally the types of Σ 1 and Σ 2 Attribute names similarity Attribute cardinality similarity Attribute types similarity

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  Given two SGOs ψ 1 and ψ 2 and their related sets of attributes Σ 1 and Σ 2, we consider:  Then, for each set S a,b we consider the sum of the ics of the attributes of each pair of the set.  The maximal sum corresponds to the attribute names similarity Sim(Σ 1,Σ 2 ). S={S a,b }

SeCoGIS 2007, Auckland, New Zealand, November 7-8, Railway(number:int,local:boolean,goods:boolean,dep_railway_ station:string, dest_railway_station:string, passengers:int) Airline(number:int, company:string, travelers:int, dep_air_terminal:string, dest_air_terminal:string) {(number,number), (passengers,travelers), (dep_railway_station,dep_air_terminal), (dest_railway_station,dest_air_terminal)} Sim(Railway,Airline) = 1/6 * [ ] = 0.643

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  To evaluate the similarity between the cardinalities of Σ 1 and Σ 2, as the they are numeric values, we can consider the ratio between the minimal and maximal cardinality between Σ 1 and Σ 2 that we indicate as ||Σ min || and ||Σ max || respectively. Railway(number:int,local:boolean,goods:boolean,dep_railway_ station:string, dest_railway_station:string, passengers:int) Airline(number:int, company:string, travelers:int, dep_air_terminal:string, dest_air_terminal:string) EXAMPLE Sim(Railway,Airline) = 5/6 = 0.83

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  Given two SGOs ψ 1 and ψ 2, their related sets of attributes Σ 1 and Σ 2, and the sets of types T 1 and T 2 associated with Σ 1 and Σ 2, we consider:  Then, the set of pairs of attributes with maximal cardinality is considered.  The maximal cardinality divided by the maximal cardinality between Σ 1 and Σ 2, that we indicate as ||Σ max ||, corresponds to the similarity Sim(Σ 1 type,Σ 2 type)

SeCoGIS 2007, Auckland, New Zealand, November 7-8, Railway(number:int,local:boolean,goods:boolean,dep_railway_ station:string, dest_railway_station:string, passengers:int) Airline(number:int, company:string, travelers:int, dep_air_terminal:string, dest_air_terminal:string) Sim(Railway,Airline) = 4/6 = 0.66

SeCoGIS 2007, Auckland, New Zealand, November 7-8, StrSIM(Railway,Airline) = 0.33 * * * 0.66 = the structural similarity between Railway and Airline is: As we obtained: Sim A (Railway,Airline) = 1/6 * [ ] = Sim C (Railway,Airline) = 5/6 = 0.83 Sim T (Railway,Airline) = 4/6 = 0.66

SeCoGIS 2007, Auckland, New Zealand, November 7-8, TotSIM(Railway,Airline) = 0.5 * * = the total similarity between Railway and Airline is: As we obtained: StrSIM(Railway,Airline) = SemSIM(Railway,Airline) = 2*2.83 /( )= =5.66/6.96=0.81

SeCoGIS 2007, Auckland, New Zealand, November 7-8,  We proposed an approach for the evaluation of similarity among Symbolic Graphical Objects (SGOs) of GeoPQL based on the semantic similarity of SGO concept names and the structural similarity of SGO attributes and types;  The similarity degree is used to provide approximate queries in the case that the concept expressed in the user query is missing or has no instances in the database. The missing concept is replaced by the most similar one.

SeCoGIS 2007, Auckland, New Zealand, November 7-8,