Semantic Matching Fausto Giunchiglia work in collaboration with Pavel Shvaiko The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, 2003.

Slides:



Advertisements
Similar presentations
Semi-automatic compound nouns annotation for data integration systems Tuesday, 23 June 2009 SEBD 2009 Sonia Bergamaschi Serena Sorrentino
Advertisements

Three-Step Database Design
Discovering Missing Background Knowledge in Ontology Matching Pavel Shvaiko 17th European Conference on Artificial Intelligence (ECAI06) 30 August 2006,
S-Match: an Algorithm and an Implementation of Semantic Matching Fausto Giunchiglia July 2004, Hannover, Germany work in collaboration with Pavel Shvaiko.
S-Match: an Algorithm and an Implementation of Semantic Matching Pavel Shvaiko 1 st European Semantic Web Symposium, 11 May 2004, Crete, Greece paper with.
Amit Shvarchenberg and Rafi Sayag. Based on a paper by: Robin Dhamankar, Yoonkyong Lee, AnHai Doan Department of Computer Science University of Illinois,
Distributed DBMS© M. T. Özsu & P. Valduriez Ch.4/1 Outline Introduction Background Distributed Database Design Database Integration ➡ Schema Matching ➡
An Extensible System for Merging Two Models Rachel Pottinger University of Washington Supervisors: Phil Bernstein and Alon Halevy.
Dynamic Ontology Matching Pavel Shvaiko OpenKnowledge meetings 9 February, 13 March, 2006 Trento, Italy.
Interactive Generation of Integrated Schemas Laura Chiticariu et al. Presented by: Meher Talat Shaikh.
NaLIX: A Generic Natural Language Search Environment for XML Data Presented by: Erik Mathisen 02/12/2008.
A Review of Ontology Mapping, Merging, and Integration Presenter: Yihong Ding.
Generic Schema Matching with Cupid Jayant Madhavan Philip A. Bernstein Erhard Raham Proceedings of the 27 th VLDB Conference.
Slides adapted from A. Silberschatz et al. Database System Concepts, 5th Ed. Entity-Relationship Model Database Management Systems I Alex Coman, Winter.
Page 1 Multidatabase Querying by Context Ramon Lawrence, Ken Barker Multidatabase Querying by Context.
QoM: Qualitative and Quantitative Measure of Schema Matching Naiyana Tansalarak and Kajal T. Claypool (Kajal Claypool - presenter) University of Massachusetts,
Sangam: A Transformation Modeling Framework Kajal T. Claypool (U Mass Lowell) and Elke A. Rundensteiner (WPI)
CIS607, Fall 2005 Semantic Information Integration Article Name: Clio Grows Up: From Research Prototype to Industrial Tool Name: DH(Dong Hwi) kwak Date:
Query Rewriting for Extracting Data Behind HTML Forms Xueqi Chen Department of Computer Science Brigham Young University March 31, 2004 Funded by National.
4.4 Naming And Directory Services Lakshmi Narayana Gupta Kollepara 09/20/2009 CSC-8320.
ONTOLOGY MATCHING Part III: Systems and evaluation.
Partners Using NLP Techniques for Meaning Negotiation Bernardo Magnini, Luciano Serafini and Manuela Speranza ITC-irst, via Sommarive 18, I Trento-Povo,
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Ontology Matching Basics Ontology Matching by Jerome Euzenat and Pavel Shvaiko Parts I and II 11/6/2012Ontology Matching Basics - PL, CS 6521.
Managing Large RDF Graphs (Infinite Graph) Vaibhav Khadilkar Department of Computer Science, The University of Texas at Dallas FEARLESS engineering.
Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia,
A survey of approaches to automatic schema matching Erhard Rahm, Universität für Informatik, Leipzig Philip A. Bernstein, Microsoft Research VLDB 2001.
Web Explanations for Semantic Heterogeneity Discovery Pavel Shvaiko 2 nd European Semantic Web Conference (ESWC), 1 June 2005, Crete, Greece work in collaboration.
XML-to-Relational Schema Mapping Algorithm ODTDMap Speaker: Artem Chebotko* Wayne State University Joint work with Mustafa Atay,
LDK R Logics for Data and Knowledge Representation Semantic Matching.
An Extension to XML Schema for Structured Data Processing Presented by: Jacky Ma Date: 10 April 2002.
Reasoning with context in the Semantic Web … or contextualizing ontologies Fausto Giunchiglia July 23, 2004.
SCHEMA-BASED SEMANTIC MATCHING Pavel Shvaiko joint work on “semantic matching” with Fausto Giunchiglia and Mikalai Yatskevich joint work on “ontology matching”
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
Logics for Data and Knowledge Representation Semantic Matching.
BACKGROUND KNOWLEDGE IN ONTOLOGY MATCHING Pavel Shvaiko joint work with Fausto Giunchiglia and Mikalai Yatskevich INFINT 2007 Bertinoro Workshop on Information.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
12th of October, 2006KEG seminar1 Combining Ontology Mapping Methods Using Bayesian Networks Ontology Alignment Evaluation Initiative 'Conference'
A Z Approach in Validating ORA-SS Data Models Scott Uk-Jin Lee Jing Sun Gillian Dobbie Yuan Fang Li.
PART IV: REPRESENTING, EXPLAINING, AND PROCESSING ALIGNMENTS & PART V: CONCLUSIONS Ontology Matching Jerome Euzenat and Pavel Shvaiko.
A SURVEY OF APPROACHES TO AUTOMATIC SCHEMA MATCHING Sushant Vemparala Gaurang Telang.
Semantic Enrichment of Ontology Mappings: A Linguistic-based Approach Patrick Arnold, Erhard Rahm University of Leipzig, Germany 17th East-European Conference.
Theory and Application of Database Systems A Hybrid Approach for Extending Ontology from Text He Wei.
Minor Thesis A scalable schema matching framework for relational databases Student: Ahmed Saimon Adam ID: Award: MSc (Computer & Information.
September 5-7, Trento Deriving “sub-source” similarities from heterogeneous, semi-stuctured information sources D. Rosaci, G. Terracina, D. Ursino Dipartimento.
Quality Control for Wordnet Development in BalkaNet Pavel Smrž Faculty of Informatics, Masaryk University in Brno, Czech.
1 Relational Databases and SQL. Learning Objectives Understand techniques to model complex accounting phenomena in an E-R diagram Develop E-R diagrams.
CSE 636 Data Integration Schema Matching Cupid Fall 2006.
Interoperable Visualization Framework towards enhancing mapping and integration of official statistics Haitham Zeidan Palestinian Central.
Logics for Data and Knowledge Representation Applications of ClassL: Lightweight Ontologies.
A Classification of Schema-based Matching Approaches Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan.
P2P Concept Search Fausto Giunchiglia Uladzimir Kharkevich S.R.H Noori April 21st, 2009, Madrid, Spain.
Object Oriented Multi-Database Systems An Overview of Chapters 4 and 5.
A Survey of Approaches to Automatic Schema Matching (VLDB Journal, 2001) November 7, 2008 IDB SNU Presented by Kangpyo Lee.
ISWC2007, Nov. 14. Discovering simple mappings between Relational database schemas and ontologies Wei Hu, Yuzhong Qu {whu,
Element Level Semantic Matching Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan Paper by Fausto.
Slide #1 Boston, Jan 5 – 6, 2005XCON WG Interim draft-levin-xcon-cccp-01.txt By Orit Levin
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Ferdowsi University of Mashhad 1 Automatic Semantic Web Service Composition based on owl-s Research Proposal presented by : Toktam ghafarian.
Deriving Relation Keys from XML Keys by Qing Wang, Hongwei Wu, Jianchang Xiao, Aoying Zhou, Junmei Zhou Reviewed by Chris Ying Zhu, Cong Wang, Max Wang,
Presented by Kyumars Sheykh Esmaili Description Logics for Data Bases (DLHB,Chapter 16) Semantic Web Seminar.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Sheet 1MDAFA2004 Linköping, June 2004 A Language for Model Transformations in the MOF Architecture Ivan Kurtev, Klaas van den Berg University of Twente,
Of 24 lecture 11: ontology – mediation, merging & aligning.
SERVICE ANNOTATION WITH LEXICON-BASED ALIGNMENT Service Ontology Construction Ontology of a given web service, service ontology, is constructed from service.
Element Level Semantic Matching
Logics for Data and Knowledge Representation
Tantan Liu, Fan Wang, Gagan Agrawal The Ohio State University
Presentation transcript:

Semantic Matching Fausto Giunchiglia work in collaboration with Pavel Shvaiko The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, 2003

2 Outline Matching Syntactic Matching Semantic Matching On Implementing Semantic Matching Conclusions

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, MATCHING

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Application Domains Generic Model Management Schema integration Data warehouses E-commerce Semantic query processing Data Coordination in P2P systems

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Matching Problems 1. RDB Schemas 2. OODB Schemas 3. XML Schemas 4. Concept Hierarchies 5. Ontologies

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Example of Matching Arts Organizations Art History Music Baroque History Organizations Arts&Humanities Art History Design Art Baroque Architecture History S c =1.0 S r ={  } S c =1.0 S r ={  } S r ={  }

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Matching Match is an operator that takes two graph-like structures (e.g., database schemas or ontologies) and produces a mapping between elements of the two graphs that correspond semantically to each other

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Matching The problem of matching can be decomposed in two steps: Extract graphs from the data and conceptual models Match the resulting graphs (generic matching)

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Matching Mapping element is a 4-tuple, i=1...h; j=1..k; where m ID is a unique identifier of the given mapping element; N i 1 is the i-th node of the first graph, h is the number of nodes in the first graph; N j 2 is the j-th node of the second graph, k is the number of nodes in the second graph R specifies a similarity relation of the given nodes Mapping is a set of mapping elements Matching is the process of discovering mappings between two graphs through the application of a matching algorithm

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Matching: Syntactic AND Semantic Matching Semantic Matching Syntactic Matching R is computed between labels at nodes R = [0,1] R is computed between concepts at nodes R = {set-theoretic relations, e.g., =, , , ,  }

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, SYNTACTIC MATCHING

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Syntactic Matching Mapping element is a 4-tuple, where L i 1 is the label at the i-th node of the first graph; L j 2 is the label at the j-th node of the second graph; R specifies a similarity relation in the form of a coefficient, which measures the similarity between the labels of the given nodes Example: R is a similarity coefficient in [0,1] R =

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Example: Cupid (tentative links) Arts Organizations Art History Music Baroque History Organizations Arts&Humanities Art History Design Art Baroque Architecture History S c =1.0 S c =0.9 S c =1.0 S c =0.7 S c =1.0 S c =0.7 (final result)

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, The State of the Art Cupid … is a hybrid matching prototype. It exploits linguistic and structural schema matching heuristics, and computes similarity coefficients between nodes of the trees. Similarity Flooding … is a hybrid matching prototype. It uses fix-point computation to determine correspondences between nodes of the graphs. COMA …is a composite matching prototype. It provides an extensible library of different matchers which manipulate DAGs and supports various ways of combining final results. As far as we know, so far only syntactic matching…

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, SEMANTIC MATCHING

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Semantic Matching Mapping element is a 4-tuple, where C i 1 is the concept of the i-th node of the first graph; C j 2 is the concept of the j-th node of the second graph; R specifies a similarity relation in the form of a semantic relation between the extensions of concepts at the given nodes Possible R’s: equality {=}, overlapping {  }, mismatch {  }, more general/specific { ,  } Example: R =

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Examples: Analysis of Siblings Suppose that we want to match nodes 5 1 and 2 2 Cupid: R = 0,8. This is because A 1 =A 2, C 1 =C 2 and we have the same structures on both sides (no importance of order of links) A semantic matching approach compares concepts A 1  C 1 with A 2  C 2 and produces C 5 1 = C 2 2

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Examples: Analysis of Ancestors. Case 1 Suppose that we want to match nodes 5 1 and 1 2 Cupid does not find a similarity coefficient between the nodes under consideration, due to the significant differences in structure of the given graphs Semantic matching: The concept denoted by the label at node 5 1 is C 1, while the concept at node 5 1 is C 5 1 = A 1  C 1. The concept at node 1 2 is C 1 2 = C 2. Thus, C 5 1  C 1 2

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Examples: Analysis of Ancestors. Case 2 Suppose that we want to match nodes 5 1 and 5 2 Cupid: R= 0,86. This is because of the identity of labels A 1 =A 2, C 1 =C 2 Semantic matching: The concept at node 5 1 is C 5 1 = A 1  C 1 ; while the concept at node 5 2 is C 5 2 = A 2  *  C 2. Since we have that A 1 =A 2 and C 1 =C 2, then C 5 2  C 5 1

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Examples: Enriched Analysis of Siblings Suppose that we want to match nodes 2 1 and 2 2 Cupid: R= 0,68. This is mainly because of the entry in the thesaurus specifying Belgium as a part of Benelux, and due to the fact that the nodes with labels Benelux 1 and Belgium 2 are leaves Semantic matching: We treat C 2 1 as Benelux 1  Netherlands 1  Luxembourg 1 = Belgium. Thus, C 2 1 = C 2 2

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, ON IMPLEMENTING SEMANTIC MATCHING

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, On Implementation Semantic Matching Structure - level Element - level Weak Semantics Techniques Strong Semantics Techniques

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Element-level Semantic Matching Weak Semantics Techniques Analysis of strings {=} Analysis of data types {=, , , ,  } Analysis of soundex {=} Strong Semantics Techniques Precompiled thesaurus syn key WordNet, where #1 … sense number 1 of the word Art according to WordNet

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Element-level Semantic Matching (cont.) Semantic Relations via WordNet Equality: one concept is equal to another if there is at least one sense of the first concept, which is a synonym of the second Overlapping: one concept is overlapped with the other if there are some senses in common Mismatch: two concepts are mismatched if they have no sense in common More general: one concept is more general then the other iff there exists at least one sense of the first concept that has a sense of the other as a hyponym or meronym Less general: one concept is less general than the other iff there exists at least one sense of the first concept that has a sense of the other concept as hypernym or as a holonym

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Structure-level Semantic Matching We translate the matching problem, namely the two graphs (in particular, the pair of nodes submitted to matching) into a propositional formula and then check for its validity We check for validity using SAT

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Semantic Matching Algorithm 1. Extract the two graphs 2. Compute element-level semantic matching 3. Compute concepts at nodes 4. Construct the propositional formula 5. Run SAT 6. Perform iterations

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Semantic Matching Algorithm: Example – (1) Extract the two graphs In the case of RDB, XML and OODB schemas, it is necessary to extract useful semantic information, for instance in the form of ontologies

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Semantic Matching Algorithm: Example – (2) Element-level semantic matching. For each node, compute semantic relations holding among all the concepts denoted by labels at nodes under consideration A 1 = A 2 B 1 = B 2 C 1 = C 2 D 1 = D 2 E 1 = E 2

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Semantic Matching Algorithm: Example – (3) Compute concepts at nodes. Suppose, we want to find a semantic relation between nodes 5 1 and 1 2 ? C 1 1 = A 1 C 5 1 = A 1  C 2 C 1 2 = C 2 C51  C12C51  C12

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Semantic Matching Algorithm: Example – (4) Construct the propositional formula. We translate all the semantic relations computed in step 2 into propositional formulas under the following rules:  A 1  A 2  A 2  A 1  A 1  A 2  A 1  A 2  A 1 = A 2  A 1  A 2  A 1  A 2   (A 1  A 2 ) ? From step 2 we have: C 1  C 2 We want to prove that C 5 1  C 1 2 ( we guess relation between nodes at this stage) (A1  C1)  C2(A1  C1)  C2 (C 1  C 2 )  ((A 1  C 1 )  C 2 )

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Semantic Matching Algorithm: Example – (5) Run SAT In order to prove that (C 1  C 2 )  ((A 1  C 1 )  C 2 ) is valid, we prove that its negation is unsatisfiabile (C 1  C 2 )  ((A 1  C 1 )  C 2 ) SAT returns FALSE Thus, C 5 1  C 1 2

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Semantic Matching Algorithm: Example – (6.1) Iterations. Iterations are performed re-running SAT Suppose, that C 2 1  C 2 2   …an oracle tells us that A 1 = F 2  G 2 After this additional analysis we can infer C 2 1 = C 2 2

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Semantic Matching Algorithm: Example – (6.2) Iterations. …to use the result of a previous match Suppose, that F 1  B 2 Having found that C 4 1  C 4 2 We can automatically infer that C 5 1  C 5 2

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Example: Cupid vs. Semantic Matching Arts Organizations Art History Music Baroque History Organizations Arts&Humanities Art History Design Art Baroque Architecture History {}{} {}{} {}{} {}{} {}{} {}{}

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Conclusions We have made a rational reconstruction of the major matching problems and articulated them in terms of the more generic problem of matching graphs We have identified semantic matching as a new approach for performing generic matching We have proposed an implementation of semantic matching using SAT

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, Future Work Extend to a full graph matcher How to extract semantics from schemas Study how to take into account attributes and instances Develop an efficient implementation of the system Do a thorough testing of the system

The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, References Project website: F. Giunchiglia, P.Shvaiko “Semantic Matching”. Technical Report #DIT , Trento, Also to appear in Proc. of ODS at IJCAI – 03. F. Giunchiglia, I. Zaihrayeu “Making peer databases interact – a vision for an architecture supporting data coordination” In Proc. Of the Conference of Information Agents (CIA 2002), Madrid, 2002