Aligner automatiquement des ontologies avec Tuesday 23 rd of January, 2007 Rapha ë l Troncy.

Slides:



Advertisements
Similar presentations
Schema Matching and Query Rewriting in Ontology-based Data Integration Zdeňka Linková ICS AS CR Advisor: Július Štuller.
Advertisements

Learning to Map between Ontologies on the Semantic Web AnHai Doan, Jayant Madhavan, Pedro Domingos, and Alon Halevy Databases and Data Mining group University.
Leveraging Data and Structure in Ontology Integration Octavian Udrea 1 Lise Getoor 1 Renée J. Miller 2 1 University of Maryland College Park 2 University.
Project 2 Ontology alignment. SIGNAL-ONTOLOGY (SigO) Immune Response i- Allergic Response i- Antigen Processing and Presentation i- B Cell Activation.
Maurice Hermans.  Ontologies  Ontology Mapping  Research Question  String Similarities  Winkler Extension  Proposed Extension  Evaluation  Results.
Leveraging Community-built Knowledge For Type Coercion In Question Answering Aditya Kalyanpur, J William Murdock, James Fan and Chris Welty Mehdi AllahyariSpring.
Data Intensive Techniques to Boost the Real-time Performance of Global Agricultural Data Infrastructures SEMAGROW U SING A POWDER T RIPLE S TORE FOR BOOSTING.
Ameet N Chitnis, Abir Qasem and Jeff Heflin 11 November 2007.
1 Learning to Detect Objects in Images via a Sparse, Part-Based Representation S. Agarwal, A. Awan and D. Roth IEEE Transactions on Pattern Analysis and.
Mapping Between Taxonomies Elena Eneva 27 Sep 2001 Advanced IR Seminar.
A web-based repository service for vocabularies and alignments in the Cultural Heritage domain Lourens van der Meij Antoine Isaac Claus Zinn.
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
Mapping Between Taxonomies Elena Eneva 11 Dec 2001 Advanced IR Seminar.
Queensland University of Technology An Ontology-based Mining Approach for User Search Intent Discovery Yan Shen, Yuefeng Li, Yue Xu, Renato Iannella, Abdulmohsen.
ReQuest (Validating Semantic Searches) Norman Piedade de Noronha 16 th July, 2004.
Towards Semantic Web: An Attribute- Driven Algorithm to Identifying an Ontology Associated with a Given Web Page Dan Su Department of Computer Science.
Scalable Text Mining with Sparse Generative Models
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Language Identification of Search Engine Queries Hakan Ceylan Yookyung Kim Department of Computer Science Yahoo! Inc. University of North Texas 2821 Mission.
COMP423: Intelligent Agent Text Representation. Menu – Bag of words – Phrase – Semantics – Bag of concepts – Semantic distance between two words.
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
RuleML-2007, Orlando, Florida1 Towards Knowledge Extraction from Weblogs and Rule-based Semantic Querying Xi Bai, Jigui Sun, Haiyan Che, Jin.
Processing of large document collections Part 2 (Text categorization) Helena Ahonen-Myka Spring 2006.
BACKGROUND KNOWLEDGE IN ONTOLOGY MATCHING Pavel Shvaiko joint work with Fausto Giunchiglia and Mikalai Yatskevich INFINT 2007 Bertinoro Workshop on Information.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
Machine Learning Approach for Ontology Mapping using Multiple Concept Similarity Measures IEEE/ACIS International Conference on Computer and Information.
Learning Object Metadata Mining Masoud Makrehchi Supervisor: Prof. Mohamed Kamel.
The Semantic Web Service Shuying Wang Outline Semantic Web vision Core technologies XML, RDF, Ontology, Agent… Web services DAML-S.
Quantifying the dynamics of Binary Search Trees under combined insertions and deletions BACKGROUND The complexity of many operations on Binary Search Trees.
Exploiting Ontologies for Automatic Image Annotation M. Srikanth, J. Varner, M. Bowden, D. Moldovan Language Computer Corporation
12th of October, 2006KEG seminar1 Combining Ontology Mapping Methods Using Bayesian Networks Ontology Alignment Evaluation Initiative 'Conference'
Annual reports and feedback from UMLS licensees Kin Wah Fung MD, MSc, MA The UMLS Team National Library of Medicine Workshop on the Future of the UMLS.
The MMI Tools Carlos Rueda Monterey Bay Aquarium Research Institute OOS Semantic Interoperability Workshop Marine Metadata Interoperability Project Boulder,
A Simple Unsupervised Query Categorizer for Web Search Engines Prashant Ullegaddi and Vasudeva Varma Search and Information Extraction Lab Language Technologies.
The KOS interoperability in aquatic science field through mapping processes Carmen Reverté Reverté Aquatic Ecosystems Documentation Center. IRTA. (Sant.
A Classification of Schema-based Matching Approaches Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan.
Collocations and Information Management Applications Gregor Erbach Saarland University Saarbrücken.
SSO: THE SYNDROMIC SURVEILLANCE ONTOLOGY Okhmatovskaia A, Chapman WW, Collier N, Espino J, Conway M, Buckeridge DL Ontology Description The SSO was developed.
Towards Distributed Information Retrieval in the Semantic Web: Query Reformulation Using the Framework Wednesday 14 th of June, 2006.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Date : 2013/03/18 Author : Jeffrey Pound, Alexander K. Hudek, Ihab F. Ilyas, Grant Weddell Source : CIKM’12 Speaker : Er-Gang Liu Advisor : Prof. Jia-Ling.
ISWC2007, Nov. 14. Discovering simple mappings between Relational database schemas and ontologies Wei Hu, Yuzhong Qu {whu,
Semi-Automatic Quality Assessment of Linked Data without Requiring Ontology Saemi Jang, Megawati, Jiyeon Choi, and Mun Yong Yi KIRD, KAIST NLP&DBPEDIA.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Number Sense Disambiguation Stuart Moore Supervised by: Anna Korhonen (Computer Lab)‏ Sabine Buchholz (Toshiba CRL)‏
KNN & Naïve Bayes Hongning Wang Today’s lecture Instance-based classifiers – k nearest neighbors – Non-parametric learning algorithm Model-based.
Class Imbalance in Text Classification
Text Based Similarity Metrics and Delta for Semantic Web Graphs Krishnamurthy Koduvayur Viswanathan Monday, June 28,
AIFB Ontology Mapping I3CON Workshop PerMIS August 24-26, 2004 Washington D.C., USA Marc Ehrig Institute AIFB, University of Karlsruhe.
Divided Pretreatment to Targets and Intentions for Query Recommendation Reporter: Yangyang Kang /23.
Multilingual Information Retrieval using GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of Kaohsiung.
ISO TC37/SC4 N435 Nov 12, 2007 Presented by Miran Choi/ETRI Written by Jae Sung Lee/Chungbuk National Univ.
A Multilingual Hierarchy Mapping Method Based on GHSOM Hsin-Chang Yang Associate Professor Department of Information Management National University of.
Toward Entity Retrieval over Structured and Text Data Mayssam Sayyadian, Azadeh Shakery, AnHai Doan, ChengXiang Zhai Department of Computer Science University.
KNN & Naïve Bayes Hongning Wang
The Semantic Web By: Maulik Parikh.
Guangbing Yang Presentation for Xerox Docushare Symposium in 2011
Introduction to Unified Modeling Language (UML)
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
NJVR: The NanJing Vocabulary Repository
Property consolidation for entity browsing
Searching and browsing through fragments of TED Talks
Information Networks: State of the Art
RDA Community and linked data
Chaitali Gupta, Madhusudhan Govindaraju
Presentation transcript:

Aligner automatiquement des ontologies avec Tuesday 23 rd of January, 2007 Rapha ë l Troncy

Motivation Various SW repositories, using different vocabularies, distributed on the web Already large amounts of data out there Swoogle hits 10 7 – 10 9 unique Semantic Web documents (16/11/2006) Problem: How to search and retrieve information in such an environment? How to map the various vocabularies used ?

oMAP: Ontology Alignment Tool

oMAP: A Formal Framework Sources of inspiration: Formal work in data exchange [Fagin et al., 2003] GLUE: combining several specialized components for finding the best set of mappings [Doan et al., 2003] Notation: A mapping is a triple: M = (T, S, ∑ )  S and T are the source and target ontologies  S i is an OWL entity (class, datatype property, object property) of the ontology  ∑ is a set of mapping rules: α ij T j ← S i

oMAP: Combining Classifiers Weight of a mapping rule: α ij = w (S i,T j, ∑ ) Using different classifiers: w (S i,T j, CL k ) is the classifier's approximation of the rule T j ← S i Combining the approximations: Use of a priority list: CL 1 CL 2 … CL n Weighted average of the classifiers prediction

Terminological Classifiers Same entity names (or URI) Same entity name stems

Terminological Classifiers String distance name Iterative substring matching See [Stoilos et al., ISWC'05]

Terminological Classifiers WordNet distance name lcs is the longest common substring between S i and T j sim =

Machine Learning-Based Classifiers Collecting bag of words: label for the named individuals data value for the datatype properties type for the anonymous individuals and the range of object properties … Recursion on the OWL definition: depth parameter Use statistical methods on the collected bag of words

Machine Learning-Based Classifiers Example Individual (x 1 type (Workshop) value (label "DECOR") value (location x 2 ) ) Individual (x 2 type (Address) value (city "Namur") value (country "Belgium") ) u1 = (" DECOR ", "Address") u2 = ("Address", "Namur", "Belgium") Naïve Bayes text classifier kNN text classifier

Structural and Semantics-Based Classifier ∑ is a set of mapping rules: α ij T j ← S i ∑ sets are computed by taking the OWL definition of the entities to align recursively in the OWL structure... without looping thanks to cycles detection

Structural and Semantics-Based Classifier If S i and T j are property names: If S i and T j are concept names 1 : 1 Where D = D(S i ) * D(T j ) ; D(S i ) represents the set of concepts directly parent of S i

Structural and Semantics-Based Classifier Let C S =(QR.C) and D T =(Q’R’.D), then 1 : Let C S =(op C 1 …C m ) and D T =(op’ D 1 …D m ), then 2 : 1 Where Q,Q’ are quantifiers, R,R’ are property names and C,D concept expressions 2 Where op, op’ are concept constructors and n,m ≥ 1

Evaluation OAEI Contests (2004, 2005, 2006): Systematic benchmark tests on bibliographic data  Tests 2xx: aligning an ontology with variations of itself where each OWL constructs are discarded or modified one per one  Tests 3xx: four real bibliographic ontologies Web categories alignment

Benchmark Tests (2005) dublin Falcon0.91 FOAM0.90 oMAP0.85 CMS0.81 OLA0.80 ctxMatch0.72 edna0.45 Falcon0.89 OLA0.74 dublin FOAM0.69 oMAP0.68 edna0.61 ctxMatch0.20 CMS0.18 oMAP: 4 th with the global F-Measure 1 st on 3xx tests (real ontologies to align) Precision Recall

Aligning Web Categories (2005) Aligning Google, Loksmart and Yahoo web categories [Avesani et al., ISWC'05] Blind tests: only recall results are available ctxMatchFOAMCMSDublin20FalconOLAoMAP 9.4%11.9%14.1%26.5%31.2%32.0%34.4%

Conclusion oMAP: a formal framework for aligning automatically OWL ontologies Combining several specific classifiers Terminological classifiers Machine learning-based classifiers Structural and semantics-based classifier

Future Work oMAP Using additional classifiers:  KL-distance, other resources, background K, etc.  Straightforward theoretically but practically difficult! Finding complex alignment  name = firstName + lastName OWL and rule-based languages:  Take into account this additional expressivity Other KR languages: e.g. SKOS

Any questions ?

Structural and Semantics-Based Classifier Possible values for w op and w Q weights w op w Q ⊓⊔ ¬ ⊓ 11/40 ⊔ 10 ¬1   1  1  n  n  m11/3  m 1