Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.

Slides:



Advertisements
Similar presentations
Semi-automatic compound nouns annotation for data integration systems Tuesday, 23 June 2009 SEBD 2009 Sonia Bergamaschi Serena Sorrentino
Advertisements

Controlled Vocabularies in TELPlus Antoine ISAAC Vrije Universiteit Amsterdam EDLProject Workshop November 2007.
The 20th International Conference on Software Engineering and Knowledge Engineering (SEKE2008) Department of Electrical and Computer Engineering
Advanced Information Systems Laboratory Department of Computer Science and Systems Engineering GI-DAYS MÜNSTER A software tool.
Semantic Access to Data from the Web Raquel Trillo *, Laura Po +, Sergio Ilarri *, Sonia Bergamaschi + and E. Mena * 1st International Workshop on Interoperability.
A Linguistic Approach for Semantic Web Service Discovery International Symposium on Management Intelligent Systems 2012 (IS-MiS 2012) July 13, 2012 Jordy.
Reducing the Cost of Validating Mapping Compositions by Exploiting Semantic Relationships Eduard C. Dragut Ramon Lawrence Eduard C. Dragut Ramon Lawrence.
OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
Research topics Semantic Web - Spring 2007 Computer Engineering Department Sharif University of Technology.
Semantic Web and Web Mining: Networking with Industry and Academia İsmail Hakkı Toroslu IST EVENT 2006.
Exploiting the Semantic Web: Next Generation Semantic Web Applications in KMi Watson, PowerMagpie, PowerAqua, … Mathieu d’Aquin Laurian Gridinoc Vanessa.
Sentiment Lexicon Creation from Lexical Resources BIS 2011 Bas Heerschop Erasmus School of Economics Erasmus University Rotterdam
Gimme’ The Context: Context- driven Automatic Semantic Annotation with CPANKOW Philipp Cimiano et al.
Creating a Bilingual Ontology: A Corpus-Based Approach for Aligning WordNet and HowNet Marine Carpuat Grace Ngai Pascale Fung Kenneth W.Church.
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
Article by: Feiyu Xu, Daniela Kurz, Jakub Piskorski, Sven Schmeier Article Summary by Mark Vickers.
SemanTic Interoperability To access Cultural Heritage Frank van Harmelen Henk Matthezing Peter Wittenburg Marjolein van Gendt Antoine Isaac Lourens van.
Ontology-based Access Ontology-based Access to Digital Libraries Sonia Bergamaschi University of Modena and Reggio Emilia Modena Italy Fausto Rabitti.
Ontology Learning and Population from Text: Algorithms, Evaluation and Applications Chapters Presented by Sole.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Improving Data Discovery in Metadata Repositories through Semantic Search Chad Berkley 1, Shawn Bowers 2, Matt Jones 1, Mark Schildhauer 1, Josh Madin.
Word Sense Disambiguation for Automatic Taxonomy Construction from Text-Based Web Corpora 12th International Conference on Web Information System Engineering.
Mining and Summarizing Customer Reviews
OMAP: An Implemented Framework for Automatically Aligning OWL Ontologies SWAP, December, 2005 Raphaël Troncy, Umberto Straccia ISTI-CNR
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Ontology Matching Basics Ontology Matching by Jerome Euzenat and Pavel Shvaiko Parts I and II 11/6/2012Ontology Matching Basics - PL, CS 6521.
Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia,
Evaluating the Contribution of EuroWordNet and Word Sense Disambiguation to Cross-Language Information Retrieval Paul Clough 1 and Mark Stevenson 2 Department.
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
BACKGROUND KNOWLEDGE IN ONTOLOGY MATCHING Pavel Shvaiko joint work with Fausto Giunchiglia and Mikalai Yatskevich INFINT 2007 Bertinoro Workshop on Information.
Survey of Semantic Annotation Platforms
Machine Learning Approach for Ontology Mapping using Multiple Concept Similarity Measures IEEE/ACIS International Conference on Computer and Information.
Name : Emad Zargoun Id number : EASTERN MEDITERRANEAN UNIVERSITY DEPARTMENT OF Computing and technology “ITEC547- text mining“ Prof.Dr. Nazife Dimiriler.
12th of October, 2006KEG seminar1 Combining Ontology Mapping Methods Using Bayesian Networks Ontology Alignment Evaluation Initiative 'Conference'
PAUL ALEXANDRU CHIRITA STEFANIA COSTACHE SIEGFRIED HANDSCHUH WOLFGANG NEJDL 1* L3S RESEARCH CENTER 2* NATIONAL UNIVERSITY OF IRELAND PROCEEDINGS OF THE.
Semantic Matching Fausto Giunchiglia work in collaboration with Pavel Shvaiko The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, 2003.
Ontology-Driven Automatic Entity Disambiguation in Unstructured Text Jed Hassell.
Jennie Ning Zheng Linda Melchor Ferhat Omur. Contents Introduction WordNet Application – WordNet Data Structure - WordNet FrameNet Application – FrameNet.
A view-based approach for semantic service descriptions Carsten Jacob, Heiko Pfeffer, Stephan Steglich, Li Yan, and Ma Qifeng
Semantic Learning Instructor: Professor Cercone Razieh Niazi.
Annotating Words using WordNet Semantic Glosses Julian Szymański Department of Computer Systems Architecture, Faculty of Electronics, Telecommunications.
Semantic Enrichment of Ontology Mappings: A Linguistic-based Approach Patrick Arnold, Erhard Rahm University of Leipzig, Germany 17th East-European Conference.
Towards an ecosystem of data and ontologies Mathieu d’Aquin and Enrico Motta Knowledge Media Institute The Open University.
WORD SENSE DISAMBIGUATION STUDY ON WORD NET ONTOLOGY Akilan Velmurugan Computer Networks – CS 790G.
The Agricultural Ontology Service (AOS) A Tool for Facilitating Access to Knowledge AGRIS/CARIS and Documentation Group Library and Documentation Systems.
SYMPOSIUM ON SEMANTICS IN SYSTEMS FOR TEXT PROCESSING September 22-24, Venice, Italy Combining Knowledge-based Methods and Supervised Learning for.
Dimitrios Skoutas Alkis Simitsis
10/22/2015ACM WIDM'20051 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis Voutsakis.
Mining Topic-Specific Concepts and Definitions on the Web Bing Liu, etc KDD03 CS591CXZ CS591CXZ Web mining: Lexical relationship mining.
A Classification of Schema-based Matching Approaches Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan.
Evaluating Semantic Metadata without the Presence of a Gold Standard Yuangui Lei, Andriy Nikolov, Victoria Uren, Enrico Motta Knowledge Media Institute,
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
Aligner automatiquement des ontologies avec Tuesday 23 rd of January, 2007 Rapha ë l Troncy.
Element Level Semantic Matching Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan Paper by Fausto.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Named Entity Disambiguation on an Ontology Enriched by Wikipedia Hien Thanh Nguyen 1, Tru Hoang Cao 2 1 Ton Duc Thang University, Vietnam 2 Ho Chi Minh.
AIFB Ontology Mapping I3CON Workshop PerMIS August 24-26, 2004 Washington D.C., USA Marc Ehrig Institute AIFB, University of Karlsruhe.
2/10/2016Semantic Similarity1 Semantic Similarity Methods in WordNet and Their Application to Information Retrieval on the Web Giannis Varelas Epimenidis.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Sentiment Analysis Using Common- Sense and Context Information Basant Agarwal 1,2, Namita Mittal 2, Pooja Bansal 2, and Sonal Garg 2 1 Department of Computer.
Food and Agriculture Organization of the UN GILW Library and Documentation Systems Division Food, Nutrition and Agriculture Ontology Portal.
Exploiting Large Scale Web Semantics
Text Analytics in ITS 2.0: Annotation of Named Entities
Social Knowledge Mining
Extracting Semantic Concept Relations
CS 620 Class Presentation Using WordNet to Improve User Modelling in a Web Document Recommender System Using WordNet to Improve User Modelling in a Web.
Block Matching for Ontologies
Giannis Varelas Epimenidis Voutsakis Paraskevi Raftopoulou
Presentation transcript:

Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy March 2010 Hue City Vietnam

The idea When we are dealing with data sources, we are dealing with structure information that are labeled by humans. Humans use lexical expressions to assign names. Natural language labels provide a rich connection between formal objects (e.g. classes and properties) and their intended meanings. Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010

Lexical knowledge inside sources It is necessary to address the problem of how the concepts are "labelled", i.e. understanding the meaning behind the names denoting ontology elements. In NLP (Natural Language Processing), Word Wense Disambiguation (WSD) is the process of identifying which sense of a word (i.e. meaning) is used in any given sentence, when the word has a number of distinct senses (polysemy). Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010

Lexical knowledge inside sources It is necessary to address the problem of how the concepts are "labelled", i.e. understanding the meaning behind the names denoting ontology elements. sentence In NLP (Natural Language Processing), Word Wense Disambiguation (WSD) is the process of identifying which sense of a word (i.e. meaning) is used in any given sentence, when the word has a number of distinct senses (polysemy). Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010

Applications of WSD Information Retrieval Information Extraction Machine Translation Content Analysis Word Processing Lexicography The Semantic Web ◦ ontology learning: to build domain taxonomies and enrich large-scale semantic networks Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010 Roberto Navigli. Word Sense Disambiguation: A Survey, ACM Computing Surveys, 41(2), 2009Word Sense Disambiguation: A Survey

Lexical Annotation Lexical Annotation is a particular metadata annotation that refers to a semantic resource. Each lexical annotation has the property to own one or more lexical descriptions. Lexical Annotation ◦ assigns meanings to class and property names w.r.t. a semantic resource (WordNet) ◦ derives relationships among source elements Lexical Annotation can be an effective method to solve ambiguity problems! Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010

Lexical Annotation – an example Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010 Hypernym of BookSYNVolume Book BTCatalog (Catalog  Book) lexical relationships extracted √ √√ √ BT

The ontology matching problem An ontology is an explicit specification of a conceptualization (Gruber, 1993). The ontology matching process, for two separate and autonomous ontologies, O1 and O2, consists of finding corresponding entities in ontologies O1 and O2 Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010

Ontology matchers Several ontology matchers have been proposed in litterature, altought the most do not take advantage of the linguistic aspect of the involved ontologies. different meanings In particular, ontology matchers do not discern elements with different meanings. Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010

The SCARLET matcher Scarlet* is a technique for discovering relationships between two concepts by making use of online available ontologies. Scarlet discovers semantic relationships between concepts by exploiting the entire Semantic Web as a source of background knowledge. Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010 * SCARLET has been developed by the Knowledge and Media Institute at Milton Keynes, UK.

The SCARLET matcher By using semantic search engines (Swoogle and WATSON), it finds online ontologies containing concepts with the same names as the candidate concepts and then it derives mappings from the relationships in the online ontologies. Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010 Ontology 1Ontology 2 Online Ontology BA A0A0 B0B0   Legenda anchoring relationship

The SCARLET matcher Scarlet is able to identify disjoint relations, subsumption relations, and correspondences. All relations are obtained by using derivation rules which explore direct relations and also relations deduced by applying subsumption reasoning. Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010 Ontology 1Ontology 2 Online Ontologies BA A0A0  B0B0  C C0C0  Legenda anchoring relationship

The evaluation of SCARLET On a large-scale, real life data sets SCARLET retrived a precision value of 70% More than half of incorrect anchonring were due to ambiguities. SCARLET is not able to take advantage of the ontological context in which a concept appears. Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010 Lexical Annotation can used to solve the ambiguity problems!

SCARLET + lexical annotation By identifying a meaning (or a set of meanings) for each concept it is possible to, more accurately, compare the concept with the concepts that appear in online ontologies. Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010

How Lexical Annotation enhances the ontology matching performance Performing lexical annotation on the ontologies involved in the matching process allows: ◦ to detect false positive mappings ◦ to discover new mappings ◦ to identify synonyms and more general classes for a given concept Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010 Improving precision Improving recall

Lexical annotation improvements 1 - detection of false positive mappings Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010 Ontology 1Ontology 2 BA A0A0 B0B0   SYNSET 4 SYNSET 3 SYNSET 2 SYNSET 1 X X If a concept and its anchoring concept have disregarding meanings (i.e. if they do not have the same list of meanings), the anchoring is detect as a false positive.

Lexical annotation improvements 1- detection of false positive mappings Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010 Ontology 1 BA A0A0  B0B0  C C0C0  Ontology 2 SYNSET 4 SYNSET 3 SYNSET 2 SYNSET 1 X X

Lexical annotation improvements 2 - new mapping discovery Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010 Ontology 1Ontology 2 BA  SYNSET 4 SYNSET 3 SYNSET 2 SYNSET 1 hyponym

Lexical annotation improvements 3 - identification of synonyms and more general concepts Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010 Ontology 1Ontology 2 B home house B0B0   SYNSET 4 SYNSET 3 SYNSET 2 SYNSET 1 New anchoring

ALA tool We employ the Automatic Lexical Annotation tool to perform lexical annotation of the ontologies involved in the matching process (source ontologies, online ontologies). ALA combines the output of 4 WSD algorithms and 2 heuristic rules. The combination is a sequential composition: ◦ only the first algorithm is executed on the entire data source, the following algorithms are executed only on the set of concepts that were not disambiguated by the previous ones. Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010

Lexical Annotation Evaluation The application of lexical annotation techniques on the SCARLET results has been tested on two test cases: ◦ NALT+AGROVOC two real life thesauri: the United Nations Food and Agriculture Organization (FAO)’s AGROVOC thesaurus, the United States National Agricultural Library (NAL) Agricultural thesaurus NALT ◦ OAEI 2006 benchmark The benchmark is bibliographic domain, the bibliographic ontologies we took into account are the reference ontology and the Karlsruhe ontology. Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010

Lexical Annotation Evaluation: detection of incorrect anchoring The results of the automatic lexical annotation have been compared with the manual evaluation done by an expert on the entire set of matching. Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010 the most are due to the presence of compound nouns

Lexical Annotation Evaluation: new mapping discovery After the execution of lexical annotation, we computed a mapping between two concepts, if we find a relationship between their corresponding meanings in WordNet. Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010

Lexical Annotation Evaluation: comparison We compared our results with a multiontology disambiguation method that has been previously applied on SCARLET Unlike multiontology disambiguation method that retrieves similarity measures, our method offers a definite answer regarding the detection of synonym relationships. Comparing the results, we evaluated some possible thresholds on the similarity measures retrieved by the multiontolgy disambiguation method (0.19 – 0.22). Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010

Conclusion We proposed and experimentally investigated a method to solve ambiguity problems in the context of ontology matching by using automatic lexical annotation techniques (ALA tool). The method has been applied on SCARLET, a semantic web based matcher. Experimental results have proved that by performing lexical annotation of ontologies we are able to: ◦ to detect false positive mappings ◦ to discover new mappings Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010

Future work on lex ann + matchers Lexical Annotation is able to identify synonymous and generalization of concepts. Implementing this will give the matcher the possibility to widen the search among online ontologies, thus, improving matching results. In order to cope with more complex ontologies, our method needs to be extended by including the treatment of compound terms and abbreviations (published at ER2009). The method could be coped with any matcher. Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010

Future perspectives There are several scenarios where we applied lexical annotation ◦ Data integration ◦ Ontology matching ◦ Disambiguation/classification of Google hits New scenarios ◦ blogs ◦ social networks ◦ Mash up ?! Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po ACIIDS - 26/03/2010

Thanks for your attention! ACIIDS - 26/03/2010Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher - Laura Po