1 M APPING C OMPOSITION FOR M ATCHING L ARGE L IFE S CIENCE O NTOLOGIES A NIKA G ROSS, M ICHAEL H ARTUNG, T ORALF K IRSTEN, E RHARD R AHM 29 TH J ULY 2011,

Slides:



Advertisements
Similar presentations
Controlled Vocabularies in TELPlus Antoine ISAAC Vrije Universiteit Amsterdam EDLProject Workshop November 2007.
Advertisements

OAEI 2007: Library Track Results Antoine Isaac, Lourens van der Meij, Shenghui Wang, Henk Matthezing Claus Zinn, Stefan Schlobach, Frank van Harmelen Ontology.
CoopIS2001 Trento, Italy The Use of Machine-Generated Ontologies in Dynamic Information Seeking Giovanni Modica Avigdor Gal Hasan M. Jamil.
Logic Synthesis – 3 Optimization Ahmed Hemani Sources: Synopsys Documentation.
Maurice Hermans.  Ontologies  Ontology Mapping  Research Question  String Similarities  Winkler Extension  Proposed Extension  Evaluation  Results.
Optimizing Similarity Computations for Ontology Matching - Experiences from GOMMA Michael Hartung, Lars Kolb, Anika Groß, Erhard Rahm Database Research.
Matching Systems ● SAMBO ● Falcon ● DSSim ● RiMOM ● ASMOV ● Anchor-Flood ● AgreementMaker.
Ontology alignment Patrick Lambrix Linköpings universitet.
Reducing the Cost of Validating Mapping Compositions by Exploiting Semantic Relationships Eduard C. Dragut Ramon Lawrence Eduard C. Dragut Ramon Lawrence.
A N I NTERACTIVE C LUSTERING - BASED A PPROACH TO I NTEGRATING S OURCE Q UERY I NTERFACES ON THE D EEP W EB Wensheng Wu Clement Yu AnHai Doan Weiyi Meng.
A S EMANTIC A PPROACH TO D ISCOVERING S CHEMA M APPING Yuan An, Alex Borgida, Renee J. Miller, and John Mylopoulos Presented by: Kristine Monteith.
Merging Models Based on Given Correspondences Rachel A. Pottinger Philip A. Bernstein.
FMA: a domain reference ontology Comments on Cornelius Rosse’s talk Anita Burgun WG6 meeting, Rome 29 Apr- 2 May 2005.
Schema Mapping: Experiences and Lessons Learned Yihong Ding Data Extraction Group Brigham Young University Sponsored by NSF.
Flow Network Models for Sub-Sentential Alignment Ying Zhang (Joy) Advisor: Ralf Brown Dec 18 th, 2001.
Generating Application Ontologies from Reference Ontologies Marianne Shaw Todd Detwiler Jim Brinkley Dan Suciu University of Washington.
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya Fridman Noy and Mark A. Musen.
Realizing the potential of reference ontologies for the semantic web Jim Brinkley June 29, 2007.
Annotating Documents for the Semantic Web Using Data-Extraction Ontologies Dissertation Proposal Yihong Ding.
PROMPT: Algorithm and Tool for Automated Ontology Merging and Alignment Natalya F. Noy and Mark A. Musen.
Multi-Concept Alignment and Evaluation Shenghui Wang, Antoine Isaac, Lourens van der Meij, Stefan Schlobach Ontology Matching Workshop Oct. 11 th, 2007.
1 A Tool to Support Ontology Creation Based on Incremental Mini-ontology Merging Zonghui Lian.
Xiaomeng Su & Jon Atle Gulla Dept. of Computer and Information Science Norwegian University of Science and Technology Trondheim Norway June 2004 Semantic.
Putting ontology alignment in context: Usage scenarios, deployment and evaluation in a library case Antoine Isaac Henk Matthezing Lourens van der Meij.
Objects What are Objects Observations
Copyright © 2010 Accenture All Rights Reserved. Accenture, its logo, and High Performance Delivered are trademarks of Accenture. Multiple Ontologies in.
Semantic Interoperability Jérôme Euzenat INRIA & LIG France Natasha Noy Stanford University USA.
Speeding Up Batch Alignment of Large Ontologies Using MapReduce Uthayasanker Thayasivam and Prashant Doshi Dept. of Computer Science University of Georgia.
Ontology Matching Basics Ontology Matching by Jerome Euzenat and Pavel Shvaiko Parts I and II 11/6/2012Ontology Matching Basics - PL, CS 6521.
Semantic Matching Pavel Shvaiko Stanford University, October 31, 2003 Paper with Fausto Giunchiglia Research group (alphabetically ordered): Fausto Giunchiglia,
An Integrated Approach to Extracting Ontological Structures from Folksonomies Huairen Lin, Joseph Davis, Ying Zhou ESWC 2009 Hyewon Lim October 9 th, 2009.
Healthcare Services as Collective Activity Susan Wakenshaw Xiao MA.
BACKGROUND KNOWLEDGE IN ONTOLOGY MATCHING Pavel Shvaiko joint work with Fausto Giunchiglia and Mikalai Yatskevich INFINT 2007 Bertinoro Workshop on Information.
Automatic Lexical Annotation Applied to the SCARLET Ontology Matcher Laura Po and Sonia Bergamaschi DII, University of Modena and Reggio Emilia, Italy.
Machine Learning Approach for Ontology Mapping using Multiple Concept Similarity Measures IEEE/ACIS International Conference on Computer and Information.
12th of October, 2006KEG seminar1 Combining Ontology Mapping Methods Using Bayesian Networks Ontology Alignment Evaluation Initiative 'Conference'
Ontology Views An Update A BISTI Collaborative RO1 with the National Center for Biomedical Ontology James F. Brinkley, PI Structural Informatics Group.
Page 1 Composing Mappings between Schemas using a Reference Ontology - ODBASE’04 - Eduard Dragut, Ramon Lawrence Composing Mappings between Schemas using.
ICS-FORTH January 11, Thesaurus Mapping Martin Doerr Foundation for Research and Technology - Hellas Institute of Computer Science Bath, UK, January.
Semantic Matching Fausto Giunchiglia work in collaboration with Pavel Shvaiko The Italian-Israeli Forum on Computer Science, Haifa, June 17-18, 2003.
Semantic Enrichment of Ontology Mappings: A Linguistic-based Approach Patrick Arnold, Erhard Rahm University of Leipzig, Germany 17th East-European Conference.
Towards an ecosystem of data and ontologies Mathieu d’Aquin and Enrico Motta Knowledge Media Institute The Open University.
Theory and Application of Database Systems A Hybrid Approach for Extending Ontology from Text He Wei.
10/18/20151 Business Process Management and Semantic Technologies B. Ramamurthy.
Combining terminology resources and statistical methods for entity recognition: an evaluation Angus Roberts, Robert Gaizauskas, Mark Hepple, Yikun Guo.
Minor Thesis A scalable schema matching framework for relational databases Student: Ahmed Saimon Adam ID: Award: MSc (Computer & Information.
SYMPOSIUM ON SEMANTICS IN SYSTEMS FOR TEXT PROCESSING September 22-24, Venice, Italy Combining Knowledge-based Methods and Supervised Learning for.
Ontology Alignment. Ontologies in biomedical research many biomedical ontologies e.g. GO, OBO, SNOMED-CT practical use of biomedical ontologies e.g. databases.
7-6 Function Operations Objective 2.01
Element Level Semantic Matching Pavel Shvaiko Meaning Coordination and Negotiation Workshop, ISWC 8 th November 2004, Hiroshima, Japan Paper by Fausto.
Collocations and Terminology Vasileios Hatzivassiloglou University of Texas at Dallas.
1 Aligning the Parasite Experiment Ontology and the Ontology for Biomedical Investigations Using AgreementMaker Valerie Cross, Cosmin Stroe Xueheng Hu,
Concept-based P2P Search How to find more relevant documents Ingmar Weber Max-Planck-Institute for Computer Science Joint work with Holger Bast Torino,
AIFB Ontology Mapping I3CON Workshop PerMIS August 24-26, 2004 Washington D.C., USA Marc Ehrig Institute AIFB, University of Karlsruhe.
BOOTSTRAPPING INFORMATION EXTRACTION FROM SEMI-STRUCTURED WEB PAGES Andrew Carson and Charles Schafer.
Intelligent Database Systems Lab 國立雲林科技大學 National Yunlin University of Science and Technology 1 Automatic Document Indexing in Large Medical Collections.
Instance Discovery and Schema Matching With Applications to Biological Deep Web Data Integration Tantan Liu, Fan Wang, Gagan Agrawal {liut, wangfa,
GoRelations: an Intuitive Query System for DBPedia Lushan Han and Tim Finin 15 November 2011
Of 24 lecture 11: ontology – mediation, merging & aligning.
A session-based approach for aligning large ontologies Patrick Lambrix, Rajaram Kaliyaperumal Linköping University.
2016/9/301 Exploiting Wikipedia as External Knowledge for Document Clustering Xiaohua Hu, Xiaodan Zhang, Caimei Lu, E. K. Park, and Xiaohua Zhou Proceeding.
Using Partial Reference Alignments to Align Ontologies
Terminology Quality Evaluation S60 Rashmie Abeysinghe Joint work with
Doron Goldfarb & Yann LE FRANC
CSc4730/6730 Scientific Visualization
Mapping Ontology classes to Wordnet synsets
[jws13] Evaluation of instance matching tools: The experience of OAEI
Actively Learning Ontology Matching via User Interaction
Chaitali Gupta, Madhusudhan Govindaraju
Business Process Management and Semantic Technologies
Presentation transcript:

1 M APPING C OMPOSITION FOR M ATCHING L ARGE L IFE S CIENCE O NTOLOGIES A NIKA G ROSS, M ICHAEL H ARTUNG, T ORALF K IRSTEN, E RHARD R AHM 29 TH J ULY 2011, ICBO, B UFFALO

2 MeSH GALEN O NTOLOGIES Multiple interrelated ontologies in a domain (e.g. anatomy) UMLS SNOMED NCI Thesaurus Mouse Anatomy FMA Identify overlapping information between ontologies Create mappings

3 O NTOLOGY M ATCHING Manual creation of mappings between very large ontologies is too labor-intensive Semi-automatic generation of semantic correspondences (linguistic, structural, instance-based ontology matching) →Interrelation of ontologies →Integration of heterogeneous information sources Matching Mapping sim(O 1.a, O 2.b) = 0.8 sim(O 1.a, O 2.c) = 0.5 sim(O 1.c, O 2.c) = 1.0 further input, e.g. instances, dictionary … O1O1 O2O2

4 ? Indirect composition-based matching Via intermediate ontology (IO): important hub ontology, synonym dictionary, … C OMPOSING MA_ UBERON: NCI_C32239 Synonym: Atlas Name: atlas Name: C1 Vertebra Name: cervical vertebra 1Synonym: cervical vertebra 1 Synonym: C1 vertebra →Find new correspondences via composition →Reuse existing mappings to →Increase match quality →Save computation time IO O1O1 O2O2

5 Composition-based ontology matching approach, reuse of previously determined mappings →composeMatch Optional match step to improve composition-based match quality → extendMatch Use of ontology and mapping operators: compose, match and extract Evaluation: indirect matching of MA and NCIT using large intermediate ontologies (UMLS, FMA, Uberon, RadLex) C ONTRIBUTIONS

6 Use mappings to intermediate ontologies IO 1, …, IO k to indirectly match O 1 and O 2 Reduce matching effort by reusing mappings to IO → very fast composition I NDIRECT M ATCHING... IO 1 IO 2 IO k O1O1 O2O2... O1O1 O2O2 OnOn HO O new →IO should have a significant overlap with O 1 and O 2 →IO 1, …, IO k may complement each other →Centralized hub HO →many mappings to other ontologies →O new aligned with any O i via HO

7 O1O1 IO 1 O2O2 occ = 1: CM O1,O2 = {(a,a),(b,b),(c,c)} occ = 2: CM O1,O2 = {(a,a)} Input: Two ontologies O 1 and O 2, list of intermediate ontologies IO 1 … IO k, occurrence count occ Output: Composed mapping CM O1,O2 (1) C OMPOSE M ATCH a bc de a b gh a bc d f a ic IO 2 MapList  empty for each IO i  IO do M O1,IOi  getMapping(O 1, IO i ) return merge (MapList, occ) MapList.add( compose (M O1,IOi, M IOi,O2 )) M IOi,O2  getMapping(IO i, O 2 ) end for MapList (c,c ), (a,a) (a,a), (b,b)

8 //Direct Mapping DM ΔO1ΔO2  match (ΔO 1, ΔO 2 ) EM O1,O2 (b,b) (a,a) (c,c) (d,d) EM O1,O2  merge ({CM O1,O2, DM ΔO1ΔO2 }, 1) return EM O1,O2 Input: Two ontologies O 1 and O 2, composed mapping CM O1,O2 Output: Extended Mapping EM O1,O2 (2) E XTEND M ATCH O1O1 O2O2 a bc de a bc d f ΔO 1  extract (O 1, CM O1,O2 ) ΔO 2  extract (O 2, inverse (CM O1,O2 )) CM O1,O2 (b,b) (a,a) (c,c) DM O1,O2 (d,d)

9 E VALUATION S ETUP Match problem Adult Mouse Anatomy (MA) NCI Thesaurus Anatomy part (NCIT) Preprocessing Normalization Linguistic Matcher (Name, synonyms, Trigram t = 0.8) Selection & Postprocessing Uberon UMLS MA NCIT RadLex FMA Gold standard ~1500 correspondences Precompute mappings using a match strategy ~5000 ~88,000 ~30,800 ~81,000 ~2,700~3,300 |concepts|

10 M APPING S TATISTICS 80% of MA48% of NCIT MA NCIT Uberon Is there a good coverage of MA and NCIT by intermediate ontologies? High overlap of covered MA and NCIT concepts → promising for composition-based match results

11 OAEI A NATOMY T RACK AgrMaker 87.7%

12 Direct match result compared to composeMatch via each hub Additional matching of unmatched parts (extendMatch) E VALUATION 88.2% 86% Uberon & UMLS → best evaluated intermediate ontologies Intermediate Ontology IO

13 Combination of the four composed mappings Correspondences have to occur in at least 1,…,4 mappings E VALUATION union(occ=1) F-Measure90.2 Precision92.7 Recall87.8 Higher occurrences → Recall ↓ extendMatch → Recall ↑

14 Composition-based approach for indirect matching of large life science ontologies (composeMatch, extendMatch) Reuse mappings for improved match efficiency and quality (>90%) Evaluated several intermediate ontologies →Uberon and UMLS: very effective, suited as hub ontologies in the anatomy domain C ONCLUSIONS

15 F UTURE W ORK Investigate composition-based ontology matching for further domains Study the impact of additional mappings Determined by structural matching Existing mappings from BioPortal

16 M APPING C OMPOSITION FOR M ATCHING L ARGE L IFE S CIENCE O NTOLOGIES ? F UNDING German Research Foundation Grant RA497/18-1 “E VOLUTION OF O NTOLOGIES AND M APPINGS ”

17 S. Zhang, O.Bodenreider "Alignment of multiple ontologies of anatomy: Deriving indirect mappings from direct mappings to a reference." AMIA Annual Symposium (2005) A. Tordai, A. Ghazvinian, J. van Ossenbruggen, M.A. Musen, N.F. Noy "Lost in Translation? Empirical Analysis of Mapping Compositions for Large Ontologies." Proc. Ontology Matching ISWC (2010) Our work: Use multiple complementing intermediate ontologies + an additional extendMatch to improve recall of compose R ELATED W ORK

18 NameSynonym Matcher B ACKUP Synonym1 Name Synonym1 Synonym2 Concept MA Concept NCIT sim = max(sim 1,sim 2,...)