Consistency between Metathesaurus and Semantic Network Workshop on The Future of the UMLS Semantic Network NLM, April 8, 2005 Olivier Bodenreider Lister.

Slides:



Advertisements
Similar presentations
Ontology Assessment – Proposed Framework and Methodology.
Advertisements

EA 3888 – Conceptual Modeling of Biomedical Knowledge Faculty of Medicine - University of Rennes 1 Integrating and querying.
Experience with Using the UMLS Semantic Network to Coordinate Controlled Terminologies for a Large Clinical Data Repository James J. Cimino Department.
Ontological analysis of the semantic types Anand Kumar MBBS, PhD IFOMIS, University of Saarland, Germany. BIOMEDICALONTOLOGYBIOMEDICALONTOLOGY.
ECO R European Centre for Ontological Research Ontology-based Error Detection in SNOMED-CT ® Werner Ceusters European Centre for Ontological Research Universität.
Catalina Martínez-Costa, Stefan Schulz: Ontology-based reinterpretation of the SNOMED CT context model Ontology-based reinterpretation of the SNOMED CT.
Semantic indexing in PubMed CERN Workshop on Innovations in Scholarly Communication (OAI8) CERN Workshop on Innovations in Scholarly Communication (OAI8)
1 Natural Language Processing Group HUGs Geneva Start.
Summary Issues and Suggestions Workshop on The Future of the UMLS Semantic Network NLM, April 8, 2005 Olivier Bodenreider Lister Hill National Center for.
The Role of the UMLS in Vocabulary Control CENDI Conference “Controlled Vocabulary and the Internet” Stuart J. Nelson, MD.
Ontology Notes are from:
Biomedical Informatics Some Observations on Clinical Data Representation in EHRs Christopher G. Chute, MD DrPH, Mayo Clinic Chair, ICD11 Revision, World.
Multiscale Information Modelling for Heart Morphogenesis Tariq Abdulla 13 th IMEKO TC1-TC7 Joint Symposium 02/09/2010.
Ontologies for Data Integration: A Semantic Web Perspective Training Course Olivier Bodenreider Lister Hill National Center for Biomedical Communications.
FMA: a domain reference ontology Comments on Cornelius Rosse’s talk Anita Burgun WG6 meeting, Rome 29 Apr- 2 May 2005.
Battling Scylla and Charybdis: The Search for Redundancy and Ambiguity in the 2001 UMLS Metathesuarus James J. Cimino Department of Medical Informatics.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 The Enhanced Entity- Relationship (EER) Model.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
VT. From Basic Formal Ontology to Medicine Barry Smith and Anand Kumar.
Biological Ontologies Neocles Leontis April 20, 2005.
Data and Knowledge Representation Lecture 6 Qing Zeng, Ph.D.
Son of SN Barry Smith. The Virtues of Single Inheritance (= True Hierarchy) better coding clearer instructions better automatic reasoning better definitions.
10 December, 2013 Katrin Heinze, Bundesbank CEN/WS XBRL CWA1: DPM Meta model CWA1Page 1.
Unified Medical Language System® (UMLS®) NLM Presentation Theater MLA 2007 National Library of Medicine National Institutes of Health U.S. Dept. of Health.
1 The Refined Semantic Network James Geller Yehoshua Perl New Jersey Institute of Technology.
CASE*Method: Entity Relationship Modeling
Unified Medical Language System® (UMLS®) NLM Presentation Theater MLA 2005 May 16 & 17, 2005 Rachel Kleinsorge.
Stefan Schulz Medical Informatics Research Group
Linking Diseases and Genes through Informatics Knowledge Bases and Ontologies Joyce A. Mitchell, Ph.D. National Library of Medicine University of Missouri.
Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA Experiences in visualizing and navigating biomedical.
Alignment of the UMLS Semantic Network with BioTop Methodology and Assessment Stefan Schulz University Medical Center, Freiburg, Germany Elena Beisswanger.
Copyright © 2011 Pearson Education, Inc. Publishing as Pearson Addison-Wesley Chapter 8 The Enhanced Entry-Relationship (EER) Model.
Annual reports and feedback from UMLS licensees Kin Wah Fung MD, MSc, MA The UMLS Team National Library of Medicine Workshop on the Future of the UMLS.
Nancy Lawler U.S. Department of Defense ISO/IEC Part 2: Classification Schemes Metadata Registries — Part 2: Classification Schemes The revision.
1 Enriching and Designing Metaschemas for the UMLS Semantic Network Department of Computer Science New Jersey Institute of Technology Yehoshua Perl James.
Survey of Medical Informatics CS 493 – Fall 2004 September 27, 2004.
The UMLS Semantic Network Support for semantic integration and reasoning Workshop UMLS Semantic Network NLM, NIH, Bethesda, 7-8 Apr 2005 Anita Burgun.
updated CmpE 583 Fall 2008 Ontology Integration- 1 CmpE 583- Web Semantics: Theory and Practice ONTOLOGY INTEGRATION Atilla ELÇİ Computer.
UMLS Unified Medical Language System. What is UMLS? A Unified knowledge representation system Project of NLM Large scale Distributed First launched in.
American Medical Informatics Association Annual Symposium 2001 The Role of Definitions in Biomedical Concept Representation Joshua Michael, José L. V.
Knowledge-Based Semantic Interpretation for Summarizing Biomedical Text Thomas C. Rindflesch, Ph.D. Marcelo Fiszman, M.D., Ph.D. Halil Kilicoglu, M.S.
WHO – IHTSDO: SNOMED CT – ICD-11 coordination: Conditions vs. Situations Stefan Schulz IHTSDO conference Copenhagen, Apr 24, 2012.
Ontological Foundations of Biological Continuants Stefan Schulz, Udo Hahn Text Knowledge Engineering Lab University of Jena (Germany) Department of Medical.
Use of the UMLS in Patient Care James J. Cimino, M.D. Center for Medical Informatics Columbia University.
Data Science for Joint Doctrine Dr. Brand Niemann Director and Senior Data Scientist/Data Journalist Semantic Community Data Science Data Science for Joint.
The ICPS: A taxonomy, a classification, an ontology or an information model? Stefan SCHULZ IMBI, University Medical Center, Freiburg, Germany.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
Medinfo 2013 Copenhagen, Denmark
Enabling complex queries to drug information sources through functional composition Olivier Bodenreider Lister Hill National Center for Biomedical Communications.
Shaping a Health Statistics Vision for the 21 st Century 2002 NCHS Data Users Conference 16 July 2002 Daniel J. Friedman, PhD Massachusetts Department.
Metadata Common Vocabulary a journey from a glossary to an ontology of statistical metadata, and back Sérgio Bacelar
- EVS Overview - Biomedical Terminology and Ontology Resources Frank Hartel, Ph.D. Director, Enterprise Vocabulary Services NCI Center for Bioinformatics.
The UMLS Semantic Network Alexa T. McCray Center for Clinical Computing Beth Israel Deaconess Medical Center Harvard Medical School
Chapter 16 UML Class Diagrams 1CS6359 Fall 2012 John Cole.
Experience with Using the UMLS Semantic Network to Coordinate Controlled Terminologies for a Large Clinical Data Repository James J. Cimino Department.
Oncologic Pathology in Biomedical Terminologies Challenges for Data Integration Olivier Bodenreider National Library of Medicine Bethesda, Maryland -
A Rule Driven Bi-Directional Translation System for Remapping Queries and Result Sets Between a Mediated Schema and Heterogeneous Data Sources R. Shaker.
International Workshop 28 Jan – 2 Feb 2011 Phoenix, AZ, USA Ontology in Model-Based Systems Engineering Henson Graves 29 January 2011.
New York State Center of Excellence in Bioinformatics & Life Sciences R T U New York State Center of Excellence in Bioinformatics & Life Sciences R T U.
Oncology in SNOMED CT NCI Workshop The Role of Ontology in Big Cancer Data Session 3: Cancer big data and the Ontology of Disease Bethesda, Maryland May.
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
The UMLS and the Semantic Web
The “Common Ontology” between ICD 11 and SNOMED CT
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
Terminology Quality Evaluation S60 Rashmie Abeysinghe Joint work with
The Unified Medical Language System Overview
Ontological analysis of the semantic types
DyKOSMap : a tool to manage ontology alignment dynamics
Stefan SCHULZ IMBI, University Medical Center, Freiburg, Germany
Towards a Unified Ontology for Biomedical Modeling and Simulation
Presentation transcript:

Consistency between Metathesaurus and Semantic Network Workshop on The Future of the UMLS Semantic Network NLM, April 8, 2005 Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA

2 Overview u Defining consistency u What does inconsistency mean? u Testing consistency l Comparing Metathesaurus relations to SN relations l Aligning Metathesaurus concepts and semantic types l Semantic type distribution of sets of descendants of Metathesaurus concepts u Suggestions l Enforcement mechanism l Ontology of relationships l CVF

Two levels in the UMLS

4 The UMLS: a two-level structure Concept 1 Metathesaurus Semantic Network Semantic Type a Semantic Type b Semantic Type c Concept 2

Heart Concepts Metathesaurus Esophagus Left Phrenic Nerve Heart Valves Fetal Heart Medias- tinum Saccular Viscus Angina Pectoris Cardiotonic Agents Tissue Donors Anatomical Structure Fully Formed Anatomical Structure Embryonic Structure Body Part, Organ or Organ Component Pharmacologic Substance Disease or Syndrome Population Group Semantic Types Semantic Network

6 Relationships can inherit semantics Semantic Network Metathesaurus Adrenal Cortex Adrenal Cortical hypofunction Disease or Syndrome Body Part, Organ, or Organ Component Pathologic Function isa Biologic Function isa Fully Formed Anatomical Structure isa location of

Defining consistency

8 The consistency “square” Concept 1 Metathesaurus Semantic Network Semantic Type a Semantic Type b Concept 2

9 The categorization link Semantic Network Professional Society Metathesaurus Salmonella American Medical Association Organism Bacterium isa is an instance of

10 Semantic Network relations u 54 types of relationships u 558 asserted relations (SRSTR) u 6703 fully expanded relations (SRSTRE*) Semantic Network Disease or Syndrome Body Part, Organ, or Organ Component Pathologic Function isa Biologic Function isa Fully Formed Anatomical Structure isa location of

11 Metathesaurus relations u REL vs. RELA u Not always labeled u 106 additional types of relationships u ~7 M symbolic relations Heart Concepts Metathesaurus Esophagus Left Phrenic Nerve Heart Valves Fetal Heart Medias- tinum Saccular Viscus Angina Pectoris Cardiotonic Agents Tissue Donors

12 Metathesaurus relations u Recorded l at the term level: from source vocabularies l at the concept level: from Metathesaurus editors u Aggregated at the concept level Oat cell carcinoma of lung Carcinoma, Small Cell SCLC Lung structure Lung Pulmonary has_finding_site Oat cell carcinoma of lung Carcinoma, Small Cell SCLC Lung structure Lung Pulmonary has_finding_site

13 Not all relationships in hierarchies are isa (1) Autoimmune Diseases Addison’s disease due to autoimmunity Tuberculous Addison’s disease is generally a

14 Not all relationships in hierarchies are isa (2) Environment and Public Health [G03] Public Health [G03.850] Accidents [G ] Accident Prevention [G ] + Accidental Falls [G ] Accidents, Aviation [G ] […] Drowning [G ] +

15 Defining consistency u SN rel. and Meta rel. must have the same direction u SN rel. and Meta rel. must be of the same type (both hierarchical or associative) u Meta rel. must be the same as SN rel. or one of its descendants Concept 1 Metathesaurus Semantic Network Semantic Type a Semantic Type b Concept 2

16 Examples of consistent relations Lung Body Part, Organ, or Organ Component Disease or Syndrome Pneumonia has_location

17 Examples of consistent relations Concept 1 Metathesaurus Semantic Network Semantic Type a Semantic Type b Concept 2 affects treats

What does inconsistency mean?

19 The consistency “square” revisited Concept 1 Metathesaurus Semantic Network Semantic Type a Semantic Type b Concept 2Concept 1 Metathesaurus Semantic Network Semantic Type a Semantic Type b Concept 2 ? ? ? ?

20 What does inconsistency mean? u Inaccurate/missing Semantic Network relation u Inaccurate (/missing?) categorization u Inaccurate Metathesaurus relation

Testing consistency

22 (A) Consistency of associative relations [McCray & Bodenreider, 2002]

23 Results u 6894 pairs of related concepts l 4496 (65%): a SN relation can be inferred unambiguously n Validity confirmed in 1981 cases n 2515 not labeled in the Metathesaurus l 1491 (22%): multiple possible SN relationships n multiple possible Metathesaurus relationships l 907 (13%): inconsistency SN/Meta relationships n 372: no SN relationship between the STs n 415: inconsistent SN/Meta relationship type (REL) n 120: inconsistent SN/Meta relationship attribute (RELA)

24 (B) Consistency of hierarchical relations u Relations used l SN: isa l Categorization: isa l Metathesaurus: PAR/CHD + RB/RN u Hypothesis l For a pair of (ST, C), the concepts categorized by ST (and its descendants) correspond to the descendants of the concept C l In the set of descendants of C, expected STs are the ST of C (and its descendants) ➊ ➋

25 ST-based classes vs. descendants u Semantic type l List of all concepts having this semantic type u Concept l List of all descendants u Comparing the 2 sets l Intersection of the 2 sets ➊ [Bodenreider & Burgun, 2004]

26 Analyzing inconsistencies AmphibianAmphibia 1126 descendants 1135 concepts 1124 in common Tadpole Invertebrate Toad licking Pharmacologic Substance Miscategor- ization (?) Wrong hierarchical relation Missing hierarchical relation Miscategor- ization Rana unclassified Class Reptilia Amphibians and Reptiles

27 Semantic types of descendants u Concept l Set of all descendants u Distribution of semantic types in the set l Allowable STs: ST of C and its descendants (strict) or ST from the same semantic group (loose) [Mougin & Bodenreider, 2005] ➋

28 Analyzing inconsistencies u 26,584 concepts studied u 59% of their descendants have a semantic type incompatible with that of the original concept Reaction belligerent Finding Hostility Mental Process

29 # # C Neoplasm of placenta (disorder) (neop) # * B: 190 C |ST|acab| 5.50|incp C |ST|anab| 1.50|incp C |ST|cgab| 76.50|incp C |ST|dsyn| 27.50|incp C |ST|inpo| 1.00|incp C |ST|neop| 76.50|comp C |ST|patf| 1.50|incp C |SG|DISO| |comp # Analyzing inconsistencies

Suggestions

31 Aligning SN and Meta relationships u 54 types of SN relationships u 106 additional types of Metathesaurus relationships u Some are simply synonymous (caused_by / due_to; follows / temporally_follows) u Some are specialized relationships (manifestation_of / definitional_manifestation_of) u Many types of mapping relationships, not in SN

32 Add classification information to SN u Explicit classificatory principles (in addition to textual definition and examples) u Abandon economy principle and return to JEPD (jointly exhaustive/pairwise disjoint) approach

33 Metathesaurus editing environment u Use SN/Meta relation consistency as a constraint for assigning semantic types u Use SN relations to suggest labels for unspecified Meta relations u Use SN/Meta relation consistency to guide the review by the Metathesaurus editors l Inaccurate categorization? l Inaccurate Metathesaurus relation?

Conclusions

35 Conclusions u Simultaneously l Improve SN l Improve categorization u ST assignment can be automated in part

36 Some references u McCray AT, Bodenreider O. A conceptual framework for the biomedical domain. In: Green R, Bean CA, Myaeng SH, editors. The semantics of relationships: an interdisciplinary perspective. Boston: Kluwer Academic Publishers; p u Bodenreider O, Burgun A. Aligning knowledge sources in the UMLS: Methods, quantitative results, and applications. Medinfo 2004:

37 Some references u Burgun A, Bodenreider O. Aspects of the taxonomic relation in the biomedical domain. In: Welty C, Smith B, editors. Collected papers from the Second International Conference "Formal Ontology in Information Systems": ACM Press; p u Mougin F, Bodenreider O. Approaches to eliminating cycles in the UMLS Metathesaurus: Naive vs. formal. Proceedings of AMIA Annual Symposium 2005:(submitted).

Medical Ontology Research Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA