Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA Experiences in visualizing and navigating biomedical.

Slides:



Advertisements
Similar presentations
Consistent and standardized common model to support large-scale vocabulary use and adoption Robust, scalable, and common API to reduce variation in clinical.
Advertisements

Experience with Using the UMLS Semantic Network to Coordinate Controlled Terminologies for a Large Clinical Data Repository James J. Cimino Department.
Knowledge Graph: Connecting Big Data Semantics
Semantic indexing in PubMed CERN Workshop on Innovations in Scholarly Communication (OAI8) CERN Workshop on Innovations in Scholarly Communication (OAI8)
Indexing the Biomedical Literature in a Time of Increased Demand and Limited Resources BioASQ Workshop September 27, 2013 Alan R. Aronson Lister Hill Center,
1 Natural Language Processing Group HUGs Geneva Start.
Summary Issues and Suggestions Workshop on The Future of the UMLS Semantic Network NLM, April 8, 2005 Olivier Bodenreider Lister Hill National Center for.
The Role of the UMLS in Vocabulary Control CENDI Conference “Controlled Vocabulary and the Internet” Stuart J. Nelson, MD.
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
Ontology Notes are from:
Overview of Biomedical Informatics Rakesh Nagarajan.
Battling Scylla and Charybdis: The Search for Redundancy and Ambiguity in the 2001 UMLS Metathesuarus James J. Cimino Department of Medical Informatics.
Brian A. Carlsen Apelon, Inc. Tools For Classification Integration Networked Knowledge Organization Systems/Services Workshop June 28, 2001.
1 CIS607, Fall 2006 Semantic Information Integration Instructor: Dejing Dou Week 10 (Nov. 29)
Social Pharmacy and Pharmacoepidemiology Lister Hill National Center for Biomedical Communications Text-based Discovery in Biomedicine The Architecture.
CSE 730 Information Retrieval of Biomedical Data The use of medical lexicon in biomedical IR.
Unified Medical Language System® (UMLS®) NLM Presentation Theater MLA 2007 National Library of Medicine National Institutes of Health U.S. Dept. of Health.
Integrating bio-ontologies with a workflow/Petri Net model to qualitatively represent and simulate biological systems Mor Peleg, Irene Gbashvili, and Russ.
1 Betsy L. Humphreys, MLS Betsy L. Humphreys, MLS National Library of Medicine National Library of Medicine National Institutes of Health National Institutes.
Indexing 1/2 BDK12-3 Information Retrieval William Hersh, MD Department of Medical Informatics & Clinical Epidemiology Oregon Health & Science University.
“Semantic Web” Applications in Bioinformatics Amr AL-Hossary M.B.B.Ch.
Unified Medical Language System® (UMLS®) NLM Presentation Theater MLA 2005 May 16 & 17, 2005 Rachel Kleinsorge.
Linking Diseases and Genes through Informatics Knowledge Bases and Ontologies Joyce A. Mitchell, Ph.D. National Library of Medicine University of Missouri.
Grant Number: IIS Institution of PI: Arizona State University PIs: Zoé Lacroix Title: Collaborative Research: Semantic Map of Biological Data.
Betsy L. Humphreys Betsy L. Humphreys Associate Director for Library Operations NLM, NIH, HHS NLM, NIH, HHS National Library.
The aims of the Gene Ontology project are threefold: - to compile vocabularies to describe components, functions and processes - to produce tools to query.
Annual reports and feedback from UMLS licensees Kin Wah Fung MD, MSc, MA The UMLS Team National Library of Medicine Workshop on the Future of the UMLS.
Integrating Clinical Systems by Integrating Controlled Vocabularies James J. Cimino, MD Center for Medical Informatics Columbia University New York, New.
IProLINK – A Literature Mining Resource at PIR (integrated Protein Literature INformation and Knowledge ) Hu ZZ 1, Liu H 2, Vijay-Shanker K 3, Mani I 4,
Knowledge Representation and Indexing Using the Unified Medical Language System Kenneth Baclawski* Joseph “Jay” Cigna* Mieczyslaw M. Kokar* Peter Major.
Intralab Workshop - Reactome CMAP Chang-Feng Quo June 29 th, 2006.
1 st June 2006 St. George’s University of LondonSlide 1 Using UMLS to map from a Library to a Clinical Classification: Improving the Functionality of a.
1 Enriching and Designing Metaschemas for the UMLS Semantic Network Department of Computer Science New Jersey Institute of Technology Yehoshua Perl James.
March 24, Integrating genomic knowledge sources through an anatomy ontology Gennari JH, Silberfein A, and Wiley JC Pac Symp Biocomputing 2005:
Survey of Medical Informatics CS 493 – Fall 2004 September 27, 2004.
Lexical Tools Briefing The Lexical Systems Group NLMNLM. LHNCBC. CGSBLHNCBCCGSB June, 2006.
LexBIG Release Overview Aug 21, LexBIG Context Project Goals for Sept –Incremental point release of LexBIG infrastructure to support EVS activities.
Value Set Resolution: Build generalizable data normalization pipeline using LexEVS infrastructure resources Explore UIMA framework for implementing semantic.
The Gene Ontology: a real-life ontology, progress and future. Jane Lomax EMBL-EBI.
Grup.bio.unipd.it CRIBI Genomics group Erika Feltrin PhD student in Biotechnology 6 months at EBI.
UMLS Unified Medical Language System. What is UMLS? A Unified knowledge representation system Project of NLM Large scale Distributed First launched in.
This material was developed by Duke University, funded by the Department of Health and Human Services, Office of the National Coordinator for Health Information.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Use of the UMLS in Patient Care James J. Cimino, M.D. Center for Medical Informatics Columbia University.
Consistency between Metathesaurus and Semantic Network Workshop on The Future of the UMLS Semantic Network NLM, April 8, 2005 Olivier Bodenreider Lister.
The Gene Ontology and its insertion into UMLS Jane Lomax.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
Medinfo 2013 Copenhagen, Denmark
Enabling complex queries to drug information sources through functional composition Olivier Bodenreider Lister Hill National Center for Biomedical Communications.
Using Domain Ontologies to Improve Information Retrieval in Scientific Publications Engineering Informatics Lab at Stanford.
A LexWiki-based Representation and Harmonization Framework for caDSR Common Data Elements Guoqian Jiang, Ph.D. Robert Freimuth, Ph.D. Harold Solbrig Mayo.
- EVS Overview - Biomedical Terminology and Ontology Resources Frank Hartel, Ph.D. Director, Enterprise Vocabulary Services NCI Center for Bioinformatics.
NLM Value Set Authority Center Curation and delivery of value sets for eMeasures eMeasures Issues Group (eMIG) May 24, 2012 NLM.
The UMLS Semantic Network Alexa T. McCray Center for Clinical Computing Beth Israel Deaconess Medical Center Harvard Medical School
Automatically Identifying Candidate Treatments from Existing Medical Literature Catherine Blake Information & Computer Science University.
Protégé 3.4 Plug-in for Editing and Maintaining the NCI Thesaurus Protégé Conference June 23, 2009 Amsterdam Sherri de Coronado, Gilberto Fragoso.
Oncologic Pathology in Biomedical Terminologies Challenges for Data Integration Olivier Bodenreider National Library of Medicine Bethesda, Maryland -
Health IT Workforce Curriculum Version 1.0 Fall Networking and Health Information Exchange Unit 4a Basic Health Data Standards Component 9/Unit.
School of Mechanical, Materials and Manufacturing Engineering About this course Biomedical industries provide a rich diversity.
Joined up ontologies: incorporating the Gene Ontology into the UMLS.
Methods We employ the UMLS Metathesaurus to annotate ICD-9 codes to MedDRA preferred terms (PTs) using the three-step process below. The mapping was applied.
Oncology in SNOMED CT NCI Workshop The Role of Ontology in Big Cancer Data Session 3: Cancer big data and the Ontology of Disease Bethesda, Maryland May.
Semantic Graph Mining for Biomedical Network Analysis: A Case Study in Traditional Chinese Medicine Tong Yu HCLS
Introduction to PubChem BioAssay
UNIFIED MEDICAL LANGUAGE SYSTEMS (UMLS)
The UMLS and the Semantic Web
Kenneth Baclawski et. al. PSB /11/7 Sa-Im Shin
The Unified Medical Language System Overview
Overview Gene Ontology Introduction Biological network data
Knowledge Representation, Structuring and Ontology Development
Presentation transcript:

Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA Experiences in visualizing and navigating biomedical ontologies and knowledge bases ISMB 2002 Fifth Annual Bio-Ontologies Meeting August 8, 2002

Lister Hill National Center for Biomedical Communications 2 Introduction 1 u Biomedical knowledge l Terminologies(names) l Ontologies(objects) l Knowledge bases(facts) u Common features l Terms / Concepts l Inter-concept relationships n Hierarchical n Associative

Lister Hill National Center for Biomedical Communications 3 Introduction 2 u Challenges l Volume of information n concepts n relationships l Orientation n Mapping to concepts n Visualizing concept spaces n Navigating concept spaces term knowledge

Lister Hill National Center for Biomedical Communications 4 Introduction 3 u SemNav l UMLS browser l Entry point: biomedical term l Display related concepts l Display properties of interconcept relationships l Allow navigation among concepts u GenNav l GO browser l Entry point: GO term or gene product name/symbol l Display related GO terms and gene products l Display properties of term/term and term/gene product relationships l Allow navigation between GO terms and gene products

Lister Hill National Center for Biomedical Communications 5 Outline u Background l Unified Medical Language System (UMLS) l Gene Ontology u Overview of the browsers l SemNav l GenNav u Common features u Differences

UMLS and GO

Lister Hill National Center for Biomedical Communications 7 Unified Medical Language System u Developed at NLM since 1990 u 13 th edition in 2002 u Integrates some 60 terminological resources l Clinical vocabularies (including specialties) l Core terminologies (anatomy, drugs, med. devices) l Administrative terminologies, standards u Integration l Synonymous terms are clustered in a concept l Hierarchies (trees) are combined in a graph structure

Lister Hill National Center for Biomedical Communications 8 Terminology integration Terms Duchenne muscular dystrophy MeSH, SNOMED CTV3, Jablonski, CRISP, DxPlain, MedDRA, LOINC pseudohypertrophic muscular dystrophy MeSH, CTV3 SNOMED X-liked recessive muscular dystrophy Jablonski Duchenne de Boulogne muscular dystrophy Jablonski Duchenne’s muscular dystrophy COSTAR severe generalized familial muscular dystrophy SNOMED Duchenne type progressive muscular dystrophy SNOMED

Lister Hill National Center for Biomedical Communications 9 Terminology integration Relationships UMLS Adrenal Cortex Diseases Hypoadrenalism Adrenal Gland Hypofunction Adrenal cortical hypofunction Adrenal Gland Diseases Addison’s Disease SNOMED MeSH AOD Read Codes

Lister Hill National Center for Biomedical Communications 10 UMLS u Two-level structure l Semantic Network n 134 Semantic Types (STs) n 54 types of relationships among STs l Metathesaurus n 800,000 concepts n ~10 M inter-concept relationships l Link = categorization Concept Metathesaurus Semantic Type Semantic Network categorization

Heart Concepts Metathesaurus Esophagus Left Phrenic Nerve Heart Valves Fetal Heart Medias- tinum Saccular Viscus Angina Pectoris Cardiotonic Agents Tissue Donors Anatomical Structure Fully Formed Anatomical Structure Embryonic Structure Body Part, Organ or Organ Component Pharmacologic Substance Disease or Syndrome Population Group Semantic Types Semantic Network

Lister Hill National Center for Biomedical Communications 12 Gene Ontology u Developed by the GO Consortium u Several components l Ontology (~11,000 concepts) n Molecular functions n Cellular components n Biological processes l Gene products (~125,000) l Associations between Gene products and GO concepts (~357,000)

SemNav

MeSH Browser

Lister Hill National Center for Biomedical Communications 18 SemNav Visualization options

Lister Hill National Center for Biomedical Communications 23 SemNav Relationships Dystrophin Concepts Semantic Types Muscular Dystrophy, Duchenne 55 Amino Acid, Peptide or Protein Disease or Syndrome Biologically Active Substance

GenNav

Material and Methods

Common features and differences

Lister Hill National Center for Biomedical Communications 31 Mapping query terms u Mapping terms to concepts l Matching criteria (exact, approximate) l Normalization techniques n work well on clinical terms n less applicable to gene names u Query disambiguation With semantic type in SemNav With semantic type in SemNav With species in GenNav With species in GenNav

Lister Hill National Center for Biomedical Communications 32 Visualization u Graph vs. Trees (Forest) l Multiple inheritance is better visualized by graphs than by trees l Off-the-shelf, freely available graph visualization packages are available (GraphViz) u Need to reduce complexity l Transitive reduction on complex graphs l Feature selection e.g., a given vocabulary in SemNav e.g., a given vocabulary in SemNav e.g., a given species in GenNav e.g., a given species in GenNav

Lister Hill National Center for Biomedical Communications 33 Navigation u Tool for exploration Navigation among concepts ( SemNav and GenNav ) Navigation among concepts ( SemNav and GenNav ) Navigation between two poles (Gene products and GO concepts in GenNav ) Navigation between two poles (Gene products and GO concepts in GenNav )  Self-contained ( SemNav ) or opened to external resources ( GenNav )

Conclusions

Lister Hill National Center for Biomedical Communications 35 Conclusions  Most of the lessons learned while developing SemNav (for browsing general biomedical knowledge) were applicable to GenNav (for browsing molecular biology knowledge) u The lexical techniques suitable for mapping text to clinical terminologies require adaptation to the specificity of molecular biology terminologies

Contact: Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA SemNav * ► Resources ► Semantic Navigator ( * free UMLS registration required) GenNav