EA 3888 – Conceptual Modeling of Biomedical Knowledge Faculty of Medicine - University of Rennes 1 Integrating and querying.

Slides:



Advertisements
Similar presentations
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Advertisements

AT THE CELLULAR LEVEL: CHEMISTRY BECOMES BIOLOGY
Discovering Disease Associations using a Biomedical Semantic Web: Integration and Ranking One of the principal goals of biomedical research is to elucidate.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
Introduction to Metabolism
Consistent and standardized common model to support large-scale vocabulary use and adoption Robust, scalable, and common API to reduce variation in clinical.
RDB2RDF: Incorporating Domain Semantics in Structured Data Satya S. Sahoo Kno.e.sis CenterKno.e.sis Center, Computer Science and Engineering Department,
FP7 meeting - Gent - Carlos Rodríguez - April 18 WP4: Conceptual Mining from Text for Knowledge Engineering State of the Art WP Coordinators: Alfonso Valencia.
CELL COMMUNICATION. YOU MUST KNOW… THE 3 STAGES OF CELL COMMUNICATION: RECEPTION, TRANSDUCTION, AND RESPONSE HOW G-PROTEIN-COUPLED RECEPTORS RECEIVE CELL.
Www. GeneOntology.org Gene Ontology Collaboration.
Catalina Martínez-Costa, Stefan Schulz: Ontology-based reinterpretation of the SNOMED CT context model Ontology-based reinterpretation of the SNOMED CT.
Representing laboratory information and other findings in clinical ontologies Anita Burgun, MD, PhD EA IFR140 Faculté de Médecine – Université de.
Part IV Cellular Communication Cellular signal transduction The cell cycle and programmed cell death Germ cells and fertilization Cell interactions in.
Gene Ontology John Pinney
Developing a protein-interactions ontology Esther Ratsch European Media Laboratory.
Gene Ontology Luis Tari. Gene Ontology (GO) URL: Gene Ontology is A hierarchy of roles of genes.
What is an ontology and Why should you care? Barry Smith with thanks to Jane Lomax, Gene Ontology Consortium 1.
COG and GO tutorial.
Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Figure S1. ID#1 Lipid Metabolism, Small Molecule Biochemistry, Drug Metabolism.
Ontological Model for Colon Carcinoma: A Case Study for Knowledge Representation in Clinical Bioinformatics Kumar A 1,2, Yip L 3, Jaremek M 2, Scheib H.
1 Cui Tao PhD Dissertation Defense Ontology Generation, Information Harvesting and Semantic Annotation For Machine-Generated Web Pages.
Cell Biology Course Info and Introduction. What is Cell Biology? Investigation of Biological Systems –Biochemistry –Molecular Biology –Genetics/Molecular.
Ontologies: Making Computers Smarter to Deal with Data Kei Cheung, PhD Yale Center for Medical Informatics CBB752, February 9, 2015, Yale University.
Amarnath Gupta Univ. of California San Diego. An Abstract Question There is no concrete answer …but …
Integrating bio-ontologies with a workflow/Petri Net model to qualitatively represent and simulate biological systems Mor Peleg, Irene Gbashvili, and Russ.
BASIC BIOCHEMISTRY MLAB Introduction. INTRODUCTION TO BASIC BIOCHEMISTRY Biochemistry can be defined as the science concerned with the chemical.
Developing an OWL-DL Ontology for Research and Care of Intracranial Aneurysms – Challenges and Limitations Holger Stenzhorn, Martin Boeker, Stefan Schulz,
Gene Ontology (GO) Project
Ranking-Aware Integration and Explorative Search of Distributed Bio-Data Dipartimento di Elettronica e Informazione NETTAB 2012 Integrated Bio-Search November.
Olivier Bodenreider Lister Hill National Center for Biomedical Communications Bethesda, Maryland - USA Experiences in visualizing and navigating biomedical.
Expression Modules Brian S. Yandell (with slides from Steve Horvath, UCLA, and Mark Keller, UW-Madison)
Chapter 11: Cell Communication. Essential Knowledge 2.e.2 – Timing and coordination of physiological events are regulated by multiple mechanisms (11.1).
DONNA MAGLOTT, PH.D. PRO AND MEDICAL GENETICS RESOURCES AT NCBI.
Weighted models for insulin Detected by scanone Detected by Ping’s multiQTL model tissue# transcripts Islet1984 Adipose605 Liver485 Gastroc404 # transcripts.
Cell Signaling Ontology Takako Takai-Igarashi and Toshihisa Takagi Human Genome Center, Institute of Medical Science, University of Tokyo.
Consistency between Metathesaurus and Semantic Network Workshop on The Future of the UMLS Semantic Network NLM, April 8, 2005 Olivier Bodenreider Lister.
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
Respective contributions of MIAME, GeneOntology and UMLS for transcriptome analysis Fouzia Moussouni, Anita Burgun, Franck Le Duff, Emilie Guérin, Olivier.
Cellular macromolecule catabolism cellular macromolecule metabolism cytoplasm organization and biogenesis establishment of cellular localization intracellular.
Cytoskeleton Ribosome Protein folding mRNA processing Lipid transport Transcription Cell growth Cytoskeleton Carbohydrate metabolism Protein catabolism.
Chapter 11: cell signals Without cell signaling, no multicellular organisms could exist. Cells would use their genomes equivalently. Cell signals allow.
DAML+OIL: an Ontology Language for the Semantic Web.
12/7/2015Page 1 Service-enabling Biomedical Research Enterprise Chapter 5 B. Ramamurthy.
Mining the Biomedical Research Literature Ken Baclawski.
Japan Consortium for Glycobiology and Glycotechnology DataBase 日本糖鎖科学統合データベース GDGDB - Glyco-Disease Genes Database The complexity of glycan metabolic pathways.
Relevant KEGG pathways Additional file 2 Te Pas et al. The file shows more KEGG pathways as found by the PERL script search of the database.
1 Semantic Relations for Interpreting DNA Microarray Data and for Novel Hypotheses Generation Dimitar Hristovski, 1 PhD, Andrej Kastrin, 2 Borut Peterlin,
GO-Slim term Cluster frequency cytoplasm 1944 out of 2727 genes, 71.3% 70 out of 97 genes, 72.2% out of 72 genes, 86.1% out.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
© The ATHENA Consortium. Susan Thomas SAP AG, Research Department How do you do semantics? Semantic Web Drawings by Sebastian Cremers Unit 3:
Ontologies for Terminologies, Knowledge Representation & Software: Benefits & Gaps (“Don’t make the tea”) (Only a part of Knowledge Representation) Alan.
Oncologic Pathology in Biomedical Terminologies Challenges for Data Integration Olivier Bodenreider National Library of Medicine Bethesda, Maryland -
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
OWL Web Ontology Language Summary IHan HSIAO (Sharon)
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
Protein. Protein and Roles 1: biological process unknown 1.1 Structural categories 1.2 organism categories 1.3 cellular component o unlocalized.
Effect of Alcohol on Brain Development NormalFetal Alcohol Syndrome.
RDF based on Integration of Pathway Database and Gene Ontology SNU OOPSLA LAB DongHyuk Im.
Cross-Ontological Relationships
Can SNOMED CT be harmonized with an upper-level ontology?
Temporal Gene Expression Profiles after Focal Cerebral Ischemia in Mice Zhang Chengjie 1, 2, 3 ;Zhu Yanbing 2 ;Wang Song 2, 3 ;Zachory Wei Zheng 1, 2,
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
CCO: concept & current status
Introduction to Biochemistry. Objectives  Know what biochemistry is and its principle.  Know the components of a cell and its major types of bio- molecules.
Fouzia Moussouni, Anita Burgun, Franck Le Duff,
Schematic of cellular role categories of theoretical (open bars) and identified proteins on a 2-D electrophoresis gel, pH 4–7 (black bars), in L. casei.
Milk-associated proteomes.
Presentation transcript:

EA 3888 – Conceptual Modeling of Biomedical Knowledge Faculty of Medicine - University of Rennes 1 Integrating and querying disease and pathway ontologies: building an OWL model and using RDFS queries Julie Chabalier, Olivier Dameron, Anita Burgun

EA 3888 – University of Rennes 1 Introduction Disease description in current medical ontologies Clinical features Etiology Location Morphology Example: SNOMED Clinical Terms® (SNOMED CT®) Disease Definitional manifestation causative agent finding site associated morphology

EA 3888 – University of Rennes 1 Introduction Characterization of diseases : biological knowledge required Genes ­ A gene mutation may result in a disease Metabolic pathways -A pathway may be shared by different phenotypes Biological processes -Different processes may explain different grades of a disease Biological knowledge Absent from medical ontologies

EA 3888 – University of Rennes 1 Objectives Integration of disease and pathway ontologies Ontology integration ­ Identify candidate ontologies ­ Get candidate ontologies in an adequate formalism ­ Integrate formalized ontologies Querying the resulting ontology ­ Consistency checking ­ Exploiting biomedical knowledge

EA 3888 – University of Rennes 1 Candidate ontologies KEGG Orthology (KO) hierarchy Organization of metabolic pathway and disease maps in the KEGG knowledge base DAG of four levels

EA 3888 – University of Rennes 1 Candidate ontologies ~ terms organized according to 3 hierarchies : - Molecular Function - Cellular Component - Biological Process Used to enrich the KO pathway definitions Gene Ontology (GO) the Gene Ontology

EA 3888 – University of Rennes 1 Candidate ontologies SNOMED-CT: clinical description of diseases Alzheimer's disease findingSite Intracranial glioma Brain structure Disorder of brain Dementia Cerebral structure Used to enrich the KO disease definitions findingSite Organic mental disorder Neoplasm of brain

EA 3888 – University of Rennes 1 Formalism OWL as a common formalism Unambiguous combination of several ontologies (URI, namespaces) Defined semantics Expressiveness (e.g disjointness) Getting candidate ontologies in OWL-DL KO: conversion of the 3 upper levels (available in text) GO: extraction of Biological Process hierarchy (available in OWL) SNOMED: extraction and conversion of the relevant concepts and relations (from UMLS)

EA 3888 – University of Rennes 1 Ontology integration Setting up relationships between ontologies Aligning: defining relationships between terms (is-a, part-of, etc.) Mapping: defining equivalence relationships between terms

EA 3888 – University of Rennes 1 Integration framework BioMed Ontology GO Biological Processes Disease and Pathway descriptions KO Pathways Diseases SNOMED Diseases Pathway descriptions Disease descriptions

EA 3888 – University of Rennes 1 Mapping GO processes – KO pathways GO biological processes KO Pathways Diseases Metamap program*: lexical mapping (labels and synonyms) KO: Metabolism KO: Carbohydrate metabolism GO: Metabolism GO: Macromolecule metabolism GO: Carbohydrate metabolism KO: Fructose and mannose metabolism SNOMED Diseases *Aronson, A.R. (2001) Effective mapping of biomedical text to the UMLS Metathesaurus: the MetaMap program, Proceedings of the AMIA Symp., 17-21

EA 3888 – University of Rennes 1 Aligning GO processes – KO pathways GO: Carbohydrate metabolism GO: Cellular carbohydrate metabolism GO: Monosaccharide metabolism GO: Hexose metabolism GO: Fructose metabolism GO: Mannose metabolism KO: Carbohydrate metabolism KO: Fructose and mannose metabolism GO: atomic conceptsKO: composite concepts Patterns to segment and recompose KO terms before the mapping KO: Fructose and mannose metabolism Fructose mannosemetabolism

EA 3888 – University of Rennes 1 Mapping & aligning GO processes – KO pathways GO: Carbohydrate metabolism GO: Cellular carbohydrate metabolism GO: Monosaccharide metabolism GO: Hexose metabolism GO: Fructose metabolism GO: Mannose metabolism KO: Carbohydrate metabolism KO: Fructose and mannose metabolism

EA 3888 – University of Rennes 1 Mapping of KO diseases and SNOMED diseases GO biological processes KO Pathways Diseases SNOMED Diseases Metamap program SN: Alzheimer's disease SN: Organic mental disorder SN: Dementia KO: Human diseases KO: Neurodegenerative disorders KO: Alzheimer's disease SN: Disorder of brain

EA 3888 – University of Rennes 1 Alignment of pathways and diseases GO Biological Processes KO Pathways Diseases Condition of alignment : if, at least, one gene is involved in both a disease D and a pathway P : 1 2 SNOMED Diseases Alignment: inferring relationships between : 1 - GO processes and KO diseases 2 - KO pathways and KO diseases D P hasPathway

EA 3888 – University of Rennes 1 1 Alignment of GO processes and KO diseases GO Biological Processes KO Pathways Diseases 2 SNOMED Diseases KEGG mapping (KEGG geneId - Uniprot id) GOA Genes Uniprot id GO id 1 hasPathway

EA 3888 – University of Rennes 1 GO Biological Processes KO Pathways Diseases 1 2 SNOMED Diseases Alignment of KO pathways and KO diseases KO: Metabolism KO: Carbohydrate metabolism KO: Glycolysis/Gluconeogenesis KO: gene1 KO: gene3 KO: Metabolism KO: Carbohydrate metabolism KO: Glycolysis/Gluconeogenesis KO: gene1 KO: gene3 KO: Human diseases KO: Neurodegenerative disorders KO: Alzheimer's disease KO: gene1 KO: gene2 KO: Human diseases KO: Neurodegenerative disorders KO: Alzheimer's disease KO: gene1 KO: gene2 hasPathway

EA 3888 – University of Rennes 1 Integration result BioMed Ontology classes: classes from GO 281 classes from KO pathways classes - 19 disease classes 146 classes from SNOMED

EA 3888 – University of Rennes 1 Integration results 144 KO pathways associated with GO processes (57%) 15 KO diseases associated with SNOMED Diseases (94%) 15 KO diseases associated with 836 distinct pathways (GO & KO) 3144 hasPathway relationships BioMed Ontology

EA 3888 – University of Rennes 1 Querying the BioMed Ontology Exploiting knowledge and checking consistency Taking into account the explicit relationships RDFS is sufficient RDF query language : SeRQL Implementation of SeRQL in Sesame is able to exploit RDFS semantics Exploitation of explicit relationships

EA 3888 – University of Rennes 1 SeRQL queries Example of an exploiting query Which pathways are shared by 2 neurological disorders : glioma & Alzheimers disease? SELECT DISTINCT Pathway, label(PathwayName) FROM {kpath:ko05010} rdfs:subClassOf {SuperClass}, {SuperClass} rdf:type {owl:Restriction}, {SuperClass} owl:onProperty {ea3888hp:hasPathway}, {SuperClass} owl:someValuesFrom {Pathway}, {Pathway} rdfs:label {PathwayName} INTERSECT SELECT DISTINCT Pathway, label(PathwayName) FROM {kpath:ko05214} rdfs:subClassOf {SuperClass}, {SuperClass} rdf:type {owl:Restriction}, {SuperClass} owl:onProperty {ea3888hp:hasPathway}, {SuperClass} owl:someValuesFrom {Pathway}, {Pathway} rdfs:label {PathwayName} SELECT DISTINCT Pathway, label(PathwayName) FROM {kpath:ko05010} rdfs:subClassOf {SuperClass}, {SuperClass} rdf:type {owl:Restriction}, {SuperClass} owl:onProperty {ea3888hp:hasPathway}, {SuperClass} owl:someValuesFrom {Pathway}, {Pathway} rdfs:label {PathwayName} INTERSECT SELECT DISTINCT Pathway, label(PathwayName) FROM {kpath:ko05214} rdfs:subClassOf {SuperClass}, {SuperClass} rdf:type {owl:Restriction}, {SuperClass} owl:onProperty {ea3888hp:hasPathway}, {SuperClass} owl:someValuesFrom {Pathway}, {Pathway} rdfs:label {PathwayName}

EA 3888 – University of Rennes 1 Query results Which pathways are shared by 2 neurological disorders : glioma & Alzheimers disease? 37 pathways: MAPK signaling pathway Focal adhesion Insulin signaling pathway Melanogenesis B cell receptor signaling pathway heart development central nervous system development axon guidance peptidyl-serine phosphorylation protein amino acid phosphorylation cell cycle cell-cell signaling cell cycle arrest lipid catabolic process lipid metabolic process ubiquitin cycle transport ErbB signaling pathway Wnt signaling pathway protein tetramerization intracellular signaling cascade protein modification process glycogen metabolic process anagen induction of apoptosis negative regulation of apoptosis apoptosis anti-apoptosis Natural killer cell mediated cytotoxicity cell proliferation DNA replication chromosome organization and biogenesis calcium ion homeostasis signal transduction response to UV negative regulation of cell growth cytoskeleton organization and biogenesis

EA 3888 – University of Rennes 1 hasPathway Query results By leveraging the pathway hierarchy: 66 pathways ( ) Alzheimers disease Intracellular protein transport Protein transport into nucleus, translocation Glioma hasPathway

EA 3888 – University of Rennes 1 Query results Example of a consistency query: Detect if a specific pathway and a more general one are associated with a same disease Disease1Pathway1 Pathway2 hasPathway Removal of redundant relationships

EA 3888 – University of Rennes 1 Conclusion Biomed Ontology project Integration Automatic method of integration of biomedical ontologies ­ Deals with the huge quantity of biomedical data ­ Takes into account the frequent updates of biomedical sources BioMed ontology ­ Integrates 3 biomedical ontologies (KO, GO, SNOMED) ­ Takes into account the formal evolution of the biomedical ontologies (OWL) Querying RDFS queries are enough: ­ to detect some basic inconsistencies of the BioMed ontology ­ to exploit the BioMed ontology

EA 3888 – University of Rennes 1 Perspectives Biological evaluation: study of glioma Increase the number of integrated biomedical sources (e.g. OMIM, BioPax) Improve the mapping/alignment techniques by taking into account the semantics in the patterns Associate a degree of confidence to the Disease/Pathway relationships (based for example on the GO evidence code)

EA 3888 – University of Rennes 1 BioMed ontology project :