What is an ontology and Why should you care? Barry Smith 1.

Slides:



Advertisements
Similar presentations
Species-Neutral vs. Multi-Species Ontologies Barry Smith.
Advertisements

On the Future of the NeuroBehavior Ontology and Its Relation to the Mental Functioning Ontology Barry Smith
Ontology in Buffalo Barry Smith. 2 Ontology (phil.) The science of being Ontologies (tech.) Standardized classification systems which enable data from.
The Environment Ontology Barry Smith 1.
1 Introduction to Biomedical Ontology Barry Smith University at Buffalo
The Future of Health Information Barry Smith Ontology Research Group Center of Excellence in Bioinformatics and Life Sciences University at Buffalo ontology.buffalo.edu/smith.
1 The OBO Foundry Towards Gold Standard Terminology Resources in the Biomedical Domain Thomas Bittner (based on a presentation by Barry Smith)
Introduction to Ontologies for Environmental Biology Barry Smith
What is an ontology and Why should you care? Barry Smith 1.
1 Intelligence Ontology: A Strategy for the Future Barry Smith University at Buffalo
1 How Ontologies Create Research Communities Barry Smith
1 Workshop 7.00 Welcoming Remarks 7.15 Barry Smith (Buffalo, NY) 7.40 Lindsay Cowell (Duke University, NC) 8.05 Nigam Shah (Stanford University, CA) 8.30.
1 An Ontology of Relations for Biomedical Informatics Barry Smith 10 January 2005.
1 The OBO Foundry Barry Smith University at Buffalo
1 How Ontologies Create Research Communities Barry Smith University at Buffalo
1 Ontology in 15 Minutes Barry Smith. 2 Main obstacle to integrating genetic and EHR data No facility for dealing with time and instances (particulars)
What is an ontology and Why should you care? Barry Smith with thanks to Jane Lomax, Gene Ontology Consortium 1.
1 The OBO Foundry 2 A prospective standard designed to guarantee interoperability of ontologies from the very start (contrast.
The Problem of Reusability of Biomedical Data OBO Foundry & HL7 RIM Barry Smith.
What is an ontology and Why should you care? Barry Smith with thanks to Jane Lomax, Gene Ontology Consortium 1.
Underlying Ontologies for Biomedical work - The Relation Ontology (RO) and Basic Formal Ontology (BFO) Thomas Bittner SUNY Buffalo
Using Ontologies to Represent Immunological Networks Lindsay G. Cowell, Anne Lieberman, Anna Maria Masci Duke University Center for Computational Immunology.
1 Logical Tools and Theories in Contemporary Bioinformatics Barry Smith
1 Introduction to Ontology Barry Smith
Room for Lunch: Arlington Room Room for Evening Reception: Grand Prairie Room.
New York State Center of Excellence in Bioinformatics & Life Sciences Biomedical Ontology in Buffalo Part I: The Gene Ontology Barry Smith and Werner Ceusters.
CTO - Clinical Trials/Research in the Ontology of Biomedical Investigation Richard H. Scheuermann U.T. Southwestern Medical Center.
The RNA Ontology RNAO Colin Batchelor Neocles Leontis May 2009 Eckart, Colin and Jane In Cambridge.
1 BIOLOGICAL DOMAIN ONTOLOGIES & BASIC FORMAL ONTOLOGY Barry Smith.
1 The OBO Foundry Barry Smith Center of Excellence in Bioinformatics & Life Sciences, University at Buffalo IFOMIS, Saarland University
Anatomical Information Science Barry Smith
CoE Ontology Research Group (ORG) Barry Smith Center of Excellence in Bioinformatics and Life Sciences Ontology Research Group Department of Philosophy.
1 The OBO Relation Ontology Genome Biology 2005, 6:R46 based on the fundamental distinction between instances and universals takes instances and time into.
How to Organize the World of Ontologies Barry Smith 1.
New York State Center of Excellence in Bioinformatics & Life Sciences Biomedical Ontology in Buffalo Part I: The Gene Ontology Barry Smith and Werner Ceusters.
1 What an Ontology is For Barry Smith University at Buffalo Common Anatomy Reference Ontology Workshop.
Introduction to Ontologies for Environmental Biology Barry Smith
The OBO Foundry Chris Mungall Lawrence Berkeley Laboratory NCBO GO Consortium May 2007.
1 How Ontologies Create Research Communities Barry Smith
1 The Canonical Life Barry Smith
1 Ontology (Science) Barry Smith University at Buffalo
1 The Future of Clinical Bioinformatics: Overcoming Obstacles to Information Integration Barry Smith Brussells, Eurorec Ontology Workshop, 25 November.
Limning the CTS Ontology Landscape Barry Smith 1.
Gene Ontology (GO) Project
Ontological realism as a strategy for integrating ontologies Ontology Summit February 7, 2013 Barry Smith 1.
Intelligence Ontology A Strategy for the Future Barry Smith University at Buffalo
1 How Ontologies Create Research Communities Barry Smith University at Buffalo
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
1 The Canonical Life Barry Smith
Gene Ontology Project
Alan Ruttenberg PONS R&D Task force Alan Ruttenberg Science Commons.
Biomedical Ontologies: The State of the Art Barry Smith and Werner Ceusters MIE, Sarajevo, August 30 1.
1 Introduction to Bio-Ontologies Barry Smith
How to integrate data Barry Smith. The problem: many, many silos DoD spends more than $6B annually developing a portfolio of more than 2,000 business.
2 3 where in the body ? where in the cell ?
Ontology and the Semantic Web Barry Smith August 26,
Need for common standard upper ontology
Introduction to Biomedical Ontology for Imaging Informatics Barry Smith, PhD, FACMI University at Buffalo May 11, 2015.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
1 Ontology (Science) vs. Ontology (Engineering) Barry Smith University at Buffalo
Basic Formal Ontology Barry Smith August 26, 2013.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
1 Using Ontologies for Annotation of Genomic Data Barry Smith University at Buffalo
What is an ontology and Why should you care? Barry Smith 1.
What is an ontology and Why should you care?
An Introduction to Ontology for Evolutionary Biology
Intelligence Ontology: A Strategy for the Future
Ontology in 15 Minutes Barry Smith.
Ontology in 15 Minutes Barry Smith.
OBO Foundry Update: April 2010
Presentation transcript:

What is an ontology and Why should you care? Barry Smith 1

What I do Gene Ontology (NIHGR) (Scientific Advisor) National Center for Biomedical Ontology (NIHGR) Protein Ontology (NIGMS) Infectious Disease Ontology (NIAID) Biometrics Ontology (US Army) Ontology for Biomedical Investigations (MGED and others) 2

Uses of ‘ontology’ in PubMed abstracts 3

By far the most successful: GO (Gene Ontology) 4

You’re interested in which genes control heart muscle development 17,536 results 5

attacked time control Puparial adhesion Molting cycle hemocyanin Defense response Immune response Response to stimulus Toll regulated genes JAK-STAT regulated genes Immune response Toll regulated genes Amino acid catabolism Lipid metobolism Peptidase activity Protein catabloism Immune response Microarray data shows changed expression of thousands of genes. How will you spot the patterns? 6

You’re interested in which of your hospital’s patient data is relevant to understanding how genes control heart muscle development 7

Lab / pathology data EHR data Clinical trial data Family history data Medical imaging Microarray data Model organism data Flow cytometry Mass spec Genotype / SNP data How will you spot the patterns? How will you find the data you need? 8

How does the Gene Ontology work? with thanks to Jane Lomax, Gene Ontology Consortium 9

1. GO provides a controlled system of representations for use in annotating data multi-species, multi-disciplinary, open source contributing to the cumulativity of scientific results obtained by distinct research communities compare use of kilograms, meters, seconds … in formulating experimental results 10

11

Definitions 12

Gene products involved in cardiac muscle development in humans 13

14

Questions for annotation where is a particular gene product involved in what type of cell or cell part? in what part of the normal body? in what anatomical abnormality? when is a particular gene product involved in the course of normal development? in the process leading to abnormality with what functions is the gene product associated in other biological processes? 15

2. GO provides a tool for algorithmic reasoning 16

Hierarchical view representing relations between represented types 17

GO now introducing also regulates relations into its ontologies 18

3. GO allows a new kind of biological research, based on analysis and comparison of the massive quantities of annotations linking GO terms to gene products 19

Uses of GO in studies of − role of regulation of gene expression in axon guidance during development in Drosophila (PMID ) − prevention of ischemic damage to the retina in rats (PMID ) − immune system involvement in abdominal aortic aneurisms in humans (PMID ) − how the white spot syndrome virus affects cell function in shrimp (PMID ) − relationships between protein interaction networks involving the ash1 and ash2 genes in flies and in humans (PMID ) 20

GO is amazingly successful – but it covers only generic biological entities of three sorts: –cellular components –molecular functions –biological processes and it does not provide representations of disease-related phenomena 21

Extending the GO methodology to other domains of biology 22

RELATION TO TIME GRANULARITY CONTINUANTOCCURRENT INDEPENDENTDEPENDENT ORGAN AND ORGANISM Organism (NCBI Taxonomy) Anatomical Entity (FMA, CARO) Organ Function (FMP, CPRO) Phenotypic Quality (PaTO) Biological Process (GO) CELL AND CELLULAR COMPONENT Cell (CL) Cellular Component (FMA, GO) Cellular Function (GO) MOLECULE Molecule (ChEBI, SO, RnaO, PrO) Molecular Function (GO) Molecular Process (GO) The Open Biomedical Ontologies (OBO) Foundry 23

OntologyScopeURLCustodians Cell Ontology (CL) cell types from prokaryotes to mammals obo.sourceforge.net/cgi- bin/detail.cgi?cell Jonathan Bard, Michael Ashburner, Oliver Hofman Chemical Entities of Bio- logical Interest (ChEBI) molecular entitiesebi.ac.uk/chebi Paula Dematos, Rafael Alcantara Common Anatomy Refer- ence Ontology (CARO) anatomical structures in human and model organisms (under development) Melissa Haendel, Terry Hayamizu, Cornelius Rosse, David Sutherland, Foundational Model of Anatomy (FMA) structure of the human body fma.biostr.washington. edu JLV Mejino Jr., Cornelius Rosse Functional Genomics Investigation Ontology (FuGO) design, protocol, data instrumentation, and analysis fugo.sf.netFuGO Working Group Gene Ontology (GO) cellular components, molecular functions, biological processes Ontology Consortium Phenotypic Quality Ontology (PaTO) qualities of anatomical structures obo.sourceforge.net/cgi -bin/ detail.cgi? attribute_and_value Michael Ashburner, Suzanna Lewis, Georgios Gkoutos Protein Ontology (PrO) protein types and modifications (under development)Protein Ontology Consortium Relation Ontology (RO) relationsobo.sf.net/relationshipBarry Smith, Chris Mungall RNA Ontology (RnaO) three-dimensional RNA structures (under development)RNA Ontology Consortium Sequence Ontology (SO) properties and features of nucleic sequences song.sf.netKaren Eilbeck 24

Foundational Model of Anatomy 25

Definitions Cell =Def. an anatomical structure which consists of cytoplasm surrounded by a plasma membrane Anatomical structure =Def. a material anatomical entity which is generated by coordinated expression of the organism’s own genes An A =Def. a B which Cs 26

Pleural Cavity Pleural Cavity Interlobar recess Interlobar recess Mesothelium of Pleura Mesothelium of Pleura Pleura(Wall of Sac) Pleura(Wall of Sac) Visceral Pleura Visceral Pleura Pleural Sac Parietal Pleura Parietal Pleura Anatomical Space Organ Cavity Organ Cavity Serous Sac Cavity Serous Sac Cavity Anatomical Structure Anatomical Structure Organ Serous Sac Mediastinal Pleura Mediastinal Pleura Tissue Organ Part Organ Subdivision Organ Subdivision Organ Component Organ Component Organ Cavity Subdivision Organ Cavity Subdivision Serous Sac Cavity Subdivision Serous Sac Cavity Subdivision part_of is_a 27

OBO Foundry recognized by NIH as framework to address mandates for re-usability of data collected through Federally funded research see NIH PAR : Data Ontologies for Biomedical Research (R01) 28

OBO Foundry provides tested guidelines enabling new groups to develop the ontologies they need in ways which counteract forking and dispersion of effort an incremental bottoms-up approach to evidence-based terminology practices in medicine that is rooted in basic biology automatic web-based linkage between biological knowledge resources (massive integration of databases across species and biological system) 29

An ontology is not a database New databases for each new kind of data New databases for each new project Ontologies like the GO are a solution to the silo problems databases cause 30

A good solution to these silo problems must be: modular incremental bottom-up based on consistent, intuitive structure evidence-based and thus revisable incorporate a strategy for motivating potential developers and users 31

An ontology is not a terminology Existing term lists built to serve specific data-processing in ad hoc ways Ontologies designed from the start to ensure integratability and reusability of data by incorporating a common logical structure 32

OBO Foundry principle of modularity one ontology for each domain no need for ‘mappings’ (which are in any case too expensive, too fragile, too difficult to keep up-to-date as mapped ontologies change) everyone knows where to look to find out how to annotate each kind of data division of labor 33

RELATION TO TIME GRANULARITY CONTINUANTOCCURRENT INDEPENDENTDEPENDENT ORGAN AND ORGANISM Organism (NCBI Taxonomy) Anatomical Entity (FMA, CARO) Organ Function (FMP, CPRO) Phenotypic Quality (PaTO) Biological Process (GO) CELL AND CELLULAR COMPONENT Cell (CL) Cellular Component (FMA, GO) Cellular Function (GO) MOLECULE Molecule (ChEBI, SO, RnaO, PrO) Molecular Function (GO) Molecular Process (GO) The Open Biomedical Ontologies (OBO) Foundry 34

Extending the OBO Foundry to evolutionary biology GO Reference Genome Project PATO – Phenotypic Quality Ontology e.g. as basis for comparative studies of human and model organisms CARO – Common Anatomy Reference Ontology PRO – Protein Ontology (ProEVO) RNA Ontology 35

which of these terms already exist in OBO Foundry ontologies? gene allele allelic variation gene pool genotype population speciation homology mutation inheritance organism extinction 36

RELATION TO TIME GRANULARITY CONTINUANTOCCURRENT INDEPENDENTDEPENDENT POPULATION family, tribe, species, … population phenotype epidemic, speciation, … ORGAN AND ORGANISM Organism (NCBI Taxonomy) Anatomical Entity (FMA, CARO) Organ Function (FMP, CPRO) Phenotypic Quality (PaTO) Biological Process (GO) CELL AND CELLULAR COMPONENT Cell (CL) Cellular Component (FMA, GO) Cellular Function (GO) MOLECULE Molecule (ChEBI, SO, RnaO, PrO) Molecular Function (GO) Molecular Process (GO) Adding population-level granularity to OBO Foundry 37

Foundational is_a part_of Spatial located_in contained_in adjacent_to Temporal transformation_of derives_from preceded_by Participation has_participant has_agent OBO Relation Ontology 1.0 “Relations in Biomedical Ontologies”, Genome Biology, April

GO graph-theoretic hierarchy allows logical reasoning 39

Relation Ontology A is_a B =def. Every instance of A is an instance of B A part_of B =def. Every instance of A is a part of some instance of B 40

C c at t C 1 c 1 at t 1 C' c' at t time instances zygote derives_from ovum sperm derives_from 41

transformation_of c at t 1 C c at t C 1 time same instance pre-RNA  mature RNA child  adult pupa  larva 42

C c at t c at t 1 C 1 embryological development 43

two continuants fuse to form a new continuant C c at t C 1 c 1 at t 1 C' c' at t fusion 44

one initial continuant is replaced by two successor continuants C c at t C 1 c 1 at t 1 C 2 c 2 at t 1 fission 45

one continuant detaches itself from an initial continuant, which itself continues to exist C c at t c at t 1 C 1 c 1 at t budding 46

one continuant is absorbed by a second continuant C c at t C 1 c 1 at t 1 C' c' at t capture 47

Relations proposed for RO 2.0 regulates (GO) inheres_in has_input has_function has_quality realization_of directly_descends_from (CARO) homologous_to (CARO) 48