About ontologies Melissa Haendel. And who am I that I am giving you this talk? Melissa Haendel Anatomist, developmental neuroscientist, molecular biologist,

Slides:



Advertisements
Similar presentations
More than one way to dissect an animal Melissa Haendel ZFIN Scientific Curator.
Advertisements

Species-Neutral vs. Multi-Species Ontologies Barry Smith.
What is Ontology? Dictionary:A branch of metaphysics concerned with the nature and relations of being. Barry Smith:The science of what is, of the kinds.
On the Future of the NeuroBehavior Ontology and Its Relation to the Mental Functioning Ontology Barry Smith
+ From OBO to OWL and back again – a tutorial David Osumi-Sutherland, Virtual Fly Brain/FlyBase Chris Mungall – GO/LBL.
Application of OBO Foundry Principles in GO Chris Mungall Lawrence Berkeley Labs NCBO GO Consortium.
Automated tools to help construction of Trait Ontologies Chris Mungall Monarch Initiative Gene.
Ontology Notes are from:
1 Introduction to Biomedical Ontology Barry Smith University at Buffalo
1 The OBO Foundry Towards Gold Standard Terminology Resources in the Biomedical Domain Thomas Bittner (based on a presentation by Barry Smith)
1 Intelligence Ontology: A Strategy for the Future Barry Smith University at Buffalo
What is an ontology and Why should you care? Barry Smith with thanks to Jane Lomax, Gene Ontology Consortium 1.
Abstract The Cell Ontology (CL) is a candidate OBO Foundry 1 ontology for the representation of in vivo cell types. As part of our work in redeveloping.
1 The OBO Foundry 2 A prospective standard designed to guarantee interoperability of ontologies from the very start (contrast.
The Problem of Reusability of Biomedical Data OBO Foundry & HL7 RIM Barry Smith.
Underlying Ontologies for Biomedical work - The Relation Ontology (RO) and Basic Formal Ontology (BFO) Thomas Bittner SUNY Buffalo
1 Logical Tools and Theories in Contemporary Bioinformatics Barry Smith
Room for Lunch: Arlington Room Room for Evening Reception: Grand Prairie Room.
The RNA Ontology RNAO Colin Batchelor Neocles Leontis May 2009 Eckart, Colin and Jane In Cambridge.
1 BIOLOGICAL DOMAIN ONTOLOGIES & BASIC FORMAL ONTOLOGY Barry Smith.
Internet tools for genomic analysis: part 2
CoE Ontology Research Group (ORG) Barry Smith Center of Excellence in Bioinformatics and Life Sciences Ontology Research Group Department of Philosophy.
How to Organize the World of Ontologies Barry Smith 1.
New York State Center of Excellence in Bioinformatics & Life Sciences Biomedical Ontology in Buffalo Part I: The Gene Ontology Barry Smith and Werner Ceusters.
What is an Ontology? AmphibiaTree 2006 Workshop Saturday 8:45–9:15 A. Maglia.
Building the Ontology Landscape for Cancer Big Data Research Barry Smith May 12, 2015.
Ontologies 101 Melissa Haendel and Jim Balhoff INCF taskforce cross program meeting Stockholm, Aug. 30, 2013.
An Introduction to Ontologies Contributors: Melissa Haendel, Chris Mungall, David Osumi-Sutherland.
Automatic methods for functional annotation of sequences Petri Törönen.
Computational Biology and Informatics Laboratory Development of an Application Ontology for Beta Cell Genomics Based On the Ontology for Biomedical Investigations.
Protege OWL Plugin Short Tutorial. OWL Usage The world wide web is a natural application area of ontologies, because ontologies could be used to describe.
Ontological realism as a strategy for integrating ontologies Ontology Summit February 7, 2013 Barry Smith 1.
GO and OBO: an introduction. Jane Lomax EMBL-EBI What is the Gene Ontology? What is OBO? OBO-Edit demo & practical What is the Gene Ontology? What is.
An (OBO) ontology is NOT a model of language, it is a model of reality. Words are ambiguous – especially in isolation. Take the word 'wing' what type of.
Intelligence Ontology A Strategy for the Future Barry Smith University at Buffalo
The aims of the Gene Ontology project are threefold: - to compile vocabularies to describe components, functions and processes - to produce tools to query.
Terry Meehan Scientific Curator Mouse Genome Informatics The Jackson Laboratory Logical Definitions for Hematopoietic Cell Terms.
Imports, MIREOT Contributors: Carlo Torniai, Melanie Courtot, Chris Mungall, Allen Xiang.
Open Biomedical Ontologies. Open Biomedical Ontologies (OBO) An umbrella project for grouping different ontologies in biological/medical field –a repository.
Principles and Practice of Ontology Development: Making Definitions Computable Chris Mungall LBL.
GENE ONTOLOGY FOR THE NEWBIES Suparna Mundodi, PhD The Arabidopsis Information Resources, Stanford, CA.
Towards a Top-Level Ontology for Molecular Biology Stefan Schulz 1, Elena Beisswanger 2, Udo Hahn 2, Joachim Wermter 2, 1 Freiburg University Hospital,
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
Ontology of Disease and the OBO Foundry Chris Mungall NCBO GO Nov 2006.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Ontological Foundations of Biological Continuants Stefan Schulz, Udo Hahn Text Knowledge Engineering Lab University of Jena (Germany) Department of Medical.
Monday, November 8, 2:30:07 PM  Ontology is the philosophical study of the nature of being, existence or reality as such, as well as the basic categories.
The “über-ontology” (Uberon) Melissa Häendel, Chris Müngall, George Gkoütos Cell Ontology Workshop May, 2010.
Linking Animal Models and Human Diseases Supported by NIH P41 HG002659, U54 HG004028, & R01 HG Cambridge University & the University of Oregon.
DAVID R. SMITH DR. MARY DOLAN DR. JUDITH BLAKE Integrating the Cell Cycle Ontology with the Mouse Genome Database.
The Plant Ontology: Development of a Reference Ontology for all Plants Plant Ontology Consortium Members and Curators*: Laurel D.
How to integrate data Barry Smith. The problem: many, many silos DoD spends more than $6B annually developing a portfolio of more than 2,000 business.
2 3 where in the body ? where in the cell ?
Anatomy Ontology Community Melissa Haendel. The OBO Foundry More than just a website, it’s a community of ontology developers.
Ontology and the Semantic Web Barry Smith August 26,
What is an ontology and Why should you care? Barry Smith 1.
Expanding species-specific anatomy ontologies to include the cell ontology Melissa Haendel (1), Ceri Van Slyke (1), Chris Mungall (2), Peiran Song (1),
Scope of the Gene Ontology Vocabularies. Compile structured vocabularies describing aspects of molecular biology Describe gene products using vocabulary.
Need for common standard upper ontology
+ From OBO to OWL and back again – a tutorial David Osumi-Sutherland, Virtual Fly Brain/FlyBase Chris Mungall – GO/LBL.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Basic Formal Ontology Barry Smith August 26, 2013.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
Introduction to Ontology Introductions Alan Ruttenberg Science Commons.
Mental Functioning and the Gene Ontology
Why do we need upper ontologies? What are their purported benefits?
What is Ontology? s Dictionary:A branch of metaphysics concerned with the nature and relations of being. Barry Smith:The science of what is, of.
OBO Foundry Update: April 2010
Presentation transcript:

About ontologies Melissa Haendel

And who am I that I am giving you this talk? Melissa Haendel Anatomist, developmental neuroscientist, molecular biologist, curator, ontologist Most ontologists become ontologists because they have a need to classify and compute on a specific domain

Information retrieval from text-based resources is hard: OMIM Query# of records “large bone”785 "enlarged bone" "big bones"16 "huge bones"4 "massive bones"28 "hyperplastic bones"12 "hyperplastic bone"40 "bone hyperplasia"134 "increased bone growth"612

MouseEcotope GlyProt DiabetInGene GluChem sphingolipid transporter activity Use of the same defined term indicates the same meaning under different annotation circumstances Common vocabularies allow grouping of disparate data

The controlled vocabulary spectrum Bottom line: you get what you pay for. Ontologies are expensive but powerful. OBO

Why build an ontology? A simple example Number of genes annotated to each of the following brain parts in an ontology: brain 20 part_of hindbrain 15 part_of rhombomere 10 Query brain without ontology 20 Query brain with ontology 45 Ontologies can facilitate grouping and retrieval of data

An ontology is a classification vocabulary Philosophers: Ontology = The study of being as a branch of philosophy Bioinformaticists: Domain ontology = representing a specific knowledge base Features of ontologies: 1.Terms are defined 2.Terms are arranged in a hierarchy 3.Relationships between the terms are defined 4. Expressed in a knowledge representation language Some well known ontologies are: SnoMED, Foundational Model of Anatomy, Gene Ontology, Linnean Taxonomy of species

Defining ontology classes, a beginner’s guide An X =Y that has one or more differentiating characteristics. where Y is the is_a parent of X. Definition: Blue cylinder = Cylinder that is blue. Definition: cylinder = Surface formed by the set of lines perpendicular to a plane, which pass through a given circle in that plane. is_a These types are Disjoint: A cylinder can be either blue or red by definition, but not both. Definitions specify necessary and sufficient conditions. is_a Definition: Red cylinder = Cylinder that is red. Y X

Subsumption reasoning: The simplest ontology reasoning is_a entity organism cat mammal animal is_a human is_a instance_of PeanutChris Shaffer

Hierarchy types Simple taxonomy Directed acyclic graph

Subject —— Property —— Object brain (class)has_partneuron (class) PloS 2009 (instance)has_author Melissa (instance) Chris (instance) ownsred tie (instance) Ontology classes, instances, and relations are stored as sentences

The True Path Rule cuticle synthesis --[i] chitin metabolism cell wall biosynthesis --[i] chitin metabolism ----[i] chitin biosynthesis ----[i] chitin catabolism chitin metabolism --[i] chitin biosynthesis --[i] chitin catabolism --[i] cuticle chitin metabolism ----[i] cuticle chitin biosynthesis ----[i] cuticle chitin catabolism --[i] cell wall chitin metabolism ----[i] cell wall chitin biosynthesis ----[i] cell wall chitin catabolism GO Before:GO After: BUT: A fly chitin synthase could be annotated to chitin biosynthesis, and appear in a query for genes annotated to cell wall biosynthesis (and its children), which makes no sense because flies don't have cell walls. NOW: all the child terms can be followed up to chitin metabolism, but cuticle chitin metabolism terms do not trace back to cell wall terms, so all the paths are true. The pathway from a child term all the way up to its top level parent(s) must be universally true.

Where does the True Path Rule come from? Transitivity. Some relations are transitive, and apply across all levels of the hierarchy. For example, a cat is_a mammal, and a mammal is_a vertebrate SO a cat is_a vertebrate => This is the true path rule and is because the is_a relation is transitive. Some properties are not transitive. For example, head has_quality round. and, head part_of organism. So is the organism round? Of course not! BUT, eyes are part_of head, and head part_of organism, SO eye part_of organism is true, because part_of is a tranistive relation. Relations are logically defined in a common relation ontology or within each ontology that uses them. ≠ >

Domain ontologies are organized according to upper ontologies that specify the general types of things that exist RELATION TO TIME GRANULARITY CONTINUANTOCCURRENT INDEPENDENTDEPENDENT ORGAN AND ORGANISM Organism (NCBI Taxonomy) Anatomical Entity (FMA, CARO) Organ Function (FMP, CPRO) Phenotypic Quality (PaTO) Biological Process (GO) CELL AND CELLULAR COMPONENT Cell (CL) Cellular Component (FMA, GO) Cellular Function (GO) MOLECULE Molecule (ChEBI, SO, RnaO, PrO) Molecular Function (GO) Molecular Process (GO) => Classification according to these higher level types helps ensure the True Path Rule holds

Universality Often, the order of the terms in an assertion will matter: We can assert adult transformation_of child but not child transformation_of adult More about using ontology classes and relations Univocity Terms should have the same meanings on every occasion of use. (= They should refer to the same universal types) Basic ontological relations such as is_a and part_of should be used in the same way by all ontologies

Common ontology languages OWL is a recognized standard, more tools available, less restricted expressivity OBO simple, more easily readable, used by many biologists, restricted expressivity DNA sequencing DNA sequencing OBI Branch derived DNA sequencing is a sequencing process which uses deoxyribonucleic acid as input and results in a the creation of DNA sequence information artifact using a DNA sequencer instrument. Philippe Rocca-Serra Genomic deletions of OFD1 account for 23% of oral-facial-digital type 1 syndrome after negative DNA sequencing. Thauvin-Robinet C, Franco B, Saugier-Veber P, Aral B, Gigot N, Donzel A, Van Maldergem L, Bieth E, Layet V, Mathieu M, Teebi A, Lespinasse J, Callier P, Mugneret F, Masurel-Paulet A, Gautier E, Huet F, Teyssier JR, Tosi M, Frébourg T, Faivre L. Hum Mutat Nov 19. PMID: immune response assay immune response assay Is an assay with the objective to determine information about an immune response [Term] id: OBI: name: DNA sequencing def:"DNA sequencing is a sequencing process which uses deoxyribonucleic acid as input and results in a the creation of DNA sequence information artifact using a DNA sequencer instrument." [OBI:sourced "OBI Branch derived"] is_a: OBI: ! sequencing assay [Term] id: OBI: name: DNA sequencing def:"DNA sequencing is a sequencing process which uses deoxyribonucleic acid as input and results in a the creation of DNA sequence information artifact using a DNA sequencer instrument." [OBI:sourced "OBI Branch derived"] is_a: OBI: ! sequencing assay

Ontology Lookup Service at EBI

Bioportal at the NCBO

Ontology Editors (and viewers) OBOEdit- OBO ontology editor and viewer Protégé - OWL ontology editor and viewer

Ontologies can help reconcile annotation inconsistencies

Developmental Biology, Scott Gilbert, 6 th ed. Using ontologies for error checking Text match mapping Fruit fly ‘tibia’Human ‘tibia’ UBERON: tibia UBERON: bone is_a Vertebrata Drosophila melanogaster part_of Homo sapiens is_a only_in_taxon part_of NOT is_a

Using ontologies for alignment of similar vocabularies

Information Content (IC) is based on depth within the ontology and annotation frequency Using ontologies for computation

Why do we need all these rules and standards for ontology use and construction? Automatic reasoning to infer related classes Annotation consistency Error checking Alignment with other ontologies Computation Ontologies must be intelligible to both: HumansMachines