An (OBO) ontology is NOT a model of language, it is a model of reality. Words are ambiguous – especially in isolation. Take the word 'wing' what type of.

Slides:



Advertisements
Similar presentations
Correct your own answers!
Advertisements

Homology.
More than one way to dissect an animal Melissa Haendel ZFIN Scientific Curator.
+ OWL for annotators David Osumi-Sutherland. + What is OWL? Web Ontology Language Can express everything in OBO and more. Certified web standard Fast.
Linking ontologies to one another and to the Cell Ontology with the COBrA ontology editor Jonathan Bard & Stuart Aitken Biomedical Science & Informatics.
Homology Review Human arm Lobed-fin fish fin Bat wing Bird wing Insect wing Homologous forelimbs not homologous as forelimbs or wings Definition: Structures.
Confessions/Disclaimers Ontologies and REDfly CARO SO OBO Foundry.
What is Ontology? Dictionary:A branch of metaphysics concerned with the nature and relations of being. Barry Smith:The science of what is, of the kinds.
Classification of developing structures Developing structures are typically classified by developmental biologists according to their fate. If such terms.
Representing Part Relationships Between Developing Structures.
Unit 7: Evolution.
+ From OBO to OWL and back again – a tutorial David Osumi-Sutherland, Virtual Fly Brain/FlyBase Chris Mungall – GO/LBL.
Application of OBO Foundry Principles in GO Chris Mungall Lawrence Berkeley Labs NCBO GO Consortium.
Linking Animal Models to Human Diseases Supported by NIH P41 HG and U54 HG the University of Oregon, Eugene, OR
Automated tools to help construction of Trait Ontologies Chris Mungall Monarch Initiative Gene.
The cell ontology covers multiple taxonomic contexts OntologyContext# of links to cl MAadult mouse1 FMAadult human658 XAOfrog63 AAOamphibian ZFAzebrafish391.
1 An Ontology of Relations for Biomedical Informatics Barry Smith 10 January 2005.
1 Ontology in 15 Minutes Barry Smith. 2 Main obstacle to integrating genetic and EHR data No facility for dealing with time and instances (particulars)
What is an ontology and Why should you care? Barry Smith with thanks to Jane Lomax, Gene Ontology Consortium 1.
1 Logical Tools and Theories in Contemporary Bioinformatics Barry Smith
Phenotype annotation using ontologies Chris Mungall (+ BS) Berkeley Bioinformatics and Ontologies Project (BBOP) National Center for Biomedical Ontology.
The RNA Ontology RNAO Colin Batchelor Neocles Leontis May 2009 Eckart, Colin and Jane In Cambridge.
1 BIOLOGICAL DOMAIN ONTOLOGIES & BASIC FORMAL ONTOLOGY Barry Smith.
What is an Ontology? AmphibiaTree 2006 Workshop Saturday 8:45–9:15 A. Maglia.
1 The Future of Clinical Bioinformatics: Overcoming Obstacles to Information Integration Barry Smith Brussells, Eurorec Ontology Workshop, 25 November.
Year 10 Biology 2014 NATURAL SELECTION & EVOLUTION.
Representing Development David Osumi-Sutherland – FlyBase Fabian Neuhaus - National Institute of Standards and Technology.
GO and OBO: an introduction. Jane Lomax EMBL-EBI What is the Gene Ontology? What is OBO? OBO-Edit demo & practical What is the Gene Ontology? What is.
Open Biomedical Ontologies. Open Biomedical Ontologies (OBO) An umbrella project for grouping different ontologies in biological/medical field –a repository.
Principles and Practice of Ontology Development: Making Definitions Computable Chris Mungall LBL.
Cell Ontology 2.0 Elimination of multiple is_a inheritance through instantiation of relationships to terms in outside ontologies, such as the GO cellular.
EVOLUTION EVIDENCE OF EVOLUTION AKHILESH CHITRANSHI PGT (BIOL.) KV NO. 1 JRC.
+ CARO 2.0 & FUNCARO David Osumi-Sutherland. + Review of CARO (v1) Many definitions are complicated and opaque: ‘anatomical group: “[An] anatomical structure.
American Medical Informatics Association Annual Symposium 2001 The Role of Definitions in Biomedical Concept Representation Joshua Michael, José L. V.
Ontology of Disease and the OBO Foundry Chris Mungall NCBO GO Nov 2006.
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Ontological Foundations of Biological Continuants Stefan Schulz, Udo Hahn Text Knowledge Engineering Lab University of Jena (Germany) Department of Medical.
The “über-ontology” (Uberon) Melissa Häendel, Chris Müngall, George Gkoütos Cell Ontology Workshop May, 2010.
Linking Animal Models and Human Diseases Supported by NIH P41 HG002659, U54 HG004028, & R01 HG Cambridge University & the University of Oregon.
The Plant Ontology: Development of a Reference Ontology for all Plants Plant Ontology Consortium Members and Curators*: Laurel D.
Phenote Mark Gibson Berkeley Bioinformatics and Ontology Project (BBOP) National Center for Biomedical Ontologies(NCBO) Lawrence Berkeley National Lab.
Gene Ontology Project
About ontologies Melissa Haendel. And who am I that I am giving you this talk? Melissa Haendel Anatomist, developmental neuroscientist, molecular biologist,
What is an ontology and Why should you care? Barry Smith 1.
Expanding species-specific anatomy ontologies to include the cell ontology Melissa Haendel (1), Ceri Van Slyke (1), Chris Mungall (2), Peiran Song (1),
+ From OBO to OWL and back again – a tutorial David Osumi-Sutherland, Virtual Fly Brain/FlyBase Chris Mungall – GO/LBL.
Phenote Mark Gibson Berkeley Bioinformatics and Ontology Project (BBOP) National Center for Biomedical Ontologies(NCBO) Lawrence Berkeley National Lab.
OBO Foundry Workshop 2009 Cell Ontology (CL) Preliminary review.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
Genes and Body plans How does an organism become a zygote
Anatomy Ontologies & Potential Users: Bridging the Gap Ravensara Travillian European Bioinformatics Institute
Basic Formal Ontology Barry Smith August 26, 2013.
Evidence for Evolution by Natural Selection.
1 The Relational Data Model David J. Stucki. Relational Model Concepts 2 Fundamental concept: the relation  The Relational Model represents an entire.
The Vertebrate Bridging Ontology (VBO) Ravensara Travillian, James Malone, Chao Pang, John Hancock, Peter W.H. Holland, Paul Schofield, and Helen Parkinson.
Names, Ranks, Clades, and Taxonomy Ontologies
The Teleost Anatomy Ontology: computable evolutionary morphology for teleost fishes Wasila Dahdul University of South Dakota & National Evolutionary Synthesis.
Towards a Computational Paradigm for Biological Structure
Outline Motivation: data mining Ontologies and all-some relationships
The Common Anatomy Reference Ontology (CARO) and queries across species Melissa Haendel ZFIN.
Development of the Amphibian Anatomical Ontology
Evolution Evidence of Evolution
Ontology in 15 Minutes Barry Smith.
Evidence for Evolution
The Gene Ontology: an evolution
15-3: Darwin Presents His Case
Ontology in 15 Minutes Barry Smith.
What is Ontology? s Dictionary:A branch of metaphysics concerned with the nature and relations of being. Barry Smith:The science of what is, of.
OBO Foundry Update: April 2010
Evolution Biology Mrs. Johnson.
Presentation transcript:

An (OBO) ontology is NOT a model of language, it is a model of reality. Words are ambiguous – especially in isolation. Take the word 'wing' what type of things have wings? birds, bats, flies, sphenoid bones*, planes, cars... An (OBO) ontology uses words from natural language but constrains their meaning to some subset of their possible meanings in language. Constraints are written into definitions and relationships between terms and refer to the scientific literature. * the lateral process of a sphenoid bone is referred to as a wing (Henderson's dictionary biological terms)‏ OBO foundry approach to ontology - 1

OBO foundry approach to ontology- 2 Problem with relying on the 'self-evident' meaning of words: With no definitions for the nouns, these two statements both have obvious readings which are true: 1. eye part of craniofacial tissue 2. ommatidium part of eye But, if we combine them: ommatidium part of eye part of craniofacial tissue... we can't get a true statement about biology, no matter what meaning we choose for the nouns. An ontology with thousands of terms and relations makes an enormous number of statements. Only by defining terms can we hope to avoid problems like this.

OBO 101 Type – a classification of things (instances) in the real world, based on some biologically significant similarity (specified by definition + relationships)‏

OBO 101 Type – a classification of things (instances) in the real world, based on some biologically significant similarity. Relationships between instances: Straightforward - each instance is just one thing: my left index finger is part_of my left hand

OBO 101 Type – a classification of things (instances) in the real world, based on some biologically significant similarity. Relationships between instances: Straightforward - each instance is just one thing: my left index finger is part_of my left hand Relationships between types: For any type, there can be many instances in the world. Does a relationship between types apply to all or just some of these instances?

Basic relations

Why we build anatomy ontologies Main use case: annotation of gene expression and phenotypes (MODs and others)‏ This needs: 1. Terms with written definitions => consistent annotation despite variant usage of terminology in literature. 2. Careful use of logically defined relations => allowing annotations with related terms to be grouped in biologically meaningful ways. Use case 2: A controlled system for recording character matrices for use in homology calls (cTOL/Nescent).

Grouping annotations using an anatomy ontology QUERY: list all axon tracts which are part of the adult midbrain List all alleles causing phenotypes in these structures. List all genes expressed in these structures.

Grouping annotations using an anatomy ontology QUERY: list all neurons which develop from neuroblast NB 7-1 List all alleles causing phenotypes in these structures. List all genes expressed in these structures.

Why standardise the way we build anatomy ontologies? So we don't all need to re-invent the wheel. Use of common relations allows the development of common tools. Best practice allows development of scalable and maintainable ontologies: Written definitions make the reasons for graph structure clear. => New editors can understand and build on the work of previous editors Reducing multiple inheritance makes ontologies makes avoiding true-path violations practical.

CARO is a standardisation exercise CARO provides an upper-ontology template to use for constructing anatomy ontologies. CARO provides a bridge between anatomy ontologies and the basic formal ontology (BFO). This is useful for putting constraints on relations. anatomical structure maps to BFO: object and as such is defined as 'maximally connected' this distinction makes part relationships easier to define and avoids confusion with is_a relations disconnected anatomical systems (e.g., immune system) are classified under 'anatomical group'

CARO is a purely structural classification. CARO classifies structures from the bottom up – on the basis of granularity. cell component cell portion of tissue multi-tissue structure... CARO classifies structure from the top down – as subdivisions of a whole organism organ system appendage CARO makes a clear distinction between cellular and acellular structures.

CARO associated standardisation 1. A system for defining structures by their developmental fate. 2. A system for recording developmental timings. 3. A system for recording part relationships which change over developmental time.

Why build multi-species anatomy ontologies ? We need systems for cross species inference of gene function. What do we need for inference? The development, differentiation of functioning of homolgous structures is likely to involve conserved genetic pathways – especially if the species are closely related.

Lee Niswander Nature Reviews Genetics 4, (February 2003)‏ Conservation of gene expression and role in limb development

Why do we care about homology when building an anatomy ontology? Consider an anatomy ontology of vertebrates: skeletal system cranial skeletal system parietal bone (in_organism human)‏ parietal bone (in_organism zebrafish) frontal bone (in_organism human)‏ frontal bone (in_organism zebrafish)‏ Homologous : frontal bone (zebrafish) and parietal bone (human)‏ f pa Different genes and developmental processes may underlie the development of the zebrafish frontal and the human frontal, even though they have the same name and are similarly located

How and where should homology information be captured? Use case: Query for phenotypes affecting the human frontal bone and its homologous structure in other species. Homologs = Synonyms

How should we not build multi- species anatomy ontologies ? It is not practical to write structural definitions which apply across many species and which apply to every structure in a homology group. e.g.- try to define parietal bone for all descendants of the common ancestor of zebrafish and human.

How should we not build multi- species anatomy ontologies ? Structural defs across many species may sometimes be impossible: Structure Z def: Anatomical entity with structural attributes, A, B, C and D. Structure X def: Anatomical entity with structural attributes C, D, E & F. evolved_from structure Z Structure Y def: Anatomical entity with structural attributes A, B, G & H. evolved_from structure Z

f pa A definition of homology X Y Where: All instances of structure X part_of some instance of species A All instances of structure Y part_of some instance of species B All instances of structure Z part_of some instance of species C X is homologous_to Y IFF All X derived_from some Z AND All Y derived_from some Z, AND C is the most recent common ancestor of A and B Z derived_from

Some term definitions forewing (MRCA Insecta)‏ def: Membranous dorsal appendage of adult mesothoracic segment of MRCA Insecta. hindwing (MRCA Insecta)‏ def: Membranous dorsal appendage of the adult metathoracic segment of MRCA Insecta. elytra (MRCA Coleoptera)‏ def: Hard, chitinous dorsal appendage of adult mesothoracic segment of MRCA Coleoptera. haltere (MRCA Diptera)‏ def: club shaped dorsal appendage of the metathoracic segment of MRCA Diptera. MRCA = most recent common ancestor species

appendage (CARO)‏.is_a wing of MRCA Insecta..is_a forewing of MRCA Insecta...derived_from wing of MRCA Diptera...derived_from elytra of MRCA Coleoptera..is_a hindwing of MRCA insecta...derived_from haltere of MRCA Diptera...derived_from wing (alae) of MRCA Coleoptera.is_a limb of MRCA tetrapoda..is_a forelimb of MRCA tetrapoda...derived_from wing of MRCA Aves...derived_from wing of MRCA Chiroptera* MRCA = most recent common ancestor species (*? perhaps should be derived_from autopod?)‏ A homology based multi-species anatomy ontology

Relations within species specific anatomy ontologies:.wing (Drosophila melanogaster)‏.derived from wing of Diptera MRCA.elytra (Tribolium casteneum)‏.derived_from elytra of coleoptera MRCA combined with:.forewing of MRCA Insecta...derived_from wing of MRCA Diptera...derived_from elytra of MRCA Coleoptera is sufficient to deduce that: elytra of Tribolium cateneum homologous_to wing of Drosophila melanogaster Reasoning homology

Acknowledgements Michael Ashburner CARO  Fabian Neuhaus (NIST)‏  Paula MaBee (U S.Dakota /cTOL)‏  Jose Mejino (U Washington, FMA)‏  Chris Mungall (Berkeley)‏  Melissa Haendel (U Oregon, ZFIN)‏  Barry Smith (Buffalo)‏ Funding  FlyBase; The Newton Trust

Terms in the Danio AO refer to many evolutionarily conserved structures.

Key point The names of terms in this anatomy ontology refer, in normal usage, to many conserved anatomical structures. But the terms themselves refer to whatever their definitions say they refer to - this is considerably more restrictive than normal usage.

Fabian’s idea wingforelimb limb-like thing derives_from  bat wing is_homologous to dog forelimb bats dogs Homology grouping limb-like thing --d-wing-bats --d-forelimb-dogs Question, what type of thing is a limb-like thing? forelimb – MRCA tetrapoda def: The forelimb of the most recent common ancestor of the tetrapoda. (attributes....)‏

Approaches to recording homology - relations 1. Capture pairwise X in species A homologous_to Y in species B Disadvantage – homology of X and Y to structures in species C, D, E F.... must be captured separately => v. many homology statements. 2. Capture with reference to some central multi-species ontology: X is_a fore-limb tetrapoda X derives_from fore-limb MRCA tetrapoda

Examples of term ambiguity: Early stages in spermatogenesis perineurial layer

David’s idea wingforelimb limb is_a  bat wing is_homologous to dog forelimb bats dogs Homology group ontology forelimb-tetrapoda --is_a wing-bat --is_a forelimb-dog CARO/other structural ontology Cardinal organism part --is_a appendage tetrapoda forelimb – tetrapoda def: An anatomical structure derived from the forelimb of the most recent common ancestor of the tetrapoda (attributes...)‏

Fish group idea (may not be current)‏ wingforelimb limb bat wing is_homologous to dog forelimb as limb bat wing is_homologous to bird wing as limb bat wing NOT homologous_to bird wing as wing bats dogs bird wing homologous_to CARO/multi-species AO appendage --i-wing-bats --i-wing-birds --i-forelimb-dogs homologous_to at level of tetrapoda limb + evidence + citation tetrapoda

Approaches to capturing homology – data format. OBO 1.2 format does not allow references and evidence codes to be captured for relationships, although OBO 1.3 will. Therefore – there is a good case for capturing this information separately in a flat file (tsv) for now. This is what the nescent/cTOL project is doing. The same argument applies, no matter what the relation.