Intelligence Ontology: A Strategy for the Future

Slides:



Advertisements
Similar presentations
Upper Ontology Summit Tuesday March 14 The BFO perspective Barry Smith Department of Philosophy, University at Buffalo National Center.
Advertisements

Upper Ontology Summit Wednesday March 15 The BFO perspective Barry Smith Department of Philosophy, University at Buffalo National.
Species-Neutral vs. Multi-Species Ontologies Barry Smith.
On the Future of the NeuroBehavior Ontology and Its Relation to the Mental Functioning Ontology Barry Smith
Goal and Status of the OBO Foundry Barry Smith. 2 Semantic Web, Moby, wikis, crowd sourcing, NLP, etc.  let a million flowers (and weeds) bloom  to.
Application of OBO Foundry Principles in GO Chris Mungall Lawrence Berkeley Labs NCBO GO Consortium.
The Environment Ontology Barry Smith 1.
1 Pax Terminologica Barry Smith Institute for Formal Ontology and Medical Information Science, Saarland University, Saarbrücken.
1 Introduction to Biomedical Ontology Barry Smith University at Buffalo
The Future of Health Information Barry Smith Ontology Research Group Center of Excellence in Bioinformatics and Life Sciences University at Buffalo ontology.buffalo.edu/smith.
1 The OBO Foundry Towards Gold Standard Terminology Resources in the Biomedical Domain Thomas Bittner (based on a presentation by Barry Smith)
1 Intelligence Ontology: A Strategy for the Future Barry Smith University at Buffalo
1 How Ontologies Create Research Communities Barry Smith
1 Workshop 7.00 Welcoming Remarks 7.15 Barry Smith (Buffalo, NY) 7.40 Lindsay Cowell (Duke University, NC) 8.05 Nigam Shah (Stanford University, CA) 8.30.
1 The OBO Foundry Barry Smith University at Buffalo
1 How Ontologies Create Research Communities Barry Smith University at Buffalo
What is an ontology and Why should you care? Barry Smith with thanks to Jane Lomax, Gene Ontology Consortium 1.
1 The OBO Foundry 2 A prospective standard designed to guarantee interoperability of ontologies from the very start (contrast.
National center for ontological research. Part One: The History of NCOR and ECOR Part Two: How to Establish JCOR: The Japanese Consortium.
Underlying Ontologies for Biomedical work - The Relation Ontology (RO) and Basic Formal Ontology (BFO) Thomas Bittner SUNY Buffalo
1 Logical Tools and Theories in Contemporary Bioinformatics Barry Smith
1 Introduction to Ontology Barry Smith
Room for Lunch: Arlington Room Room for Evening Reception: Grand Prairie Room.
Why a Credit Card Number is Not a Number Barry Smith 1.
OBO-Foundry. OBO was conceived and announced in in october 2001 Michael Ashburner and Suzanna Lewis with acknowledgements of others in the GO.
CTO - Clinical Trials/Research in the Ontology of Biomedical Investigation Richard H. Scheuermann U.T. Southwestern Medical Center.
The RNA Ontology RNAO Colin Batchelor Neocles Leontis May 2009 Eckart, Colin and Jane In Cambridge.
1 BIOLOGICAL DOMAIN ONTOLOGIES & BASIC FORMAL ONTOLOGY Barry Smith.
1 The OBO Foundry Barry Smith Center of Excellence in Bioinformatics & Life Sciences, University at Buffalo IFOMIS, Saarland University
CoE Ontology Research Group (ORG) Barry Smith Center of Excellence in Bioinformatics and Life Sciences Ontology Research Group Department of Philosophy.
How to Organize the World of Ontologies Barry Smith 1.
New York State Center of Excellence in Bioinformatics & Life Sciences Biomedical Ontology in Buffalo Part I: The Gene Ontology Barry Smith and Werner Ceusters.
Introduction to Ontologies for Environmental Biology Barry Smith
The OBO Foundry Chris Mungall Lawrence Berkeley Laboratory NCBO GO Consortium May 2007.
1 How Ontologies Create Research Communities Barry Smith
1 The Canonical Life Barry Smith
1 Ontology (Science) Barry Smith University at Buffalo
The OBO Foundry approach to ontologies and standards with special reference to cytokines Barry Smith ImmPort Science Talk / Discussion June 17, 2014.
Ontological realism as a strategy for integrating ontologies Ontology Summit February 7, 2013 Barry Smith 1.
Intelligence Ontology A Strategy for the Future Barry Smith University at Buffalo
Why we need the OBO Core Michael Ashburner, Suzanna Lewis and Barry Smith.
Amo amos amot amomus amotis amont. Happy birthday Swiss-Prot Fortaleza August 2006.
Imports, MIREOT Contributors: Carlo Torniai, Melanie Courtot, Chris Mungall, Allen Xiang.
1 How Ontologies Create Research Communities Barry Smith University at Buffalo
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
Ontology of Disease and the OBO Foundry Chris Mungall NCBO GO Nov 2006.
Alan Ruttenberg PONS R&D Task force Alan Ruttenberg Science Commons.
2 3 where in the body ? where in the cell ?
What is an ontology and Why should you care? Barry Smith 1.
Need for common standard upper ontology
Towards a Top-Domain Ontology for Linking Biomedical Ontologies Holger Stenzhorn a,b Elena Beißwanger c Stefan Schulz a a Department of Medical Informatics,
New Foundations for Social Ontology Barry Smith 1.
Introduction to Biomedical Ontology for Imaging Informatics Barry Smith, PhD, FACMI University at Buffalo May 11, 2015.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
1 Ontology (Science) vs. Ontology (Engineering) Barry Smith University at Buffalo
Big Data that might benefit from ontology technology, but why this usually fails Barry Smith National Center for Ontological Research 1.
Basic Formal Ontology Barry Smith August 26, 2013.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
Upper Ontology Summit The BFO perspective Barry Smith Department of Philosophy, University at Buffalo National Center for Ontological Research National.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
1 Standards and Ontology Barry Smith
What is an ontology and Why should you care?
An Introduction to Ontology for Evolutionary Biology
Development of the Amphibian Anatomical Ontology
Ontology in 15 Minutes Barry Smith.
Why do we need upper ontologies? What are their purported benefits?
OBO Foundry Principles
IDEAS Core Model Concept
Ontology in 15 Minutes Barry Smith.
OBO Foundry Update: April 2010
Presentation transcript:

Intelligence Ontology: A Strategy for the Future Barry Smith University at Buffalo http://ontology.buffalo.edu/smith

Semantic Web, wikis, statistical textmining, etc. let a million flowers bloom how create broad-coverage semantic annotation systems which will enable sharing of gigantic bodies of heterogeneous data?

let a million flowers (weeds) bloom how create broad-coverage semantic annotation systems which will enable sharing of gigantic bodies of heterogeneous data?

let a million microtheories bloom what about Cyc?

why does Cyc not do the job? #$Configuration   A specialization of both #$StaticSituation and #$SpatialThing-Localized. Each instance of #$Configuration is a static situation consisting of two or more #$PartiallyTangible things of certain types standing in a certain type of spatial relationship (or set of relationships). This (set of) spatial relationship(s) characterizes the #$Configuration's _type_ in the sense that any group of objects of the appropriate types standing in that relationship (or those relationships) correspond to a #$Configuration of that type; and each of these objects, in turn, is said to be configured (see #$objectConfigured) in the (individual) #$Configuration. why does Cyc not do the job?

why does Cyc not do the job? (speculations) Cyc doesn’t care about consistency between microtheories so no progressive cumulation from an established core too little concern for consistency with basic science (common sense should not wear the trousers) no perspicuous policies for updating built by outsiders why does Cyc not do the job?

Unified Medical Language System (National Library of Medicine) built by trained experts massively useful for information retrieval and information integration good versioning and term-ID policies creates out of literature a semantically searchable space Unified Medical Language System (National Library of Medicine)

for UMLS local usage respected regimentation frowned upon mappings between ‘synonyms’ full of noise is_synonymous_with is not transitive no cross-framework consistency no concern to establish consistency with basic science different grades of formal rigor, different degrees of completeness, different update policies for UMLS

with UMLS-based annotations we can know what data we have (via term searches) we can map between data at single granularities (via ‘synonyms’) how do we combine data across granularities? how do we resolve logical conflicts ? how do we know what data we don’t have ? how do we reason with data ? with UMLS-based annotations

no evolutionary path towards improvement with UMLS, Cyc, Web 2.0, ...

We will be able to use ontologies to help us share data only if the ontologies represent the world correctly are humanly intelligible and computationally tractable

a new approach prospective standardization based on objective measures of what works bring together selected influential groups to agree on good terminology / annotation habits preemptively

for science requirements ensure legacy annotation efforts not wasted create an evolutionary path towards improvement, of the sort we find in science must be a collaborative, community effort to ensure buy-in ensure future-proofing requirements

create a consensus core of interoperable domain ontologies for science create a consensus core of interoperable domain ontologies starting with low hanging fruit and working outwards from there built and validated by trained experts backed by persons of influence in different communities

for science geospatial transport religion weather bacteria chemicals politics law use common rules drawing on best practices for creating ontologies ... and for linking ontologies

for science geospatial transport religion weather bacteria chemicals politics law ... exploiting the division of labor ... relying on champions in dispersed communities to spread the words

ontology of documents ontology of provenance ontology of names ontology of numbers (IDs) ontology of signatures ontology of identity ...

that people should use the core to annotate their data

and set up feedback mechanisms annotators discover they need more terms more relations between terms to correct existing relations the ontology gets better as it is used

This process leads to improvements and extensions of the ontology which in turn leads to better annotations  a virtuous cycle of improvement in the quality and reach of both future annotations and the ontology itself This process

a growing computer-interpretable map of reality within which major databases are automatically integrated in semantically searchable form The result

This solution is already being implemented in the domain of biomedicine

create a shared portal (low regimentation) First step (2003) create a shared portal (low regimentation) http://obo.sourceforge.net

Second step (2004) reform efforts initiated to link OBO ontologies together and to ensure orthogonality

The OBO Foundry http://obofoundry.org/ Third step (2006) The OBO Foundry http://obofoundry.org/ Why suddenly the switch from deep-hell black background to heavenly white ? Is that an intended connotation or just a copy and paste coincidence?

some groups commit to working together and to following common rules

a family of interoperable gold standard biomedical reference ontologies to serve the annotation of scientific literature model organism databases clinical data experimental results The OBO Foundry

A subset of OBO ontologies, whose developers have agreed in advance to accept a common set of principles designed to ensure tight connection to the biomedical basic sciences compatibility interoperability formal robustness support for logic-based reasoning

A prospective standard designed to guarantee interoperability of ontologies from the very start (contrast to: post hoc mapping) established March 2006 12 initial candidate OBO ontologies – focused primarily on basic science domains several being constructed ab initio A prospective standard

Ontology Scope URL Custodians Cell Ontology (CL) cell types from prokaryotes to mammals obo.sourceforge.net/cgi- bin/detail.cgi?cell Jonathan Bard, Michael Ashburner, Oliver Hofman Chemical Entities of Bio- logical Interest (ChEBI) molecular entities ebi.ac.uk/chebi Paula Dematos, Rafael Alcantara Common Anatomy Refer- ence Ontology (CARO) anatomical structures in human and model organisms (under development) Melissa Haendel, Terry Hayamizu, Cornelius Rosse, David Sutherland, Foundational Model of Anatomy (FMA) structure of the human body fma.biostr.washington. edu JLV Mejino Jr., Cornelius Rosse Functional Genomics Investigation Ontology (FuGO) design, protocol, data instrumentation, and analysis fugo.sf.net FuGO Working Group Gene Ontology (GO) cellular components, molecular functions, biological processes www.geneontology.org Gene Ontology Consortium Phenotypic Quality (PaTO) qualities of anatomical structures obo.sourceforge.net/cgi -bin/ detail.cgi? attribute_and_value Michael Ashburner, Suzanna Lewis, Georgios Gkoutos Protein Ontology (PrO) protein types and modifications Protein Ontology Consortium Relation Ontology (RO) relations obo.sf.net/relationship Barry Smith, Chris Mungall RNA Ontology (RnaO) three-dimensional RNA RNA Ontology Consortium Sequence Ontology (SO) properties and features of nucleic sequences song.sf.net Karen Eilbeck

Foundry communities include Transcriptomics (MIAME Working Group) Proteomics (Proteomics Standards Initiative) Metabolomics (Metabolomics Standards Initiative) Genomics and Metagenomics (Genomic Standards Consortium) In Situ Hybridization and Immunohistochemistry (MISFISHIE Working Group) Phylogenetics (Phylogenetics Community) RNA Interference (RNAi Community) Toxicogenomics (Toxicogenomics WG) Environmental Genomics (Environmental Genomics WG) Nutrigenomics (Nutrigenomics WG) Flow Cytometry (Flow Cytometry Community) Foundry communities include

Organism-Level Process CONTINUANT OCCURRENT INDEPENDENT DEPENDENT ORGAN AND ORGANISM Organism (NCBI Taxonomy) Anatomical Entity (FMA, CARO) Organ Function (FMP, CPRO) Phenotypic Quality (PaTO) Organism-Level Process (GO) CELL AND CELLULAR COMPONENT Cell (CL) Cellular Component (FMA, GO) Cellular Function Cellular Process MOLECULE Molecule (ChEBI, SO, RnaO, PrO) Molecular Function Molecular Process RELATION TO TIME GRANULARITY OBO Foundry coverage (canonical ontologies)

CRITERIA The ontology is in, or can be instantiated in, a common formal language. The developers of the ontology agree in advance to collaborate with developers of other OBO Foundry ontology where domains overlap. The ontology should be useful (have a plurality of user communities). CRITERIA

UPDATE: The developers of each ontology commit to its maintenance in light of scientific advance, and to soliciting community feedback for its improvement. ORTHOGONALITY: They commit to working with other Foundry members to ensure that, for any particular domain, there is community convergence on a single controlled vocabulary. CRITERIA

for science ORTHOGONALITY when communities work together to ensure consistency  orthogonality  additivity of annotation frameworks ADDITIVITY: if we annotate a database or body of literature with terms from one high-quality ontology, we should be able to add annotations from a second such ontology without conflicts ORTHOGONALITY

CRITERIA IDENTIFIERS: The ontology possesses a unique identifier space within OBO. VERSIONING: The ontology provider has procedures for identifying distinct successive versions. The ontology includes textual definitions for all terms. CRITERIA

COMMON ARCHITECTURE: The ontology uses relations which are unambiguously defined following the pattern of definitions laid down in the OBO Relation Ontology.* * Smith et al., Genome Biology 2005, 6:R46 CRITERIA

OBO Relation Ontology Foundational Spatial Temporal Participation is_a part_of Spatial located_in contained_in adjacent_to Temporal transformation_of derives_from preceded_by Participation has_participant has_agent

Create an OIC portal: a list of those ontologies and related artifacts which already exist 2. Find out which groups of ontology developers are willing to commit to working towards interoperability

3. Work with these groups in open, on-line and face-to-face, discussions, the records of which are made available on the OIC portal 4. Move towards a suite of authoritative ontologies, one for each domain – stable attractors 5. Make funding depend on use of authoritative ontologies – because these have been shown to work

what is a question? a representation of reality with a hole in it

They will provide a representation of the context of reality within which the holes can appear

The OIC Foundry ontologies will be stable, maximally open resources They will be authoritative in light of NCOR’s evaluation measures Will not reveal sensitive capabilities and interests. They will deal with types (of mobile telephone, of spores, of soil ...) Not with instances

(from Jen Williams, OWI) Some problems (from Jen Williams, OWI)

Do we have an instruction manual ?

create a simple top level framework http://ontologist.com Ontology: An Introduction http://www.ifomis.org/bfo

Too few knowledgeable folks, and fewer cleared. Computer scientists are teaching people ontology tools and ... Mohammed is_a string Amount of money is_a integer

what we need Training events (summer school ...) to teach people to CREATE ONTOLOGY CONTENT to teach people to USE ONTOLOGY CONTENT

Joining up will diminish your fiefdom   If you give everyone the keys to you kingdom, how will you justify your budget? We can do this already, as a start, with open source resources, plus a few brave champions of good annotation habits Later we humiliate those who do not join in

People use different tools format/language of the ontology is not easy to understand the OIC Foundry should use ontologies which are maximally format- and language-neutral

ncor@buffalo.edu we are to enable sharing of gigantic bodies of heterogeneous data we need all the help we can get our computers need all the help we can get