The Problem of Reusability of Biomedical Data OBO Foundry & HL7 RIM Barry Smith.

Slides:



Advertisements
Similar presentations
Confessions/Disclaimers Ontologies and REDfly CARO SO OBO Foundry.
Advertisements

Species-Neutral vs. Multi-Species Ontologies Barry Smith.
On the Future of the NeuroBehavior Ontology and Its Relation to the Mental Functioning Ontology Barry Smith
Goal and Status of the OBO Foundry Barry Smith. 2 Semantic Web, Moby, wikis, crowd sourcing, NLP, etc.  let a million flowers (and weeds) bloom  to.
1 Introduction to Biomedical Ontology Barry Smith University at Buffalo
The Future of Health Information Barry Smith Ontology Research Group Center of Excellence in Bioinformatics and Life Sciences University at Buffalo ontology.buffalo.edu/smith.
1 The OBO Foundry Towards Gold Standard Terminology Resources in the Biomedical Domain Thomas Bittner (based on a presentation by Barry Smith)
1 Intelligence Ontology: A Strategy for the Future Barry Smith University at Buffalo
1 How Ontologies Create Research Communities Barry Smith
1 The OBO Foundry Barry Smith University at Buffalo
1 How Ontologies Create Research Communities Barry Smith University at Buffalo
What is an ontology and Why should you care? Barry Smith with thanks to Jane Lomax, Gene Ontology Consortium 1.
1 The OBO Foundry 2 A prospective standard designed to guarantee interoperability of ontologies from the very start (contrast.
Thomas Bittner and Barry Smith IFOMIS (Saarbrücken) Normalizing Medical Ontologies Using Basic Formal Ontology.
What is an ontology and Why should you care? Barry Smith with thanks to Jane Lomax, Gene Ontology Consortium 1.
Underlying Ontologies for Biomedical work - The Relation Ontology (RO) and Basic Formal Ontology (BFO) Thomas Bittner SUNY Buffalo
1 Logical Tools and Theories in Contemporary Bioinformatics Barry Smith
1 Introduction to Ontology Barry Smith
Room for Lunch: Arlington Room Room for Evening Reception: Grand Prairie Room.
Why a Credit Card Number is Not a Number Barry Smith 1.
The RNA Ontology RNAO Colin Batchelor Neocles Leontis May 2009 Eckart, Colin and Jane In Cambridge.
1 BIOLOGICAL DOMAIN ONTOLOGIES & BASIC FORMAL ONTOLOGY Barry Smith.
1 The OBO Foundry Barry Smith Center of Excellence in Bioinformatics & Life Sciences, University at Buffalo IFOMIS, Saarland University
CoE Ontology Research Group (ORG) Barry Smith Center of Excellence in Bioinformatics and Life Sciences Ontology Research Group Department of Philosophy.
How to Organize the World of Ontologies Barry Smith 1.
New York State Center of Excellence in Bioinformatics & Life Sciences Biomedical Ontology in Buffalo Part I: The Gene Ontology Barry Smith and Werner Ceusters.
The Core Infectious Disease Ontology. Purpose: To make infectious disease-relevant data deriving from different sources comparable and computable Across.
1 Part III.The OBO Foundry Project: Towards Scientific Standards and Principles-Based Coordination in Biomedical Ontology Development.
1 How Ontologies Create Research Communities Barry Smith
BFO and Disease Barry Smith Milan, September 4,
Medical Informatics Basics
Building the Ontology Landscape for Cancer Big Data Research Barry Smith May 12, 2015.
Limning the CTS Ontology Landscape Barry Smith 1.
2007 CDISC International Interchange Ontologies in Clinical Research: Representation of clinical research data in the framework of formal biomedical ontologies.
Stefan Schulz Medical Informatics Research Group
Ontology of Sensors: Some Examples from Biology
Ontological realism as a strategy for integrating ontologies Ontology Summit February 7, 2013 Barry Smith 1.
Intelligence Ontology A Strategy for the Future Barry Smith University at Buffalo
Resurrecting SOWG BS, Baltimore, CTS Ontology Workshop April
1 How Ontologies Create Research Communities Barry Smith University at Buffalo
Ontological Engineering Barry Smith Computers and Information in Engineering Conference, Buffalo August 19,
Towards a Top-Level Ontology for Molecular Biology Stefan Schulz 1, Elena Beisswanger 2, Udo Hahn 2, Joachim Wermter 2, 1 Freiburg University Hospital,
1 HL7 RIM Barry Smith
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
What is an ontology? Barry Smith 1.
Ontology of Disease and the OBO Foundry Chris Mungall NCBO GO Nov 2006.
Alan Ruttenberg PONS R&D Task force Alan Ruttenberg Science Commons.
Introduction to Biomedical Ontology for Imaging Informatics Barry Smith, PhD, FACMI University at Buffalo May 11, 2015.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
How to integrate data Barry Smith. The problem: many, many silos DoD spends more than $6B annually developing a portfolio of more than 2,000 business.
2 3 where in the body ? where in the cell ?
About ontologies Melissa Haendel. And who am I that I am giving you this talk? Melissa Haendel Anatomist, developmental neuroscientist, molecular biologist,
Ontology and the Semantic Web Barry Smith August 26,
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
Need for common standard upper ontology
New Foundations for Social Ontology Barry Smith 1.
Introduction to Biomedical Ontology for Imaging Informatics Barry Smith, PhD, FACMI University at Buffalo May 11, 2015.
1 An Introduction to Ontology for Scientists Barry Smith University at Buffalo
Immunology Ontology Rho Meeting October 10, 2013.
OBO Foundry Principles BFO RO Barry Smith 1. OBO Foundry Principles  open  common formal language (OBO Format, OWL DL, CL)  commitment to collaboration.
Big Data that might benefit from ontology technology, but why this usually fails Barry Smith National Center for Ontological Research 1.
Basic Formal Ontology Barry Smith August 26, 2013.
Building Ontologies with Basic Formal Ontology Barry Smith May 27, 2015.
Upper Ontology Summit The BFO perspective Barry Smith Department of Philosophy, University at Buffalo National Center for Ontological Research National.
New York State Center of Excellence in Bioinformatics & Life Sciences R T U Ontological Realism and the Open Biomedical Ontologies Foundry Februari 25,
1 Standards and Ontology Barry Smith
Why do we need upper ontologies? What are their purported benefits?
Upper Ontologies for Specifying Context
OBO Foundry Update: April 2010
Presentation transcript:

The Problem of Reusability of Biomedical Data OBO Foundry & HL7 RIM Barry Smith

2 DCRI Project Goal to facilitate research in CV and TB by increasing the re-usability of data collected in the healthcare setting.

3 Knowledge Environments for Biomedical Research (KEBR) NIH Conference, December 11-12, 2006 Knowledge environments must be characterized by:  sustainability  adaptability  interoperability  evolvability

4 sustainability biologists have huge amounts of data, which they need to manage and make accessible have worked out a sustainable way of achieving this result

5 adaptability best achieved through modularity, each portion of the knowledge environment controlled by appropriate domain experts

6 interoperability the modules should use a common (simple) logic and a common, thoroughly- tested (simple) ontology - unification with a light touch

7 evolvability = change in light of scientific advance  knowledge environments must be tied to biological and clinical research must be able to evolve incrementally must must ensure backwards compatibility with legacy annotations

8 how do we make different sorts of data combinable in ways useful to the human beings who carry out research?

9 how was this problem solved in the years before computers? how did clinical researchers from different disciplines communicate? how did they learn to communicate?

10 Ontology-based methodology for clinical and translational research

11 through the basic biomedical sciences anatomy, physiology, biochemistry, histology,...

12 what is “metadata” ?? data models (HL7 RIM, BRIDG,...  UML) vs. ontologies How should we represent data? vs. How should we represent reality?

13

14 what cellular component? what molecular function? what biological process?

15 Gene Ontology

16 checking the ontology: everything can be traced back to instances in reality serotonin is_a biogenic amine every instance of serotonin is an instance of biogenic amine

17 Heparin therapy is_a written or spoken designation of a concept mouse is_a common name for the species mus musculus virus is_a environment ontology unclassified Lulworthiales is_a environmental samples

18 Logical power of the ontology Example: Ontologies facilitate grouping of annotations brain 20 hindbrain 15 rhombomere 10 Query brain without ontology 20 Query brain with ontology 45

19 CONTINUANTOCCURRENT INDEPENDENTDEPENDENT ORGAN AND ORGANISM Organism (NCBI Taxonomy) Anatomical Entity (FMA, CARO) Organ Function (FMP, CPRO) Phenotypic Quality (PaTO) Organism-Level Process (GO) CELL AND CELLULAR COMPONENT Cell (CL) Cellular Component (FMA, GO) Cellular Function (GO) Cellular Process (GO) MOLECULE Molecule (ChEBI, SO, RnaO, PrO) Molecular Function (GO) Molecular Process (GO) OBO Foundry coverage GRANULARITY RELATION TO TIME

20 Biorepository Ontology Chemical Entities of Biological Interest (ChEBI) Clinical Investigation Ontology (CIO) Common Anatomy Reference Ontology (CARO) Disease Ontology (DO) Foundational Model of Anatomy (FMA) Cell Ontology (CL) Gene Ontology (GO) Mosquito Anatomy Ontology (MAO) Ontology for Biomedical Investigations (OBI) Phenotypic Quality Ontology (PaTO) Plant Ontology (PO) Protein Ontology (PRO) Relation Ontology (RO) RNA Ontology (RnaO) Sequence Ontology (SO) Xenopus Anatomy Ontology (XAO) Zebrafish Anatomical Ontology (ZAO)

21 Chemical Entities of Biological Interest (ChEBI) Clinical Investigation Ontology (CIO) Common Anatomy Reference Ontology (CARO) Disease Ontology (DO) Foundational Model of Anatomy (FMA) Cell Ontology (CL) Gene Ontology (GO) Mosquito Anatomy Ontology (MAO) Ontology for Biomedical Investigations (OBI) Phenotypic Quality Ontology (PaTO) Plant Ontology (PO) Protein Ontology (PRO) Relation Ontology (RO) RNA Ontology (RnaO) Sequence Ontology (SO) Xenopus Anatomy Ontology (XAO) Zebrafish Anatomical Ontology (ZAO)

22 Biorepository Ontology Chemical Entities of Biological Interest (ChEBI) Clinical Investigation Ontology (CIO) Common Anatomy Reference Ontology (CARO) Disease Ontology (DO)  interoperation with SNOMED CT Foundational Model of Anatomy (FMA) Cell Ontology (CL) Gene Ontology (GO) Mosquito Anatomy Ontology (MAO) Ontology for Biomedical Investigations (OBI) Phenotypic Quality Ontology (PaTO)  signs and symptoms Plant Ontology (PO) Protein Ontology (PRO) Relation Ontology (RO) RNA Ontology (RnaO) Sequence Ontology (SO) Xenopus Anatomy Ontology (XAO) Zebrafish Anatomical Ontology (ZAO)

23 RELATION TO TIME GRANULARITY CONTINUANTOCCURRENT INDEPENDENTDEPENDENT ORGAN AND ORGANISM Organism (NCBI Taxonomy / placeholder) Anatomical Entity (FMA, CARO) Organ Function (placeholder) Phenotypic Quality (PaTO) Biological Process (GO) CELL AND CELLULAR COMPONENT Cell (CL) Cellular Component (FMA, GO) Cellular Function (GO) MOLECULE Molecule (ChEBI, SO, RnaO, PrO) Molecular Function (GO) Molecular Process (GO) Building out from the original GO

24 BFO Top-Level Ontology Continuant Occurrent (always dependent on one or more independent continuants) Independent Continuant Dependent Continuant

25 = A representation of top-level types Continuant Occurrent Independent Continuant Dependent Continuant cell component biological process molecular function biological process

26 Top-Level Ontology Continuant Occurrent Independent Continuant Dependent Continuant QualityFunction instances (in space and time)

27 BFO as organising structure

clinical medicine rooted in the basic biological sciences via high-quality controlled vocabularies

29 INDEPENDENT CONTINUANTS organism system organ organ part tissue cell acellular anatomical structure biological molecule genome DEPENDENT CONTINUANTS physiology (functions) pathology acute stage progressive stage resolution stage next step: create a repertoire of disease ontologies built out of OBO Foundry elements

30 Ontology for Acute Respiratory Distress Syndrome

31 Draft Ontology for Multiple Sclerosis what data do we have? what data do the others have? what data do we not have?

32 HL7

33 Schadow The RIM ‘defines the grammar of a language for information in healthcare’. ‘All data is in a form in which Entities (people, places, things: NOUNS) are related in Roles (RELATORS) to other Entities, and through their participations (PREPOSITIONS) interact in Acts (VERBS).’

34 Problems of scope Act = intentional action No processes (verb items) outside Act How can the RIM deal with disease processes, drug interactions, traffic accidents, adverse events?

35 Problems of scope Entity = persons, places, organizations, material No things (noun items) outside Entity How can the RIM deal with wounds, fractures,? How can the RIM deal with diseases?

36 Mayo on ‘Act’ as “intentional action” Is a snake bite or bee sting an intentional action? Is a knife stabbing an intentional action? Is a car accident an intentional action? When a child swallows the contents of a bottle of poison is that an intentional action? of_Observations

37 Diseases in the RIM... are not Acts... are not Entities... are not Roles, Participations, Role-Links... So what are they?

38 Correct Answer Diseases fall outside the scope of the RIM. The RIM is concerned to standardize the way in which data is represented in messages, etc. It is not concerned to standardize the way in which diseases, alleles, drug interactions, etc., are represented.

39 The RIM’s answer Diseases are Acts of Observation A case of pneumonia is an Act of Observation of a case of pneumonia A diagnosis is an Act of Observation of an Act of Observation

40 HL7’s Clinical Genomics Standard Specifications an individual allele as an Act of Observation a phenotype is an Act of Observation of an Act of Observation

41 Why is Act of Observation used for biomolecules but not for pharmaceutical products? Schadow: because biomolecules ‘are in the process of being discovered.’

42 HL7 Items in reality about which we have information that is known a priori are identified as Entities Items in reality about which we are still discovering information are identified as Acts of Observation. Gunther: ‘ What seems like an arbitrary split... is in fact the practical thing to do’ – it helps us to avoid ‘frequent modifications’

43 A disease is an “Act of concern” Dan Russler. 'In HL7, we assert that Peter has a UML symbol with an object instance identifier assigned to "his disease" that includes an instance identifier assigned to his "type of disease". Once consensus builds around the subject of concern, the patient can be said to "have" a specific disease. It can be said that "over time" the UML symbol instance identifier for the "concern" becomes the UML symbol instance identifier for the "disease."'

44 What should be done? Create a clinical ontology which allows adequate treatment of all the types of entity relevant to information exchange in biomedicine, including: non-intentional processes, diseases, infections, biomolecules, etc.

45 BFO top-Level Ontology Continuant Occurrent Independent Continuant Dependent Continuant Quality Function Act Physical Event...

46 RIM Ontology Continuant Occurrent Entity(Intentional) Act Person Physical Thing Organ- ization Bio- molecule Disease Drug interaction...

47 BFO normalized RIM Continuant Occurrent Independent Continuant Dependent Continuant Condition Act Physical Event Disease Request Observation Everything made of molecules Temperature Drug interaction

48 What is new Continuant Occurrent Independent Continuant Dependent Continuant Condition Act Physical Event Disease Request Observation Everything made of molecules Temperature Drug interaction

49 Coherent interoperation with ChEBI, PATO, SNOMED, MedDRA, etc. Continuant Occurrent Independent Continuant Dependent Continuant Condition Act Physical Event Disease Request Observation Everything made of molecules Temperature Drug interaction

50 ? ? ? ? ? ?

51 Draft Ontology for Multiple Sclerosis what data do we have? what data do the others have? what data do we not have?

52 Methodology of cross-products compound terms and definitions should be built out of constituent terms drawn from ontologies. E.g. PaTO increased concentration’ FMA ‘blood’ CheBI term ‘glucose’  blood glucose phenotypes. Foundry provides rigor for post-coordination Contributions to solving the silo problem

53 Open questions relations to generating forms controlled vocabulary for clinical care common data elements clinical trials treatments signs and symptoms (clinical and pre-clinical manifestations)

54 Open questions role of stakeholders professional society support champions who will test role of rare disease researcher communities mandates