Presentation is loading. Please wait.

Presentation is loading. Please wait.

Ontology and the Future of Biomedical Research Barry Smith

Similar presentations


Presentation on theme: "Ontology and the Future of Biomedical Research Barry Smith"— Presentation transcript:

1 Ontology and the Future of Biomedical Research Barry Smith http://ifomis.org

2 Institute for Formal Ontology and Medical Information Science Saarland University

3 From chromosome to disease

4 Problem: how to reason with data deriving from different sources, each of which uses its own system of classification ?

5 Solution: Ontology !

6 Examples of current needs for ontologies in biomedicine –to enforce semantic consistency within a database –to enable data sharing and re-use –to enable data integration (bridging across data at multiple granularities) –to allow querying

7 What is needed strong general purpose classification hierarchies created by domain specialists clear, rigorous definitions thoroughly tested in real use cases updated in light of scientific advance

8 The actuality (too often) myriad special purpose ‘light’ ontologies, prepared by ontology engineers and deposited in internet ‘repositories’ or ‘registries’

9 ontologies for ‘agent’

10

11 General trend on the part of NIH, FDA and other bodies to consolidate ontology-based standards for the communication and processing of biomedical data.

12 Responses to this trend Old: UMLS (Unified Medical Language System) – rooted in the faithfulness to the ways language is used by different medical communities

13 SNOMED DEMONS U M L S

14 –congenital absent nipple is_a nipple –cancer documentation is_a cancer –disease prevention is_a disease – repair and maintenance of wheelchair is_a disease – water is_a nursing phenomenon – part-whole =def. a nursing phenomenon with topology part-whole U M L S

15 MeSH MeSH Descriptors Index Medicus Descriptor Anthropology, Education, Sociology and Social Phenomena (MeSH Category) Social Sciences Political Systems National Socialism

16 MeSH National Socialism is_a Political Systems National Socialism is_a Anthropology... National Socialism is_a Social Sciences National Socialism is_a MeSH Descriptors

17 New: Semantic Web deposits Pet Profile Ontology Review Vocabulary Band Description Vocabulary Musical Baton Vocabulary MusicBrainz Metadata Vocabulary Kissology

18 http://www.w3.org/ Beer Ontology  all instances of hops that have ever existed are necessarily ingredients of beer.

19 some nice computational resources, but low expressivity and few genuinely scientific demonstration cases OWL-based ontologies …

20 OWL’s syntactic regimentation is not enough to ensure high-quality ontologies – the use of a common syntax and logical machinery and the careful separating out of ontologies into namespaces does not solve the problem of ontology integration

21 Both UMLS- and OWL-type responses involve ad hoc creation of new terminologies by each community Many of these terminologies remain as torsos, gather dust, poison the wells,...

22 How to do better? How to create the conditions for a step-by-step evolution towards high quality ontologies in the biomedical domain which will serve as stable attractors for clinical and biomedical researchers in the future?

23 A basic distinction type vs. instance science text vs. clinical document dog vs. Fido

24 Instances are not represented in an ontology built for scientific purposes It is the generalizations that are important (but instances must still be taken into account)

25 A515287DC3300 Dust Collector Fan B521683Gilmer Belt C521682Motor Drive Belt Catalog vs. inventory

26 Ontology Types Instances

27 Ontology = A Representation of Types

28 Each node of an ontology consists of: preferred term (aka term) term identifier (TUI, aka CUI) synonyms definition, glosses, comments

29 Each term in an ontology represents exactly one type hence ontology terms should be singular nouns National Socialism is_a Political Systems

30 An ontology is a representation of types We learn about types in reality from looking at the results of scientific experiments in the form of scientific theories – which describe not what is particular in reality but rather what is general Ontologies need to exploit the evolutionary path to convergence created by science

31 High quality shared ontologies build communities NIH, FDA trend to consolidate ontology- based standards for the communication and processing of biomedical data. caBIG / NECTAR / BIRN / BRIDG...

32 http://obo.sourceforge.net

33 http://www.geneontology.org/

34

35

36 The Methodology of Annotations GO employs scientific curators, who use experimental observations reported in the biomedical literature to link gene products with GO terms in annotations. This gene product exercises this function, in this part of the cell, leading to these biological processes

37 The Methodology of Annotations This process of annotating literature leads to improvements and extensions of the ontology, which in turn leads to better annotations This institutes a virtuous cycle of improvement in the quality and reach of both future annotations and the ontology itself. Annotations + ontology taken together yield a slowly growing computer-interpretable map of biological reality.

38 The OBO Foundry

39 A subset of OBO ontologies, whose developers have agreed in advance to accept a common set of principles designed to ensure –intelligibility to biologists (curators, annotators, users) –formal robustness –stability –compatibility –interoperability –support for logic-based reasoning The OBO Foundry

40 Custodians Michael Ashburner (Cambridge) Suzanna Lewis (Berkeley) Barry Smith (Buffalo/Saarbrücken) The OBO Foundry

41 A collaborative experiment participants have agreed in advance to a growing set of principles specifying best practices in ontology development designed to guarantee interoperability of ontologies from the very start The OBO Foundry

42 The developers of each ontology commit to its maintenance in light of scientific advance, and to soliciting community feedback for its improvement. They commit to working with other Foundry members to ensure that, for any particular domain, there is community convergence on a single reference ontology. The OBO Foundry

43 Initial Candidate Members of the OBO Foundry –GO Gene Ontology –CL Cell Ontology –SO Sequence Ontology –ChEBI Chemical Ontology –PATO Phenotype Ontology –FuGO Functional Genomics Investigation Ontology –FMA Foundational Model of Anatomy –RO Relation Ontology The OBO Foundry

44 Under development – Disease Ontology –NCI Thesaurus –Mammalian Phenotype Ontology –OBO-UBO / Ontology of Biomedical Reality –Organism (Species) Ontology –Plant Trait Ontology –Protein Ontology –RnaO RNA Ontology The OBO Foundry

45 Considered for development –Environment Ontology –Behavior Ontology –Biomedical Image Ontology –Clinical Trial Ontology The OBO Foundry

46 CRITERIA The OBO Foundry The ontology is open and available to be used by all. The developers of the ontology agree in advance to collaborate with developers of other OBO Foundry ontology where domains overlap. The ontology is in, or can be instantiated in, a common formal language.

47 The ontology possesses a unique identifier space within OBO. The ontology provider has procedures for identifying distinct successive versions. The ontology includes textual definitions for all terms. CRITERIA The OBO Foundry

48 The ontology has a clearly specified and clearly delineated content. The ontology is well-documented. The ontology has a plurality of independent users. CRITERIA The OBO Foundry

49 The ontology uses relations which are unambiguously defined following the pattern of definitions laid down in the OBO Relation Ontology.* *Genome Biology 2005, 6:R46 CRITERIA The OBO Foundry

50 CRITERIA Further criteria will be added over time in order to bring about a gradual improvement in the quality of the ontologies in the Foundry The OBO Foundry

51 A reference ontology is analogous to a scientific theory; it seeks to optimize representational adequacy to its subject matter to the maximal degree that is compatible with the constraints of computational usefulness.

52 An application ontology is comparable to an engineering artifact such as a software tool. It is constructed for a specific practical purpose. Examples: National Cancer Institute Thesaurus FuGO Functional Genomics Investigation Ontology

53 Reference Ontology vs. Application Ontology Currently, application ontologies are often built afresh for each new task; commonly introducing not only idiosyncrasies of format or logic, but also simplifications or distortions of their subject- matters. To solve this problem application ontology development should take place always against the background of a formally robust reference ontology framework

54 Advantages of the methodology of shared coherently defined ontologies promotes quality assurance (better coding) guarantees automatic reasoning across ontologies and across data at different granularities yields direct connection to temporally indexed instance data

55 Advantages of the methodology of shared coherently defined ontologies We know that high-quality ontologies can help in creating better mappings e.g. between human and model organism phenotypes S Zhang, O Bodenreider, “Alignment of Multiple Ontologies of Anatomy: Deriving Indirect Mappings from Direct Mappings to a Reference Ontology”, AMIA 2005

56 Advantages of the methodology of shared coherently defined ontologies once the interoperable gold standard reference ontologies are there, it will make sense to reformulate parts of existing incompatible terminologies (e.g. in UMLS) in terms of the standard ontologies in order to achieve greater domain coverage and alignment of different but veridical views. Thus not everything that was done in the past turns out to be a waste.

57 Goal: to create a family of gold standard reference ontologies upon which terminologies developed for specific applications can draw The OBO Foundry

58 Goal: to introduce the scientific method into ontology development: –all Foundry ontologies must be constantly updated in light of scientific advance –all Foundry ontology developers must work with all other Foundry ontology developers in a spirit of scientific collaboration The OBO Foundry

59 Goal: to replace the current policy of ad hoc creation of new database schemas by each clinical research group by providing reference ontologies in terms of which database schemas can be defined The OBO Foundry

60 Goal: to introduce some of the features of scientific peer review into biomedical ontology development The OBO Foundry

61 Goal: to create controlled vocabularies for use by clinical trial banks, clinical guidelines bodies, scientific journals,... The OBO Foundry

62 Goal: to create controlled vocabularies for use by clinical trial banks, clinical guidelines bodies, scientific journals,... The OBO Foundry

63 Goal: to create an evolving map-like representation of the entire domain of biological reality The OBO Foundry

64 GO’s three ontologies molecular function cellular component biological process

65 cell (types) molecular function (GO) species molecular process cellular anatomy anatomy (fly, fish, human...) cellular physiology organism-level physiology ChEBI, Sequence, RNA...

66 cell (types) molecula r function (GO) species molecula r process cellular anatomy anatomy (fly, fish, human...) cellular physiology organism-level physiology ChEBI, Sequence, RNA... normal (functionings)

67 pathophysiology (disease) pathoanatomy (fly, fish, human...) pathological (malfunctionings)

68 cell (types) molecula r function (GO) species molecula r process cellular anatomy (GO) anatomy (fly, fish, human...) cellular physiology organism-level physiology ChEBI, Sequence, RNA... pathophysiology (disease) pathoanatomy (fly, fish, human...)

69 cell (types) molecula r function (GO) species molecula r process cellular anatomy anatomy (fly, fish, human...) cellular physiology organism-level physiology ChEBI, Sequence, RNA... pathophysiology (disease) pathoanatomy (fly, fish, human...) phenotype

70 cell (types) molecula r function (GO) species molecula r process cellular anatomy anatomy (fly, fish, human...) cellular physiology organism-level physiology ChEBI, Sequence, RNA... pathophysiology (disease) pathoanatomy (fly, fish, human...) phenotype investigation (FuGO)

71 Ende

72 First step Alignment of OBO Foundry ontologies through a common system of formally defined relations in the OBO Relation Ontology See “Relations in Biomedical Ontologies”, Genome Biology Apr. 2005

73 Judith Blake: “The use of bio-ontologies … ensures consistency of data curation, supports extensive data integration, and enables robust exchange of information between heterogeneous informatics systems... ontologies … formally define relationships between the concepts.”

74 "Gene Ontology: Tool for the Unification of Biology" an ontology "comprises a set of well-defined terms with well-defined relationships" (Ashburner et al., 2000, p. 27)

75 is_a (sensu UMLS) A is_a B =def ‘A ’ is narrower in meaning than ‘B ’ grows out of the heritage of dictionaries (which ignore the basic distinction between types and instances)

76 is_a congenital absent nipple is_a nipple cancer documentation is_a cancer disease prevention is_a disease Nazism is_a social science

77 is_a (sensu logic) A is_a B =def For all x, if x instance_of A then x instance_of B cell division is_a biological process adult is_a child ???

78 Two kinds of entities occurrents (processes, events, happenings) cell division, ovulation, death continuants (objects, qualities,...) cell, ovum, organism, temperature of organism,...

79 is_a (for occurrents) A is_a B =def For all x, if x instance_of A then x instance_of B cell division is_a biological process

80 is_a (for continuants) A is_a B =def For all x, t if x instance_of A at t then x instance_of B at t abnormal cell is_a cell adult human is_a human but not: adult is_a child

81 Part_of as a relation between types is more problematic than is standardly supposed heart part_of human being ? human heart part_of human being ? human being has_part human testis ? human testis part_of human being ?

82 two kinds of parthood 1.between instances: Mary’s heart part_of Mary this nucleus part_of this cell 2.between types human heart part_of human cell nucleus part_of cell

83 Definition of part_of as a relation between types A part_of B =Def all instances of A are instance-level parts of some instance of B ALL–SOME STRUCTURE

84 part_of (for occurrents) A part_of B =Def For all x, if x instance_of A then there is some y, y instance_of B and x part_of y where ‘part_of’ is the instance-level part relation

85 part_of (for continuants) A part_of B =def. For all x, t if x instance_of A at t then there is some y, y instance_of B at t and x part_of y where ‘part_of’ is the instance-level part relation ALL-SOME STRUCTURE

86 How to use the OBO Relation Ontology Ontologies are representations of types and of the relations between types The definitions of these relations involve reference to times and instances, but these references are washed out when we get to the assertions (edges) in the ontology But curators should still be aware of the underlying definitions when formulating such assertions

87 part_of (for occurrents) A part_of B =Def For all x, if x instance_of A then there is some y, y instance_of B and x part_of y where ‘part_of’ is the instance-level part relation

88 A part_of B, B part_of C... The all-some structure of such definitions allows cascading of inferences (true path rule) (i) within ontologies (ii) between ontologies (iii) between ontologies and repositories of instance-data

89 Strengthened true path rule Whichever A you choose, the instance of B of which it is a part will be included in some C, which will include as part also the A with which you began The same principle applies to the other relations in the OBO-RO: located_at, transformation_of, derived_from, adjacent_to, etc.

90 Kinds of relations Between types: –is_a, part_of,... Between an instance and a type –this explosion instance_of the type explosion Between instances: –Mary’s heart part_of Mary

91 In every ontology some terms and some relations are primitive = they cannot be defined (on pain of infinite regress) Examples of primitive relations: –identity –instantiation –(instance-level) part_of –(instance-level) continuous_with

92 Fiat and bona fide boundaries

93 Continuity Attachment Adjacency

94 everything here is an independent continuant

95 structures vs. formations = bona fide vs. fiat boundaries

96 Modes of Connection The body is a highly connected entity. Exceptions: cells floating free in blood.

97 Modes of Connection Modes of connection: attached_to (muscle to bone) synapsed_with (nerve to nerve, nerve to muscle) continuous_with (= share a fiat boundary)

98 articular eminencearticular (glenoid)fossa ANTERIOR Attachment, location, containment

99 Containment involves relation to a hole or cavity 1: cavity 2: tunnel, conduit (artery) 3: mouth; a snail’s shell

100 Fiat vs. Bona Fide Boundaries

101 Double Hole Structure Medium (filling the environing hole) Tenant (occupying the central hole) Retainer (a boundary of some surrounding structure)

102 head of condyle neck of condyle fossa fiat boundary THE TEMPOROMANDIBULAR JOINT

103 continuous_with (a relation between instances which share a fiat boundary) is always symmetric: if x continuous_with y, then y continuous_with x

104 continuous_with (relation between types) A continuous_with B =Def. for all x, if x instance-of A then there is some y such that y instance_of B and x continuous_with y

105 continuous_with is not always symmetric Consider lymph node and lymphatic vessel: Each lymph node is continuous with some lymphatic vessel, but there are lymphatic vessels (e.g. lymphs and lymphatic trunks) which are not continuous with any lymph nodes

106 Adjacent_to as a relation between types is not symmetric Consider seminal vesicle adjacent_to urinary bladder Not: urinary bladder adjacent_to seminal vesicle

107 instance level this nucleus is adjacent to this cytoplasm implies: this cytoplasm is adjacent to this nucleus type level nucleus adjacent_to cytoplasm Not: cytoplasm adjacent_to nucleus

108 Applications Expectations of symmetry e.g. for protein- protein interactions may hold only at the instance level if A interacts with B, it does not follow that B interacts with A if A is expressed simultaneously with B, it does not follow that B is expressed simultaneously with A

109 c at t 1 C c at t C 1 time same instance transformation_of pre-RNAmature RNA adultchild

110 transformation_of A transformation_of B =Def. Every instance of A was at some earlier time an instance of B adult transformation_of child

111 C c at t c at t 1 C 1 tumor development

112 C c at t C 1 c 1 at t 1 C' c' at t time instances zygote derives_from ovum sperm derives_from

113 two continuants fuse to form a new continuant C c at t C 1 c 1 at t 1 C' c' at t fusion

114 one initial continuant is replaced by two successor continuants C c at t C 1 c 1 at t 1 C 2 c 1 at t 1 fission

115 one continuant detaches itself from an initial continuant, which itself continues to exist C c at t c at t 1 C 1 c 1 at t budding

116 one continuant absorbs a second continuant while itself continuing to exist C c at t c at t 1 C' c' at t capture

117 A suite of defined relations between types Foundationalis_a part_of Spatiallocated_in contained_in adjacent_to Temporaltransformation_of derives_from preceded_by Participationhas_participant has_agent

118 To be added to the Relation Ontology lacks (between an instance and a type, e.g. this fly lacks wings) dependent_on (between a dependent entity and its carrier or bearer) quality_of (between a dependent and an independent continuant) functioning_of (between a process and an independent continuant)

119 Low Hanging Fruit Ontologies should include only those relational assertions which hold universally (= have the ALL-SOME form) Often, order will matter here: We can include adult transformation_of child but not child transforms_into adult

120 The Gene Ontology

121 GO’s three ontologies molecular functions cellular components biological processes

122 When a gene is identified three types of questions need to be addressed: 1. Where is it located in the cell? 2. What functions does it have on the molecular level? 3. To what biological processes do these functions contribute?

123 Three granularities: Cellular (for components) Molecular (for functions) Organ + organism (for processes)

124 GO has cells but it does not include terms for molecules or organisms within any of its three ontologies except e.g. GO:0018995 host =Def. Any organism in which another organism spends part or all of its life cycle

125 Are the relations between functions and processes a matter of granularity? Molecular activities are the ‘building blocks’ of biological processes ? But they are not allowed to be represented in GO as parts of biological processes

126 GO’s three ontologies molecular functions cellular components biological processes

127 What does “function” mean? an entity has a biological function if and only if it is part of an organism and has a disposition to act reliably in such a way as to contribute to the organism’s survival the function is this disposition

128 Improved version an entity has a biological function if and only if it is part of an organism and has a disposition to act reliably in such a way as to contribute to the organism’s realization of the canonical life plan for an organism of that type

129 This canonical life plan might include canonical embryological development canonical growth canonical reproduction canonical aging canonical death

130 The function of the heart is to pump blood Not every activity (process) in an organism is the exercise of a function – there are – mal functionings – side-effects (heart beating) –accidents (external interference) – background stochastic activity

131 Kidney

132 Nephron

133 Functional Segments

134 Functions

135 This is a screwdriver This is a good screwdriver This is a broken screwdriver This is a heart This is a healthy heart This is an unhealthy heart

136 Functions are associated with certain characteristic process shapes Screwdriver: rotates and simultaneously moves forward simultaneously transferring torque from hand and arm to screw Heart: performs a contracting movement inwards and an expanding movement outwards

137 Not functioning at all leads to death, modulo internal factors: plasticity redundancy (2 kidneys) criticality of the system involved external factors: prosthesis (dialysis machines, oxygen tent) special environments assistance from other organisms

138 What clinical medicine is for to eliminate malfunctioning by fixing broken body parts (or to prevent the appearance of malfunctioning by intervening e.g. at the molecular level)

139 Hypothesis: there are no ‘bad’ functions It is not the function of an oncogene to cause cancer Oncogenes were in every case proto- oncogenes with functions of their own They become oncogenes because of bad (non-prototypical) environments

140 Is there an exception for molecular functions? Does this apply only to functions on biological levels of granularity (= levels of granularity coarser than the molecule) ? If pathology is the deviation from (normal) functioning, does it make sense to talk of a pathological molecule? (Pathologically functioning molecule vs. pathologically structured molecule)

141 Is there an exception for molecular functions? A molecular function is a propensity of a gene product instance to perform actions on the molecular level of granularity. Hypothesis 1: these actions must be reliably such as to contribute to biological processes. Hypothesis 2: these actions must be reliably such as to contribute to the organism’s realization of the canonical life plan for an organism of that type.

142 The Gene Ontology is a canonical ontology – it represents only what is normal in the realm of molecular functioning

143 The GO is a canonical representation “The Gene Ontology is a computational representation of the ways in which gene products normally function in the biological realm” Nucl. Acids Res. 2006: 34.

144 The FMA is a canonical representation It is a computational representation of types and relations between types deduced from the qualitative observations of the normal human body, which have been refined and sanctioned by successive generations of anatomists and presented in textbooks and atlases of structural anatomy.

145 The importance of pathways (successive causality) Each stage in the history of a disease presupposes the earlier stages Therefore need to reason across time, tracking the order of events in time, using relations such as derives_from, transformation_of... Need pathway ontologies on every level of granularity

146 The importance of granularity (simultaneous causality) Networks are continuants At any given time there are networks existing in the organism at different levels of granularity Changes in one cause simultaneous changes in all the others (Compare Boyle’s law: a rise in temperature causes a simultaneous increase in pressure)

147 The Granularity Gulf most existing data-sources are of fixed, single granularity many (all?) clinical phenomena cross granularities Therefore need to reason across time, tracking the order of events in time

148 Good ontologies require: consistent use of terms, supported by logically coherent (non-circular) definitions, in equivalent human-readable and computable formats coherent shared treatment of relations to allow cascading inference both within and between ontologies

149 Three fundamental dichotomies continuants vs. occurrents dependent vs. independent types vs. instances

150 ONTOLOGIES ARE REPRESENTATIONS OF TYPES aka kinds, universals, categories, species, genera,...

151 Continuants (aka endurants) –have continuous existence in time –preserve their identity through change –exist in toto whenever they exist at all Occurrents (aka processes) –have temporal parts –unfold themselves in successive phases –exist only in their phases

152 You are a continuant Your life is an occurrent You are 3-dimensional Your life is 4-dimensional

153 Dependent entities require independent continuants as their bearers There is no run without a runner There is no grin without a cat

154 Dependent vs. independent continuants Independent continuants (organisms, cells, molecules, environments) Dependent continuants (qualities, shapes, roles, propensities, functions)

155 All occurrents are dependent entities They are dependent on those independent continuants which are their participants (agents, patients, media...)

156 Top-Level Ontology Continuant Occurrent (always dependent on one or more independent continuants) Independent Continuant Dependent Continuant

157 = A representation of top-level types Continuant Occurrent Independent Continuant Dependent Continuant cell component biological process molecular function

158 Top-Level Ontology Continuant Occurrent Independent Continuant Dependent Continuant Functioning Side-Effect, Stochastic Process,... Function

159 Top-Level Ontology Continuant Occurrent Independent Continuant Dependent Continuant Functioning Side-Effect, Stochastic Process,... Function

160 Top-Level Ontology Continuant Occurrent Independent Continuant Dependent Continuant Quality Function Spatial Region Functioning Side-Effect, Stochastic Process,... instances (in space and time)

161 Smith B, Ceusters W, Kumar A, Rosse C. On Carcinomas and Other Pathological Entities, Comp Functional Genomics, Apr. 2006

162 everything here is an independent continuant

163 Functions, etc. Some dependent continuants are realizable expression of a gene application of a therapy course of a disease execution of an algorithm realization of a protocol

164 Functions vs Functionings the function of your heart = to pump blood in your body this function is realized in processes of pumping blood not all functions are realized (consider the function of this sperm...)

165 Concepts Biomedical ontology integration will never be achieved through integration of meanings or concepts The problem is precisely that different user communities use different concepts Concepts are in your head and will change as your understanding changes

166 Concepts Ontologies represent types: not concepts, meanings, ideas... Types exist, with their instances, in objective reality – including types of image, of imaging process, of brain region, of clinical procedure, etc.

167 Rules on types Don’t confuse types with words Don’t confuse types with concepts Don’t confuse types with ways of getting to know types Don’t confuse types with ways of talking about types Don’t confuses types with data about types

168 Some other simple rules for high quality ontologies

169 Univocity Terms should have the same meanings on every occasion of use. They should refer to the same kinds of entities in reality Basic ontological relations such as is_a and part_of should be used in the same way by all ontologies

170 Positivity Complements of types are not themselves types. Hence terms such as non-mammal non-membrane other metalworker in New Zealand do not designate types in reality

171 Ontology of types  logic of terms There are no conjunctive and disjunctive types: anatomic structure, system, or substance musculoskeletal and connective tissue disorder rheumatism, excluding the back

172 Objectivity Which types exist in reality is not a function of our knowledge. Terms such as unknown unclassified unlocalized arthropathies not otherwise specified do not designate types in reality.

173 Keep Epistemology Separate from Ontology If you want to say that We do not know where A’s are located do not invent a new class of A’s with unknown locations (A well-constructed ontology should grow linearly; it should not need to delete classes or relations because of increases in knowledge)

174 Syntactic Separateness Do not confuse sentences with terms If you want to say I surmise that this is a case of pneumonia do not invent a new class of surmised pneumonias

175 Single Inheritance No kind in a classificatory hierarchy should have more than one is_a parent on the immediate higher level

176 Multiple Inheritance thing car blue thing blue car is_a

177 Multiple Inheritance is a source of errors encourages laziness serves as obstacle to integration with neighboring ontologies hampers use of Aristotelian methodology for defining terms

178 Multiple Inheritance thing car blue thing blue car is_a 1 is_a 2

179 is_a Overloading The success of ontology alignment demands that ontological relations (is_a, part_of,...) have the same meanings in the different ontologies to be aligned.

180 Example: is_a is pressed into service by the GO to express location is-located-at and similar relations are expressed by creating special compound terms using: site of … … within … … in … extrinsic to … yielding associated errors

181 e.g. errors with ‘within’ lytic vacuole within a protein storage vacuole lytic vacuole within a protein storage vacuole is- a protein storage vacuole Compare: embryo within a uterus is-a uterus

182 similar problems with part_of extrinsic to membrane part_of membrane

183 Compositionality The meanings of compound terms should be determined 1. by the meanings of component terms together with 2. the rules governing syntax

184 Why do we need rules/standards for good ontology? Ontologies must be intelligible both to humans (for annotation and curation) and to machines (for reasoning and error-checking): the lack of rules for classification leads to human error and blocks automatic reasoning and error-checking Intuitive rules facilitate training of curators and annotators Common rules allow alignment with other ontologies

185 When we annotate the record of an experiment we use terms representing types to capture what we learn about: –this experiment (instance), performed here and now, in this laboratory –the instances experimented upon These instances are typical = they are representatives of types –of experiment (described in FuGO) –of gene product molecules, molecular functions, cellular components, biological processes (described in GO)

186 Experimental records document a variety of instances (particular real- world examples or cases), ranging from instances of gene products (including individual molecules) to instances of biochemical processes, molecular functions, and cellular locations

187 Experimental records provide evidence that gene products of given types have molecular functions of given types by documenting occurrences in the real world that involve corresponding instances of functioning. They document the existence of real-world molecules that have the potential to execute (carry out, realize, perform) the types of molecular functions that are involved in these occurrences.

188 Motivation: To capture reality Inferences and decisions we make are based upon what we know of reality. An ontology is a computable representation of biological reality, which is designed to enable a computer to reason over the data we collect about this reality in (some of) the ways that we do.


Download ppt "Ontology and the Future of Biomedical Research Barry Smith"

Similar presentations


Ads by Google