Ontology The science of the kinds and structures of objects, and their properties and relations. Defined by a scientific field's vocabulary and by the.

Slides:



Advertisements
Similar presentations
Three-Step Database Design
Advertisements

BIOLOGY- SEMESTER 1.
Knowledge Representation
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
Ontology From Wikipedia, the free encyclopedia In philosophy, ontology (from the Greek oν, genitive oντος: of being (part. of εiναι: to be) and –λογία:
What is Ontology? Dictionary:A branch of metaphysics concerned with the nature and relations of being. Barry Smith:The science of what is, of the kinds.
Intelligent systems Lecture 6 Rules, Semantic nets.
ISBN Chapter 3 Describing Syntax and Semantics.
Application of OBO Foundry Principles in GO Chris Mungall Lawrence Berkeley Labs NCBO GO Consortium.
Using the Semantic Web to Construct an Ontology- Based Repository for Software Patterns Scott Henninger Computer Science and Engineering University of.
Gene Ontology John Pinney
OASIS Reference Model for Service Oriented Architecture 1.0
© Janice Regan, CMPT 102, Sept CMPT 102 Introduction to Scientific Computer Programming The software development method algorithms.
Human Evolution Session I Matter-Universe A multidisciplinary anthropic focus.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Biological Ontologies Neocles Leontis April 20, 2005.
The RNA Ontology RNAO Colin Batchelor Neocles Leontis May 2009 Eckart, Colin and Jane In Cambridge.
Modified from Sommerville’s originalsSoftware Engineering, 7th edition. Chapter 8 Slide 1 System models.
Internet tools for genomic analysis: part 2
What is an Ontology? AmphibiaTree 2006 Workshop Saturday 8:45–9:15 A. Maglia.
Describing Syntax and Semantics
PHY 042: Electricity and Magnetism
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Ontology Alignment/Matching Prafulla Palwe. Agenda ► Introduction  Being serious about the semantic web  Living with heterogeneity  Heterogeneity problem.
Building Biomedical Ontologies Jennifer Clark, GO Editorial Office.
Principles for Building Biomedical Ontologies Suzanna Lewis National Center Biomedical Ontology 22 October 2005 Advanced Bioinformatics, Cold Spring Harbor.
Chapter 4 System Models A description of the various models that can be used to specify software systems.
Are We Ready to Implement the Common Core Standards in Mathematics ? Katy White based on a presentation by DR. WESLEY BIRD
Protege OWL Plugin Short Tutorial. OWL Usage The world wide web is a natural application area of ontologies, because ontologies could be used to describe.
GeoUML a conceptual data model for geographical data conformant to ISO TC 211 Main GeoUML constructs Alberto BelussiNovembre 2004.
GO and OBO: an introduction. Jane Lomax EMBL-EBI What is the Gene Ontology? What is OBO? OBO-Edit demo & practical What is the Gene Ontology? What is.
6. Simple Features Specification Background information UML overview Simple features geometry.
Open Biomedical Ontologies. Open Biomedical Ontologies (OBO) An umbrella project for grouping different ontologies in biological/medical field –a repository.
Class 4: Part 1 Common Core State Standards (CCSS) Class April 4, 2011.
Biology and You Chapter 1. Objectives Relate the seven properties of life to a living organism Relate the seven properties of life to a living organism.
LOGIC AND ONTOLOGY Both logic and ontology are important areas of philosophy covering large, diverse, and active research projects. These two areas overlap.
Sommerville 2004,Mejia-Alvarez 2009Software Engineering, 7th edition. Chapter 8 Slide 1 System models.
Taken from Schulze-Kremer Steffen Ontologies - What, why and how? Cartic Ramakrishnan LSDIS lab University of Georgia.
WHAT IS THE NATURE OF SCIENCE?. SCIENTIFIC WORLD VIEW 1.The Universe Is Understandable. 2.The Universe Is a Vast Single System In Which the Basic Rules.
1 What is OO Design? OO Design is a process of invention, where developers create the abstractions necessary to meet the system’s requirements OO Design.
Sharing Ontologies in the Biomedical Domain Alexa T. McCray National Library of Medicine National Institutes of Health Department of Health & Human Services.
Machine Learning Chapter 5. Artificial IntelligenceChapter 52 Learning 1. Rote learning rote( โรท ) n. วิถีทาง, ทางเดิน, วิธีการตามปกติ, (by rote จากความทรงจำ.
Copyright OpenHelix. No use or reproduction without express written consent1.
Anatomy Ontology Community Melissa Haendel. The OBO Foundry More than just a website, it’s a community of ontology developers.
ESDI Workshop on Conceptual Schema Languages and Tools
Natural Science 06 - Science is embedded in the scientific community, Scientific ideas lead to ongoing research.
L1-Spatial Concepts NGEN06 & TEK230: Algorithms in Geographical Information Systems by: Irene Rangel, updated by Sadegh Jamali 1.
Organization Challenges for Building Biomedical Ontologies Talk by Jennifer Clark, Slides by Michael Ashburner and Suzanna Lewis
THEORY OF COMPUTATION Komate AMPHAWAN 1. 2.
Chapter 10. The Explorer System in Cognitive Systems, Christensen et al. Course: Robots Learning from Humans On, Kyoung-Woon Biointelligence Laboratory.
Some Thoughts to Consider 8 How difficult is it to get a group of people, or a group of companies, or a group of nations to agree on a particular ontology?
1 Principles for Building Biomedical Ontologies Barry Smith.
+ From OBO to OWL and back again – a tutorial David Osumi-Sutherland, Virtual Fly Brain/FlyBase Chris Mungall – GO/LBL.
Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation Bioinformatics, July 2003 P.W.Load,
Artificial Intelligence: Research and Collaborative Possibilities a presentation by: Dr. Ernest L. McDuffie, Assistant Professor Department of Computer.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
1 SWE Introduction to Software Engineering Lecture 14 – System Modeling.
Copyright © The McGraw-Hill Companies, Inc. Permission required for reproduction or display. Chapter 1 Lecture Slides.
MDD-Kurs / MDA Cortex Brainware Consulting & Training GmbH Copyright © 2007 Cortex Brainware GmbH Bild 1Ver.: 1.0 How does intelligent functionality implemented.
Knowledge Representation Part I Ontology Jan Pettersen Nytun Knowledge Representation Part I, JPN, UiA1.
Knowledge Representation
The Basic of Measurement
Principles for Building Biomedical Ontologies
Principles for Building Biomedical Ontologies
Principles for Building Biomedical Ontologies: A GO Perspective
OBO Foundry Principles
Principles for Building Biomedical Ontologies:A GO Perspective
Ch. 1 Review A test which determines the effect of a single variable by changing it while keeping all other variables the same. Controlled Experiment.
What is Ontology? s Dictionary:A branch of metaphysics concerned with the nature and relations of being. Barry Smith:The science of what is, of.
Generalized Diagnostics with the Non-Axiomatic Reasoning System (NARS)
Presentation transcript:

Ontology The science of the kinds and structures of objects, and their properties and relations. Defined by a scientific field's vocabulary and by the canonical formulations of its theories.

Information management view of “ontology” Different groups of data-gatherers develop their own idiosyncratic terms, and relationships between them, to represent information. To put this information together, methods must be found to resolve incompatibilities. Again, and again, and again… Ontology: A shared, common, backbone taxonomy of relevant entities, and the relationships between them, within an application domain

Which means… Instances are not included! It is the abstractions that are important (…but always with instances in mind)

And it means ontology is not: A common syntax for data exchange –These will change over time, e.g. XML was the syntax du jour.

Motivation Inferences and decisions we make are based upon what we know of the biological reality. An ontology is a computable representation of this underlying biological reality. Enables a computer to reason over the data in (some of) the ways that we do –particularly to locate relevant data.

Ontologies must be shared Communities form scientific theories –that seek to explain all of the existing evidence –and can be used for prediction These communities are all directed to the same biological reality, but have their own perspective The computable representation must be shared Ontology development is inherently collaborative

WhySurvey Improve Domain covered ? Public ? Active ? Applied ? Communit y? Develop Salvage Collaborate & Learn yes no SCOR, mmCIF,… yes

Pragmatic assessment of an ontology Is there access to help, e.g.: ? Does a warm body answer help mail within a ‘reasonable’ time—say 2 working days ?

WhySurvey Improve Domain covered ? Public ? Active ? Applied ? Communit y? Develop Salvage Collaborate & Learn yes no SCOR, mmCIF,… yes

Where the rubber meets the road Every ontology improves when it is applied to actual instances of data It improves even more when these data are used to answer research questions There will be fewer problems in the ontology and more commitment to fixing remaining problems when important research data is involved that scientists depend upon Be very wary of ontologies that have never been applied

A little sociology Experience from building the GO

Design for purpose Who will use it? –If no one is interested, then go back to bed What will they use it for? –Define the domain Who will maintain it? –Be pragmatic and modest Pragmatic example that worked: Linnaean classification (and it is independent of technology) Need to aim for progress between every meeting. What does the ROC want to have completed before you meet again?

The character of the principals With a shared commitment and vision. With broad domain knowledge. Who will engage in vigorous debate without engaging their egos (or, at least not too much). Who will do concrete work and attend frequent working sessions (quarterly), phone conferences (weekly), correspondence (daily). Who have a stake in seeing it work.

Establish a mechanism for change. Use CVS or Subversion. Limit the number of editors with write permission. Seriously implement upon real instances and feed what is learned back to the editors (mail and tracking systems).

Involve the community Release ontology to community. Release the products of its instantiation. Invite broad community input and establish a mechanism for this (e.g. SourceForge). Publish Actively court contributors Emphasize openness

Improvements come in two forms Getting it right –It is impossible to get it right the 1st (or 2nd, or 3rd, …) time. What we know about reality is continually growing A different kind of “standard” that requires versioning. Improve Collaborate and Learn

On relationships and terms Relationships must also be defined. (does ‘R’ signify relationships?)

The Rules 1.Univocity: Terms should have the same meanings on every occasion of use 2.Positivity: Terms such as ‘non-mammal’ or ‘non- membrane’ do not designate genuine classes. 3.Objectivity: Terms such as ‘unknown’ or ‘unclassified’ or ‘unlocalized’ do not designate biological natural kinds. 4.Single Inheritance: No class in a classification hierarchy should have more than one is_a parent on the immediate higher level 5.Intelligibility of Definitions: The terms used in a definition should be simpler (more intelligible) than the term to be defined 6.Basis in Reality: When building or maintaining an ontology, always think carefully at how classes relate to instances in reality 7.Distinguish Universals and Instances

Tactile sense Taction Tactition ? The Challenge of Univocity: People call the same thing by different names

Tactile sense Taction Tactition perception of touch ; GO: Univocity: GO uses 1 term and many characterized synonyms

= bud initiation The Challenge of Univocity: People use the same words to describe different things

Positivity Note the logical difference between –“non-membrane-bound organelle” and –“not a membrane-bound organelle” The latter includes everything that is not a membrane bound organelle!

Objectivity How can we use GO to annotate gene products when we know that we don’t have any information about them? –Currently GO has terms in each ontology to describe unknown (wrong!) –An alternative is to annotate genes to root nodes and use an evidence code to describe that we have no data. Similar strategies could be used for things like receptors where the ligand is unknown.

True path violation What is it?..”the pathway from a child term all the way up to its top-level parent(s) must always be true". chromosome Mitochondrial chromosome Is_a relationship Part_of relationship nucleus

True path violation What is it?..”the pathway from a child term all the way up to its top-level parent(s) must always be true". nucleuschromosome Nuclear chromosome Mitochondrial chromosome Is_a relationshipsPart_of relationship

Relationships and definitions The set of necessary conditions is determined by the graph –This can be considered a partial definition Important considerations: –Placement in the graph—selecting parents –Appropriate relationships to different parents –True path violation

Structured definitions contain both genus and differentiae Essence = Genus + Differentiae neuron cell differentiation = Genus: differentiation (processes whereby a relatively unspecialized cell acquires the specialized features of..) Differentiae: acquires features of a neuron

Alignment of the Two Ontologies will permit the generation of consistent and complete definitions id: CL: name: osteoblast def: "A bone-forming cell which secretes an extracellular matrix. Hydroxyapatite crystals are then deposited into the matrix to form bone." [MESH:A ] is_a: CL: relationship: develops_from CL: relationship: develops_from CL: GO Cell type New Definition + = Osteoblast differentiation: Processes whereby an osteoprogenitor cell or a cranial neural crest cell acquires the specialized features of an osteoblast, a bone-forming cell which secretes extracellular matrix.

Alignment of the Two Ontologies will permit the generation of consistent and complete definitions id: GO: name: osteoblast differentiation synonym: osteoblast cell differentiation genus: differentiation GO: (differentiation) differentium: acquires_features_of CL: (osteoblast) definition (text): Processes whereby a relatively unspecialized cell acquires the specialized features of an osteoblast, the mesodermal cell that gives rise to bone Formal definitions with necessary and sufficient conditions, in both human readable and computer readable forms

Relations to describe topology of nucleic sequence features Based on the formal relationships between pairs of intervals in a 1-dimensional space. Uses the coincidence of edges and interiors Enables questions regarding the equality, overlap, disjointedness, containment and coverage of genomic features. Conventional operations in genomics are simplified Software no longer needs to know what kind of feature particular instances are

For features A & BAn end of A intersects an end of B Interior of A intersects interior of B An end of A intersects interior of B Interior of A intersects an end of B A is disjoint from B False A meets BTrueFalse A overlaps BFalseTrue A is inside BFalseTrue False A contains BFalseTrueFalseTrue A covers BTrue FalseTrue A is covered_by BTrue False A equals BTrue False

Possible relationships of the RO Spatial –Distances, Angles, Orientation,… Chemical –Hydrogen bonding, Van der Waal forces,… Conformational It is the relationships that enable computational reasoning. Can RO use knowledge from geo-spatial ontology work?

Have fun!