Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at www.biopax.org)www.biopax.org.

Slides:



Advertisements
Similar presentations
Database Design: ER Modelling (Continued)
Advertisements

BioPAX The Birth of A Data Exchange Language for Biological Pathways Joanne Luciano BioPAX Core Group 7 th International Annual Bio-Ontologies.
Using DAML format for representation and integration of complex gene networks: implications in novel drug discovery K. Baclawski Northeastern University.
CH-4 Ontologies, Querying and Data Integration. Introduction to RDF(S) RDF stands for Resource Description Framework. RDF is a standard for describing.
Look for other standards: - BioPAX (composition) - CellML (aggregation) - Other SBML L3 extensions (modules - rule-based modeling)
Database Systems: Design, Implementation, and Management Tenth Edition
So What Does it All Mean? Geospatial Semantics and Ontologies Dr Kristin Stock.
©Ian Sommerville 2004Software Engineering, 7th edition. Chapter 8 Slide 1 System modeling 2.
Ontology annotation: mapping genomic regions biological function Paul D Thomas, Huaiyu Mi and Suzanna Lewis.
Gene Ontology John Pinney
BiNoM Tutorial Andrei Zinovyev, Laurence Calzone UMR U900 INSERM/Institut Curie/Ecole des Mines de Paris Wednesday, April 30th.
Contents of this Talk [Used as intro to Genome Databases Seminar, 2002] Overview of bioinformatics Motivations for genome databases Analogy of virus reverse-eng.
Developing a protein-interactions ontology Esther Ratsch European Media Laboratory.
Interoperation of Molecular Biology Databases Peter D. Karp, Ph.D. Bioinformatics Research Group SRI International Menlo Park, CA
Systems Biology Existing and future genome sequencing projects and the follow-on structural and functional analysis of complete genomes will produce an.
10-Jun-15Anatoly Sorokin Existing Standards in Systems Biology Anatoly Sorokin Computation Systems Biology Group University of Edinburgh.
©Ian Sommerville 2006Software Engineering, 8th edition. Chapter 8 Slide 1 System models.
How can Computer Science contribute to Research Publishing?
Lecture Fourteen Methodology - Conceptual Database Design
Modified from Sommerville’s originalsSoftware Engineering, 7th edition. Chapter 8 Slide 1 System models.
4/1/2008 OWL-ED 2008, Gaithersburg, MD 1 OWL: PAX of Mind or the AX? Experiences of Using OWL in the Development of BioPAX Joanne Luciano 1 & Robert Stevens.
The bioinformatics of biological processes The challenge of temporal data Per J. Kraulis CMCM, Tartu University.
The Cell Cycle Ontology Erick Antezana Dept. of Plant Systems Biology Flanders Institute for Biotechnology (VIB) / Ghent University Ghent - BELGIUM
Pathways Database System: An Integrated System For Biological Pathways L. Krishnamurthy, J. Nadeau, G. Ozsoyoglu, M. Ozsoyoglu, G. Schaeffer, M. Tasan.
Building and Using Ontologies Dr. Robert Stevens Department of Computer Science University of Manchester
Ch10. Intermolecular Interactions and Biological Pathways
Review of Ondex Bernice Rogowitz G2P Visualization and Visual Analytics Team March 18, 2010.
©Ian Sommerville 2000 Software Engineering, 6th edition. Chapter 7 Slide 1 System models l Abstract descriptions of systems whose requirements are being.
Chapter 4 System Models A description of the various models that can be used to specify software systems.
System models Abstract descriptions of systems whose requirements are being analysed Abstract descriptions of systems whose requirements are being analysed.
Bioinformatics Dr. Víctor Treviño BT4007
Of 39 lecture 2: ontology - basics. of 39 ontology a branch of metaphysics relating to the nature and relations of being a particular theory about the.
The aims of the Gene Ontology project are threefold: - to compile vocabularies to describe components, functions and processes - to produce tools to query.
How Ontologies Add Value BioPAX: Biological Pathway Data Exchange Ontology Joanne Luciano BioPAX Workgroup (biopax.org)biopax.org BioPathways Consortium.
Computer Science 101 Database Concepts. Database Collection of related data Models real world “universe” Reflects changes Specific purposes and audience.
BioPAX - Biological Pathway Data Exchange Format Tutorial Nadia Anwar BioPAX Workshop November 2009.
Semantic Web for Life Sciences Workshop Session VII: Semantic Aggregation, Integration, and Inference Moderator: Joanne Luciano October, Cambridge,
Metadata. Generally speaking, metadata are data and information that describe and model data and information For example, a database schema is the metadata.
Chapter 7 System models.
System models l Abstract descriptions of systems whose requirements are being analysed.
Pertemuan 19 PEMODELAN SISTEM Matakuliah: D0174/ Pemodelan Sistem dan Simulasi Tahun: Tahun 2009.
What is an Ontology? An ontology is a specification of a conceptualization that is designed for reuse across multiple applications and implementations.
Software Engineering, 8th edition Chapter 8 1 Courtesy: ©Ian Somerville 2006 April 06 th, 2009 Lecture # 13 System models.
Sommerville 2004,Mejia-Alvarez 2009Software Engineering, 7th edition. Chapter 8 Slide 1 System models.
Cell Signaling Ontology Takako Takai-Igarashi and Toshihisa Takagi Human Genome Center, Institute of Medical Science, University of Tokyo.
Database Systems: Enhanced Entity-Relationship Modeling Dr. Taysir Hassan Abdel Hamid.
Modeling System Requirements: Events and Things. Objectives Explain the many reasons for creating information system models Describe three types of models.
What is an Ontology? A representation of knowledge in a domain In theory Thomas Gruber (1993) “An ontology is a formal, explicit specification of a shared.
Computations using pathways and networks Nigam Shah
SWEN 5231 FORMAL METHODS Slide 1 System models u Abstract presentations of systems whose requirements are being analyzed.
Ontologies Working Group Agenda MGED3 1.Goals for working group. 2.Primer on ontologies 3.Working group progress 4.Example sample descriptions from different.
Mining the Biomedical Research Literature Ken Baclawski.
Issues in Ontology-based Information integration By Zhan Cui, Dean Jones and Paul O’Brien.
Introduction to biological molecular networks
Metamodeling and Modeling language for Systems Biology SB-UML Magali ROUX-ROUQUIE CNRS, Paris.
A database of biological pathways and processes (borrowed from a presentation created by Steve Jupe)
Knowledge Representation. Keywordsquick way for agents to locate potentially useful information Thesaurimore structured approach than keywords, arranging.
BioPAX The Birth of A Data Exchange Language for Biological Pathways Joanne Luciano BioPAX Group 7 th International Annual Bio-Ontologies.
 To explain why the context of a system should be modelled as part of the RE process  To describe behavioural modelling, data modelling and object modelling.
Semantic Interoperability in GIS N. L. Sarda Suman Somavarapu.
High throughput biology data management and data intensive computing drivers George Michaels.
RDF based on Integration of Pathway Database and Gene Ontology SNU OOPSLA LAB DongHyuk Im.
Engineering, 7th edition. Chapter 8 Slide 1 System models.
ece 627 intelligent web: ontology and beyond
DATA MODELS.
Abstract descriptions of systems whose requirements are being analysed
Developing a protein-interactions ontology
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
CCO: concept & current status
Computational Biology
Presentation transcript:

Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at

Definition of an Ontology Conceptualization of a domain of interest Concepts, relations, attributes, constraints, objects, values … An ontology is a specification of a conceptualization Formal notation Documentation A variety of forms, but includes: A vocabulary of terms Some specification of the meaning of the terms

Ontologies – Key Aspects Focus on semantics! Accurately model a complex domain Capture semantic nuances Rigorously define what each field means Adhere to those definitions!

Ontologies – Key Aspects Ontologies are for people and computers: People browse the ontology to learn it It encodes the definition of a concept so that the computer “ understands ” it “understands” = automated reasoning with concept definitions Is concept A more general than concept B? Is X an instance of concept A?

Components of an Ontology Concepts (Class, Set, Type, Predicate) ex: Gene, Reaction, Macromolecule Taxonomy of concepts (generalization/specialization hierarchies) ex: a physical interaction is an interaction Relations and Attributes Domains –values allowed for an attribute- ex: a feature location consists of a sequence location Constraints and other meta-information about relations ex: a pathway has at least one interaction

Ontologies in Bioinformatics Biological DBs need to have a good ontology AND a good mapping – implementation- of it: this prevents errors on data entry and interpretation Provide a common framework for multidatabase queries Provide a controlled vocabulary, such as for genome annotation For information extraction

BioPAX Biological PAthway eXchange A data exchange ontology and format for biological pathway integration, aggregation and inference Open source, ongoing

BioPAX Goals Include support for these pathway types: Metabolic pathways Signaling pathways Protein-protein interactions Genetic regulatory pathways Note: representing pathways is nothing new

The problem pathway databases of different kinds ( Rich data, different ontologies Nightmare for integration and data exchange

Biological pathways Metabolic Pathways Molecular Interaction Networks Signaling Pathways

Ontologies reflect “real life” A typical pathway would be decomposed into: A single pathway instance, which would contain several pathway steps, which would each contain one or more interactions occurring between physical entity participants, which each point to one physical entity.

BioPAX vs other ontologies Conceptual framework based upon existing DB schemas, allowing wide range of detail, multiple levels of abstraction Uses (refers to) existing ontologies to provide supplemental annotations where appropriate Cellular location  GO Component Cell type  Cell.obo Organism  NCBI taxon DB Incorporates other standards where appropriate Interoperates with existing standards

BioPax & other Exchange Formats BioPAX PSI-MI 2 SBML, CellML Genetic Interactions Molecular Interactions Pro:Pro All:All Interaction Networks Molecular Non-molecular Pro:Pro TF:Gene Genetic Regulatory Pathways Low Detail High Detail DB Exchange Formats Simulation Model Exchange Formats Rate Formulas Biochemical Reactions Small Molecules Low Detail High Detail Metabolic Pathways Low Detail High Detail BioPAX level 1

Capturing data at different resolutions Metabolic pathway data has a high level of detail Molecular interaction have less Ex: no causal or temporal aspects of interactions BioPAX Level 2 captures molecular binding interactions at a relatively high level in the ontology class hierarchy This reflects the fact that any given binding interaction may be a low-resolution (or more abstract) view of a more specific type of interaction.

Example A signaling database would likely capture the interaction between MEK1 and ERK1 as a catalysis event (MEK1 catalyzes the phosphorylation of ERK1). A molecular interaction database would likely store the interaction using a simpler abstraction, such as a protein-protein interaction. BioPAX Level 2 supports both of these representations.

Aggregation, Integration, Inference with BioPax 1. Aggregation: represent multiple kinds of pathway databases metabolic molecular interactions signal transduction gene regulatory 2. Integration: special constructs designed for integration DB References XRefs (Publication, Unification, Relationship) Synonyms 3. OWL DL – to enable reasoning

BioPAX Ontology: Top Level Pathway A set of interactions E.g. Glycolysis, MAPK, Apoptosis Interaction A set of entities and some relationship between them E.g. Reaction, Molecular Association, Catalysis Physical Entity A building block of simple interactions E.g. Small molecule, Protein, DNA, RNA Entity Pathway Interaction Physical Entity Subclass (is a) Contains (has a)

BioPAX Ontology: Physical Entities PhysicalEntity ComplexRNA ProteinSmall Molecule This class serves as the super-class for all physical entities, although its current set of subclasses is limited to molecules. This list may be expanded to include photon, environment, cell and cellular component in later levels of BioPAX.

Interaction Class Structure

Relational implementation