Karen Eilbeck 7/22/08 Ontological relations and computable definitions for sequences at DNA, RNA and protein levels Karen Eilbeck Neocles Leontis Thomas.

Slides:



Advertisements
Similar presentations
Functions Reading: Epp Chp 7.1, 7.2, 7.4
Advertisements

Using Ontology Reasoning to Classify Protein Phosphatases K.Wolstencroft, P.Lord, L.tabernero, A.brass, R.stevens University of Manchester.
PENN S TATE Compatible text, visual and mathematical representations for biological process ontologies Nigam Shah Penn State University.
Everything must be logical Terms should be in the right place with good definition and the correct relationships I need very specific terms to describe.
Chronos: A Tool for Handling Temporal Ontologies in Protégé
April 15, 2004SPIE1 Association in Level 2 Fusion Mieczyslaw M. Kokar Christopher J. Matheus Jerzy A. Letkowski Kenneth Baclawski Paul Kogut.
Knowledge Representation
Confessions/Disclaimers Ontologies and REDfly CARO SO OBO Foundry.
Meanings: some key terms used in the Core Skills Test.
Of 27 lecture 7: owl - introduction. of 27 ece 627, winter ‘132 OWL a glimpse OWL – Web Ontology Language describes classes, properties and relations.
1 Semantic Description of Programming languages. 2 Static versus Dynamic Semantics n Static Semantics represents legal forms of programs that cannot be.
Gene Ontology John Pinney
The design, construction and use of software tools to generate, store, annotate, access and analyse data and information relating to Molecular Biology.
Ontology Notes are from:
Component 4: Introduction to Information and Computer Science Unit 9: Components and Development of Large Scale Systems Lecture 5 This material was developed.
PR-OWL: A Framework for Probabilistic Ontologies by Paulo C. G. COSTA, Kathryn B. LASKEY George Mason University presented by Thomas Packer 1PR-OWL.
Chapter 8: Web Ontology Language (OWL) Service-Oriented Computing: Semantics, Processes, Agents – Munindar P. Singh and Michael N. Huhns, Wiley, 2005.
Extending to the GO model OBO open biology ontologies aka - extended go - (ego)
RnaO.owl Colin Batchelor Royal Society of Chemistry, UK
Computational Molecular Biology (Spring’03) Chitta Baral Professor of Computer Science & Engg.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
1 Logical Tools and Theories in Contemporary Bioinformatics Barry Smith
Biological Ontologies Neocles Leontis April 20, 2005.
1 Department of Computer Science and Engineering, University of South Carolina Issues for Discussion and Work Jan 2007  Choose meeting time.
The RNA Ontology RNAO Colin Batchelor Neocles Leontis May 2009 Eckart, Colin and Jane In Cambridge.
From SHIQ and RDF to OWL: The Making of a Web Ontology Language
1 Unity of Invention: Biotech Examples TC1600 Special Program Examiner Julie Burke (571)
BTN323: INTRODUCTION TO BIOLOGICAL DATABASES Day2: Specialized Databases Lecturer: Junaid Gamieldien, PhD
Regulatory factors 1) Gene copy number 2) Transcriptional control 2-1) Promoters 2-2) Terminators, attenuators and anti-terminators 2-3) Induction and.
Ontology Development Kenneth Baclawski Northeastern University Harvard Medical School.
Integration of PRO and UniProtKB Amherst, NY May 16, 2013 Cathy H. Wu, Ph.D. PRO-PO-GO Meeting.
ON REASONING WITH IMAGES: THE USE OF IMAGES IN CLINICAL RESEARCH Thomas Bittner Louis Goldberg University at Buffalo.
Genome Annotation using MAKER-P at iPlant Collaboration with Mark Yandell Lab (University of Utah) iPlant: Josh Stein (CSHL) Matt Vaughn.
Stefan Schulz Medical Informatics Research Group
INF 384 C, Spring 2009 Ontologies Knowledge representation to support computer reasoning.
U.S. Patent and Trademark Office Technology Center 1600 Michael P. Woodward Unity of Invention: Biotech Examples.
Amo amos amot amomus amotis amont. Happy birthday Swiss-Prot Fortaleza August 2006.
Biology 224 Instructor: Tom Peavy Feb 21 & 26, Protein Structure & Analysis.
RNA Secondary Structure Prediction Spring Objectives  Can we predict the structure of an RNA?  Can we predict the structure of a protein?
Sequence analysis: Macromolecular motif recognition Sylvia Nagl.
Categories and Computer Science
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Taken from Schulze-Kremer Steffen Ontologies - What, why and how? Cartic Ramakrishnan LSDIS lab University of Georgia.
Logic. What is logic? Logic (from the Ancient Greek: λογική, logike) is the use and study of valid reasoning. The study of logic features most prominently.
GRAPHS THEROY. 2 –Graphs Graph basics and definitions Vertices/nodes, edges, adjacency, incidence Degree, in-degree, out-degree Subgraphs, unions, isomorphism.
Predicting protein degradation rates Karen Page. The central dogma DNA RNA protein Transcription Translation The expression of genetic information stored.
Protein and RNA Families
Programming Languages and Design Lecture 3 Semantic Specifications of Programming Languages Instructor: Li Ma Department of Computer Science Texas Southern.
What is an Ontology? A representation of knowledge in a domain In theory Thomas Gruber (1993) “An ontology is a formal, explicit specification of a shared.
November 18, 2000ICTCM 2000 Introductory Biological Sequence Analysis Through Spreadsheets Stephen J. Merrill Sandra E. Merrill Marquette University Milwaukee,
Mining the Biomedical Research Literature Ken Baclawski.
ChEBI, text mining and ontological best practice Colin Batchelor Royal Society of Chemistry
EBI is an Outstation of the European Molecular Biology Laboratory. Gautier Koscielny VectorBase Meeting 08 Feburary 2012, EBI VectorBase Text Search Engine.
Primary vs. Secondary Databases Primary databases are repositories of “raw” data. These are also referred to as archival databases. -This is one of the.
+ From OBO to OWL and back again – a tutorial David Osumi-Sutherland, Virtual Fly Brain/FlyBase Chris Mungall – GO/LBL.
CFE Higher Biology DNA and the Genome Transcription.
Using OWL 2 For Product Modeling David Leal Caesar Systems April 2009 Henson Graves Lockheed Martin Aeronautics.
David Goldberg CS 1950 Directed Study. RNA are the building blocks for life RNA is read by ribosome's which then create a functional product.(i.e. protein.
Analyzing Promoter Sequences with Multilayer Perceptrons Glenn Walker ECE 539.
Ccs.  Ontologies are used to capture knowledge about some domain of interest. ◦ An ontology describes the concepts in the domain and also the relationships.
Editing Pathway/Genome Databases
Subject Name: Software Testing Subject Code: 10CS842 Prepared By:
Ontology.
Semantic Objects.
Genome organization and Bioinformatics
Introduction: Themes in the Study of Life
CCO: concept & current status
Ontology.
Evolution and Natural Selection
Unity, Coherence, and Completeness
Presentation transcript:

Karen Eilbeck 7/22/08 Ontological relations and computable definitions for sequences at DNA, RNA and protein levels Karen Eilbeck Neocles Leontis Thomas Bittner Colin Batchelor

Karen Eilbeck 7/22/08 Two sections 1.A report on the joint RNAO and SO meeting held in SLC in April 2008 (Eilbeck, Leontis and Bittner) 2.Computable definitions for 1D and 2D structures (Batchelor)

Karen Eilbeck 7/22/08 Ontological Relations for Sequences at DNA, RNA, and protein levels A report on the joint RNAO and SO meeting held in SLC in April Karen Eilbeck Neocles Leontis Thomas Bittner

Karen Eilbeck 7/22/08 Aim of meeting Coordinate the development of relationships between SO and RNAO

Karen Eilbeck 7/22/08 Universals and instances Universal: repeatable or recurrent entities that can be instantiated or exemplified by many particular things. Instance: A universal may have instances, known as its particulars. They identify single objects such as “ that chromosome under that microscope ”.

Karen Eilbeck 7/22/08 What is a sequence? Sequence is a universal. A sequence can be located in places at the same time. Manifestation of the sequence happens at the molecular level.

Karen Eilbeck 7/22/08 Same sequence different molecule.

Karen Eilbeck 7/22/08 Identifying regions and relations between regions Category theory. –Morphism: relationship between some posited domain and codomain. –Isomorphism between dna and RNA (both directions) –Morphism between rna and protein (information loss from protein to rna.) –Morphism between DNA and protein.

Karen Eilbeck 7/22/08 Next step 1: core terms and relations

Karen Eilbeck 7/22/08 Next step 2: even more relationships Homology and similarity relationships Topological relationships Supportive evidence relationships

Karen Eilbeck 7/22/08 Next step 3: Description logic Conversion of core types and relations to formal logic. A sound foundation to build upon for the features and other types in RNAO and SO

Karen Eilbeck 7/22/08 People: Karen Eilbeck - SO University of Utah Neocles Leontis - RNAO BGSU Thomas Bittner - OBO Buffalo Colin Batchelor - relations in SO RSC

Karen Eilbeck 7/22/08 Computable definitions Colin Batchelor

Karen Eilbeck 7/22/08 Computable definitions These consist of necessary and sufficient conditions. Generally written in OBO or OWL format. Example from SO: any primary transcript that is adjacent to a cap must be a capped_primary_transcript, and conversely all capped_primary_transcripts are primary transcripts that are adjacent to caps. id: SO: name: capped_primary_transcript def: "A primary transcript that is capped." [SO:xp] intersection_of: SO: ! primary_transcript intersection_of: adjacent_to SO: ! cap

Karen Eilbeck 7/22/08 What does this buy us? It makes ontology maintenance easier for the curators. But most importantly: With computable definitions, reasoners can in principle annotate automatically…

Karen Eilbeck 7/22/08 Loops (1) Consider an example 1D sequence: ……(((((….((….))..))).))… The definition of a tetraloop could look like this: tetraloop =”.…” that (adjacent_to “(“) and (adjacent to “)”) Much like the capture group in the regex \((\.{4})\)

Karen Eilbeck 7/22/08 Loops (2): (includes cardinality) loop = “.+” that adjacent_to “(“ and adjacent_to “)” diloop = loop that has_part “.” cardinality exactly 2 triloop = loop that has_part “.” cardinality exactly 3

Karen Eilbeck 7/22/08 Loops (3): stem-loops Assume no kinks or bulges or pseudoknots. Take a simple example: ((((..)))) “424 stemloop” = sequence that has_part (“(“ cardinality exactly 4 that adjacent_to diloop) and has_part (diloop adjacent_to “(“) and has_part (diloop adjacent_to “)”) and has_part (“)” cardinality exactly 4 that adjacent_to diloop) But what about the general case?

Karen Eilbeck 7/22/08 Loops (4): stem-loops and formal grammar This: \({n}\.+\){n} is not a valid regular expression. It reduces to a n b n, which is well-known to be non-regular. Likewise in OWL you cannot say cardinality exactly n. So what do we do?

Karen Eilbeck 7/22/08 A way out Write the necessary and sufficient conditions in terms of the 2D structure. Hence: stem-loop = structure that has_part (base-pair that bound_to base-pair) and has_part (base-pair that bound_to loop) and has_part loop

Karen Eilbeck 7/22/08 What next? Write necessary and sufficient conditions for some example motifs. Take 2D structures in RNAML that contain known example motifs. Convert RNAML to OWL. Run reasoner.