GOAT: The Gene Ontology Annotation Tool Dr. Mike Bada Department of Computer Science University of Manchester

Slides:



Advertisements
Similar presentations
Ontology Assessment – Proposed Framework and Methodology.
Advertisements

Semantic Similarity Measures Across The Gene Ontology. Relating Sequence to Annotation. P.W. Lord, R.D. Stevens, A.Brass, and C. Goble Department of Computer.
UNCERTML - DESCRIBING AND COMMUNICATING UNCERTAINTY Matthew Williams
Who am I Gianluca Correndo PhD student (end of PhD) Work in the group of medical informatics (Paolo Terenziani) PhD thesis on contextualization techniques.
Contents of this Talk [Used as intro to Genome Databases Seminar, 2002] Overview of bioinformatics Motivations for genome databases Analogy of virus reverse-eng.
OntoBlog: Informal Knowledge Management by Semantic Blogging Aman Shakya 1, Vilas Wuwongse 2, Hideaki Takeda 1, Ikki Ohmukai 1 1 National Institute of.
Fungal Semantic Web Stephen Scott, Scott Henninger, Leen-Kiat Soh (CSE) Etsuko Moriyama, Ken Nickerson, Audrey Atkin (Biological Sciences) Steve Harris.
Storing and Retrieving Biological Instances with the Instance Store Daniele Turi, Phillip Lord, Michael Bada, Robert Stevens.
Sensemaking and Ground Truth Ontology Development Chinua Umoja William M. Pottenger Jason Perry Christopher Janneck.
PLANSERVE Knowledge acquisition & Ontological engineering for AI Planning applications.
Use of Ontologies in the Life Sciences: BioPax Graciela Gonzalez, PhD (some slides adapted from presentations available at
Semantics For the Semantic Web: The Implicit, the Formal and The Powerful Amit Sheth, Cartic Ramakrishnan, Christopher Thomas CS751 Spring 2005 Presenter:
TAMBIS Transparent Access to Multiple Biological Information Sources.
POC tutorial #1: Introduction This tutorial will run automatically in Quicktime. To run the tutorial at your own pace use the internal controllers within.
The Cell Cycle Ontology Erick Antezana Dept. of Plant Systems Biology Flanders Institute for Biotechnology (VIB) / Ghent University Ghent - BELGIUM
Ayorinde I. T., Akinkunmi B. O. and Alao Y. M. Computer Science Department, University of Ibadan, Ibadan, Nigeria 1 AUGUST, International Conference.
DAML+OIL Ontology Tutorial Chris Wroe, Robert Stevens (Sean Bechhofer, Carole Goble, Alan Rector, Ian Horrocks….) University of Manchester.
ANSWERING CONTROLLED NATURAL LANGUAGE QUERIES USING ANSWER SET PROGRAMMING Syeed Ibn Faiz.
Domain Modelling the upper levels of the eframework Yvonne Howard Hilary Dexter David Millard Learning Societies LabDistributed Learning, University of.
Semantic Web Technologies Lecture # 2 Faculty of Computer Science, IBA.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
Building and Using Ontologies Dr. Robert Stevens Department of Computer Science University of Manchester
Integrating bio-ontologies with a workflow/Petri Net model to qualitatively represent and simulate biological systems Mor Peleg, Irene Gbashvili, and Russ.
SOUPA: Standard Ontology for Ubiquitous and Pervasive Applications Harry Chen, Filip Perich, Tim Finin, Anupam Joshi Department of Computer Science & Electrical.
University of Dublin Trinity College Localisation and Personalisation: Dynamic Retrieval & Adaptation of Multi-lingual Multimedia Content Prof Vincent.
Integrated Development Environment for Policies Anjali B Shah Department of Computer Science and Electrical Engineering University of Maryland Baltimore.
A service-oriented middleware for building context-aware services Center for E-Business Technology Seoul National University Seoul, Korea Tao Gu, Hung.
Domain Modeling In FREMA David Millard Yvonne Howard Hugh Davis Gary Wills Lester Gilbert Learning Societies Lab University of Southampton, UK.
Dimitrios Skoutas Alkis Simitsis
A Knowledge Model for Analysis and Simulation of Signal Transduction Networks.
Gene Ontology TM (GO) Consortium Jennifer I Clark EMBL Outstation - European Bioinformatics Institute (EBI), Hinxton, Cambridge CB10 1SD, UK Objectives:
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Taken from Schulze-Kremer Steffen Ontologies - What, why and how? Cartic Ramakrishnan LSDIS lab University of Georgia.
UNCERTML - DESCRIBING AND COMMUNICATING UNCERTAINTY WITHIN THE (SEMANTIC) WEB Matthew Williams
Knowledge Representation of Statistic Domain For CBR Application Supervisor : Dr. Aslina Saad Dr. Mashitoh Hashim PM Dr. Nor Hasbiah Ubaidullah.
A Context Model based on Ontological Languages: a Proposal for Information Visualization School of Informatics Castilla-La Mancha University Ramón Hervás.
Interoperability & Knowledge Sharing Advisor: Dr. Sudha Ram Dr. Jinsoo Park Kangsuk Kim (former MS Student) Yousub Hwang (Ph.D. Student)
Using Several Ontologies for Describing Audio-Visual Documents: A Case Study in the Medical Domain Sunday 29 th of May, 2005 Antoine Isaac 1 & Raphaël.
Ontology-Based Computing Kenneth Baclawski Northeastern University and Jarg.
Christoph F. Eick University of Houston Organization 1. What are Ontologies? 2. What are they good for? 3. Ontologies and.
SKOS. Ontologies Metadata –Resources marked-up with descriptions of their content. No good unless everyone speaks the same language; Terminologies –Provide.
WP2: ONTOLOGY ENRICHMENT METHODOLOGIES Carole Goble (IMG) Robert Stevens (BHIG) Mikel Egaña Aranguren (BHIG) Manchester University Computer Science: IMG:
Domain Modeling In FREMA Yvonne Howard David Millard Hugh Davis Gary Wills Lester Gilbert Learning Societies Lab University of Southampton, UK.
12/7/2015Page 1 Service-enabling Biomedical Research Enterprise Chapter 5 B. Ramamurthy.
Of 33 lecture 1: introduction. of 33 the semantic web vision today’s web (1) web content – for human consumption (no structural information) people search.
Scope of the Gene Ontology Vocabularies. Compile structured vocabularies describing aspects of molecular biology Describe gene products using vocabulary.
1 MedAT: Medical Resources Annotation Tool Monika Žáková *, Olga Štěpánková *, Taťána Maříková * Department of Cybernetics, CTU Prague Institute of Biology.
Semantic Similarity Computation and Concept Mapping in Earth and Environmental Science Jin Guang Zheng Xiaogang Ma Stephan.
Using DAML+OIL Ontologies for Service Discovery in myGrid Chris Wroe, Robert Stevens, Carole Goble, Angus Roberts, Mark Greenwood
Investigating semantic similarity measures across the Gene Ontology: the relationship between sequence and annotation Bioinformatics, July 2003 P.W.Load,
Approach to building ontologies A high-level view Chris Wroe.
The Semantic Web. What is the Semantic Web? The Semantic Web is an extension of the current Web in which information is given well-defined meaning, enabling.
A Portrait of the Semantic Web in Action Jeff Heflin and James Hendler IEEE Intelligent Systems December 6, 2010 Hyewon Lim.
Be.wi-ol.de User-friendly ontology design Nikolai Dahlem Universität Oldenburg.
Screen Readers Cannot See (Ontology Based Semantic Annotation for Visually impaired Web users) Yeliz Yesilada, Simon Harper, Carole Goble and Robert Stevens.
Ontology Driven Data Collection for EuPathDB Jie Zheng, Omar Harb, Chris Stoeckert Center for Bioinformatics, University of Pennsylvania.
Big Data that might benefit from ontology technology, but why this usually fails Barry Smith National Center for Ontological Research 1.
High Risk 1. Ensure productive use of GRID computing through participation of biologists to shape the development of the GRID. 2. Develop user-friendly.
Informatics for Scientific Data Bio-informatics and Medical Informatics Week 9 Lecture notes INF 380E: Perspectives on Information.
` Comparison of Gene Ontology Term Annotations Between E.coli K12 Databases REDDYSAILAJA MARPURI WESTERN KENTUCKY UNIVERSITY.
Universal descriptions of gene and gene product attributes
TDM in the Life Sciences Application to Drug Repositioning *
Department of Genetics • Stanford University School of Medicine
Functional Annotation of the Horse Genome
Ontology-Based Information Integration Using INDUS System
Overview Gene Ontology Introduction Biological network data
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
CCO: concept & current status
A User’s Guide to GO: Structural and Functional Annotation
Semantic Markup for Semantic Web Tools:
Presentation transcript:

GOAT: The Gene Ontology Annotation Tool Dr. Mike Bada Department of Computer Science University of Manchester

Outline Semantic annotation and ontologies The Gene Ontology Annotation with terms from the Gene Ontology GOAT Conclusions

The Need for Semantic Annotation Bioinformatics has a large number of distributed, autonomous, heterogeneous resources The experimenter and computer need access to the knowledge within these resources Data need to be in a common, computationally amenable form

Ontologies An ontology is a set of terms, relationships, and definitions that capture a community’s understanding of a domain Terms represent the concepts of a domain, which are linked by relationships; these constitute a controlled vocabulary These terms augment natural-language annotation and can be more easily processed computationally

The Gene Ontology (GO) Is currently comprised of over 15,000 terms representing molecular functions, biological processes, and cellular components Is arranged as three orthogonal, structurally unlinked subontologies Enables a common understanding between databases and between model organisms Has been used extensively in a number of prominent biological databases

A Fragment of GO

Annotation with GO Terms Annotator must rely upon his/her domain expertise and the usability of the annotation tool S/he must find terms among the 15,000+ terms Unconstrained entry of terms may lead to inconsistent or nonsensical descriptions of gene products GOAT dynamically determines which terms are most relevant and offers these subsets as options

GONG and GOAT GOAT relies upon GONG Goal of GONG is to convert GO into a more formal, Description Logic representation and enrich its content Description Logics provide unambiguous semantics of a model and come with reasoning support Specifically, DAML+OIL is being used

Adding Associations to GO All GO annotations have been gathered into one database named GOA GOA was mined for associations between GO terms GO-associated databases were also manually examined for associations between GO terms and gene-product types These GO-term-to-GO-term and GO-term-to-gene-product-type associations were added to DAML+OIL GO

Guiding the Annotator GOAT seeks to reduce the potential for annotation errors and reduce the overhead of GO’s size Our approach is to use a formal DAML+OIL GO, augmented with these associations, and a reasoner to guide the user in the annotation process GOAT presents subsets of the appropriate subontologies that are most likely relevant in that they have been associated with information already entered by the user

GOAT

An Annotation Scenario Say the user has a specific snRNA to annotate.

An Annotation Scenario

Conclusions GO-term annotation can be tedious and error-prone Representing GO as a formal DAML+OIL ontology allows for complex and efficient reasoning over the ontology Reasoning over GO augmented with mined term associations can help guide users in the annotation process by narrowing down term choices to those that are most likely relevant

Acknowledgments University of Manchester Robert Stevens Chris Wroe Kevin Garwood Phil Lord Daniele Turi Carole Goble GlaxoSmithKline Robin McIntire

Thanks!