GO : the Gene Ontology & Functional enrichment analysis

Slides:



Advertisements
Similar presentations
Annotation of Gene Function …and how thats useful to you.
Advertisements

24th Feb 2006 Jane Lomax Gene Ontology tutorial Talk:Using the Gene Ontology (GO) for Expression Analysis Practical:Onto-Express analysis tool Talk: GO.
25th June 2007 Jane Lomax Using the Gene Ontology (GO) for analysis of expression data Jane Lomax EMBL-EBI.
GO : the Gene Ontology “because you know sometimes words have two meanings” Amelia Ireland GO Curator EBI, Cambridge, UK.
Gene Ontology John Pinney
Introduction to Functional Analysis J.L. Mosquera and Alex Sanchez.
Gene ontology & hypergeometric test Simon Rasmussen CBS - DTU.
COG and GO tutorial.
BI class 2010 Gene Ontology Overview and Perspective.
1 Gene Ontology and Semantic Similarity Measures.
Analysis of GO annotation at cluster level by H. Bjørn Nielsen Slides from Agnieszka S. Juncker.
An introduction to using the AmiGO Gene Ontology tool.
MN-B-C 2 Analysis of High Dimensional (-omics) Data Kay Hofmann – Protein Evolution Group Week 5: Proteomics.
Viewing & Getting GO COST Functional Modeling Workshop April, Helsinki.
Gene Ontology (GO) Project
GO : the Gene Ontology “because you know sometimes words have two meanings” Amelia Ireland GO Curator EBI, Cambridge, UK.
Gene Set Enrichment Analysis (GSEA)
GO and OBO: an introduction. Jane Lomax EMBL-EBI What is the Gene Ontology? What is OBO? OBO-Edit demo & practical What is the Gene Ontology? What is.
CACAO Training Fall Community Assessment of Community Annotation with Ontologies (CACAO)
The aims of the Gene Ontology project are threefold: - to compile vocabularies to describe components, functions and processes - to produce tools to query.
Intralab Workshop - Reactome CMAP Chang-Feng Quo June 29 th, 2006.
GENE ONTOLOGY FOR THE NEWBIES Suparna Mundodi, PhD The Arabidopsis Information Resources, Stanford, CA.
Gene Ontology Consortium
Cell Ontology 2.0 Elimination of multiple is_a inheritance through instantiation of relationships to terms in outside ontologies, such as the GO cellular.
The Gene Ontology: a real-life ontology, progress and future. Jane Lomax EMBL-EBI.
The Gene Ontology project Jane Lomax. Ontology (for our purposes) “an explicit specification of some topic” – Stanford Knowledge Systems Lab Includes:
Gene Ontology Project
Grup.bio.unipd.it CRIBI Genomics group Erika Feltrin PhD student in Biotechnology 6 months at EBI.
Gene Ontology TM (GO) Consortium Jennifer I Clark EMBL Outstation - European Bioinformatics Institute (EBI), Hinxton, Cambridge CB10 1SD, UK Objectives:
Ontologies GO Workshop 3-6 August Ontologies  What are ontologies?  Why use ontologies?  Open Biological Ontologies (OBO), National Center for.
Gene Onotology Part 1: what is the GO? Harold J Drabkin Senior Scientific Curator The Jackson Laboratory Mouse Genome Informatics.
From Functional Genomics to Physiological Model: Using the Gene Ontology Fiona McCarthy, Shane Burgess, Susan Bridges The AgBase Databases, Institute of.
Introduction to the GO: a user’s guide Iowa State Workshop 11 June 2009.
Getting Started: a user’s guide to the GO GO Workshop 3-6 August 2010.
1 Gene function annotation. 2 Outline  Functional annotation  Controlled vocabularies  Functional annotation at TAIR  Resources and tools at TAIR.
Top X interactions of PIN Network A interactions Coverage of Network A Figure S1 - Network A interactions are distributed evenly across the top 60,000.
Analysis of GO annotation at cluster level by Agnieszka S. Juncker.
Other biological databases and ontologies. Biological systems Taxonomic data Literature Protein folding and 3D structure Small molecules Pathways and.
Getting Started: a user’s guide to the GO TAMU GO Workshop 17 May 2010.
To Boldly GO… Amelia Ireland GO Curator EBI, Hinxton, UK.
CACAO Training Fall Community Assessment of Community Annotation with Ontologies (CACAO)
Statistical Testing with Genes Saurabh Sinha CS 466.
Scope of the Gene Ontology Vocabularies. Compile structured vocabularies describing aspects of molecular biology Describe gene products using vocabulary.
Computer Science Ph. D. Seminar Gene Ontology (GO) Based Search for Protein Structure Similarity Clustering Metrics Ph.D. Candidate Steve Johnson Committee.
Discovering functional interaction patterns in Protein-Protein Interactions Networks   Authors: Mehmet E Turnalp Tolga Can Presented By: Sandeep Kumar.
InterPro Sandra Orchard.
Tools in Bioinformatics Ontologies and pathways. Why are ontologies needed? A free text is the best way to describe what a protein does to a human reader.
Gene Set Analysis using R and Bioconductor Daniel Gusenleitner
Gene Ontology TM (GO) Consortium
2/3/2005 Gene Ontology (GO) The Gene Ontology (GO) project is a collaborative effort to address the need for consistent descriptions.
Gene Annotation & Gene Ontology May 24, Gene lists from RNAseq analysis What do you do with a list of 100s of genes that contain only the following.
Gene Annotation & Gene Ontology
a Cytoscape plugin to assess enrichment of
CACAO Training ASM-JGI 2012.
Clustering Manpreet S. Katari.
Pathway Analysis June 13, 2017.
Mental Functioning and the Gene Ontology
Statistical Testing with Genes
Workshop Aims TAMU GO Workshop 17 May 2010.
Department of Genetics • Stanford University School of Medicine
Functional Annotation of the Horse Genome
Modified from slides from Jim Hu and Suzi Aleksander Spring 2016
Using the Gene Ontology (GO) for analysis of expression data Jane Lomax EMBL-EBI 25th June 2007 Jane Lomax.
Analysis of GO annotation at cluster level by Agnieszka S. Juncker
Overview Gene Ontology Introduction Biological network data
What is an Ontology An ontology is a set of terms, relationships and definitions that capture the knowledge of a certain domain. (common ontology ≠ common.
Gene expression analysis
Information Networks: State of the Art
Statistical Testing with Genes
Presentation transcript:

GO : the Gene Ontology & Functional enrichment analysis F. Burdet

Brief presentation of GO Slides about GO used with the kind permission of: Amelia Ireland GO Curator EBI, Cambridge, UK

What is the Gene Ontology? Definition: The Gene Ontology: “a controlled vocabulary that can be applied to all organisms even as knowledge of gene and protein roles in cells is accumulating and changing” A controlled vocabulary to describe gene products - proteins and RNA - in any organism.

What is GO? One of the Open Biological Ontologies Standard, species-neutral way of representing biology Three structured networks of defined terms to describe gene product attributes More like a phrase book than a biology text book

How does GO work? What does the gene product do? Where and when does it act? Why does it perform these activities? What information might we want to capture about a gene product?

Cellular Component where a gene product acts

Cellular Component

Cellular Component

Molecular Function activities or “jobs” of a gene product glucose-6-phosphate isomerase activity

insulin receptor activity Molecular Function insulin binding insulin receptor activity

Biological Process a commonly recognized series of events cell division

Biological Process transcription

Anatomy of a GO term unique GO ID id: GO:0006094 name: gluconeogenesis namespace: process def: The formation of glucose from noncarbohydrate precursors, such as pyruvate, amino acids and glycerol. [http://cancerweb.ncl.ac.uk/omd/index.html] exact_synonym: glucose biosynthesis xref_analog: MetaCyc:GLUCONEO-PWY is_a: GO:0006006 is_a: GO:0006092 term name ontology definition 17800 terms in three ontologies 94% of terms defined synonym database ref parentage

Ontology Structure Ontologies are structured as a hierarchical directed acyclic graph Terms can have more than one parent and zero, one or more children Terms are linked by two relationships is-a  part-of 

mitochondrial chloroplast Ontology Structure cell membrane chloroplast mitochondrial chloroplast membrane membrane is-a part-of

GO Annotation Using GO terms to represent the activities and localizations of a gene product Annotations contributed by members of the GO Consortium model organism databases cross-species databases, eg. UniProt Annotations freely available from GO website

GO Annotation Database object GO term ID Reference for annotation gene or gene product GO term ID e.g. GO:0003677 Reference for annotation e.g. PubMed paper, BLAST results Evidence code (source of the annotation) from evidence code ontology

Functional enrichment analysis

An other way to look at the gene modules I have a set of genes (from a module). On top of the correlations with the traits, can I discover more about it? One way is to look if some known pathways or functionally related genes are enriched in this module How can one assess the enrichment?

Example of the marbles (hypergeometric test) Take an urn with 5 green and 45 red marbles Draw 10 marbles randomly What is the probability that 4 of the 10 are green? => use an hypergeometric test to calculate the probability ! And do the same with sets of genes (modules) found in a pathway, in comparison of a random draw of the same size across the whole set of genes.

From marbles to genes Say I have 100 genes in my orange module 25 out of the 30 genes involved in lipid biosynthesis are found in my orange module Given the fact that there are 10’000 genes in the dataset… What was the probability of this distribution to happen, compared to a random draw?

Biological databases available Gene Ontology (GO) KEGG (Kyoto Encyclopedia of Genes and Genomes) Reactome BioCarta MSigDB (Molecular Signatures Database) DAVID (Database for Annotation, Visualization and Integrated Discovery) …

topGO Bioconductor package Has an algorithm that will only consider the significant “leaves” instead of all the levels (weight) Usually, the gene universe is defined as all feasible genes measured by the microarray.

BP Cellular process Cell division