::: Schedule. Biological (Functional) Databases

Slides:



Advertisements
Similar presentations
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Advertisements

Relating Gene Expression to a Phenotype and External Biological Information Richard Simon, D.Sc. Chief, Biometric Research Branch, NCI
Gene Set Enrichment Analysis Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Gene Set Enrichment Analysis (GSEA)
Using Gene Ontology Models and Tests Mark Reimers, NCI.
Gene ontology & hypergeometric test Simon Rasmussen CBS - DTU.
Gene Set Analysis 09/24/07. From individual gene to gene sets Finding a list of differentially expressed genes is only the starting point. Suppose we.
Babelomics Functional interpretation of genome-scale experiments Barcelona, 28 November de 2007 Ignacio Medina David Montaner
Gene Ontology and Functional Enrichment Genome 559: Introduction to Statistical and Computational Genomics Elhanan Borenstein.
Working with enriched gene sets in R Peter Svensson Micheline Giphart-Gassler Harry Vrieling.
Daniel Rico, PhD. Daniel Rico, PhD. ::: Introduction to Functional Analysis Course on Functional Analysis Bioinformatics Unit.
1 Identifying differentially expressed sets of genes in microarray experiments Lecture 23, Statistics 246, April 15, 2004.
Gene Set Enrichment Analysis (GSEA)
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics Lab v1 | Saurabh Sinha1 Powerpoint by Casey Hanson.
EADGENE and SABRE Post-Analyses Workshop 12-14th November 2008, Lelystad, Netherlands 1 François Moreews SIGENAE, INRA, Rennes Cytoscape.
UBio Training Courses Micro-RNA web tools Gonzalo
From genes to functional blocks in the study of biological systems Fátima Al-Shahrour, Joaquín Dopazo National Institute of Bioinformatics, Functional.
Summarizing Differential Expression Using Mann-Whitney U-tests.
Regulatory Genomics Lab Saurabh Sinha Regulatory Genomics | Saurabh Sinha | PowerPoint by Casey Hanson.
Statistical Testing with Genes Saurabh Sinha CS 466.
Gene set analyses of genomic datasets Andreas Schlicker Jelle ten Hoeve Lodewyk Wessels.
Comp. Genomics Recitation 10 4/7/09 Differential expression detection.
GO enrichment and GOrilla
No reference available
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Gene Set Analysis using R and Bioconductor Daniel Gusenleitner
Canadian Bioinformatics Workshops
Microarray Data Analysis Roy Williams PhD; Burnham Institute for Medical Research.
Estimating the False Discovery Rate in Genome-wide Studies BMI/CS 576 Colin Dewey Fall 2008.
1 A Discussion of False Discovery Rate and the Identification of Differentially Expressed Gene Categories in Microarray Studies Ames, Iowa August 8, 2007.
Canadian Bioinformatics Workshops
David Amar, Tom Hait, and Ron Shamir
Module 2: Analyzing gene lists: over-representation analysis
Networks and Interactions
NGS Analysis Using Galaxy
Regulatory Genomics Lab
GSEA-Pro Tutorial Anne de Jong University of Groningen.
Tutorial 6 : RNA - Sequencing Analysis and GO enrichment
Statistical Testing with Genes
Canadian Bioinformatics Workshops
Proteins enriched in HCs are strongly associated with genetic hearing/balance impairment. Proteins enriched in HCs are strongly associated with genetic.
Global comparison of MinSpan pathways with databases of human‐defined pathways The pairwise connection specificity index (CSI) was calculated for all pathway.
Behaviorally dependent allele-specific expression.
The mRNA stem cell signature.
Genome-Wide Analysis of PLT2 Binding to Target Genes.
Meta-analysis of the Listeriomics transcriptomic data sets.
Performance of FDR methods on filtered microbiome data.
Regulatory Genomics Lab
Functional enrichment of differentially expressed genes.
Boosting Signal-to-Noise in Complex Biology: Prior Knowledge Is Power
Single Sample Expression-Anchored Mechanisms Predict Survival in Head and Neck Cancer Yang et al Presented by Yves A. Lussier MD PhD The University.
The Omics Dashboard.
Comparison of proteomics and RNA‐Seq data.
Varying Intolerance of Gene Pathways to Mutational Classes Explain Genetic Convergence across Neuropsychiatric Disorders  Shahar Shohat, Eyal Ben-David,
Metabolite predictability across metabolic categories (A) and disease states (B) in the vaginal microbiome. Metabolite predictability across metabolic.
Statistical Testing with Genes
Regulatory Genomics Lab
Identification of aging-related genes and affected biological processes. Identification of aging-related genes and affected biological processes. (A) Experimental.
Statistical chart of significantly differentially expressed genes
Genetic maintenance of histone acetylation prevents gene expression changes in the promoters of genes responding to acute mtDNA depletion. Genetic maintenance.
Reanalysis of an existing experiment using ADAGE
Gene set enrichment analysis reveals MafA regulation of many key β-cell activities. Gene set enrichment analysis reveals MafA regulation of many key β-cell.
Identification of acetylated peptides and proteins by LC-MS/MS.
Stage-specific expression modules of preimplantation development.
Enrichment of KEGG pathways in microbial genes in different samples.
Differentially regulated biological pathways in response to HFrD and protective effects of grape PP supplementation on fructose-induced alterations of.
CREBBP loss-of-function results in gene expression repression signature. CREBBP loss-of-function results in gene expression repression signature. A–D,
A, Stacked Venn plots of the significantly associated (FDR < 25%) KEGG pathways for 3 imaging subtypes with GSEA. B, Pathway activity scores for 3 imaging.
Fig. 2 Tissue-specific transcriptomic alterations in response to acute sleep loss in healthy humans. Tissue-specific transcriptomic alterations in response.
Differential Expression of RNA-Seq Data
Presentation transcript:

::: Schedule. Biological (Functional) Databases Threshold-based and threshold free methods Threshold-based example: FatiGO. Threshold free example 1: FatisScan.

Two-steps approach reproduces pre-genomics paradigms annotation pass test experiments no experiments test annotation .... : Context and cooperation between genes is ignored

A previous step of gene selection causes loss of information and makes the test insensitive A B GO1 GO2 - Significantly over-expressed in B Functional Classes expressed as blocks in A and B Very few genes selected to arrive to a significant conclussion on GO1 and GO2 If a threshold based on the experimental values is applied, and the resulting selection of genes compared for enrichment of a functional term, this might not be found t-test with two tails. p<0.05 statistic Significantly over-expressed in A +

Threshold-free approach Including information in the procedure of gene selection Functional label A Functional label B Functional label C A B C Genes are ranked by any biological criteria (expression value, p- value, positive selection, etc). FatiScan searchs for the distribution of the blocks of functionally related genes across the list. To detect significant terms a segmentation test is performed. Properties of groups of genes are considered. + List of genes Blocks of genes with significant functional term A annotated Homogeneously distributed Functional term C Blocks of genes with significant functional term B annotated -

Gene Set Enrichment Just ONE list (ranked)

::: Exercise 3: FatiScan Selected data Mootha et al. Nat Genet 2003

-RunT-rex for differential expression analysis- Files: fatiscan_diabetes_mootha_array.txt fatiscan_diabetes_mootha_classes.txt Results T-Rex Run T-Rex here http://www.gepas.org/

-Examine Results and send to FatiScan- Send to FatiScan here Check the FDR adjusted p-value

-Run Fatiscan with statistic-ranked list In FatiScan input form: select the organism Homo sapiens, Gene Ontology: biological process with the default filtering, 30 partitions, Two tailed Fisher's exact test, and press Run. As the sorted list comes from GEPAS it will automatically detect the sorting and the labels. This could be true or not depending on the relative abundance of this term. If you look to the rest of genes not activated in the experiment and the proportion of them related to metabolism is, let's say 10%, then you are right. Contrarily, if the proportion is, let's say 61%, then the experiment has probably nothing to do with metabolism. The comparison is compulsory to support such assertions.

Significantly altered KEGG Pathways (MGT y DM2/IGT):