Functional Genomics in Evolutionary Research

Slides:



Advertisements
Similar presentations
From the homework: Distribution of DNA fragments generated by Micrococcal nuclease digestion mean(nucs) = bp median(nucs) = 110 bp sd(nucs+ = 17.3.
Advertisements

Microarray technology and analysis of gene expression data Hillevi Lindroos.
Detecting Differentially Expressed Genes Pengyu Hong 09/13/2005.
Introduction to DNA Microarrays Todd Lowe BME 88a March 11, 2003.
Microarrays Dr Peter Smooker,
Microarray Data Preprocessing and Clustering Analysis
Differentially expressed genes
‘Gene Shaving’ as a method for identifying distinct sets of genes with similar expression patterns Tim Randolph & Garth Tan Presentation for Stat 593E.
Statistical Analysis of Microarray Data
Analysis of Differential Expression T-test ANOVA Non-parametric methods Correlation Regression.
Microarray Technology Types Normalization Microarray Technology Microarray: –New Technology (first paper: 1995) Allows study of thousands of genes at.
Data analytical issues with high-density oligonucleotide arrays A model for gene expression analysis and data quality assessment.
Microarray Data Analysis Using R Studies in Tissue Databases Mark Reimers, NCI.
Introduce to Microarray
Analysis of High-throughput Gene Expression Profiling
Exploring the Metabolic and Genetic Control of Gene Expression on a Genomic Scale Joseph L. DeRisi, Vishwanath R. Iyer, Patrick O. Brown Science Vol. 278.
with an emphasis on DNA microarrays
Affymetrix vs. glass slide based arrays
Data Type 1: Microarrays
Applying statistical tests to microarray data. Introduction to filtering Recall- Filtering is the process of deciding which genes in a microarray experiment.
Microarray - Leukemia vs. normal GeneChip System.
Scenario 6 Distinguishing different types of leukemia to target treatment.
Bioinformatics Expression profiling and functional genomics Part II: Differential expression Ad 27/11/2006.
Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
Gene Expression and Evolution. Why are Evolutionists Interested in Gene Expression? Divergence in gene expression can underlie differences between taxa.
Statistical Principles of Experimental Design Chris Holmes Thanks to Dov Stekel.
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
Microarray (Gene Expression) DNA microarrays is a technology that can be used to measure changes in expression levels or to detect SNiPs Microarrays differ.
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
Comp. Genomics Recitation 10 4/7/09 Differential expression detection.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Example of a Functional Genomics Study Molecular Ecology ,
DNA Microarray Overview and Application. Table of Contents Section One : Introduction Section Two : Microarray Technique Section Three : Types of DNA.
Marshall University School of Medicine Department of Biochemistry and Microbiology BMS 617 Lecture 6 –Multiple hypothesis testing Marshall University Genomics.
Distinguishing active from non active genes: Main principle: DNA hybridization -DNA hybridizes due to base pairing using H-bonds -A/T and C/G and A/U possible.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
NCode TM miRNA Analysis Platform Identifies Differentially Expressed Novel miRNAs in Adenocarcinoma Using Clinical Human Samples Provided By BioServe.
Microarray: An Introduction
Estimating the False Discovery Rate in Genome-wide Studies BMI/CS 576 Colin Dewey Fall 2008.
Arrays How do they work ? What are they ?. WT Dwarf Transgenic Other species Arrays are inverted Northerns: Extract target RNA YFG Label probe + hybridise.
DNA Microarray. Microarray Printing 96-well-plate (PCR Products) 384-well print-plate Microarray.
Transcriptomics History and practice.
Using Web-Based Tools for Microarray Analysis
Tutorial 6 : RNA - Sequencing Analysis and GO enrichment
Microarray - Leukemia vs. normal GeneChip System.
Microarray Experiment Design and Data Interpretation
Statistical Testing with Genes
The Basics of cDNA Microarray Technology
Genome Wide Association Studies using SNP
Functional Genomics in Evolutionary Research
Microarray Technology and Applications
 The human genome contains approximately genes.  At any given moment, each of our cells has some combination of these genes turned on & others.
Lecture 11 By Shumaila Azam
Introduction to cDNA Microarray Technology
Overview Expression data basics Introduction Biological network data
Microarrays 1/31/2018.
Loyola Marymount University
Transcriptomics History and practice.
Statistical Analysis and Design of Experiments for Large Data Sets
Gene Expression Analysis
Microarray Data Analysis
Loyola Marymount University
Statistical Testing with Genes
Loyola Marymount University
Topic Cloning and analyzing oxalate degrading enzymes to see if they dissolve kidney stones with Dr. VanWert.
Loyola Marymount University
Data Type 1: Microarrays
Presentation transcript:

Functional Genomics in Evolutionary Research

What Is Microarray Technology? High throughput method for measuring simultaneously, mRNA abundances for thousands of genes. Thousands of probes or features adhered to a solid substrate at known x,y coordinates. Probes: Spotted cDNA ~ 200 bp Oligo = 25 to 60 bp

Why Is Microarray Technology Important? From NSF Program Announcement: Environmental Genomics

How Do Microarrays Work? Hybridization Technique - RNA targets isolated from a cell line or tissue of interest are labeled and hybridized to the probes. - Label intensity at a given location on the substrate correlates with the amount of target for a given mRNA (gene) present in the sample. Differentially expressed Genes: Identified statistically (e.g. t-test) by comparing control vs experimental.

The Burden of Multiple Testing A given microarray may have over 40,000 probes!!! This means that you may run as many as 40,000 statistical tests. If you reject a null hypothesis when P < 0.05, then 5% of the time you are rejecting true null hypotheses. If you run 40,000 tests, then by chance alone you will reject ~ 40,000 x 0.05 = 2000 true null hypotheses (i.e., you will have ~ 2000 false positives)

Sources of Variation in Microarray Experiments Biological (1) Experimental Treatments (2) Individual variance... may or may not be good (3) Nonspecific hybridization Paralogs of gene families Technical (Bad) (1) RNA quality (2) Dye biases (3) Stochasticity during scanning, image processing (5) Errors during probe synthesis or deposition (6) Stochasticity in labeling targets

What gene expression changes are associated with the evolution of paedomorphosis? Example Design Larval Adult Metamorphosis R x x x x x x x x x x x x x x (1) Sample tissue from 15 time points (x), including an early reference (R) time point. (2) Compare expression for each time point and the reference on a DNA chip. Metamorphosis? Metamorphic Life Cycle Paedomorphic Life Cycle (3) Quantify relative expression of each gene across all DNA chips. (2 life cycles x 3 tissues x 14 timepoints) (4) Model gene expression to determine how genes are expressed temporally within life cycle cycles for each tissue. (6) Compare gene expression profiles among life cycles and tissues to identify differentially expressed genes. (7)Verify results by rt-PCR and analyze candidates in thyroid hormone-induced paedomorphs.

Visualization & Categorization Can be done for genes and/or arrays... Options Include a variety of multivariate and pattern matching techniques including the methodologies listed below Quadratic Regression Principal Component Analysis ClusteringHeat maps Liu et al. 2005... From the Stromberg Group here at UK

Gene Ontology & Biological Relevance Microarray datasets can be overwhelming because they contain A LOT of information Even experts on a system can be overwhelmed by the number of genes that are differentially regulated in some experiments Having a standardized nomenclature that places a gene into one or more biological contexts can be invaluable for functional grouping (previous grouping techniques were irrespective of biological information) Gene Ontology is a standardized hierarchical nomenclature that classifies genes under three broad categories

Example of a Functional Genomics Study Molecular Ecology 2006 15, 4635-4643

Drosophila Most species are very poor ecological model organisms. D. mojavensis is cactophilic: it uses 4 different kinds of cactus host in the Sonoran Desert. Oviposits in necrotic tissues, exposing larvae to varied toxic chemicals.

Objective Identify gene expression differences of 3rd instar larvae reared between two chemically distinct cactus hosts: Agria (Stenocereus gummosus), native host Organpipe (Stenocereus thurberi), alternative host Used a custom microarray (6520 anonymous cDNA fragments that were pinned robotically to glass slides)

Organpipe vs Agria Cacti Differ in lipids, triterpenes, and glycosides. Differ in alcohol content. Adh is duplicated in D. mojavensis and The paralogs are known to play different roles in host adaptation.

Mixed Model Anova Approach Relative hybridization Intensity 1) = Random Technical and Residual Variation Yij = µ + ARRAYi + DYEj + ARRAY × DYEij + Residualij Residual Variation Per Gene 2) = Random Technical and Fixed Technical and Biological Variation Residualijkl = µ + ARRAYi + DYEj + CACTUS + ARRAY x Spotil + Errorijkl

Correcting for Multiple Tests Bonferroni correction: More conservative test where the significance threshold is divided by the total number of tests. False Discovery Rate (FDR): Less conservative test that calculates the number of false positives within a set of significant values (P<0.05) and then calculates a new significance threshold , q.

Identifying Differentially Expressed Genes Greater Expression Organpipe Greater Expression Agria P value for Each Gene Specific Anova -log(P) Bonferroni (173) False Discovery Rate (1034) Fold Difference Log2

Representation of Up-regulated Genes Among Gene Ontology Categories.

Conclusions (i) Cactus host usage affects patterns of gene transcription. (ii) Loci whose function involve detoxification were differentially regulated in response to a cactus host shift. (iii) A subset of the differentially expressed loci may have arisen de novo in the D. mojavensis lineage.