CDNA-Project cDNA project Julia Brettschneider (UCB Statistics)

Slides:



Advertisements
Similar presentations
Bias, Variance, and Fit for Three Measures of Expression: AvDiff, Li &Wong’s, and AvLog(PM-BG) Rafael A. Irizarry Department of Biostatistics, JHU (joint.
Advertisements

Bias, Variance, and Fit for Three Measures of Expression: AvDiff, Li &Wong’s, and AvLog(PM-BG) Rafael A. Irizarry Department of Biostatistics, JHU (joint.
Pre-processing in DNA microarray experiments Sandrine Dudoit PH 296, Section 33 13/09/2001.
Gene Expression Index Stat Outline Gene expression index –MAS4, average –MAS5, Tukey Biweight –dChip, model based, multi-array –RMA, model.
Microarray Normalization
Zhongxue Chen, Monnie McGee, Qingzhong Liu and Richard Scheuermann
Mathematical Statistics, Centre for Mathematical Sciences
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Image Quantitation in Microarray Analysis More tomorrow...
Normalization of Microarray Data - how to do it! Henrik Bengtsson Terry Speed
Development, Implementation and Testing of a DNA Microarray Test Suite Ehsanul Haque Mentors: Dr. Cecilie Boysen Dr. Jim Breaux ViaLogy Corp.
Statistical Methods in Microarray Data Analysis Mark Reimers, Genomics and Bioinformatics, Karolinska Institute.
Sandrine Dudoit1 Microarray Experimental Design and Analysis Sandrine Dudoit jointly with Yee Hwa Yang Division of Biostatistics, UC Berkeley
Getting the numbers comparable
Probe Level Analysis of AffymetrixTM Data
Normalization for cDNA Microarray Data Yee Hwa Yang, Sandrine Dudoit, Percy Luu and Terry Speed. SPIE BIOS 2001, San Jose, CA January 22, 2001.
DNA Microarray Bioinformatics - #27612 Normalization and Statistical Analysis.
Normalization Class web site: Statistics for Microarrays.
Summarizing and comparing GeneChip  data Terry Speed, UC Berkeley & WEHI, Melbourne Affymetrix Users Meeting, Friday June 7, 2002 Redwood City, CA.
Normalization of 2 color arrays Alex Sánchez. Dept. Estadística Universitat de Barcelona.
Microarray Technology Types Normalization Microarray Technology Microarray: –New Technology (first paper: 1995) Allows study of thousands of genes at.
Data analytical issues with high-density oligonucleotide arrays A model for gene expression analysis and data quality assessment.
Microarray Data Analysis Data quality assessment and normalization for affymetrix chips.
Microarray Data Analysis Data quality assessment and normalization for affymetrix chips.
ViaLogy Lien Chung Jim Breaux, Ph.D. SoCalBSI 2004 “ Improvements to Microarray Analytical Methods and Development of Differential Expression Toolkit ”
Introduce to Microarray
Gene Expression Microarrays Microarray Normalization Stat
Summaries of Affymetrix GeneChip probe level data By Rafael A. Irizarry PH 296 Project, Fall 2003 Group: Kelly Moore, Amanda Shieh, Xin Zhao.
Microarray Preprocessing
1 Normalization Methods for Two-Color Microarray Data 1/13/2009 Copyright © 2009 Dan Nettleton.
Image Quantitation in Microarray Analysis More tomorrow...
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Affymetrix GeneChips Oligonucleotide.
Affymetrix vs. glass slide based arrays
Statistical Analyses of Microarray Data Rafael A. Irizarry Department of Biostatistics
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
CDNA Microarrays MB206.
Data Type 1: Microarrays
Bioconductor Packages for Pre-processing DNA Microarray Data affy and marray Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang Bioconductor.
Probe-Level Data Normalisation: RMA and GC-RMA Sam Robson Images courtesy of Neil Ward, European Application Engineer, Agilent Technologies.
Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research.
Microarray - Leukemia vs. normal GeneChip System.
Lo w -Level Analysis of Affymetrix Data Mark Reimers National Cancer Institute Bethesda Maryland.
Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine
Lecture Topic 5 Pre-processing AFFY data. Probe Level Analysis The Purpose –Calculate an expression value for each probe set (gene) from the PM.
Summarization of Oligonucleotide Expression Arrays BIOS Winter 2010.
Model-based analysis of oligonucleotide arrays, dChip software Statistics and Genomics – Lecture 4 Department of Biostatistics Harvard School of Public.
Introduction to Statistical Analysis of Gene Expression Data Feng Hong Beespace meeting April 20, 2005.
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
Pre-processing in DNA microarray experiments Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang Bioconductor short course Summer 2002.
Henrik Bengtsson Mathematical Statistics Centre for Mathematical Sciences Lund University, Sweden Plate Effects in cDNA Microarray Data.
Figure SOM1. Functional roles of the genes affected in zmet2-m1 mutants. Although the genes localized on the intracellular membranes were slightly over-represented.
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
Introduction to Microarrays. The Central Dogma.
Pre-processing DNA Microarray Data Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang Bioconductor Short Course Winter 2002 © Copyright.
Henrik Bengtsson Mathematical Statistics Centre for Mathematical Sciences Lund University Plate Effects in cDNA Microarray Data.
Statistical Analyses of High Density Oligonucleotide Arrays Rafael A. Irizarry Department of Biostatistics, JHU (joint work with Bridget Hobbs and Terry.
Oigonucleotide (Affyx) Array Basics Joseph Nevins Holly Dressman Mike West Duke University.
Distinguishing active from non active genes: Main principle: DNA hybridization -DNA hybridizes due to base pairing using H-bonds -A/T and C/G and A/U possible.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Exploration, Normalization, and Summaries of High Density Oligonucleotide Array Probe Level Data Rafael A. Irizarry Department of Biostatistics, JHU (joint.
AN INTRODUCTION TO GENE EXPRESSION ANALYSIS BY MICROARRAY TECHNIQUE (PART II) DR. AYAT B. AL-GHAFARI MONDAY 10 TH OF MUHARAM 1436.
Microarray Data Analysis Xuming He Department of Statistics University of Illinois at Urbana-Champaign.
Introduction to Affymetrix GeneChip data
Microarray - Leukemia vs. normal GeneChip System.
Microarray Technology and Applications
Normalization Methods for Two-Color Microarray Data
Getting the numbers comparable
Normalization for cDNA Microarray Data
Data Type 1: Microarrays
Pre-processing AFFY data
Presentation transcript:

cDNA-Project cDNA project Julia Brettschneider (UCB Statistics)

cDNA project

Biological question What are the underlying molecular cascades that are activated following brain injury and lead the cells into and through cell death?

Experiments Platform: Incyte mouse cDNA arrays Twelve cDNA dye swap pairs 9024 genes total 4 print-tip groups (48 x 47 spots each) Controls: The last 24 spots in each print-tip group

Find differentially expressed genes… injured (i) vs. sham (s) hypocamp. subfield CA3 (C) vs. upper blade Dentate (D) between different marker status (+,>,-) would also be interesting…

The 24 comparisons #01 iC+ sC- #02 iC- sC- #03 iD+ sD- #04 iD- sD- #05 iC+ iC- #06 iD+ iD- #07 sC- iC+ #08 iC- iC+ #09 sC- iC- #10 sD- iD+ #11 iD- iD+ #12 sD- iD- #13 iC+ iC> #14 iC> iC- #15 iC> sC- #16 iD+ iD> #17 iD> iD- #18 iD> sD- #19 iC> iC+ #20 iC- iC> #21 sC- iC> #22 iD> iD+ #2C iD> iD- #24 sD- iD>

Dye swap pair (injured +) vs (sham -)

Dye swap pairs (injured >) vs (sham -)

Incyte’s Flags Between 22% and 91% of the genes flagged, in 15 chips more than 50%, in 8 chips more than 75%. Incyte’s criterion for flagging: - signal to bg ratio smaller than 2.5, or - spot area smaller than 40% in both dyes Flagged genes are not used in normalization

Problems Design (regions CA3 and dentate never hybridized together) “Flagged genes” Low quality chips Inconsistent dye swap pairs Normalization

Image analysis and normalization Image analysis using “Spot” (Yang, Buckley, and Speed 1999, Program written by CSIRO Mathematics and Information Science) Lowess normalization (Dudoit et al. 2000) for print-tip groups

Summary cDNA project Image analysis redone Normalization Gene rank lists

Affymetrix Technoloy Statistics Drosophila project

Technology of oligonucleotide arrays Each gene is interrogated by a number (16 - 20) of probe pairs Each probe pair consists of two oligonucleotides (typically 25 bp long): Perfect Match (PM) fits the target exactly Missmatch (MM) has a middle base flipped

Chip Probes PM MM

PM MM …TGCTGGGTTACGCAGTCTTACTGACCATACACGCACTGCT… PM: AATGCGTCAGAATGACTGGTATGTGCGTCACGA MM: AATGCGTCAGAAAGACTGGTATGTGCGTCACGA ^ missmatch position

Steps in the analysis Converting .cel file into probe pair data Normalization Calculation of an expression value for each gene based on the intensities of its 20 probe pairs Comparing treatment/control, WT/mutant etc. Hypothesis testing Clustering etc.

Gene expression estimation Genechip (Affymetrix): Average difference between the probe pairs Average log ratio Now: robust/resistent version

Gene expression estimation Li and Wong (2000) fit a model expression in chip i (j probe pair index)

Gene expression estimation Irizarry, Speed et al. (2002): Average log (PM - BG), where BG global background Now: robust version using median polish

Example Scatters

Mixture Model fits

Fits for significant genes

Significance of regression

Residual Plots

Axon guidance in the Drosophila CNS How do neurons find their correct targets, make appropriate synaptic connections, and set and adjust their size and strength? Find genes which regulate these mechanisms by comparing different mutants Loss Of Function (LOF) and Gain Of Functions (GOF) mutants for a number of proteins

Robo, Slit & Comm Robo: transmembrane protein, receptor for slit, negatively controlled by Comm Slit: extracellular protein, expressed by midline glia, ligand for Robo receptor Comm: surface protein, expressd on midline cells, transferred to commissural neurons

Phenotypes

Experiments 12 (plus…) different mutants, plus WT 4 replicates for each Affymetrix Drosophila chips about 13 thousand genes in total

Comparisons Formulate particular questions/hypothesis for subgroups of comparisons, - Slit and Robo, Robo2 mutants have similar phenotypes, compare their gene expression - Compare Robo1, Robo2, and Robo1,2 - etc.

Data Analysis Using Affy R-package (Bolstad, Irizarry) Quantile normalization (Bolstad 2001) Currently implementing an additiv model on the probe pair level: - simultanously for mutant/WT - incorporates replicates

Acknowledgement Ben Bolstad Sandrine Dudoit Phil Spector Terry Speed Jean Yee Hwa Yang (UCB Statistics) Tiago Magelhaes (Goodman lab, UCB) Paolo Marciano (Pennsylvania) Francois Collin (Genlogic) Rafael Irizarry (Johns Hopkins) www.stat.berkeley.edu/users/terry/zarray/Html/index.html