Microarray Data Analysis Data quality assessment and normalization for affymetrix chips.

Slides:



Advertisements
Similar presentations
NASC Normalisation and Analysis of the Affymetrix Data David J Craigon.
Advertisements

Lecture 9 Microarray experiments MA plots
Quality control of Affymetrix arrays. What can go wrong? RNA degradation (before hyb) –3’/5’ Dirty samples –background, % present calls Uneven hybridizations.
Bias, Variance, and Fit for Three Measures of Expression: AvDiff, Li &Wong’s, and AvLog(PM-BG) Rafael A. Irizarry Department of Biostatistics, JHU (joint.
Bias, Variance, and Fit for Three Measures of Expression: AvDiff, Li &Wong’s, and AvLog(PM-BG) Rafael A. Irizarry Department of Biostatistics, JHU (joint.
Microarray Quality Assessment Issues in High-Throughput Data Analysis BIOS Spring 2010 Dr Mark Reimers.
Gene Expression Index Stat Outline Gene expression index –MAS4, average –MAS5, Tukey Biweight –dChip, model based, multi-array –RMA, model.
Microarray Normalization
Zhongxue Chen, Monnie McGee, Qingzhong Liu and Richard Scheuermann
Analysis and Interpretation of Microarray Data Michael F. Miles, M.D., Ph.D. Depts. of Pharmacology/Toxicology and Neurology and the Center for Study of.
Introduction to Affymetrix Microarrays
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Development, Implementation and Testing of a DNA Microarray Test Suite Ehsanul Haque Mentors: Dr. Cecilie Boysen Dr. Jim Breaux ViaLogy Corp.
Statistical Methods in Microarray Data Analysis Mark Reimers, Genomics and Bioinformatics, Karolinska Institute.
Getting the numbers comparable
Probe Level Analysis of AffymetrixTM Data
DNA Microarray Bioinformatics - #27612 Normalization and Statistical Analysis.
Normalization Class web site: Statistics for Microarrays.
Low-Level Analysis and QC Regional Biases Mark Reimers, NCI.
Summarizing and comparing GeneChip  data Terry Speed, UC Berkeley & WEHI, Melbourne Affymetrix Users Meeting, Friday June 7, 2002 Redwood City, CA.
Identification of spatial biases in Affymetrix oligonucleotide microarrays Jose Manuel Arteaga-Salas, Graham J. G. Upton, William B. Langdon and Andrew.
1 Preprocessing for Affymetrix GeneChip Data 1/18/2011 Copyright © 2011 Dan Nettleton.
Low Level Statistics and Quality Control Javier Cabrera.
SNP chips Advanced Microarray Analysis Mark Reimers, Dept Biostatistics, VCU, Fall 2008.
Normalization of 2 color arrays Alex Sánchez. Dept. Estadística Universitat de Barcelona.
GCB/CIS 535 Microarray Topics John Tobias November 8th, 2004.
Data analytical issues with high-density oligonucleotide arrays A model for gene expression analysis and data quality assessment.
Felix Naef & Marcelo Magnasco, GL meeting, Nov Outline Background subtraction Probeset statistics Excursions into.
1 Models and methods for summarizing GeneChip probe set data.
Microarray Data Analysis Data quality assessment and normalization for affymetrix chips.
ViaLogy Lien Chung Jim Breaux, Ph.D. SoCalBSI 2004 “ Improvements to Microarray Analytical Methods and Development of Differential Expression Toolkit ”
Affymetrix GeneChip Data Analysis Chip concepts and array design Improving intensity estimation from probe pairs level Clustering Motif discovering and.
Theoretical and experimental comparisons of gene expression indexes for oligonucleotide microarrays Division of Human Cancer Genetics Ohio State University.
Gene expression array and SNP array
Different Expression Multiple Hypothesis Testing STAT115 Spring 2012.
Summaries of Affymetrix GeneChip probe level data By Rafael A. Irizarry PH 296 Project, Fall 2003 Group: Kelly Moore, Amanda Shieh, Xin Zhao.
Microarray Preprocessing
Lecture 10. Microarray and RNA-seq
Microarray Data Analysis Illumina Gene Expression Data Analysis Yun Lian.
(4) Within-Array Normalization PNAS, vol. 101, no. 5, Feb Jianqing Fan, Paul Tam, George Vande Woude, and Yi Ren.
Practical Issues in Microarray Data Analysis Mark Reimers National Cancer Institute Bethesda Maryland.
Lecture 22 Introduction to Microarray
CDNA Microarrays MB206.
Bioconductor Packages for Pre-processing DNA Microarray Data affy and marray Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang Bioconductor.
Probe-Level Data Normalisation: RMA and GC-RMA Sam Robson Images courtesy of Neil Ward, European Application Engineer, Agilent Technologies.
Agenda Introduction to microarrays
Assessing expression data quality in high-density oligonucliotide arrays.
A A R H U S U N I V E R S I T E T Faculty of Agricultural Sciences Introduction to analysis of microarray data David Edwards.
Lo w -Level Analysis of Affymetrix Data Mark Reimers National Cancer Institute Bethesda Maryland.
Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine
Lecture Topic 5 Pre-processing AFFY data. Probe Level Analysis The Purpose –Calculate an expression value for each probe set (gene) from the PM.
Summarization of Oligonucleotide Expression Arrays BIOS Winter 2010.
Model-based analysis of oligonucleotide arrays, dChip software Statistics and Genomics – Lecture 4 Department of Biostatistics Harvard School of Public.
Pre-processing in DNA microarray experiments Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang Bioconductor short course Summer 2002.
ABC D EF GH I JKL. Supplementary figure S1: Exemplary overview of the quality assessment plots generated by Robin. All plots have been generated using.
Microarray Data Pre-Processing
Microarray hybridization Usually comparative – Ratio between two samples Examples – Tumor vs. normal tissue – Drug treatment vs. no treatment – Embryo.
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
Pre-processing DNA Microarray Data Sandrine Dudoit, Robert Gentleman, Rafael Irizarry, and Yee Hwa Yang Bioconductor Short Course Winter 2002 © Copyright.
Statistical Analyses of High Density Oligonucleotide Arrays Rafael A. Irizarry Department of Biostatistics, JHU (joint work with Bridget Hobbs and Terry.
Oigonucleotide (Affyx) Array Basics Joseph Nevins Holly Dressman Mike West Duke University.
The microarray data analysis Ana Deckmann Carla Judice Jorge Lepikson Jorge Mondego Leandra Scarpari Marcelo Falsarella Carazzolle Michelle Servais Tais.
Gene expression  Introduction to gene expression arrays Microarray Data pre-processing  Introduction to RNA-seq Deep sequencing applications RNA-seq.
Exploration, Normalization, and Summaries of High Density Oligonucleotide Array Probe Level Data Rafael A. Irizarry Department of Biostatistics, JHU (joint.
Using ArrayStar with a public dataset
Introduction to Affymetrix GeneChip data
Copy-number estimation using Robust Multichip Analysis - Supplementary materials for the aroma.affymetrix lab session Henrik Bengtsson & Terry Speed Dept.
CDNA-Project cDNA project Julia Brettschneider (UCB Statistics)
Getting the numbers comparable
Pre-processing AFFY data
Presentation transcript:

Microarray Data Analysis Data quality assessment and normalization for affymetrix chips

2 Outline Visualization and diagnostic plots Normalization

3 Microarray studies life cycle Here we are

Looking at microarray data Diagnostic Plots Was the experiment a success?

5 Image plots for affymetrix chips

6 MA-plot for GeneChip arrays (1 color) MT WT RMA MT intensity for each probe set RMA WT intensity for each probe set aRNA M Log 2 (MT) - Log 2 (WT ) A Log 2 (MT*WT) / 2 (signal strength)

7 Box plots

8 Density plots

9 Digestion plots

Preprocessing affy chips

11 Preprocessing steps Computing expression values for each probe set requires 3-steps  Background correction  Normalization  Probe set summaries

12 Most popular approaches Affymetrix’s own MAS 5 or GCOS 1.0 algorithms RMA (Robust Multichip Analysis)  Irizarry, Bolstad, Collin, Cope, Hobbs, Speed (2003), Summaries of Affymetrix GeneChip probe level data. NAR 31(4):e15 dChip Li and Wong (2001). Model-based analysis of oligonucleotide arrays: expression index computation and outlier detection. PNAS 98, 31-

13 MAS Background correction  E j = PM j - MM j * where MM j * is chosen so that E j is non-negative Normalization  Scale so that mean E j is same for each chip Probe Set Summary  log(Signal Intensity) = TukeyBiweight(log Ej)

14 RMA- Background correction Ignore MM, fit model to PM  PM = Background (N(0,  2 ) + Signal (Exp(  ))

15 RMA-Normalization Force the empirical distribution of probe intensities to be the same for every chip in an experiment The common distribution is obtained by averaging each quantile across chips: Quantile Normalization

16 One distribution for all arrays: the black curve

17 RMA: Probe set summary Robustly fit a two-way model yielding an estimate of log2(signal) for each probe set Fit may be by  median polish (quick) or by  Mestimation (slower but yields standard errors and good quality RMA reduces variability without loosing the ability to detect differential expression

18 MA plots before and after RMA

19 Summary Microarray experiments have many “hot spots” where errors or systematic biases can apper Visual and numerical quality control should be performed Usually intensities will require normalisation  At least global or intensity dependent normalisation should be performed  More sophisticated procedures rely on stronger assumptions  Must look for a balance