Low Level Statistics and Quality Control Javier Cabrera.

Slides:



Advertisements
Similar presentations
Applications of one-class classification
Advertisements

MicroArray Image Analysis Robin Liechti
High-dimensional data analysis: Microarrays and multiple testing Mark van de Wiel 1,2 1. Dep. of Mathematics, VU University Amsterdam 2. Dep. of Biostatistics.
Microarray Quality Assessment Issues in High-Throughput Data Analysis BIOS Spring 2010 Dr Mark Reimers.
Pre-processing in DNA microarray experiments Sandrine Dudoit PH 296, Section 33 13/09/2001.
MicroArray Image Analysis
MicroArray Image Analysis Robin Liechti
Automatic Identification of Bacterial Types using Statistical Image Modeling Sigal Trattner, Dr. Hayit Greenspan, Prof. Shimon Abboud Department of Biomedical.
Microarray Normalization
Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Image Quantitation in Microarray Analysis More tomorrow...
TIGR Spotfinder: a tool for microarray image processing
Introduction to DNA Microarray Technology Steen Knudsen, April 2005.
Getting the numbers comparable
DNA Microarray Bioinformatics - #27612 Normalization and Statistical Analysis.
Preprocessing Methods for Two-Color Microarray Data
Low-Level Analysis and QC Regional Biases Mark Reimers, NCI.
Figure 1: (A) A microarray may contain thousands of ‘spots’. Each spot contains many copies of the same DNA sequence that uniquely represents a gene from.
‘Gene Shaving’ as a method for identifying distinct sets of genes with similar expression patterns Tim Randolph & Garth Tan Presentation for Stat 593E.
An Approach to Korean License Plate Recognition Based on Vertical Edge Matching Mei Yu and Yong Deak Kim Ajou University Suwon, , Korea 指導教授 張元翔.
Gene Expression Data Analyses (2)
Sample preparation 1. Design experiment Question? Replicates? Test? 2. Perform experiment 4. Label RNA Amplification? Direct or indirect? Label? wild.
1 Test of significance for small samples Javier Cabrera.
Exploring Microarray data Javier Cabrera. Outline 1.Exploratory Analysis Steps. 2.Microarray Data as Multivariate Data. 3.Dimension Reduction 4.Correlation.
Microarray Data Analysis Data quality assessment and normalization for affymetrix chips.
Microarray Data Analysis Data quality assessment and normalization for affymetrix chips.
ViaLogy Lien Chung Jim Breaux, Ph.D. SoCalBSI 2004 “ Improvements to Microarray Analytical Methods and Development of Differential Expression Toolkit ”
Scanning and image analysis Scanning -Dyes -Confocal scanner -CCD scanner Image File Formats Image analysis -Locating the spots -Segmentation -Evaluating.
Filtering and Normalization of Microarray Gene Expression Data Waclaw Kusnierczyk Norwegian University of Science and Technology Trondheim, Norway.
Microarray Preprocessing
1 Normalization Methods for Two-Color Microarray Data 1/13/2009 Copyright © 2009 Dan Nettleton.
Microarray Basics Part 1: Choosing a platform, setting up, data preprocessing.
Image Quantitation in Microarray Analysis More tomorrow...
Tal Mor  Create an automatic system that given an image of a room and a color, will color the room walls  Maintaining the original texture.
(2) Ratio statistics of gene expression levels and applications to microarray data analysis Bioinformatics, Vol. 18, no. 9, 2002 Yidong Chen, Vishnu Kamat,
Copyright 2000, Media Cybernetics, L.P. Array-Pro ® Analyzer Software.
The following slides have been adapted from to be presented at the Follow-up course on Microarray Data Analysis.
Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research.
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
CDNA Microarrays MB206.
Panu Somervuo, March 19, cDNA microarrays.
Gene Expression Data Qifang Xu. Outline cDNA Microarray Technology cDNA Microarray Technology Data Representation Data Representation Statistical Analysis.
Randomization issues Two-sample t-test vs paired t-test I made a mistake in creating the dataset, so previous analyses will not be comparable.
Probe-Level Data Normalisation: RMA and GC-RMA Sam Robson Images courtesy of Neil Ward, European Application Engineer, Agilent Technologies.
CS 6825: Binary Image Processing – binary blob metrics
Agenda Introduction to microarrays
Microarray - Leukemia vs. normal GeneChip System.
Scenario 6 Distinguishing different types of leukemia to target treatment.
A Short Overview of Microarrays Tex Thompson Spring 2005.
Introduction to Statistical Analysis of Gene Expression Data Feng Hong Beespace meeting April 20, 2005.
Statistical Methods for Identifying Differentially Expressed Genes in Replicated cDNA Microarray Experiments Presented by Nan Lin 13 October 2002.
Levels of Image Data Representation 4.2. Traditional Image Data Structures 4.3. Hierarchical Data Structures Chapter 4 – Data structures for.
Extracting binary signals from microarray time-course data Debashis Sahoo 1, David L. Dill 2, Rob Tibshirani 3 and Sylvia K. Plevritis 4 1 Department of.
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
(1) Normalization of cDNA microarray data Methods, Vol. 31, no. 4, December 2003 Gordon K. Smyth and Terry Speed.
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb GP3xCLI GenePix Post-Processing.
Analyzing Expression Data: Clustering and Stats Chapter 16.
Microarray Data Analysis The Bioinformatics side of the bench.
Oigonucleotide (Affyx) Array Basics Joseph Nevins Holly Dressman Mike West Duke University.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Course 3 Binary Image Binary Images have only two gray levels: “1” and “0”, i.e., black / white. —— save memory —— fast processing —— many features of.
©2003/04 Alessandro Bogliolo Analysis of gene expression by means of Microarrays.
From: Automated voxel classification used with atlas-guided diffuse optical tomography for assessment of functional brain networks in young and older adults.
Microarray - Leukemia vs. normal GeneChip System.
Image Processing for cDNA Microarray Data
The Basics of Microarray Image Processing
Volume 6, Issue 5, Pages e5 (May 2018)
Getting the numbers comparable
CSCI N317 Computation for Scientific Applications Unit Weka
Presentation transcript:

Low Level Statistics and Quality Control Javier Cabrera

Outline 1.Processing Steps. 2.Spotting the Raw Image 3.Panel quality 4.Array quality 5.Area problems

Scanned Image Spotted Image Preprocessing Gene Exp. Mat. Statistical Analysis Data processing steps

Preprocessing steps -Microarrays are scanned -The image is converted into spot intensities for analysis. -Spot intensity: Reflects the amount of labeled probe that hybridized to it. -Run a series of quality checks on the data. -The spot intensity is adjusted for background effects.

(A) (B) (C) speckles Light Background Shape Raw array

u Gridding: where are the spots? u Edge detection: Cho, Meer, Cabrera (1997) u Segmentation: which pixels correspond to the spot (signal) and which to background? u Measurement: what is the intensity at each spot? Spot intensity = average pixel intensity within the spot Background intensity = average pixel intensity immediately around the spot Spotting the Raw image

Segmentation -Fixed circle segmentation: by fitting a circle with a constant diameter to all the spots. -Adaptive circle segmentation: fitting circles with different diameters to different spots -Seeded region growing algorithm (this works better) (i)Seed specification. (ii)Region Growing. (iii)Stopping Rule.

Segmentation -Fixed circle segmentation: by fitting a circle with a constant diameter to all the spots. -Adaptive circle segmentation: fitting circles with different diameters to different spots -Seeded region growing algorithm (this works better) (i)Seed specification. (ii)Region Growing. (iii)Stopping Rule.

Spot quality: Assess the quality of the spots: uSpot Intensity  Background Intensity uSpot and background Std. Dev. Or CV uSpot Circularity uSignal to noise ratio Use Scatter plot and scatter matrices to look for patterns, outliers. Data example: Preprocessing the spot intensity values

Exploring Channel 1,2 Intensities, Background

Spot quality -Spot Intensity -Background I -Circularity -Uniformity -Signal to Noise The graphs show some outliers but there are not very dramatic.

Panel and Array quality: Assess the quality of the panels and the entire arrays - Image plot - Boxplots of panels - Quality diagnostic plot from Bioconductor. Preprocessing the spot intensity values

Target Info Probe Info = mrrayLayout : Structure, construction marrayInfo: Probe Anotation Intensity Measures : maRf, maGf, maRb, maGb, maW Preprocessing the spot intensity values

Systematic Effects Changes in measurement scale due to: Time: Day effect or other order Operator Batch

Example of day Effect Day Two Day One

Day effect Day effect removed via linear model + normalized Removal of Systematic Effects

Array quality: Assess the quality of the array - area problems Area Effects

Array Quality Image plot of a good array SignalBackground

Image plot of a defective array SignalBackground

Area Problems (i)Large spots covering a good part of the area of the background image. These spots show higher or lower intensities than the rest of the image. (ii)Vertical or horizontal strips on the background image that show higher or lower intensities. (iii) Diagonal strips again showing higher or lower background intensities. (iv) A ramp in the background intensities going across the array. (v)Bleeding in the spotted image showing sequences of consecutive.

Algorithm: Step 1. Split the image into high intensity and low intensity spots: Y rc =1 for high intensity I rc > c Y rc =0 for low intensity I rc  c c = (I I 0.95 )/2 Step 2. Fit a quadratic discriminant function to the binary response Y rc using the spot coordinates (r,c) on the microarray as predictors. Step 3. Let q=proportion of correct classifications. Step 4. Generate M=300 images by randomly permute the rows and columns of {I rc } and for each image calculate the corresponding q 1,…,q 300. Step 5 Use prop{q i >q } as a quality measure of the array or p-value.

Low High Histogram of intensities

Microarray with an area problem

Microarray with a ramp

Good Microarray

Image Quality Graph

Microarray Graph