Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical.

Slides:



Advertisements
Similar presentations
MicroArray Image Analysis Robin Liechti
Advertisements

Pre-processing in DNA microarray experiments Sandrine Dudoit PH 296, Section 33 13/09/2001.
MicroArray Image Analysis
MicroArray Image Analysis Robin Liechti
Measuring Gene Expression Part 3
A Review of Image Analysis Software for Spotted Microarrays Jess Mar Department of Mathematics University of Queensland CBiS Microarray/Chip.
LimmaGUI A Point-and-Click Interface for cDNA Microarray Analysis James Wettenhall and Gordon Smyth Division of Genetics and Bioinformatics Walter and.
Filtering and Normalization of Microarray Gene Expression Data Waclaw Kusnierczyk Norwegian University of Science and Technology Trondheim, Norway.
Introduction to Microarray Analysis and Technology Dave Lin - November 5, 2001.
Mathematical Statistics, Centre for Mathematical Sciences
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Image Quantitation in Microarray Analysis More tomorrow...
Microarray Data Analysis Stuart M. Brown NYU School of Medicine.
TIGR Spotfinder: a tool for microarray image processing
Introduction to DNA Microarray Technology Steen Knudsen, April 2005.
Getting the numbers comparable
The second-simplest cDNA microarray data analysis problem Terry Speed, UC Berkeley Fred Hutchinson Cancer Research Center March 9, 2001.
Normalization for cDNA Microarray Data Yee Hwa Yang, Sandrine Dudoit, Percy Luu and Terry Speed. SPIE BIOS 2001, San Jose, CA January 22, 2001.
DNA Microarray Bioinformatics - #27612 Normalization and Statistical Analysis.
Microarray Data Preprocessing and Clustering Analysis
Normalization Class web site: Statistics for Microarrays.
Figure 1: (A) A microarray may contain thousands of ‘spots’. Each spot contains many copies of the same DNA sequence that uniquely represents a gene from.
Gene Expression Data Analyses (2)
Low Level Statistics and Quality Control Javier Cabrera.
Sample preparation 1. Design experiment Question? Replicates? Test? 2. Perform experiment 4. Label RNA Amplification? Direct or indirect? Label? wild.
Image Analysis Class web site: Statistics for Microarrays.
HWW Gene Expression Experiments: How? Why? What’s the problem?
Introduction to genome biology and microarrays experiment
Corrections and Normalization in microarrays data analysis
Scanning and image analysis Scanning -Dyes -Confocal scanner -CCD scanner Image File Formats Image analysis -Locating the spots -Segmentation -Evaluating.
SPH 247 Statistical Analysis of Laboratory Data. Two-Color Arrays Two-color arrays are designed to account for variability in slides and spots by using.
Filtering and Normalization of Microarray Gene Expression Data Waclaw Kusnierczyk Norwegian University of Science and Technology Trondheim, Norway.
Preprocessing of cDNA microarray data Lecture 19, Statistics 246, April 1, 2004.
Image Quantitation in Microarray Analysis More tomorrow...
The following slides have been adapted from to be presented at the Follow-up course on Microarray Data Analysis.
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
The following slides have been adapted from to be presented at the Follow-up course on Microarray Data Analysis.
Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research.
Statistical Analyses of Microarray Data Rafael A. Irizarry Department of Biostatistics
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
CDNA Microarrays MB206.
Panu Somervuo, March 19, cDNA microarrays.
1 Two Color Microarrays EPP 245/298 Statistical Analysis of Laboratory Data.
WORKSHOP SPOTTED 2-channel ARRAYS DATA PROCESSING AND QUALITY CONTROL Eugenia Migliavacca and Mauro Delorenzi, ISREC, December 11, 2003.
Analysis of Microarray Data Analysis of images Preprocessing of gene expression data Normalization of data –Subtraction of Background Noise –Global/local.
Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research.
Microarray - Leukemia vs. normal GeneChip System.
ARK-Genomics: Centre for Comparative and Functional Genomics in Farm Animals Richard Talbot Roslin Institute and R(D)SVS University of Edinburgh Microarrays.
What Is Microarray A new powerful technology for biological exploration Parallel High-throughput Large-scale Genomic scale.
Measuring Gene Expression Part 3 David Wishart Bioinformatics 301
ImArray - An Automated High-Performance Microarray Scanner Software for Microarray Image Analysis, Data Management and Knowledge Mining Wei-Bang Chen and.
Introduction to Statistical Analysis of Gene Expression Data Feng Hong Beespace meeting April 20, 2005.
Statistical Methods for Identifying Differentially Expressed Genes in Replicated cDNA Microarray Experiments Presented by Nan Lin 13 October 2002.
Henrik Bengtsson Mathematical Statistics Centre for Mathematical Sciences Lund University, Sweden Plate Effects in cDNA Microarray Data.
Microarray hybridization Usually comparative – Ratio between two samples Examples – Tumor vs. normal tissue – Drug treatment vs. no treatment – Embryo.
MICROARRAYS D’EXPRESSIÓ ESTUDI DE REGULADORS DE LA TRANSCRIPCIÓ DE LA FAMILIA trxG M. Corominas:
(1) Normalization of cDNA microarray data Methods, Vol. 31, no. 4, December 2003 Gordon K. Smyth and Terry Speed.
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb GP3xCLI GenePix Post-Processing.
The second-simplest cDNA microarray data analysis problem Terry Speed, UC Berkeley Bioinformatic Strategies For Application of Genomic Tools to Environmental.
Genomic Signal Processing Dr. C.Q. Chang Dept. of EEE.
Henrik Bengtsson Mathematical Statistics Centre for Mathematical Sciences Lund University Plate Effects in cDNA Microarray Data.
Variability & Statistical Analysis of Microarray Data GCAT – Georgetown July 2004 Jo Hardin Pomona College
Statistical Analyses of High Density Oligonucleotide Arrays Rafael A. Irizarry Department of Biostatistics, JHU (joint work with Bridget Hobbs and Terry.
The microarray data analysis Ana Deckmann Carla Judice Jorge Lepikson Jorge Mondego Leandra Scarpari Marcelo Falsarella Carazzolle Michelle Servais Tais.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
©2003/04 Alessandro Bogliolo Analysis of gene expression by means of Microarrays.
CDNA-Project cDNA project Julia Brettschneider (UCB Statistics)
Image Processing for cDNA Microarray Data
Getting the numbers comparable
Normalization for cDNA Microarray Data
Presentation transcript:

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Image Analysis on cDNA Microarray Data Demo of Spot Jean Yang October 24, 2000 Genetics & Bioinformatics Meetings

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, cDNA clones (probes) PCR product amplification purification printing microarray Hybridise target to microarray mRNA target) excitation laser 1laser 2 emission scanning analysis overlay images and normalise 0.1nl/spot

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research,Scanner Laser PMT Dye Glass Slide Objective Lens Detector lens Pinhole Beam-splitter

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Scanner Process Dye Photons ElectronsSignal LaserPMT A/D Convertor excitation amplification Filtering Time-space averaging

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, How to adjust for PMT? Cy3Cy saturated Very weak

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, After normalisation In addition, the ranking of the genes stays pretty much the same.

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Practical Problems 1 Comet Tails Likely caused by insufficiently rapid immersion of the slides in the succinic anhydride blocking solution.

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Practical Problems 2

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Practical Problems 3 High Background 2 likely causes: –Insufficient blocking. –Precipitation of the labeled probe. Weak Signals

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Practical Problems 4 Spot overlap: Likely cause: too much rehydration during post - processing.

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Steps in Images Processing 1. Addressing: locate centers 2. Segmentation: classification of pixels either as signal or background. using seeded region growing). 3. Information extraction: for each spot of the array, calculates signal intensity pairs, background and quality measures.

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Addressing This is the process of assigning coordinates to each of the spots. Automating this part of the procedure permits high throughput analysis. 4 by 4 grids 19 by 21 spots per grid

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Addressing 4 by 4 grids Within the same batch of print runs. Estimate the translation of grids Other problems: -- Mis-registration -- Rotation -- Skew in the array

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Segmentation methods Fixed circles Adaptive Circle Adaptive Shape –Edge detection. –Seeded Region Growing. (R. Adams and L. Bishof (1994) :Regions grow outwards from the seed points preferentially according to the difference between a pixel’s value and the running mean of values in an adjoining region. Histogram Methods –Adaptive threshold.

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Seeds

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Limitation of circular segmentation —Small spot —Not circular Results from SRG

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Information Extraction —Spot Intensities —mean (pixel intensities). —median (pixel intensities). —Background values —Local —Morphological opening —Constant (global) —None —Quality Information Take the average

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Local Backgrounds

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Statistical Software - R

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Who are we comparing? Spot (SRG) –valley –morph ScanAlzye (fixed circle) GenePix (adaptive circle) QuantArray –Fixed circle –Adaptive (Chen’s method) –Histogram

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, How are we comparing? Foreground and Background Intensities M vs A plot Within slide variability Between slide variability Ability to differentiate important genes from noise

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Foreground and Background comparison

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Does the image analysis matter? Spot.nbg Spot.morph Spot.valley ScanAlyze

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Background makes a difference Background methodSegmentation methodExp1 Exp2 S.nbg66 Gp.nbg76 SA.nbg66 No backgroundQA.fix.nbg76 QA.hist.nbg76 QA.adp.nbg1414 S.valley1721 GP1111 Local surroundingSA1214 QA.fix1823 QA.hist98 QA.adp2726 OthersS.morph99 S.const1414 Medians of the SD of log 2 (R/G) for 8 replicated spots multiplied by 100 and rounded to the nearest integer.

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Between slide variability

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, T

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Adjusted p-values

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Acknowledgments Terry Speed Michael Buckley Sandrine Dudoit Natalie Roberts Ben Bolstad CSIRO Image Analysis Group Ryan Lagerstorm Richard Beare Hugues Talbot Kevin Cheong Matt Callow (LBL) Percy Luu (USB) Dave Lin (USB) Vivian Pang (USB) Elva Diaz (USB) WEHI Bioinformatics group

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Steps in Images Processing 1. Addressing: locate centers 2. Segmentation: classification of pixels either as signal or background. using seeded region growing). 3. Information extraction: for each spot of the array, calculates signal intensity pairs, background and quality measures.

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Steps in Image Processing Spot Intensities –mean (pixel intensities). –median (pixel intensities). –Pixel variation ( IQR of log (pixel intensities ). Background values –Local –Morphological opening –Constant (global) –None Quality Information Signal Background 3. Information Extraction

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Addressing Registration

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, Quality Measurements Array –Correlation between spot intensities. –Percentage of spots with no signals. –Distribution of spot signal area. Spot –Signal / Noise ratio. –Variation in pixel intensities. –Identification of “bad spot” (spots with no signal). Ratio (2 spots combined) –Circularity

Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, The Walter and Eliza Hall Institute of Medical Research, T