TIGR Spotfinder: a tool for microarray image processing

Slides:



Advertisements
Similar presentations
AFGC Damares C Monte Carnegie Institution of Washington,
Advertisements

MicroArray Image Analysis Robin Liechti
Digital Image Processing Lecture 3: Image Display & Enhancement
The Rice Functional Genomics Program of China cDNA microarray database (RIFGP-CDMD) consists of complete datasets, including the probe sequences, microarray.
Pre-processing in DNA microarray experiments Sandrine Dudoit PH 296, Section 33 13/09/2001.
MicroArray Image Analysis
MicroArray Image Analysis Robin Liechti
A Review of Image Analysis Software for Spotted Microarrays Jess Mar Department of Mathematics University of Queensland CBiS Microarray/Chip.
AIIA Lab, Department of Informatics Aristotle University of Thessaloniki Z.Theodosiou, F.Raimondo, M.E.Garefalaki, G.Karayannopoulou, K.Lyroudia, I.Pitas,
Filtering and Normalization of Microarray Gene Expression Data Waclaw Kusnierczyk Norwegian University of Science and Technology Trondheim, Norway.
Microarray Simultaneously determining the abundance of multiple(100s-10,000s) transcripts.
Mathematical Statistics, Centre for Mathematical Sciences
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Image Quantitation in Microarray Analysis More tomorrow...
Genomic Arrays: Tools for cancer gene discovery Ian Roberts MRC Cancer Cell Unit Hutchison MRC Research Centre
Microarray Data Preprocessing and Clustering Analysis
Figure 1: (A) A microarray may contain thousands of ‘spots’. Each spot contains many copies of the same DNA sequence that uniquely represents a gene from.
09/05/2005 סמינריון במתמטיקה ביולוגית Dimension Reduction - PCA Principle Component Analysis.
Gene Expression Data Analyses (2)
Low Level Statistics and Quality Control Javier Cabrera.
Summer at ViaLogy Ronald J. Perez. ViaLogy Developers of computational products for increased performance of molecular detection systems ViaAmp Gene expression.
Sample preparation 1. Design experiment Question? Replicates? Test? 2. Perform experiment 4. Label RNA Amplification? Direct or indirect? Label? wild.
Microarray Technology Types Normalization Microarray Technology Microarray: –New Technology (first paper: 1995) Allows study of thousands of genes at.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Scanning and image analysis Scanning -Dyes -Confocal scanner -CCD scanner Image File Formats Image analysis -Locating the spots -Segmentation -Evaluating.
lecture 2, linear imaging systems Linear Imaging Systems Example: The Pinhole camera Outline  General goals, definitions  Linear Imaging Systems.
B IOINFORMATICS Dr. Aladdin HamwiehKhalid Al-shamaa Abdulqader Jighly Lecture 8 Analyzing Microarray Data Aleppo University Faculty of technical.
Filtering and Normalization of Microarray Gene Expression Data Waclaw Kusnierczyk Norwegian University of Science and Technology Trondheim, Norway.
Mark Hartnett Software Support Engineer December 7, 2001 Introduction to Analysis and Feature Extraction Software.
Image Quantitation in Microarray Analysis More tomorrow...
Scanning and Image Processing -by Steve Clough. GSI Lumonics cDNA microarrays use two dyes with well separated emission spectra such as Cy3 and Cy5 to.
The following slides have been adapted from to be presented at the Follow-up course on Microarray Data Analysis.
Digital Images Chapter 8 Exploring the Digital Domain.
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
Analysis of Microarray Data 1.Scan the images 2.Quantify intensity of spots 3.Normalization 4.Analysis of data 5.Identification of genes of interest 6.Validation.
Hybridization and data acquisition –Hybridization –Scanning –Image analysis –Background correction and filtering –Data transformation Methods for normalization.
IMAGE INFORMATICS SOLUTIONS Extracting Information From Images Array-Pro 4.5 Training, May 2003.
The following slides have been adapted from to be presented at the Follow-up course on Microarray Data Analysis.
Test1 April 2004 Microarray Data Management Jianwei (Jerry) Li.
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
CDNA Microarrays MB206.
Panu Somervuo, March 19, cDNA microarrays.
1 Two Color Microarrays EPP 245/298 Statistical Analysis of Laboratory Data.
WORKSHOP SPOTTED 2-channel ARRAYS DATA PROCESSING AND QUALITY CONTROL Eugenia Migliavacca and Mauro Delorenzi, ISREC, December 11, 2003.
Probe-Level Data Normalisation: RMA and GC-RMA Sam Robson Images courtesy of Neil Ward, European Application Engineer, Agilent Technologies.
Analysis of Microarray Data Analysis of images Preprocessing of gene expression data Normalization of data –Subtraction of Background Noise –Global/local.
A New Oklahoma Bioinformatics Company. Microarray and Bioinformatics.
BMME 560 & BME 590I Medical Imaging: X-ray, CT, and Nuclear Methods Introductory Topics Part 2.
Microarray - Leukemia vs. normal GeneChip System.
ARK-Genomics: Centre for Comparative and Functional Genomics in Farm Animals Richard Talbot Roslin Institute and R(D)SVS University of Edinburgh Microarrays.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
What Is Microarray A new powerful technology for biological exploration Parallel High-throughput Large-scale Genomic scale.
ImArray - An Automated High-Performance Microarray Scanner Software for Microarray Image Analysis, Data Management and Knowledge Mining Wei-Bang Chen and.
Statistical Methods for Identifying Differentially Expressed Genes in Replicated cDNA Microarray Experiments Presented by Nan Lin 13 October 2002.
Computational Laboratory: aCGH Data Analysis Feb. 4, 2011 Per Chia-Chin Wu.
Microarray Technology. Introduction Introduction –Microarrays are extremely powerful ways to analyze gene expression. –Using a microarray, it is possible.
Microarray hybridization Usually comparative – Ratio between two samples Examples – Tumor vs. normal tissue – Drug treatment vs. no treatment – Embryo.
MICROARRAYS D’EXPRESSIÓ ESTUDI DE REGULADORS DE LA TRANSCRIPCIÓ DE LA FAMILIA trxG M. Corominas:
(1) Normalization of cDNA microarray data Methods, Vol. 31, no. 4, December 2003 Gordon K. Smyth and Terry Speed.
Disease Diagnosis by DNAC MEC seminar 25 May 04. DNA chip Blood Biopsy Sample rRNA/mRNA/ tRNA RNA RNA with cDNA Hybridization Mixture of cell-lines Reference.
CGH Data BIOS Chromosome Re-arrangements.
The microarray data analysis Ana Deckmann Carla Judice Jorge Lepikson Jorge Mondego Leandra Scarpari Marcelo Falsarella Carazzolle Michelle Servais Tais.
From: Duggan et.al. Nature Genetics 21:10-14, 1999 Microarray-Based Assays (The Basics) Each feature or “spot” represents a specific expressed gene (mRNA).
Distinguishing active from non active genes: Main principle: DNA hybridization -DNA hybridizes due to base pairing using H-bonds -A/T and C/G and A/U possible.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Microarray: An Introduction
©2003/04 Alessandro Bogliolo Analysis of gene expression by means of Microarrays.
2.2 - Microarray Data Modeling Lab Sébastien Lemieux Elitra Canada Ltd.
Microarray - Leukemia vs. normal GeneChip System.
The Basics of Microarray Image Processing
Presentation transcript:

TIGR Spotfinder: a tool for microarray image processing Developer: Vasily Sharov Presenter: John Braisted The Institute for Genomic Research THE INSTITUTE FOR GENOMIC RESEARCH TIGR

Microarray Data Flow Image Analysis Normalization / Filtering Printer Scanner .tiff Image File Image Analysis Raw Gene Expression Data Gene Annotation Normalization / Filtering Database AGED Others… MAD Normalized Data with Gene Annotation Expression Analysis Interpretation of Analysis Results

Process Overview Sample1 mRNA Sample2 mRNA Cy3 intensity Cy3 RT Cy5-cDNA Cy3-cDNA RT RT cDNA array Cy5 intensity Sample2 mRNA

Basic Steps from Image to File 1.) Image File Loading 2.) Construct or Apply an Overlay Grid 3.) Computations Find Spot Boundary and Area Intensity Calculation Background Calculation and Correction 4.) Quality Control 5.) Text File Output

Basic Demonstration Exploring the Interface (Using An Existing Grid File)

Microarray Image Parameters MA Scanner generates two 16 bit gray scale TIFF images: one image for each labeling probe (Cy3 and Cy5) 16 bit schema provides signal dynamic range from 0 to 216=65536 Each image size varies from 20 to 30 MB for scanning resolution 10 mm/pixel

Typical layout of microarray image Image size 22 MB Image size 28 MB (images scanned at 10mm/pix resolution)

Processing Overview Apply the Grid Determine Spot Boundary Determine Background and Correct Intensity Calculate Spot Intensity

Applying an Overlay Grid What does it accomplish? The grid cells set a boundary for the spot finding algorithms. The grid cells also define an area for background correction.

Gridding Dimension Parameters pin X pin X pin Y pin Y

Spot Spacing Parameter

Spot Finding Spot finding requires an estimated spot size. The spot can be drawn as an irregular contour, as an ellipse, or as unconnected pixels. Area inside contour is used for spot intensity calculation Area outside contour is used for local background calculation

Processing Overview Apply the Grid Determine Spot Boundary Calculate Spot Intensity Determine Background and Correct Intensity

Background Calculation Background intensity is calculated as the median pixel intensity from the area within the square and outside the spot. A separate local background is calculated for each spot using the non-spot pixels from it’s square. local background area

Spot Definition and Calculations Spot Area, A = number of pixels within the defined spot boundary BKG = median pixel value within the cell (excluding the spot pixels) Integral = Sum of all spot pixels excluding saturated pixels Reported “Intensity”=Integral-BKG*A

Spot Integration with Background Correction

Intensity Calculation Formula = pixel intensity for pixel i = number of pixels in the spot = median background pixel value = number of non-saturated pixels in spot

Quality Control Issues Two measures of spot quality are reported by SpotFinder: Saturation Factor QC Score; (Reports shape and signal to noise ratio.)

Saturation Examples Partially saturated spots can look like this: saturated area non-saturated area Completely saturated spots can look like this: fully saturated spot

Saturation, Pixel Value Limit Output: pixel value 216=65536 Input: fluorescence dye light signal

(total number of spot pixels) Saturation Factor Saturation Factor = (# good pixels in spot) (total number of spot pixels) -Partially saturated spots can be handled in SpotFinder by excluding the saturated pixels from spot area and intensity calculations. -Fully saturated spots can not be recovered in SpotFinder. In this case rescanning with lower excitation power or PMT gain could be considered. *Faint spots may possibly be lost.

Saturation, RI Plot RI plot: log(IB/IA) vs 1/2log(IA*IB) clearly displays the saturation limits

Quality Control, QC Score A QC Score is generated for each spot and is based on the spot shape and a measure of signal to noise ratio. QC Score QCA QCB shape signal/noise shape signal/noise

Shape Factor = (Spot Area/Perimeter) Spot Shape Parameter Shape Factor = (Spot Area/Perimeter) Spots with large perimeters relative to spot area will have a low shape factor.

*med(BKG) + * SD(BKG) Signal to Noise Ratio 216 S/N factor = fraction of spot pixels exceeding: *med(BKG) + * SD(BKG) Pixel Values med(BKG)

Quality Control Calculation QC Score = (QCA+QCB)/2 QCA=sqrt(QC shape*QC S/N) for channel A QCB=sqrt(QC shape*QC S/N) for channel B

Quality Control, RI Plot log(IB/IA) vs1/2log(IA*IB) plotted for means shows clearly low intensity distortion due to background overestimation. Data from earlier slide processed without QC filter

Quality Control (data provided by E. Snesrud)

Quality Control (data provided by E. Snesrud)

SpotFinder Flag Descriptions A - The number of non-saturated pixels in spot is 0 B - Number of non-saturated pixels in spot is between 0 and 50 C - Number of non saturated pixels in spot is more then 50 X - Spot was detected and rejected based on spot shape and spot intensity relative to surrounding background Y- Spot background is higher than spot intensity Z - Spot was not detected by the program

Demonstration Grid Construction