Microarray - Leukemia vs. normal GeneChip System.

Slides:



Advertisements
Similar presentations
Microarray Technique, Analysis, and Applications in Dermatology Jennifer Villaseñor-Park 1 and Alex G Ortega-Loayza 2 1 Department of Dermatology, University.
Advertisements

Biology and Cells All living organisms consist of cells. Humans have trillions of cells. Yeast - one cell. Cells are of many different types (blood, skin,
MicroArray Image Analysis
MicroArray Image Analysis Robin Liechti
Microarray technology and analysis of gene expression data Hillevi Lindroos.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Image Quantitation in Microarray Analysis More tomorrow...
Microarray Data Analysis Stuart M. Brown NYU School of Medicine.
Introduction to DNA Microarray Technology Steen Knudsen, April 2005.
Extraction and comparison of gene expression patterns from 2D RNA in situ hybridization images BIOINFORMATICS Gene expression Vol. 26, no. 6, 2010, pages.
DNA microarray and array data analysis
Microarrays Dr Peter Smooker,
Microarray analysis Golan Yona ( original version by David Lin )
Figure 1: (A) A microarray may contain thousands of ‘spots’. Each spot contains many copies of the same DNA sequence that uniquely represents a gene from.
The Human Genome Project and ~ 100 other genome projects:
DNA Arrays …DNA systematically arrayed at high density, –virtual genomes for expression studies, RNA hybridization to DNA for expression studies, –comparative.
Robots for Printing Microarrays Jianping Zhou Robot 2003 Spring.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Data analytical issues with high-density oligonucleotide arrays A model for gene expression analysis and data quality assessment.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
What are microarrays? Microarrays consist of thousands of oligonucleotides or cDNAs that have been synthesized or spotted onto a solid substrate (nylon,
Introduce to Microarray
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
and analysis of gene transcription
Analysis of microarray data
Microarray Preprocessing
with an emphasis on DNA microarrays
Microarray Gene Expression Data Analysis A.Venkatesh CBBL Functional Genomics Chapter: 07.
Image Quantitation in Microarray Analysis More tomorrow...
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
Affymetrix vs. glass slide based arrays
Clustering of DNA Microarray Data Michael Slifker CIS 526.
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
CDNA Microarrays MB206.
Data Type 1: Microarrays
Probe-Level Data Normalisation: RMA and GC-RMA Sam Robson Images courtesy of Neil Ward, European Application Engineer, Agilent Technologies.
Agenda Introduction to microarrays
Scenario 6 Distinguishing different types of leukemia to target treatment.
ARK-Genomics: Centre for Comparative and Functional Genomics in Farm Animals Richard Talbot Roslin Institute and R(D)SVS University of Edinburgh Microarrays.
A Short Overview of Microarrays Tex Thompson Spring 2005.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
What Is Microarray A new powerful technology for biological exploration Parallel High-throughput Large-scale Genomic scale.
1 FINAL PROJECT- Key dates –last day to decided on a project * 11-10/1- Presenting a proposed project in small groups A very short presentation (Max.
Genomics I: The Transcriptome
Computational Biology and Bioinformatics Lab. Songhwan Hwang Functional Genomics DNA Microarray Technology.
Gene Expression Analysis. 2 DNA Microarray First introduced in 1987 A microarray is a tool for analyzing gene expression in genomic scale. The microarray.
Design of Micro-arrays Lecture Topic 6. Experimental design Proper experimental design is needed to ensure that questions of interest can be answered.
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
Microarray Technology. Introduction Introduction –Microarrays are extremely powerful ways to analyze gene expression. –Using a microarray, it is possible.
Microarray (Gene Expression) DNA microarrays is a technology that can be used to measure changes in expression levels or to detect SNiPs Microarrays differ.
Microarray hybridization Usually comparative – Ratio between two samples Examples – Tumor vs. normal tissue – Drug treatment vs. no treatment – Embryo.
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
High-throughput omic datasets and clustering
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
Computational Biology Clustering Parts taken from Introduction to Data Mining by Tan, Steinbach, Kumar Lecture Slides Week 9.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Microarrays and Other High-Throughput Methods BMI/CS 576 Colin Dewey Fall 2010.
EE150a – Genomic Signal and Information Processing On DNA Microarrays Technology October 12, 2004.
DNA Microarray Overview and Application. Table of Contents Section One : Introduction Section Two : Microarray Technique Section Three : Types of DNA.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Introduction to Oligonucleotide Microarray Technology
Microarray: An Introduction
MICROARRAY. Microarray  A multiplex lab-on-a-chip  A 2D array on a solid substrate (Usually a glass slide or silicon thin-film cell) that assays large.
Microarray - Leukemia vs. normal GeneChip System.
The Basics of cDNA Microarray Technology
Microarray Technology and Applications
Introduction to cDNA Microarray Technology
The Basics of Microarray Image Processing
Microarray Data Analysis
Data Type 1: Microarrays
Presentation transcript:

Microarray - Leukemia vs. normal

GeneChip System

Affymetrix Gene Expression Machine

Spotter device

Robot arm

Robot for printing

Print head

Print-head of robot arm

Hybridization equipment

Part 3 Raw Microarray Data

Raw data from microarrays Microarray data comprise images from hybridized arrays representing hybridization signal intensities for individual spots These may be generated by single fluorescent, dual fluorescent, radioactive or colorimetric labels and the recording methods differ in each case

Microarray preparation They are miniature devices comprising a large number of DNA sequences immobilized on a substrate such as a glass microscope slide The sequences, known as features, are arranged as a grid Arrays are hybridized with a complex probe (a population of labeled DNA or RNA molecules, representing a particular cell type or tissue) The intensity of hybridization signal for each feature corresponds to the amount of that particular molecule in the probe, and this is directly proportional to the level of gene expression in the cell type or tissue from which the probe was prepared

They consist of images from hybridized arrays The exact nature of image depends on the array platform (the type of array used)

First generation microarrays They were made by spotting DNA molecules onto nylon membranes and hybridized with a radioactive probe. The signals were detected and quantified using a phosphorimager spatial resolution of radioactive signals is low, so the features on the array cannot be packed very tightly Hence, nylon arrays tend to be large (in order of 10 cm2) and are sometimes called macroarrays for this reason Feature density can be increased by using a colorimetric label instead of a radioactive label, but the sensitivity is lower

Second generation microarrays Spotted cDNA microarrays or high-density oligonucleotide chips are used in most array experiments these days In both cases, the substrate has minimal autofluorescence so a fluorescent probe can be used Data are acquired by confocal laser scanning of hybridized array at appropriate excitation wavelength and recording at appropriate emission wavelength (or channel)

Part 4 Data Quality

Labeling and hybridization A single label is used for oligonucleotide chips, so differential gene expression is detected hybridizing different probes to duplicate arrays However, in the case of spotted arrays, two probes can be labeled with different fluorophores and hybridized simultaneously to the same array allowing differential gene expression to be monitored directly

Data Normalization Issues Normalization of data from different chips MGED normalization standards -- www.mged.org natural biological variation is large technical variation is small ~ 98% auto-correlation MIT approach -- raw gene expression values Stanford approach -- ratios

Data Preparation Thresholding: usually min 20, max 16,000 For older Affy chips (new Affy chips do not have negative values) Filtering - remove genes with insufficient variation e.g. MaxVal - MinVal < 500 and MaxVal/MinVal < 5 biological reasons feature reduction for algorithmic For clustering, normalize each gene separately to Mean = 0, Std. Dev = 1

Data quality It is essential to record signal intensities from individual spots accurately as errors in data recording cannot be detected or corrected at a later stage Software for reading microarrays is generally provided with the recording equipment (scanner or phosphorimager) but manual adjustment is necessary to compensate for variations in array manufacture The signal must be corrected for background (nonspecific hybridization, autofluorescence, contamination) and hybridization controls must be used when comparing results across different arrays

Process automation DNA arrays may contain many thousands of features and hence data acquisition and analysis must be automated Software for initial image processing is normally provided with the scanner (or phosphoimager), which allows boundaries of individual spots to be determined and the total signal intensity to be measured over the whole spot (called signal volume) Locating spots precisely can be problem, particularly if there is distortion on array surface, and hence often necessary to align the grid manually. This is essential since signal intensities can vary across individual spots and the shape an size of different spots may not be uniform

Noise suppression Signal intensity has to be corrected for background noise, which may creep in by non-specific hybridization, autofluorescence, dust and other contaminants or poor hybridization technique (e.g. partial dehydration) Noise can vary over the array surface, so signal intensities must be normalized for local background values Correction for background noise is difficult when the signal intensity for a particular spot is itself very low

Control features They should be included on array to measure non-specific hybridization and variable hybridization across arrays For instance, Affymetrix GeneChips incorporate a set of mismatching oligonucleotides for each perfect match set to determine non-specific hybridization Controls are important where duplicate arrays are being used to study differential gene expression, since variation in array manufacture or experimental protocol can influence signal intensities on different arrays Bottom line is that errors and artifacts introduced before or during data acquisition cannot be detected or corrected later

Gene expression matrices The raw data from microarray experiments are converted into tables known as gene expression matrices The rows represent genes and the columns represent experimental conditions The data in the table are signal intensities, representing relative levels of gene expression

Grouping expression data Each gene in a gene expression matrix has an expression profile, that is, the expression measurements over a range of conditions The analysis of microarray data involves grouping these data on the basis of similar expression profiles If a predefined classification system is used to group the genes, the analysis is described as supervised If there is no predefined classification, the analysis is described as unsupervised and is known as clustering

Clustering methods Clustering first involves converting the gene expression matrix into a distance matrix, so genes with similar expression profiles can be grouped together This generally involves calculating the Euclidean distance, the correlation measure based distance or the Pearson linear correlation based distance for each pair of values Several clustering methods can then be used including hierarchical clustering, k-means clustering and the derivation of self-organizing maps