Plotting the path from RNA to microarray: the importance of experimental planning and methods Glenn Short Microarray Core Facility/Lipid Metabolism Unit.

Slides:



Advertisements
Similar presentations
Experiment Design for Affymetrix Microarray.
Advertisements

Introduction to Microarray Gene Expression
Comparative genomic hybridization (CGH) is a technique for studying chromosomal changes in cancer. As cancerous cells multiply, they can undergo dramatic.
Application of available statistical tools Development of specific, more appropriate statistical tools for use with microarrays Functional annotation of.
M. Kathleen Kerr “Design Considerations for Efficient and Effective Microarray Studies” Biometrics 59, ; December 2003 Biostatistics Article Oncology.
Microarray Simultaneously determining the abundance of multiple(100s-10,000s) transcripts.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Introduction to the design of cDNA microarray experiments Statistics 246, Spring 2002 Week 9, Lecture 1 Yee Hwa Yang.
Microarray Data Analysis Stuart M. Brown NYU School of Medicine.
Sandrine Dudoit1 Microarray Experimental Design and Analysis Sandrine Dudoit jointly with Yee Hwa Yang Division of Biostatistics, UC Berkeley
DNA Microarray: A Recombinant DNA Method. Basic Steps to Microarray: Obtain cells with genes that are needed for analysis. Isolate the mRNA using extraction.
Microarray analysis Golan Yona ( original version by David Lin )
DNA Arrays …DNA systematically arrayed at high density, –virtual genomes for expression studies, RNA hybridization to DNA for expression studies, –comparative.
Data analytical issues with high-density oligonucleotide arrays A model for gene expression analysis and data quality assessment.
A snapshot that captures the activity
Introduce to Microarray
Gene Expression Data Analyses (1) Trupti Joshi Computer Science Department 317 Engineering Building North (O)
Applied Biosystems 7900HT Fast Real-Time PCR System I. Real-time RT-PCR analysis of siRNA-induced knockdown in mammalian cells (Amit Berson, Mor Hanan.
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Microarrays: Basic Principle AGCCTAGCCT ACCGAACCGA GCGGAGCGGA CCGGACCGGA TCGGATCGGA Probe Targets Highly parallel molecular search and sort process based.
and analysis of gene transcription
Diabetes and Endocrinology Research Center The BCM Microarray Core Facility: Closing the Next Generation Gap Alina Raza 1, Mylinh Hoang 1, Gayan De Silva.
with an emphasis on DNA microarrays
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
Chapter 5 Nucleic Acid Hybridization Assays A. Preparation of nucleic acid probes: 1. Labeling DNA & RNA - Nick Translation - Random primed DNA labeling.
Affymetrix vs. glass slide based arrays
This Week: Mon—Omics Wed—Alternate sequencing Technologies and Viromics paper Next Week No class Mon or Wed Fri– Presentations by Colleen D and Vaughn.
Analyzing your clone 1) FISH 2) “Restriction mapping” 3) Southern analysis : DNA 4) Northern analysis: RNA tells size tells which tissues or conditions.
Experimental Design and Setup. Experimental Design What is the question? Which experiments will give the answer? How many replicates do we need?
-The methods section of the course covers chapters 21 and 22, not chapters 20 and 21 -Paper discussion on Tuesday - assignment due at the start of class.
Swiss Institute of Bioinformatics Institut Suisse de Bioinformatique LF Probe selection for Microarrays Considerations and pitfalls.
Lecture 22 Introduction to Microarray
How do you identify and clone a gene of interest? Shotgun approach? Is there a better way?
CDNA Microarrays MB206.
Data Type 1: Microarrays
Gene Expression Data Qifang Xu. Outline cDNA Microarray Technology cDNA Microarray Technology Data Representation Data Representation Statistical Analysis.
Agenda Introduction to microarrays
Amplification of Genomic DNA Fragments OrR. Amplification To get particular DNA in large amount Fragment size shouldn’t be too long The nucleotide sequence.
Expression of the Genome The transcriptome. Decoding the Genetic Information  Information encoded in nucleotide sequences contained in discrete units.
Microarray - Leukemia vs. normal GeneChip System.
1 The Biology, Technology and Statistical Modeling of High- throughput Genomics Data Naomi Altman Dept. of Statistics Penn State U. May 25, 2010.
ARK-Genomics: Centre for Comparative and Functional Genomics in Farm Animals Richard Talbot Roslin Institute and R(D)SVS University of Edinburgh Microarrays.
Design of microarray gene expression profiling experiments Peter-Bram ’t Hoen.
Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
DNA Microarrays: An Introduction Jochen Mueller
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Design of Micro-arrays Lecture Topic 6. Experimental design Proper experimental design is needed to ensure that questions of interest can be answered.
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
Microarray Technology. Introduction Introduction –Microarrays are extremely powerful ways to analyze gene expression. –Using a microarray, it is possible.
Microarray (Gene Expression) DNA microarrays is a technology that can be used to measure changes in expression levels or to detect SNiPs Microarrays differ.
CSIRO Insert presentation title, do not remove CSIRO from start of footer Experimental Design Why design? removal of technical variance Optimizing your.
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
Lecture 23 – Functional Genomics I Based on chapter 8 Functional and Comparative Genomics Copyright © 2010 Pearson Education Inc.
Microarrays and Other High-Throughput Methods BMI/CS 576 Colin Dewey Fall 2010.
Experimental Design Reaching a balance between statistical power and available finances.
DNA Microarray Overview and Application. Table of Contents Section One : Introduction Section Two : Microarray Technique Section Three : Types of DNA.
From: Duggan et.al. Nature Genetics 21:10-14, 1999 Microarray-Based Assays (The Basics) Each feature or “spot” represents a specific expressed gene (mRNA).
Introduction to Oligonucleotide Microarray Technology
Microarray: An Introduction
Arrays How do they work ? What are they ?. WT Dwarf Transgenic Other species Arrays are inverted Northerns: Extract target RNA YFG Label probe + hybridise.
Microarray - Leukemia vs. normal GeneChip System.
Expression of the Genome
Functional Genomics in Evolutionary Research
Microarray Technology and Applications
Expression of the Genome
Polymerase Chain Reaction (PCR)
Microarray Data Analysis
Design Issues Lecture Topic 6.
Presentation transcript:

Plotting the path from RNA to microarray: the importance of experimental planning and methods Glenn Short Microarray Core Facility/Lipid Metabolism Unit Massachusetts General Hospital

Talk Outline Why perform a microarray experiment? Why perform a microarray experiment? Choosing a microarray platform Choosing a microarray platform Sources of variability that lend to experimental considerations Sources of variability that lend to experimental considerations Overcoming experimental variability Overcoming experimental variability

Why perform a microarray experiment? Genomic vantage point Genomic vantage point –Detect gene expression –Compare gene expression levels Over timeOver time Over treatment courseOver treatment course –Map genes to phenotypes –Map deleted or duplicated regions –Identify genes that modulate other genes Binary decision-making Binary decision-making

When not to perform a Microarray Experiment Interested in a small number of specific genes QRT-PCR, Northern blots Interested in a small number of specific genes QRT-PCR, Northern blots Desire quantitative results Desire quantitative results Low tolerance of variability Low tolerance of variability Cannot afford to perform experiment with adequate replication Cannot afford to perform experiment with adequate replication

Asking a Specific Question The most fundamental; the MOST IMPORTANT The most fundamental; the MOST IMPORTANT Simplifies experimental design Simplifies experimental design Empowers interpretation of data Empowers interpretation of data Simplicity, simplicity, simplicity! I say let your affairs be as one, two, three and to a hundred or a thousand… We are happy in proportion to the things we can do without.--Henry David Thoreau

Considerations of Microarray Experimental Design Which microarray platform will be used? Which microarray platform will be used? What is the end goal of the experiment? What is the end goal of the experiment? What is the specific question being asked? What is the specific question being asked? What are the most pertinent comparisons? What are the most pertinent comparisons? What controls will be applied to the experiments? What controls will be applied to the experiments? Which statistical methods will be used during data analysis? Which statistical methods will be used during data analysis? What methods will be used to verify results from the microarrays? What methods will be used to verify results from the microarrays?

Choosing a Microarray Platform Are genes of interest included on the array? Are genes of interest included on the array? Are genes replicated? Are genes replicated? Tiling of genes that undergo splicing Tiling of genes that undergo splicing Controls on array Controls on array Quantity of RNA needed for testing Quantity of RNA needed for testing Are the arrays adequately QC’d? Are the arrays adequately QC’d? Cost Cost

Affymetrix Platform

Pro’s Pro’s –standardized production –gene replication –probe tiling across gene –Reproducible –Affymetrix custom database user-friendly Con’s Con’s –Expensive –Annotation differences –single sample per chip

cDNA Platform cDNA clones (probes) 1. PCR product amplification amplification 2. Purification 3. Printing Pro’s Pro’s –Genome sequence independent –High stringency hybridization –Little need for signal amplification Con’s Con’s –Clone handling –Clone authentication –cDNA resources difficult to access and often cross- contaminated PCR products used as probes

Spotted oligonucleotide Platform Pro’s Pro’s –Complete control over oligo sequences –Absence of contamination –Additional probes may be added when needed –Flexibility of design, probe replication, and tiling –Inexpensive, enabling experimental replication Con’s Con’s –Sequence data required for probe design –No consensus set of probe design algorithms –Must have arraying instrumentation Synthesized oligonucleotides in 384 well plates 1.Purification 2.QC 3.Printing Oligonucleotides used as probes

Spotted Oligonucleotide vs Affymetrix Arrays probe set Probe design and synthesis Oligonulceotide Affymetrix

ParaBioSys Platform Long Oligonucleotides, 70mer Long Oligonucleotides, 70mer Designed and synthesized in-house Designed and synthesized in-house 5’-amine modified 5’-amine modified Extensively QC’d Extensively QC’d Probes designed to the 5’-orf Probes designed to the 5’-orf Set is updated as known orf list grows Set is updated as known orf list grows –Currently 20,000 probes

ParaBioSys probe design and synthesis Probe design using OligoPicker Probe design using OligoPicker –based on gen-pept database –Tm’s of selected oligos approx. the same –improved specificity

Oligonucleotide Quality Control pass fail Use of mass spectral analysis Use of mass spectral analysis –Identifies relative abundance –Ensures probe is of the expected mass based upon sequence Capillary Electrophoresis Capillary Electrophoresis –Identifies relative abundance of full- length product

Array Quality Control Spotted probes are 3’- labeled with dCTP-Cy3 using terminal deoxynucleotidyl transferase Spotted probes are 3’- labeled with dCTP-Cy3 using terminal deoxynucleotidyl transferase First and last array of the print-run are QC’d First and last array of the print-run are QC’d

Understanding sources of variability in microarray experiments ? ? ?

Sources of Variation Differences in identical treatments Differences in identical treatments Intrinsic biological variation Intrinsic biological variation Technical variation in extraction and labeling of RNA samples Technical variation in extraction and labeling of RNA samples Technical variation in hybridization Technical variation in hybridization Spot size variation Spot size variation Measurement error in scanning Measurement error in scanning

When graphing expression data, use log ratio (T/C) log 2 ratio (T/C) ratio (T/C) log 2 ratio (T/C)

log 2 T log 2 C Plotting expression data A M M= log ratio vs A=log geometric mean mean

Expression data-cont Low expressed Highly expressed log 2 (T i /C i ) Genes expressed up relative to reference by a factor of 32. Genes expressed down relative to reference by a factor of 1/32.

Differences Due to Treatment RNA isolation protocol differences RNA isolation protocol differences Cell-culture media changes Cell-culture media changes Expression differences over time Expression differences over time –Cell cycle genes (synchronization) Variables need to be minimized! Variables need to be minimized!

Biological Variability Self-self hybridizations of four independent biological replicates Biological variability of inhibitory PAS domain protein

Technical Variability Sample 1 Sample 2 Sample 1 Sample 3 Self-self hybridization (Cerebellar vs cerebellar) Self-self hybridization (Cerebellar vs cerebellar) –Sample 1 and 2 labeled together and hybridized on separate slides –Sample 3 labeled separately Arises from differences in labeling, efficiency in RT, hybridization, arrays, etc. Arises from differences in labeling, efficiency in RT, hybridization, arrays, etc.

Dye Effects Variation in quantum yield of fluorophores Variation in quantum yield of fluorophores Variation in the incorporation efficiency Variation in the incorporation efficiency Differential dye effects on hybridization Differential dye effects on hybridization Environmental Health Perspectives VOLUME 112 | NUMBER 4 | March 2004

Hybridization Variability

Printing Variability

Differences in Probe Performance Academic_1Academic_2ParaBioSysVendor Probe design algorithms will cause changes in the expression pattern Probe design algorithms will cause changes in the expression pattern Once a platform is chosen all future comparisons should be performed on the same platform Once a platform is chosen all future comparisons should be performed on the same platform Cross-platform comparisons as a means of validation Cross-platform comparisons as a means of validation

Differences Across Commercial Platforms P<0.001 Nucleic Acids Research, 2003, Vol. 31, No. 19,

Controlling Variability Experimental Plan

Increased Quality Control Probe QC Array QC Total RNA QC –denaturing agarose gel –Agilent Bioanalyzer Labeling QC

Controlling biological and technical variability with replication Average across replicates Average across replicates Essential to the estimation of variance Essential to the estimation of variance Critical for valid statistical analysis Integrin alpha 2b Pro-platelet basic protein

Controlling Dye Effects Dye-Swap Dye-Swap TC TC

Controlling Variability through Experimental Design Replication Replication –Spot –Multiple arrays per sample comparison (technical) Dye swapDye swap –Multiple samples per treatment group (biological) Increased precision and quality control Increased precision and quality control Estimate measurement error Estimate measurement error Estimate biological variation Estimate biological variation Pooling Pooling –Reduce biological variation

Controlling Variability through Experimental Design –cont. Normalize data to correct for systematic differences (spot intensity, location on array, hybridization,dye,scanner, scanner parameters…) on the same slide or between slides, which is not a result of biological variation between mRNA samples Normalize data to correct for systematic differences (spot intensity, location on array, hybridization,dye,scanner, scanner parameters…) on the same slide or between slides, which is not a result of biological variation between mRNA samples Minimize printing differences by using a contiguous series of slides from the same print run Minimize printing differences by using a contiguous series of slides from the same print run If wanting to do historical comparisons, use the same platform If wanting to do historical comparisons, use the same platform

Planning your experiment Experimental Aim Experimental Aim –Specific questions and priorities among them –How will the experiments answer the questions posed? Experimental logistics Experimental logistics –Types of total RNA samples Reference, control, cell line, tissue sample, treatment A….Reference, control, cell line, tissue sample, treatment A…. How will the samples be compared?How will the samples be compared? Number of arrays neededNumber of arrays needed Other Considerations Other Considerations –Plan of experimental process prior to hybridization: Sample isolation, RNA extraction, amplification, pooling, labelingSample isolation, RNA extraction, amplification, pooling, labeling –Limitations: number of arrays, amount of material –Extensibility (linking)

Planning your Experiment- cont Other Considerations-cont Other Considerations-cont –Controls: positive, negative, in-spike controls –Methods of verification: QRT-PCR, Northern, in situ hybridization,…QRT-PCR, Northern, in situ hybridization,… Performing the experiment Performing the experiment –Reagents (arrays-from same print run), equipment (scanners), order of hybridizations

Controls Positive Controls Positive Controls –used to ensure that target DNAs are labeled to an acceptable specific activity –single pool of all probe elements on array Negative Controls Negative Controls –used to assess the degree of non-specific cross- hybridization –probes derived from organisms with no known homologs/paralogs to the organism of study –derived in silico (alien sequences) In-spike controls In-spike controls –Known amounts of polyadenylated mRNAs added to each labeling reaction –Should not cross-hybridize with with any probe sequences Alien sequencesAlien sequences Spot-report (Stratagene)Spot-report (Stratagene) Lucidea ScoreCard (Amersham Biosciences)Lucidea ScoreCard (Amersham Biosciences) –Can be used to assess dynamic range of the system

Validation If you have failed to If you have failed to validate your array data, validate your array data, you have NOT completed you have NOT completed your analysis your analysis ParaBioSys has developed ParaBioSys has developed Primer Bank for QRT-PCR Primer Bank for QRT-PCR primer sequences primer sequenceshttp://pga.mgh.harvard.edu/primerbank/

Many thanks for your attention Glenn Short Microarray Core Massachusetts General Hospital Massachusetts General Hospital