Microarray Technology and Data Analysis Roy Williams PhD Sanford | Burnham Medical Research Institute.

Slides:



Advertisements
Similar presentations
Bioconductor in R with a expectation free dataset Transcriptomics - practical 2012.
Advertisements

High-dimensional data analysis: Microarrays and multiple testing Mark van de Wiel 1,2 1. Dep. of Mathematics, VU University Amsterdam 2. Dep. of Biostatistics.
Microarrays Pauliina Munne
Microarray Simultaneously determining the abundance of multiple(100s-10,000s) transcripts.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Microarray Data Analysis Stuart M. Brown NYU School of Medicine.
Introduction to DNA Microarrays Todd Lowe BME 88a March 11, 2003.
DNA microarray and array data analysis
DNA Microarray Bioinformatics - #27612 Normalization and Statistical Analysis.
Microarray analysis Golan Yona ( original version by David Lin )
Chip arrays and gene expression data. With the chip array technology, one can measure the expression of 10,000 (~all) genes at once. Can answer questions.
Kate Milova MolGen retreat March 24, Microarray experiments: Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
Kate Milova MolGen retreat March 24, Microarray experiments. Database and Analysis Tools. Kate Milova cDNA Microarray Facility March 24, 2005.
GCB/CIS 535 Microarray Topics John Tobias November 8th, 2004.
Microarray Technology Types Normalization Microarray Technology Microarray: –New Technology (first paper: 1995) Allows study of thousands of genes at.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Introduce to Microarray
Pathway Informatics 6 th July, 2015 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System University of.
Analysis of microarray data
Agenda Introduction to microarrays
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
The following slides have been adapted from to be presented at the Follow-up course on Microarray Data Analysis.
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
Affymetrix vs. glass slide based arrays
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
CDNA Microarrays MB206.
Data Type 1: Microarrays
Gene Expression Data Qifang Xu. Outline cDNA Microarray Technology cDNA Microarray Technology Data Representation Data Representation Statistical Analysis.
Agenda Introduction to microarrays
Bioconductor in R with a expectation free dataset Transcriptomics - practical 2014.
UBio Training Courses Micro-RNA web tools Gonzalo
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
Tutorial session 3 Network analysis Exploring PPI networks using Cytoscape EMBO Practical Course Session 8 Nadezhda Doncheva and Piet Molenaar.
Model-based analysis of oligonucleotide arrays, dChip software Statistics and Genomics – Lecture 4 Department of Biostatistics Harvard School of Public.
Systems Biology through Pathway Statistics Chris Evelo BiGCaT Bioinformatics Group – BMT-TU/e & UM Diepenbeek; May
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
Other genomic arrays: Methylation, chIP on chip… UBio Training Courses.
Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Day 5-2 What bioinformatics.
GeWorkbench John Watkinson Columbia University. geWorkbench The bioinformatics platform of the National Center for the Multi-scale Analysis of Genomic.
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
Introduction to Microarrays. The Central Dogma.
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
Microarrays and Other High-Throughput Methods BMI/CS 576 Colin Dewey Fall 2010.
Microarray Data Analysis The Bioinformatics side of the bench.
Gene Expression Analysis Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
Genomic Signal Processing Dr. C.Q. Chang Dept. of EEE.
DNA Microarray Overview and Application. Table of Contents Section One : Introduction Section Two : Microarray Technique Section Three : Types of DNA.
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Affymetrix User’s Group Meeting Boston, MA May 2005 Keynote Topics: 1. Human genome annotations: emergence of non-coding transcripts -tiling arrays: study.
CCLE Cancer Cell Line Encyclopedia Alexey Erohskin.
AN INTRODUCTION TO GENE EXPRESSION ANALYSIS BY MICROARRAY TECHNIQUE (PART II) DR. AYAT B. AL-GHAFARI MONDAY 10 TH OF MUHARAM 1436.
Roy Williams PhD Sanford | Burnham Medical Research Institute.
Pathway Informatics 30 th March, 2016 Ansuman Chattopadhyay, PhD Head, Molecular Biology Information Services Health Sciences Library System University.
Microarray Data Analysis Roy Williams PhD; Burnham Institute for Medical Research.
MESA A Simple Microarray Data Management Server. General MESA is a prototype web-based database solution for the massive amounts of initial data generated.
Ingenuity Pathway Analysis Alex Pico. Description "IPA is a software application that enables researchers to analyze and understand the complex biological.
Arrays How do they work ? What are they ?. WT Dwarf Transgenic Other species Arrays are inverted Northerns: Extract target RNA YFG Label probe + hybridise.
Transcriptomics History and practice.
Pathway Informatics 16th August, 2017
Tutorial 6 : RNA - Sequencing Analysis and GO enrichment
Functional Genomics in Evolutionary Research
Microarray Technology and Applications
Pathway Informatics December 5, 2018 Ansuman Chattopadhyay, PhD
Transcriptomics History and practice.
Getting the numbers comparable
Gene Expression Analysis
Single Sample Expression-Anchored Mechanisms Predict Survival in Head and Neck Cancer Yang et al Presented by Yves A. Lussier MD PhD The University.
Data Type 1: Microarrays
Presentation transcript:

Microarray Technology and Data Analysis Roy Williams PhD Sanford | Burnham Medical Research Institute

Microarray Revolution

Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein would be more direct, but is currently harder. Measuring Gene Expression

General assumption of microarray technology Use mRNA transcript abundance level as a measure of expression for the corresponding gene Proportional to degree of gene expression

How to measure RNA abundance Several different approaches with similar themes Illumina bead array – highly redundant oligo array Affymetrix GeneChip – highly redundant oligo array Nimblegen – highly redundant long oligo array 2-colour array (very long cDNA; low redundancy) SAGE (random Sanger sequencing of cDNA library) Reborn as Next Gen RNA seq

The Illumina Beadarray Technology Highly redundant ~50 copies of a bead 60mer oligos Absolute expression Each array is deconvoluted using a colour coding tag system Human, Mouse, Rat, Custom

Affymetrix Technology Highly redundant (~25 short oligos per gene) Absolute expression PM-MM oligo system valuable for cross hybe detection Human, Mouse, E. coli, Yeast…….. Affy and illumina arrays have been systematically compared

Spotted Arrays Low redundancy cDNA and oligo Two dyes Cy5/Cy3 Relative expression Cost and custom

Single Colour Labelling

Microarrays in action off on

Areas Being Studied with Microarrays Differential gene expression between two (or more) sample types Similar gene expression across treatments Tumour sub-class identification using gene expression profiles Classification of malignancies into known classes Identification of “marker” genes that characterize different cell types Identification of genes associated with clinical outcomes (e.g. survival)

Experimental Design Design Experiment Replicates 2x 3 chips <2x 5 chips Perform Experiment Standardize conditions Dump outliers

Microarray Data Analysis Workflow Quality Control Normalize Data Set up experimental data Filter for differential expression Advanced analysis techniques- clustering Compare results to biology; Nextbio, GeneGo; IPA

Recommended Software Free Software – GenePattern -- powerful, many plug-in packages and pipelines -- good video examples/tutorials GeneSpring GX11 R-Bioconductor (with guidance) Hierarchical Cluster Explorer – easy clustering Cytoscape, GSEA – for pathway visualisation Partek IPA, Nextbio, GeneGo <= Burnham subscriptions!

Log Transformed Data 2/2 = 1log2(1) = 0 4/1=4log2(4) = +2 ¼=0.25log2(0.25) = -2 Transformation often performed before normalisation

After QC for low confidence genes (P<0.99) Note: ~50 replicate beads per array Median Outliers 25% quartile 75% quartile BAD CHIP BOXPLOT REPRESENTATION OF DATA SPREAD CHIP NUMBER SIGNAL INTENSITY

The effect of quantiles Normalisation on the filtered 36 data sets IMPORTANT: use non-linear normalisation >library(affy) >Qdata <- normalize.quantiles(Rawdata) All same range

Data Analysis Examples 1# Illumina arrays with GeneSpring GX11 2# Affymetrix data, with a GenePattern module Import, Quality Control, normalize Detect differentially expressed genes Pathway analysis

Illumina Analysis Workflow Check array hybridisation quality Direct Export file as “sample probe profile” Import into GENESPRING GX11 Genome Studio Application: process binary.idat files to txt Normalisation here is optional

GeneSpring GX11 features Guided workflows Pathways GSEA IPA integration Ontologies MySQL R script API

GeneSpring GX11 Create New Project Browse to and load Data Automated install of GenomeDef from Agilent repository

Illumina Advanced Workflow

Grouping Sample Replicates

Check Replicates Are Similar

Scatterplot of replicates

Scatterplot of differently treated samples

Filter genes on P-value

Significantly different genes in a Volcano plot

Significant Pathway Determination

Which types of genes are enriched in a cluster? Idea: Compare your cluster of genes with lists of genes with common properties (function, expression, location). Find how many genes overlap between your cluster and a gene list. Calculate the probability of obtaining the overlap by chance This measures if the enrichment is significant. This analysis provides an unbiased way of detecting connections between expression and function GeneOntology Cell cycle Our Cell cycle 15000

Send list to IPA for pathway Analysis

Significant Pathways sent to Ingenuity Pathway Analysis

Completed Analysis genelists Data Pathways

Affymetrix Workflow: GenePattern

Comparative Marker Selection

Paste the URLs for Data files

Send results to next module Viewer module

Outputs ranked list of genes List of Marker genes can be Filtered and exported

Nextbio Compares your Genelists to the Nextbio database Can reveal unexpected similarities between datasets Has a very good literature database connected to the results Contains data from model organisms

Ingenuity Pathway Analysis Detects networks in your data Allows you to look for connections between genes and drugs/small molecules Focused on Man and Mouse GeneGo High Quality hand annotated ontologies Has a very good literature database connected to the results Contains data from model organisms

Start a new core analysis

Ingenuity Data import

IPA determines functions

Overlay drug and disease data

Data Import to Nextbio

The Nextbio Report Page

What else does my gene do?

THE END Many thanks for coming!