Microarray Data Analysis Using R Studies in Tissue Databases Mark Reimers, NCI.

Slides:



Advertisements
Similar presentations
Analysis of Microarray Genomic Data of Breast Cancer Patients Hui Liu, MS candidate Department of statistics Prof. Eric Suess, faculty mentor Department.
Advertisements

Genetic Analysis of Genome-wide Variation in Human Gene Expression Morley M. et al. Nature 2004,430: Yen-Yi Ho.
Data integration across omics landscapes Bing Zhang, Ph.D. Department of Biomedical Informatics Vanderbilt University School of Medicine
Microarray Quality Assessment Issues in High-Throughput Data Analysis BIOS Spring 2010 Dr Mark Reimers.
Peter Tsai Bioinformatics Institute, University of Auckland
Data Analysis for High-Throughput Sequencing
1 MicroArray -- Data Analysis Cecilia Hansen & Dirk Repsilber Bioinformatics - 10p, October 2001.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Microarray Data Analysis Stuart M. Brown NYU School of Medicine.
Gene Expression Chapter 9.
Public data - available for projects 6 data sets: –Human Tissues –Leukemia –Spike-in –FARO compendium – Yeast Cell Cycle –Yeast Rosetta Find one yourself.
Low-Level Analysis and QC Regional Biases Mark Reimers, NCI.
Parallel human genome analysis: Microarray-based expression monitoring of 1000 genes Mark Schena, Dari Shalon, Renu Heller, Andrew Chai, Patrick O. Brown,
‘Gene Shaving’ as a method for identifying distinct sets of genes with similar expression patterns Tim Randolph & Garth Tan Presentation for Stat 593E.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Comparative Genomic Hybridization (CGH). Outline Introduction to gene copy numbers and CGH technology DNA copy number alterations in breast cancer (Pollack.
Alternative Splicing As an introduction to microarrays.
1 Predicting Gene Expression from Sequence Michael A. Beer and Saeed Tavazoie Cell 117, (16 April 2004)
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Microarrays: Theory and Application By Rich Jenkins MS Student of Zoo4670/5670 Year 2004.
Genomic signatures to guide the use of chemotherapeutics Authors: Anil Potti et. al Presenter: Jong Cheol Jeong.
Generate Affy.dat file Hyb. cRNA Hybridize to Affy arrays Output as Affy.chp file Text Self Organized Maps (SOMs) Functional annotation Pathway assignment.
DNA Microarrays Examining Gene Expression. Prof. GrossBiology 4 DNA MicroArrays DNA MicroArrays use hybridization technology to examine gene expression.
Affymetrix GeneChip Data Analysis Chip concepts and array design Improving intensity estimation from probe pairs level Clustering Motif discovering and.
Why microarrays in a bioinformatics class? Design of chips Quantitation of signals Integration of the data Extraction of groups of genes with linked expression.
Different Expression Multiple Hypothesis Testing STAT115 Spring 2012.
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb Affymetrix GeneChips Oligonucleotide.
Proteomics Informatics – Data Analysis and Visualization (Week 13)
Practical Issues in Microarray Data Analysis Mark Reimers National Cancer Institute Bethesda Maryland.
Amandine Bemmo 1,2, David Benovoy 2, Jacek Majewski 2 1 Universite de Montreal, 2 McGill university and Genome Quebec innovation centre Analyses of Affymetrix.
Gene Expression Data Qifang Xu. Outline cDNA Microarray Technology cDNA Microarray Technology Data Representation Data Representation Statistical Analysis.
Finish up array applications Move on to proteomics Protein microarrays.
Verna Vu & Timothy Abreo
Microarrays and Their Uses Brad Windle, Ph.D
HUMAN-MOUSE CONSERVED COEXPRESSION NETWORKS PREDICT CANDIDATE DISEASE GENES Ala U., Piro R., Grassi E., Damasco C., Silengo L., Brunner H., Provero P.
Epigenetic Analysis BIOS Statistics for Systems Biology Spring 2008.
Scenario 6 Distinguishing different types of leukemia to target treatment.
Biology-Driven Clustering of Microarray Data Applications to the NCI60 Data Set K.R. Coombes, K.A. Baggerly, D.N. Stivers, J. Wang, D. Gold, H.G. Sung,
A Short Overview of Microarrays Tex Thompson Spring 2005.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
1 Transcript modeling Brent lab. 2 Overview Of Entertainment  Gene prediction Jeltje van Baren  Improving gene prediction with tiling arrays Aaron Tenney.
Lawrence Hunter, Ph.D. Director, Computational Bioscience Program University of Colorado School of Medicine
Intro to Microarray Analysis Courtesy of Professor Dan Nettleton Iowa State University (with some edits)
Summarization of Oligonucleotide Expression Arrays BIOS Winter 2010.
MCB 317 Genetics and Genomics Topic 11 Genomics. Readings Genomics: Hartwell Chapter 10 of full textbook; chapter 6 of the abbreviated textbook.
Introduction to Statistical Analysis of Gene Expression Data Feng Hong Beespace meeting April 20, 2005.
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
Differential analysis of Eigengene Networks: Finding And Analyzing Shared Modules Across Multiple Microarray Datasets Peter Langfelder and Steve Horvath.
Statistics for Differential Expression Naomi Altman Oct. 06.
Whole Genome Approaches to Cancer 1. What other tumor is a given rare tumor most like? 2. Is tumor X likely to respond to drug Y?
Data Mining the Yeast Genome Expression and Sequence Data Alvis Brazma European Bioinformatics Institute.
Gene Expression Platforms for Global Co-Expression Analyses A Comparison of spotted cDNA microarrays, Affymetrix microarrays, and SAGE Obi Griffith, Erin.
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
A Report on CAMDA’01 Biointelligence Lab School of Computer Science and Engineering Seoul National University Kyu-Baek Hwang and Jeong-Ho Chang.
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
Comp. Genomics Recitation 10 4/7/09 Differential expression detection.
Molecular Classification of Cancer Class Discovery and Class Prediction by Gene Expression Monitoring.
Example of a Functional Genomics Study Molecular Ecology ,
TOX680 Unveiling the Transcriptome using RNA-seq Jinze Liu.
No reference available
Oigonucleotide (Affyx) Array Basics Joseph Nevins Holly Dressman Mike West Duke University.
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Examining Genetic Similarity and Difference of the Three Progressor Groups at the First and Middle Visits Nicole Anguiano BIOL398: Bioinformatics Laboratory.
Microarray Technology and Data Analysis Roy Williams PhD Sanford | Burnham Medical Research Institute.
Functional Genomics in Evolutionary Research
Functional Genomics in Evolutionary Research
Characterization of differentially expressed genes based on their COG classification. Characterization of differentially expressed genes based on their.
Identification of aging-related genes and affected biological processes. Identification of aging-related genes and affected biological processes. (A) Experimental.
Presentation transcript:

Microarray Data Analysis Using R Studies in Tissue Databases Mark Reimers, NCI

Outline The GNF tissue database The GNF tissue database Exploratory analysis - clustering Exploratory analysis - clustering Positional co-regulation Positional co-regulation Insight via co-regulation Insight via co-regulation Apoptotic configuration of tissues Apoptotic configuration of tissues Probe level analysis Probe level analysis

The GNF Expression Atlas Su et al ( PNAS 2004) hybridized 150 samples from 61 tissues to Affymetrix U133A and custom arrays Su et al ( PNAS 2004) hybridized 150 samples from 61 tissues to Affymetrix U133A and custom arrays Variation in gene expression (as proportion of transcriptome) Variation in gene expression (as proportion of transcriptome) 95% show at least one 2-fold change among 61 tissues 95% show at least one 2-fold change among 61 tissues 37% show more than 2-fold differences between lowest 10% and highest 10% 37% show more than 2-fold differences between lowest 10% and highest 10%

Clustering samples All biological replicates are nearest neighbors All biological replicates are nearest neighbors Dendrogram reflects discrepancy between healthy and cancerous Dendrogram reflects discrepancy between healthy and cancerous

Co-regulation of Nearby Genes Some groups of genes next to one another on chromosome show high correlation across tissues Some groups of genes next to one another on chromosome show high correlation across tissues

Significance of Co-regulation How often would such correlations happen ‘by chance’ - eg. by selecting genes at random? How often would such correlations happen ‘by chance’ - eg. by selecting genes at random? Three random measures would have correlation greater than 0.6 with p < ! Three random measures would have correlation greater than 0.6 with p < ! However 3 genes selected at random from atlas have probability ~ of having all corrs > 0.6 However 3 genes selected at random from atlas have probability ~ of having all corrs > 0.6 In 30,000 positions, we should see 30 In 30,000 positions, we should see regions of high correlation determined 156 regions of high correlation determined Many are paralogs Many are paralogs Perhaps 50% false discovery rate among the rest Perhaps 50% false discovery rate among the rest

Prediction of Function Zhang, et al (J. Biol, 2004, 3:21) hybridized 55 mouse tissues to spotted oligo arrays Zhang, et al (J. Biol, 2004, 3:21) hybridized 55 mouse tissues to spotted oligo arrays Hypothesis: genes with similar tissue expression patterns share similar function Hypothesis: genes with similar tissue expression patterns share similar function Able to recover prediction of GO biological process for known genes with better than 50% accuracy for many categories Able to recover prediction of GO biological process for known genes with better than 50% accuracy for many categories Extended prediction to 1,092 uncharacterized transcripts Extended prediction to 1,092 uncharacterized transcripts

Investigation of Poorly Characterized Gene - Top1MT 10-fold variation in expression (odd for a ‘housekeeping gene’) 10-fold variation in expression (odd for a ‘housekeeping gene’) >50 genes with expression highly correlated (.75) with Top1MT across tissue database >50 genes with expression highly correlated (.75) with Top1MT across tissue database Large proportion are splicing factors Large proportion are splicing factors Top1MT has an odd splice junction in intron 1, and may depend critically on abundant splicing factors Top1MT has an odd splice junction in intron 1, and may depend critically on abundant splicing factors

Apoptosis Patterns Majority of epithelial tissues show common pattern (indisposed to apoptosis) Majority of epithelial tissues show common pattern (indisposed to apoptosis) Blood cells show variety of patterns Blood cells show variety of patterns

Exploration of Probe Sets Examine correlation of probe sets across 150 samples Examine correlation of probe sets across 150 samples All but one probe verified to match latest Unigene build for gene All but one probe verified to match latest Unigene build for gene Probes organized by position in 3’ end Probes organized by position in 3’ end Red: 1; White: < 0

Quality of Arrays Regional bias images Regional bias images