Clustering of DNA Microarray Data Michael Slifker CIS 526.

Slides:



Advertisements
Similar presentations
Analysis of Microarray Genomic Data of Breast Cancer Patients Hui Liu, MS candidate Department of statistics Prof. Eric Suess, faculty mentor Department.
Advertisements

Cluster analysis for microarray data Anja von Heydebreck.
1 MicroArray -- Data Analysis Cecilia Hansen & Dirk Repsilber Bioinformatics - 10p, October 2001.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Gene Expression Chapter 9.
DNA Microarray Bioinformatics - #27611 Program Normalization exercise (from last week) Dimension reduction theory (PCA/Clustering) Dimension reduction.
SocalBSI 2008: Clustering Microarray Datasets Sagar Damle, Ph.D. Candidate, Caltech  Distance Metrics: Measuring similarity using the Euclidean and Correlation.
Microarray Data Preprocessing and Clustering Analysis
Discrimination and clustering with microarray gene expression data Terry Speed, Jane Fridlyand, Yee Hwa Yang and Sandrine Dudoit* Department of Statistics,
Bio277 Lab 2: Clustering and Classification of Microarray Data Jess Mar Department of Biostatistics Quackenbush Lab DFCI
Figure 1: (A) A microarray may contain thousands of ‘spots’. Each spot contains many copies of the same DNA sequence that uniquely represents a gene from.
Dimension reduction : PCA and Clustering Slides by Agnieszka Juncker and Chris Workman.
Alizadeh et. al. (2000) Stephen Ayers 12/2/01. Clustering “Clustering is finding a natural grouping in a set of data, so that samples within a cluster.
Dimension reduction : PCA and Clustering Christopher Workman Center for Biological Sequence Analysis DTU.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Introduction to Hierarchical Clustering Analysis Pengyu Hong 09/16/2005.
Microarray Data Analysis
An Introduction to DNA Microarrays Jack Newton University of Alberta
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
Inferring the nature of the gene network connectivity Dynamic modeling of gene expression data Neal S. Holter, Amos Maritan, Marek Cieplak, Nina V. Fedoroff,
1 April, 2005 Chapter C4.1 and C5.1 DNA Microarrays and Cancer.
DNA Microarrays Examining Gene Expression. Prof. GrossBiology 4 DNA MicroArrays DNA MicroArrays use hybridization technology to examine gene expression.
Supervised gene expression data analysis using SVMs and MLPs Giorgio Valentini
Cluster Analysis Hierarchical and k-means. Expression data Expression data are typically analyzed in matrix form with each row representing a gene and.
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Introduction to Bioinformatics Algorithms Clustering and Microarray Analysis.
By Moayed al Suleiman Suleiman al borican Ahmad al Ahmadi
Analysis of microarray data
B IOINFORMATICS Dr. Aladdin HamwiehKhalid Al-shamaa Abdulqader Jighly Lecture 8 Analyzing Microarray Data Aleppo University Faculty of technical.
with an emphasis on DNA microarrays
Microarray Gene Expression Data Analysis A.Venkatesh CBBL Functional Genomics Chapter: 07.
CDNA Microarrays Neil Lawrence. Schedule Today: Introduction and Background 18 th AprilIntroduction and Background 25 th AprilcDNA Mircoarrays 2 nd MayNo.
The European Nutrigenomics Organisation Deciding and acting on quality of microarray experiments in genomics Chris Evelo BiGCaT Bioinformatics Maastricht.
Functional genomics + Data mining BCH364C/391L Systems Biology / Bioinformatics – Spring 2015 Edward Marcotte, Univ of Texas at Austin.
From motif search to gene expression analysis
DNA MICROARRAYS WHAT ARE THEY? BEFORE WE ANSWER THAT FIRST TAKE 1 MIN TO WRITE DOWN WHAT YOU KNOW ABOUT GENE EXPRESSION THEN SHARE YOUR THOUGHTS IN GROUPS.
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
CDNA Microarrays MB206.
Microarrays.
Microarray - Leukemia vs. normal GeneChip System.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
1 FINAL PROJECT- Key dates –last day to decided on a project * 11-10/1- Presenting a proposed project in small groups A very short presentation (Max.
Genomics I: The Transcriptome
Introduction to Statistical Analysis of Gene Expression Data Feng Hong Beespace meeting April 20, 2005.
Gene Expression Analysis. 2 DNA Microarray First introduced in 1987 A microarray is a tool for analyzing gene expression in genomic scale. The microarray.
Statistical Analysis of DNA Microarray. An Example of HDLSS in Genetics.
Lecture 7. Functional Genomics: Gene Expression Profiling using
An Overview of Clustering Methods Michael D. Kane, Ph.D.
Microarray Technology. Introduction Introduction –Microarrays are extremely powerful ways to analyze gene expression. –Using a microarray, it is possible.
Gene expression & Clustering. Determining gene function Sequence comparison tells us if a gene is similar to another gene, e.g., in a new species –Dynamic.
High-throughput omic datasets and clustering
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
Computational Biology Clustering Parts taken from Introduction to Data Mining by Tan, Steinbach, Kumar Lecture Slides Week 9.
Molecular Classification of Cancer Class Discovery and Class Prediction by Gene Expression Monitoring.
Brad Windle, Ph.D Unsupervised Learning and Microarrays Web Site: Link to Courses and.
Prof. Yechiam Yemini (YY) Computer Science Department Columbia University (c)Copyrights; Yechiam Yemini; Lecture 2: Introduction to Paradigms 2.3.
Gene expression. Gene Expression 2 protein RNA DNA.
Genomic Signal Processing Dr. C.Q. Chang Dept. of EEE.
Example of a DNA array used to study gene expression (note green, yellow red colors; also note.
1 Microarray Clustering. 2 Outline Microarrays Hierarchical Clustering K-Means Clustering Corrupted Cliques Problem CAST Clustering Algorithm.
Microarray: An Introduction
C LUSTERING José Miguel Caravalho. CLUSTER ANALYSIS OR CLUSTERING IS THE TASK OF ASSIGNING A SET OF OBJECTS INTO GROUPS ( CALLED CLUSTERS ) SO THAT THE.
Gene Expression Analysis
Microarray - Leukemia vs. normal GeneChip System.
Dimension reduction : PCA and Clustering by Agnieszka S. Juncker
Functional Genomics in Evolutionary Research
Example of a DNA Array (note green, yellow red colors; also note that only part of the total array is depicted)
Dimension reduction : PCA and Clustering
Class Prediction Based on Gene Expression Data Issues in the Design and Analysis of Microarray Experiments Michael D. Radmacher, Ph.D. Biometric Research.
Presentation transcript:

Clustering of DNA Microarray Data Michael Slifker CIS 526

DNA Microarrays Measure gene expression in a sample for thousands of genes simultaneously Used to compare gene expression among samples –Between individuals or treatments –Over time –Between normal tissue and tumor –Assess normal biological variation

Microarray Process Single-stranded DNA is printed onto slide Extract mRNA from cells Experimental mRNA sample & reference sample are fluorescently labelled (Cy3-green, Cy5-red) RNA samples are hybridized onto slide – bind to complementary DNA Laser scanning – fluorescent labels allow relative levels of bound mRNA to be measured Gridding, background correction, log-ratio transformation, normalization, analysis (finally!)

Red = low expression relative to reference Green = high expression relative to reference Yellow = similar expression in two samples Black = no expression in either sample

Example DLBCL – Diffuse large B-cell lymphoma (Alizadeh et al, 2000) ~18,000 genes x 96 samples of normal and malignant leukocytes Clinical evidence of great heterogeneity in terms of survival Question: Are there subclasses of DLBCL that can be discovered by looking at gene expression profiles? (Answer: yes)

Why cluster? Very large numbers of genes and highly complex systems/pathways render clustering essential for interpretation and visualization Discover new tumor subclasses Describe common expression profiles (e.g., cell cycle)

What to cluster? Clustering genes: –Look for groups of genes with similar expression profiles – may identify genes that are involved in biochemical pathways Clustering samples: –Do clusters conform to known categories? –Can new structure be discovered (e.g., new subclasses of tumor)? Clustering both genes and samples at once

Clustering methods Hierarchical (agglomerative) – most common K-means, PAM Self-organizing maps (SOMs) PCA clustering Ensemble methods “Fuzzy” methods – genes can belong to more than one cluster Model-based methods (e.g., mixtures of Gaussians)

Challenges Noisy data in highly dimensional space Many choices of algorithm and algorithmic parameters –What distance measure? –What linkage? –How many clusters? How can we assess quality/reliability?

Ex.

Two main sample clusters can be seen Genes correspond to two different types of B-cell Clusters are associated with differential survival beyond traditional clinical indicators

Conclusions To be useful, clustering of microarray data must ultimately be informed by biology Large number of genes and complexity of pathways means clustering is an essential part of most microarray analyses There is no “best” method – choices of distance, linkage, algorithm, gene filtering criteria. As much art as science