Gene Expression Chapter 9.

Slides:



Advertisements
Similar presentations
BiGCaT Bioinformatics Hunting strategy of the bigcat.
Advertisements

Consider the following…  Do all of the cells in your body carry out the same processes?  Do all of the cells in your body make the same proteins?  Do.
1 MicroArray -- Data Analysis Cecilia Hansen & Dirk Repsilber Bioinformatics - 10p, October 2001.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Microarray Data Analysis Stuart M. Brown NYU School of Medicine.
Introduction to DNA Microarrays Todd Lowe BME 88a March 11, 2003.
Microarrays Dr Peter Smooker,
DNA Microarray: A Recombinant DNA Method. Basic Steps to Microarray: Obtain cells with genes that are needed for analysis. Isolate the mRNA using extraction.
Genome annotation. What we have GATCAATGATGATAGGAATTGAAAGTGTCTTAATTACAATCCCTGTGCAATTATTAATAACTTTTTTGTT CACCTGTTCCCAGAGGAAACCTCAAGCGGATCTAAAGGAGGTATCTCCTCAAAAGCATCCTCTAATGTCA.
Microarrays. Regulation of Gene Expression Cells respond to environment Heat Food Supply Responds to environmental conditions Various external messages.
Figure 1: (A) A microarray may contain thousands of ‘spots’. Each spot contains many copies of the same DNA sequence that uniquely represents a gene from.
Exam #2 Mean = 73% Median = 74% Mode = 90% A range: | | | | | | | | | B range: | | | | | | | | | C range: | | | | | | | D range: | | | | | | | | | | Failing:
Bacterial Physiology (Micr430)
Microarray Technology Types Normalization Microarray Technology Microarray: –New Technology (first paper: 1995) Allows study of thousands of genes at.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Alternative Splicing As an introduction to microarrays.
Inferring the nature of the gene network connectivity Dynamic modeling of gene expression data Neal S. Holter, Amos Maritan, Marek Cieplak, Nina V. Fedoroff,
What are microarrays? Microarrays consist of thousands of oligonucleotides or cDNAs that have been synthesized or spotted onto a solid substrate (nylon,
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Microarrays: Theory and Application By Rich Jenkins MS Student of Zoo4670/5670 Year 2004.
Introduce to Microarray
Lecture 4 Microarray & Analysis Alizadeh et al. Nature 403 (2000)
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Gene Expression Analysis using Microarrays Anne R. Haake, Ph.D.
and analysis of gene transcription
By Moayed al Suleiman Suleiman al borican Ahmad al Ahmadi
Analysis of microarray data
Paola CASTAGNOLI Maria FOTI Microarrays. Applicazioni nella genomica funzionale e nel genotyping DIPARTIMENTO DI BIOTECNOLOGIE E BIOSCIENZE.
with an emphasis on DNA microarrays
Microarray Gene Expression Data Analysis A.Venkatesh CBBL Functional Genomics Chapter: 07.
DNA microarrays Each spot contains a picomole of a DNA ( moles) sequence.
Clustering of DNA Microarray Data Michael Slifker CIS 526.
Introduction to DNA Microarray Technology Steen Knudsen Uma Chandran.
How do you identify and clone a gene of interest? Shotgun approach? Is there a better way?
CDNA Microarrays MB206.
Data Type 1: Microarrays
Gene expression and DNA microarrays Old methods. New methods based on genome sequence. –DNA Microarrays Reading assignment - handout –Chapter ,
Microarray Technology
Finish up array applications Move on to proteomics Protein microarrays.
Department of Statistics, University of California, Berkeley, and Division of Genetics and Bioinformatics, Walter and Eliza Hall Institute of Medical Research.
LSM3241: Bioinformatics and Biocomputing Lecture 8: Gene Expression Profiles and Microarray Data Analysis Prof. Chen Yu Zong Tel:
Microarray - Leukemia vs. normal GeneChip System.
CS491JH: Data Mining in Bioinformatics Introduction to Microarray Technology Technology Background Data Processing Procedure Characteristics of Data Data.
Microarrays and Gene Expression Analysis. 2 Gene Expression Data Microarray experiments Applications Data analysis Gene Expression Databases.
1 FINAL PROJECT- Key dates –last day to decided on a project * 11-10/1- Presenting a proposed project in small groups A very short presentation (Max.
CZ5211 Topics in Computational Biology Lecture 2: Gene Expression Profiles and Microarray Data Analysis Prof. Chen Yu Zong Tel:
Gene expression. The information encoded in a gene is converted into a protein  The genetic information is made available to the cell Phases of gene.
Gene Expression Analysis. 2 DNA Microarray First introduced in 1987 A microarray is a tool for analyzing gene expression in genomic scale. The microarray.
Microarrays and Gene Expression Arrays
Idea: measure the amount of mRNA to see which genes are being expressed in (used by) the cell. Measuring protein might be more direct, but is currently.
Microarray Technology. Introduction Introduction –Microarrays are extremely powerful ways to analyze gene expression. –Using a microarray, it is possible.
Microarray (Gene Expression) DNA microarrays is a technology that can be used to measure changes in expression levels or to detect SNiPs Microarrays differ.
Introduction to Microarrays Kellie J. Archer, Ph.D. Assistant Professor Department of Biostatistics
Gene expression & Clustering. Determining gene function Sequence comparison tells us if a gene is similar to another gene, e.g., in a new species –Dynamic.
Overview of Microarray. 2/71 Gene Expression Gene expression Production of mRNA is very much a reflection of the activity level of gene In the past, looking.
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
Computational Biology Clustering Parts taken from Introduction to Data Mining by Tan, Steinbach, Kumar Lecture Slides Week 9.
DNA Gene A Transcriptional Control Imprinting Histone Acetylation # of copies of RNA? Post Transcriptional Processing mRNA Stability Translational Control.
Proteome and Gene Expression Analysis Chapter 15 & 16.
ANALYSIS OF GENE EXPRESSION DATA. Gene expression data is a high-throughput data type (like DNA and protein sequences) that requires bioinformatic pattern.
Microarrays and Other High-Throughput Methods BMI/CS 576 Colin Dewey Fall 2010.
Gene expression and DNA microarrays No lab on Thursday. No class on Tuesday or Thursday next week –NCBI training Monday and Tuesday –Feb. 5 during class.
DNA Microarray Overview and Application. Table of Contents Section One : Introduction Section Two : Microarray Technique Section Three : Types of DNA.
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
Introduction to Oligonucleotide Microarray Technology
Microarray: An Introduction
Computational Biology
Protein Characterization/Purification
Lecture 11 By Shumaila Azam
Data Type 1: Microarrays
Presentation transcript:

Gene Expression Chapter 9

What is Gene Expression? The process of transcribing and translating a gene to yield a protein product Why are we interested in gene expression? Tells us which genes are involved in which functions Although all the cells in the human body have the same genome, only a fraction of the genes are being expressed for any given function.

Gene Expression Cells are different because of differential gene expression - proteome About 40% of human genes are expressed at one time. Gene is expressed by transcribing DNA into single-stranded mRNA - transcriptome mRNA is later translated into a protein

Molecular Biology Overview Nucleus Cell Chromosome Protein Gene (DNA) Gene (mRNA), single strand cDNA

Gene Expression Genes control cell behavior by controlling which proteins are made by a cell House keeping genes vs. cell/tissue specific genes Regulation: Transcriptional (promoters and enhancers) Post Transcriptional (RNA splicing, stability, localization -small non coding RNAs)

Gene Expression Regulation: Translational (3’UTR repressors, poly A tail) Post Transcriptional (RNA splicing, stability, localization -small non coding RNAs) Post Translational (Protein modification: carbohydrates, lipids, phosphorylation, hydroxylation, methlylation, precursor protein) cDNA

How do you measure Gene Expression?

Traditional Methods Northern Blotting Western Blotting RT-PCR Single RNA isolated Probed with labeled cDNA Western Blotting Multiple proteins Probed with antibodies to a specific protein RT-PCR Primers amplify specific cDNA transcripts

How do Microarrays work? New Technology (first paper: 1995) Allows study of thousands of genes at same time Glass slide of DNA molecules Molecule: string of bases (25 bp – 500 bp) uniquely identifies gene or unit to be studied

Gene Expression Microarrays The main types of gene expression microarrays: Short oligonucleotide arrays (Affymetrix) cDNA or spotted arrays (Brown/Botstein). Long oligonucleotide arrays (Agilent Inkjet); Fiber-optic arrays ...

Fabrications of Microarrays Size of a microscope slide Images: http://www.affymetrix.com/

Differing Conditions Ultimate Goal: Helps to: Understand expression level of genes under different conditions Helps to: Determine genes involved in a disease Pathways to a disease Used as a screening tool

Gene Conditions Cell types (brain vs. liver) Developmental (fetal vs. adult) Response to stimulus Gene activity (wild vs. mutant) Disease states (healthy vs. diseased)

Expressed Genes Genes under a given condition mRNA extracted from cells mRNA labeled Labeled mRNA is mRNA present in a given condition Labeled mRNA will hybridize (base pair) with corresponding sequence on slide

Two Different Types of Microarrays Custom spotted arrays (up to 20,000 sequences) cDNA Oligonucleotide High-density (up to 100,000 sequences) synthetic oligonucleotide arrays Affymetrix (25 bases)

Microarray Technology

Microarray Image Analysis Microarrays detect gene interactions: 4 colors: Green: high control Red: High sample Yellow: Equal Black: None Problem is to quantify image signals

Microarray Animations Davidson University: http://www.bio.davidson.edu/courses/genomics/chip/chip.html Imagecyte: http://www.imagecyte.com/array2.html

Microarray analysis Operation Principle: Samples are tagged with flourescent material to show pattern of sample-probe interaction (hybridization) Microarray may have 60K probe Microarray analysis

Microarray Processing sequence

Gene Expression Data Gene expression data on p genes for n samples mRNA samples sample1 sample2 sample3 sample4 sample5 … 1 0.46 0.30 0.80 1.51 0.90 ... 2 -0.10 0.49 0.24 0.06 0.46 ... 3 0.15 0.74 0.04 0.10 0.20 ... 4 -0.45 -1.03 -0.79 -0.56 -0.32 ... 5 -0.06 1.06 1.35 1.09 -1.09 ... Genes 3 Gene expression level of gene i in mRNA sample j Log (Red intensity / Green intensity) = Log(Avg. PM - Avg. MM)

Some possible applications? Sample from specific organ to show which genes are expressed Compare samples from healthy and sick host to find gene-disease connection Probes are sets of human pathogens for disease detection

Huge amount of data from single microarray If just two color, then amount of data on array with N probes is 2N Cannot analyze pixel by pixel Analyze by pattern – cluster analysis

Major Data Mining Techniques Link Analysis Associations Discovery Sequential Pattern Discovery Similar Time Series Discovery Predictive Modeling Classification Clustering

Some clustering methods and software Partitioning:K-Means, K-Medoids, PAM, CLARA … Hierarchical:Cluster, HAC、BIRCH、CURE、ROCK Density-based: CAST, DBSCAN、OPTICS、CLIQUE… Grid-based:STING、CLIQUE、WaveCluster… Model-based:SOM (self-organized map)、COBWEB、CLASSIT、AutoClass… Two-way Clustering Block clustering Actually a number of clustering methods have been proposed. I’ll go through some representative types in the following slides.

randomized row column both data clustered Eisen et al. Proc. Natl. Acad. Sci. USA 95 (1998) time

A dendrogram (tree) for clustered genes E.g. p=5 Let p = number of genes. 1. Calculate within class correlation. 2. Perform hierarchical clustering which will produce (2p-1) clusters of genes. 3. Average within clusters of genes. 4 Perform testing on averages of clusters of genes as if they were single genes. Cluster 6=(1,2) Cluster 7=(1,2,3) Cluster 8=(4,5) Cluster 9= (1,2,3,4,5) 14 1 2 3 4 5

A real case Nature Feb, 2000 Paper by Allzadeh. A et al Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling 6

Discovering sub-groups 7

Gene Expression is Time-Dependent Time Course Data basically, this mining task can be achieved by using clustering techniques. As shown in this sample clustering results, Each curve represents the expression of a gene over conducted experiments, and all genes are classify into six clusters. As you can see, the genes in a same group have very similar pattern, while each group differs from another pretty much.

Sample of time course of clustered genes time time time basically, this mining task can be achieved by using clustering techniques. As shown in this sample clustering results, Each curve represents the expression of a gene over conducted experiments, and all genes are classify into six clusters. As you can see, the genes in a same group have very similar pattern, while each group differs from another pretty much.

Limitations Cluster analyses: Single gene tests: Usually outside the normal framework of statistical inference Less appropriate when only a few genes are likely to change Needs lots of experiments Single gene tests: May be too noisy in general to show much May not reveal coordinated effects of positively correlated genes. Hard to relate to pathways

But a few Links Affymetrix www.affymetrix.com Stanford MicroArray Database http://smd.stanford.edu/resources/restech.shtml Yale Microarray Database http://www.med.yale.edu/microarray/ NCBI Gene Expression Omnibus http://www.ncbi.nlm.nih.gov/geo/ University of North Carolina Database https://genome.unc.edu/