Using Isoform-Sensitive Microarrays to Study Different Modes of Alternative Splicing Christina Zheng Ares Lab RNA Club September 14, 2006.

Slides:



Advertisements
Similar presentations
12/04/2017 RNA seq (I) Edouard Severing.
Advertisements

Microarray Pitfalls Stem Cell Network Microarray Course, Unit 3 October 2006.
Transcriptome Sequencing with Reference
Naveen K. Bansal and Prachi Pradeep Dept. of Math., Stat., and Comp. Sci. Marquette University Milwaukee, WI (USA)
Alternative Splicing Genomic DNA Sequence GmGm AAAAA Exon Intron Exon GmGm AAAAA Transcription mRNA RNA Processing pre-mRNA.
Microarray technology and analysis of gene expression data Hillevi Lindroos.
Tutorial 7 Genome browser. Free, open source, on-line broswer for genomes Contains ~100 genomes, from nematodes to human. Many tools that can be used.
Gene Expression And Regulation Bioinformatics January 11, 2006 D. A. McClellan
Probe design for microarrays using OligoWiz. Sample Preparation Hybridization Array design Probe design Question Experimental Design Buy Chip/Array Statistical.
Differentially expressed genes
Exon selection factor Exon selection factor U2 snRNPU1 snRNP Intron 1 Overview of mRNA Splicing Exon 1 AGGU Exon 2 A AGG Factors such as U1 and U2 snRNP.
Microarrays and Cancer Segal et al. CS 466 Saurabh Sinha.
Gene Discovery & Genome Browsing
Data analytical issues with high-density oligonucleotide arrays A model for gene expression analysis and data quality assessment.
Microarray Data Analysis Using R Studies in Tissue Databases Mark Reimers, NCI.
Characterizing Alternative Splicing With Respect To Protein Domains BME 220 Project Charlie Vaske.
ViaLogy Lien Chung Jim Breaux, Ph.D. SoCalBSI 2004 “ Improvements to Microarray Analytical Methods and Development of Differential Expression Toolkit ”
The Influence of Alternative Splicing in Protein Structure The fact that gene number is not significantly different between mammals and some invertebrates.
Modeling Functional Genomics Datasets CVM Lesson 1 13 June 2007Bindu Nanduri.
Review of important points from the NCBI lectures. –Example slides Review the two types of microarray platforms. –Spotted arrays –Affymetrix Specific examples.
Lecture 12 Splicing and gene prediction in eukaryotes
Interrogating the transcriptome in all its diversity
Different Expression Multiple Hypothesis Testing STAT115 Spring 2012.
Presented by Karen Xu. Introduction Cancer is commonly referred to as the “disease of the genes” Cancer may be favored by genetic predisposition, but.
June Detecting Alternative Splicing using the Human Affymetrix Exon Array 1.0 Instructors: Jennifer Barb, Zoila Rangel, Peter Munson June 15, 2009.
Special Topics in Genomics Lecture 1: Introduction Instructor: Hongkai Ji Department of Biostatistics
Genome of the week - Deinococcus radiodurans Highly resistant to DNA damage –Most radiation resistant organism known Multiple genetic elements –2 chromosomes,
Wfleabase.org/docs/tileMEseq0905.pdf Notes and statistics on base level expression May 2009Don Gilbert Biology Dept., Indiana University
Amandine Bemmo 1,2, David Benovoy 2, Jacek Majewski 2 1 Universite de Montreal, 2 McGill university and Genome Quebec innovation centre Analyses of Affymetrix.
Probe-Level Data Normalisation: RMA and GC-RMA Sam Robson Images courtesy of Neil Ward, European Application Engineer, Agilent Technologies.
RNAseq analyses -- methods
Gene Level Expression Profiling Using Affymetrix Exon Arrays Alan Williams, Ph.D. Director Chip Design Affymetrix, Inc.
Analysis of Exon Arrays Slides provided by Dr. Yi Xing.
Verna Vu & Timothy Abreo
MPL Identification of alternative spliced mRNA variants related to cancers by genome-wide ESTs alignment KIM DAE SOO Oncogene Apr.
A A R H U S U N I V E R S I T E T Faculty of Agricultural Sciences Introduction to analysis of microarray data David Edwards.
Changes in Gene Regulation in Δ Zap1 Strain of Saccharomyces cerevisiae due to Cold Shock Jim McDonald and Paul Magnano.
Intro to Microarray Analysis Courtesy of Professor Dan Nettleton Iowa State University (with some edits)
Fea- ture Num- ber Feature NameFeature description 1 Average number of exons Average number of exons in the transcripts of a gene where indel is located.
Summarization of Oligonucleotide Expression Arrays BIOS Winter 2010.
Proteomic Characterization of Alternative Splicing and Coding Polymorphism Nathan Edwards Center for Bioinformatics and Computational Biology University.
Glue Grant Human Transcriptome Array. 2 Affymetrix Confidential PNAS (9) ; published ahead of print February 11, 2011, doi: /pnas
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
Alistair Chalk, Elisabet Andersson Stem Cell Biology and Bioinformatic Tools, DBRM, Karolinska Institutet, September Day 5-2 What bioinformatics.
RNA-Seq Primer Understanding the RNA-Seq evidence tracks on the GEP UCSC Genome Browser Wilson Leung08/2014.
MEME homework: probability of finding GAGTCA at a given position in the yeast genome, based on a background model of A = 0.3, T = 0.3, G = 0.2, C = 0.2.
Plant Biology Division Post-process of IMGAG M.t. 2.0 Release Affymetrix Medicago Probe set – IMGAG 2.0 / MTGI 8.0 Mapping Zhao Bioinformatics Lab.
Geuvadis achievements and contributions Robert Häsler, functional genomics.
Alternative Splicing (a review by Liliana Florea, 2005) CS 498 SS Saurabh Sinha 11/30/06.
How can we find genes? Search for them Look them up.
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
Research about Alternative Splicing recently 楊佳熒.
Bioinformatics Workshops 1 & 2 1. use of public database/search sites - range of data and access methods - interpretation of search results - understanding.
TOX680 Unveiling the Transcriptome using RNA-seq Jinze Liu.
Oigonucleotide (Affyx) Array Basics Joseph Nevins Holly Dressman Mike West Duke University.
Finding genes in the genome
Gene expression  Introduction to gene expression arrays Microarray Data pre-processing  Introduction to RNA-seq Deep sequencing applications RNA-seq.
From: Duggan et.al. Nature Genetics 21:10-14, 1999 Microarray-Based Assays (The Basics) Each feature or “spot” represents a specific expressed gene (mRNA).
Distinguishing active from non active genes: Main principle: DNA hybridization -DNA hybridizes due to base pairing using H-bonds -A/T and C/G and A/U possible.
Affymetrix User’s Group Meeting Boston, MA May 2005 Keynote Topics: 1. Human genome annotations: emergence of non-coding transcripts -tiling arrays: study.
Canadian Bioinformatics Workshops
bacteria and eukaryotes
Statistical Applications in Biology and Genetics
Regulated Unproductive Splicing
Genome organization and Bioinformatics
Transcriptome analysis
lincRNAs: Genomics, Evolution, and Mechanisms
Nonsense-Mediated mRNA Decay (NMD)
Introduction to Alternative Splicing and my research report
Presentation transcript:

Using Isoform-Sensitive Microarrays to Study Different Modes of Alternative Splicing Christina Zheng Ares Lab RNA Club September 14, 2006

Outline Isoform-sensitive microarrays (splicing arrays) –introduction –challenges Probe cross-hybridization –mapping of probes onto the genome –excluding potential cross-hybridizing probes Analysis of different modes of alternative splicing –annotation of different modes –using splicing arrays to study different modes Isoform Ratio (IR) Isoform Expression (IE) Future directions

Outline Isoform-sensitive microarrays (splicing arrays) –introduction –challenges Probe cross-hybridization –mapping of probes onto the genome –excluding potential cross-hybridizing probes Analysis of different modes of alternative splicing –annotation of different modes –using splicing arrays to study different modes Isoform Ratio (IR) Isoform Expression (IE) Future directions

Splicing Arrays Used to assay and identify splicing changes associated with different biological conditions –muscle specific alternative splicing –alternative splicing associated with nonsense mediated decay The first splicing array was made in yeast –Clark et al. Science 2002 Mammalian splicing arrays – –Johnson et. al. Science 2003 – –Pan et. al. Mol. Cell 2004 –Li et. al. Cancer Research 2006 –Le et. al. Nucleic Acids Research 2004 –Sugnet et. al. PLoS 2006

Affymetrix Mouse Splicing Array 5 X mer probes probes are grouped intro probesets (6-10 probes) gene probesets - 8 – 10 probes placed in common regions exon probesets exon-exon junction probesets – 6 probesets across 30 nucleotides 15,000+ genes Sugnet et al. PLoS Comput. Bio inflexible probe selection greater chance of cross-hyb

Splicing Arrays – AS events All exon-exon junctions of human mRNA RefSeq – –Johnson et. al. Science 2003 Focused on simple cassette exon events – –Pan et. al. Mol. Cell 2004 Focused on simple AS events with two isoforms –Le et. al. Nucleic Acids Research 2004 –Ule et al. Nature Genetics 2005 –Sugnet et. al. PLoS 2006 –Li et. al. Cancer Research 2006 Skip to include ratio –one measurement for each event –not applicable to more complicated modes of AS

Difficulties with Splicing Arrays Greater potential of probe cross-hybridization –inflexibility in probe selection due to location of events exon probes – restricted to the alternative exon exon-exon junction probes – restricted to exon-exon junction Alternative splicing (AS) events –identifying/annotating them –analyzing different modes of AS more complex with a greater number of isoforms

Outline Isoform-sensitive microarrays (splicing arrays) –introduction –challenges Probe cross-hybridization –mapping of probes onto the genome –excluding potential cross-hybridizing probes Analysis of different modes of alternative splicing –annotation of different modes –using splicing arrays to study different modes Isoform Ratio (IR) Isoform Expression (IE) Future directions

Probe Remapping Tools used to remap onto the May 2004 mouse assembly –GMAP Wu et al. Bioinformatics 2005 –BLAT –home-made junction database used GMAP to align all mRNA and EST from unigene made a database of sequences and genomic coordinates of all exon-exon junctions Remapped probes –uniquely mapped 25mer: –multiple hits: (cross-hyb to other genes) –not mapped 25mer: missed exon-exon junction SNPs changed from old mouse assembly to new

Remapping Probes

Potential Cross-hybridization Potential cross-hybridization –BLAST ~400,000 uniquely mapped probes Cutoff for the level of similarity to other genes –how do different levels of similarity affect probe intensity? –took probes which only hit 2 genes hit 25nt to one gene hit at different level to another (24nt, 23nt, 22nt ….) –choose a cutoff based on the how the probe behavior in each class

Probe Analysis

Outline Isoform-sensitive microarrays (splicing arrays) –introduction –challenges Probe cross-hybridization –mapping of probes onto the genome –excluding potential cross-hybridizing probes Analysis of different modes of alternative splicing –annotation of different modes –using splicing arrays to study different modes Isoform Ratio (IR) Isoform Expression (IE) Future directions

Analysis of Affy Splicing Array Previous work –Ule et al. Nature Genetics 2005 –Sugnet et al. PLoS 2006 Focused on simple cassette exon events and or simple two isoform events Using a variation of skip to include ratio Array was designed with more complicated events

Splicing Event Probe Groupings Annotated AS events –exonwalk identifies and annotates events, no matter how complicated the event Mapped the probes onto annotated events 3418 AS events: –1 isoform: 2002 –2 isoforms: 892 –3 isoforms: 182 –4 isoforms: 95 –5 isoforms: 44 –6 or more isoforms: 203

Isoform Ratio Isoform 1Isoform 2 Isoform 3 Isoform Ratio (IR) = isoform i  isoform  isoform = isoform1+isoform2+isoform3 isoform1  isoform isoform2  isoform isoform3  isoform

Isoform Ratio Significance Analysis of Microarrays (SAM) –identify statistically significant IRs –based on a modified t test - ‘relative difference’, s = standard deviation; s 0 = small positive constant, s = standard deviation; s 0 = small positive constant q value - min false discovery rate (FDR) Storey J. Roy. Stat. Soc. Ser. B 2002 – (FDR) –the minimum FDR incurred for calling a specific isoform significant –analogous to p-value for false positive rate –can use a q-value as a specific cutoff much like a p-value x t - x c s+s 0 # of false positives # of significant isoforms

Isoform Ratio Identifying muscle specific AS events – C2C12 myoblast differentiation system Run samples on Affymetrix mouse splicing array C2C12 stem cells differentiate stem cells myo-tubule formation isolate control RNAisolate test RNA

Analysis Pipeline Background correction, normalization, and probe summarization –RMA (Irizarry et al. Biostatistics 2003) Grouping probesets into splicing events –mapping probesets onto annotated AS events –calculating IR Grouping probesets into genes –average of all probesets within a gene Use SAM (Tusher et al. PNAS 2001) to test significance differences between test and control –q-value (min false discovery rate) Storey J. Roy. Stat. Soc. Ser. B 2002 Display results on dataviewer

Splicing Array Dataviewer

GeneViewer

Muscle Specific AS events DnaJ (Hsp40) homolog Coro6, actin binding protein upregulated

AAAAA Multiple rounds of normal translation STOP Ribosome includeskip AAAAA STOP EJC Stop codon is in last exon EJC Premature stop codon (PTC) >50nt AAAAA STOP EJC AAAAA NMD STOP Ribosome EJC Example: PTB Isoform Expression Connection between AS and nonsense-mediated decay (NMD) Block NMD and assay for changes in individual isoform changes

Isoform Expression Isoform 1Isoform 2 Isoform 3 Isoform Expression (IE) = log (isoform i) – log (gene) gene =  probes in gene log (isoform1) – log(gene)log (isoform2) – log(gene)log (isoform3) – log(gene)

Analysis Pipeline Background correction, normalization, and probe summarization –RMA (Irizarry et al. Biostatistics 2003) Grouping probesets into splicing events –mapping probesets onto predefined AS events –calculating IE Grouping probesets into genes –average of all probesets within a gene Use SAM (Tusher et al. PNAS 2001) to test the significance between test and control –q-value (min false discovery rate) Display results on dataviewer

AS associated with NMD SAT1 - spermidine/spermine N1-acetyl transferase 1 –down regulates polyamine levels in the cell –the inclusion of an alternative exon throws it out of frame NMD –block NMD under conditions which SAT1 is needed polyamine and polyamine analog (BENSPM) expect inclusion of the exon the be repressed –missed by previous analysis methods because this event is an example of having probes for only one of the isoforms

Outline Isoform-sensitive microarrays (splicing arrays) –introduction –challenges Probe cross-hybridization –mapping of probes onto the genome –excluding potential cross-hybridizing probes Analysis of different modes of alternative splicing –annotation of different modes –using splicing arrays to study different modes Isoform Ratio (IR) Isoform Expression (IE) Future directions

Probe cross-hybridization –18bp cross-hyb level – behavior of exon probes vs exon-exon junction probes Different modes of AS –better classification of the more complicated modes Future Directions

Acknowledgements Ares Lab Manny Ares John-Paul Donohue Leslie Grate Roland Nagel Julie Ni Lily Shiue Charles Sugnet

Splicing Arrays Clark et. al. Science nt probes each intron-containing gene in yeast Splice Junction (SJ) Index = log - log (SJ mut ) (SJ wt ) (EX mut ) (EX wt ) Normalize out gene expression