RIP – T RANSCRIPT E XPRESSION L EVELS. O UTLINE RNA Immuno-Precipitation (RIP) NGS on RIP & its alternatives Alternate splicing Transcription as a graph.

Slides:



Advertisements
Similar presentations
RNA-Seq as a Discovery Tool
Advertisements

Functional Genomics with Next-Generation Sequencing
RNA-Seq based discovery and reconstruction of unannotated transcripts
RNAseq.
Transcriptional regulation and promoter analysis
Walk-thru of CAGE exercise Also at /tag_analysis/ /tag_analysis/
Peter Tsai Bioinformatics Institute, University of Auckland
Amplicon-Based Quasipecies Assembly Using Next Generation Sequencing Nick Mancuso Bassam Tork Computer Science Department Georgia State University.
Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520
How many transcripts does it take to reconstruct the splice graph? Introduction Alternative splicing is the process by which a single gene may be used.
Bayesian Filtering for Location Estimation D. Fox, J. Hightower, L. Liao, D. Schulz, and G. Borriello Presented by: Honggang Zhang.
Lecture 12 Splicing and gene prediction in eukaryotes
CLOSING DISCUSSION Advances and challenges in Protein-RNA: recognition, regulation, and prediction Discussion facilitator: Manny Ares Banff International.
LECTURE 2 Splicing graphs / Annoteted transcript expression estimation.
Physical Mapping of DNA Shanna Terry March 2, 2004.
Mapping protein-DNA interactions by ChIP-seq Zsolt Szilagyi Institute of Biomedicine.
Amandine Bemmo 1,2, David Benovoy 2, Jacek Majewski 2 1 Universite de Montreal, 2 McGill university and Genome Quebec innovation centre Analyses of Affymetrix.
Variables: – T(p) - set of candidate transcripts on which pe read p can be mapped within 1 std. dev. – y(t) -1 if a candidate transcript t is selected,
1 Global Routing Method for 2-Layer Ball Grid Array Packages Yukiko Kubo*, Atsushi Takahashi** * The University of Kitakyushu ** Tokyo Institute of Technology.
Next Generation Sequencing and its data analysis challenges Background Alignment and Assembly Applications Genome Epigenome Transcriptome.
Adrian Caciula Department of Computer Science Georgia State University Joint work with Serghei Mangul (UCLA) Ion Mandoiu (UCONN) Alex Zelikovsky (GSU)
Small RNAs and their regulatory roles. Presented by: Chirag Nepal.
The iPlant Collaborative
I519 Introduction to Bioinformatics, Fall, 2012
Summarization of Oligonucleotide Expression Arrays BIOS Winter 2010.
A B IL-4(+) IL-4(-) IL-4(+) IL-4(-) ChIP-Seq (STAT6) Ramos IL-4 (+) P-value Ramos IL-4 (-) P-value BEAS2B IL-4 (+) P-value BEASB IL-4 (-) P-value fold.
Alternative Splicing (a review by Liliana Florea, 2005) CS 498 SS Saurabh Sinha 11/30/06.
TOX680 Unveiling the Transcriptome using RNA-seq Jinze Liu.
Analysis of ChIP-Seq Data Biological Sequence Analysis BNFO 691/602 Spring 2014 Mark Reimers.
Comparative transcriptomics of fungi Group Nicotiana Daan van Vliet, Dou Hu, Joost de Jong, Krista Kokki.
Motif Search and RNA Structure Prediction Lesson 9.
Nonlinear differential equation model for quantification of transcriptional regulation applied to microarray data of Saccharomyces cerevisiae Vu, T. T.,
PLANT BIOTECHNOLOGY & GENETIC ENGINEERING (3 CREDIT HOURS) LECTURE 13 ANALYSIS OF THE TRANSCRIPTOME.
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
CyVerse Workshop Transcriptome Assembly. Overview of work RNA-Seq without a reference genome Generate Sequence QC and Processing Transcriptome Assembly.
RNA Sequencing and transcriptome reconstruction Manfred G. Grabherr.
Reliable Identification of Genomic Variants from RNA-seq Data Robert Piskol, Gokul Ramaswami, Jin Billy Li PRESENTED BY GAYATHRI RAJAN VINEELA GANGALAPUDI.
Simultaneous identification of causal genes and dys-regulated pathways in complex diseases Yoo-Ah Kim, Stefan Wuchty and Teresa M Przytycka Paper to be.
Introduction to Next Generation Sequencing. Strategies For Interrogating the Transcriptome Known genes Predicted genes Surrogate strategy Exon verification.
Presented By: Farid, Alidoust Vahid, Akbari 18 th May IAUT University – Faculty.
Simon v RNA-Seq Analysis Simon v
RNA Quantitation from RNAseq Data
Sungkyunkwan University, School of Medicine.
The Transcriptional Landscape of the Mammalian Genome
1. SELECTION OF THE KEY GENE SET 2. BIOLOGICAL NETWORK SELECTION
Biases and their Effect on Biological Interpretation
MicroRNA signature in patients with eosinophilic esophagitis, reversibility with glucocorticoids, and assessment as disease biomarkers  Thomas X. Lu,
Gene expression from RNA-Seq
RNA-Seq analysis in R (Bioconductor)
Detect alternative splicing
Pick a Gene Assignment 4 Requirements
Gene expression estimation from RNA-Seq data
Artefacts and Biases in Gene Set Analysis
1 Department of Engineering, 2 Department of Mathematics,
Solve a system of linear equation in two variables
Reference based assembly
From: TopHat: discovering splice junctions with RNA-Seq
1 Department of Engineering, 2 Department of Mathematics,
MicroRNA signature in patients with eosinophilic esophagitis, reversibility with glucocorticoids, and assessment as disease biomarkers  Thomas X. Lu,
1 Department of Engineering, 2 Department of Mathematics,
Iterative resolution of multi-reads in multiple genomes
Artefacts and Biases in Gene Set Analysis
Working with RNA-Seq Data
Single Cell Regulatory Variation
Volume 10, Issue 7, Pages (February 2015)
Volume 30, Issue 1, Pages (January 2009)
Sequence Analysis - RNA-Seq 2
Volume 26, Issue 11, Pages e5 (March 2019)
Fig. 5 E2F1 also interacts with alternatively spliced transcripts from the MECOM gene. E2F1 also interacts with alternatively spliced transcripts from.
Presentation transcript:

RIP – T RANSCRIPT E XPRESSION L EVELS

O UTLINE RNA Immuno-Precipitation (RIP) NGS on RIP & its alternatives Alternate splicing Transcription as a graph Distribution of tags in exons Pipeline on RIP-seq dataset

RNA I MMUNO -P RECIPITATION (RIP) Global identification of multiple RNA targets of RNA-Binding Proteins (RBPs) Identify proteins associated with RNAs in RNP complexes Identify subsets of RNAs that are functionally- related and potentially co-regulated

H OW IS RIP PERFORMED ?

S EQUENCING ON RIP RIP-Chip Noisy May miss out rare transcripts RIP-RT-PCR PCR introduces mutations RIP tilting-arrays Very expensive Too sensitive to ‘transcriptional noise’

NGS ON RIP RIP-Seq A more complete and unbiased assessment of the global population of RNAs associated with a RNP complex Minimize sequencing bias and high backgrounds known to the previously-mentioned methods

A LTERNATE S PLICING A simple example Regions with the numbers of reads – Exon1: chr1: (5,1); – Exon2: chr1: (2,0); – Exon3: chr1: (3,0); Splice reads – chr1, , ,chr1, , ,3.0; – chr1, , ,chr1, , ,2.0; Exon1(5)Exon2(2)Exon3(3) Exon_Num(Tags)

A LTERNATE S PLICING A less ideal example Regions with the numbers of reads – Exon1: chr4: (29,0); – Exon2: chr4: (8,0); – Exon3: chr4: (116,1); Splice reads – chr4, , ,chr4, , ,3.0; – chr4, , ,chr4, , ,2.0; Exon1(29)Exon2(8)Exon3(116)

T RANSCRIPTION AS A G RAPH From RNA-seq data, check the overlap of the tags If a region has more than one tag, we call it an enriched region Nodes Using the splice reads, we will connect the enriched regions Edges

T RANSCRIPTION AS A G RAPH Represent transcriptome in a topologically sorted acyclic graph Some Observed Errors (RME005) Out-of-range edges in graphs Self-looping nodes Default action: Ignore them

D ISTRIBUTION OF T AGS IN E XONS rQuant – Courtesy of Regina Bohnert (FML, Tubingen)

RNA- SEQ RIP- SEQ The previous results are from RNA-seq Will we have similar observations on RIP-seq datasets? And possibly link the observations to transcription expression levels in transcriptome

P IPELINE ON RIP- SEQ DATASET 1. Dataset RME005 is used 2. Use TopHat / Eland to map RNA back to genome 3. Generate transcription-graphs for each transcript with alternate splicing 4. Express the paths of all transcriptions in the graph using a set of linear equations 5. Use R to solve the linear equations

A N EXAMPLE FROM RME005 There are two transcripts Path1: Exon1 -> Exon2 -> Exon4 Path2: Exon1 -> Exon3 -> Exon4 Exon1 - Exon4 have length L1 - L4, and have reads with number N1 - N4 S1-S4 are the numbers of splice reads Exon1Exon2Exon3 Exon4 N1 N4N3 N2 S1 S2 S3 S4

A SSUMPTIONS The transcript expression levels are: Path1: x1 Path2: x2 The read length = constant The reads are uniformly sampled from the transcripts Use density of reads instead of read_coverage Differentiate reads on both long & short exons

E QUATIONS FOR LINEAR PROGRAMMING Objective function: minimize the sum of d_i Constraints N1/L1 = x1 + x2 + d1 - d2 S1/R = x1 + d3 - d4 N2/L2 = x1 + d5 - d6 S2/R = x1 + d7 - d8 S3/R = x2 + d9 - d10 N3/L3 = x2 + d11 - d12 S4/R = x2 + d13 - d14 N4/L4 = x1 + x2 + d15 - d16 x1, x2 >= 0 d_i >= 0 The solution should be the values of x1, x2 and all d_i N1 N4N3N2 S1 S2 S3 S4

A NOTHER PROBLEM An implicit assumption on enriched regions in RME005 RIP is known to be ~10% efficient Noise will overwhelm true RNP-targets Should use total-RNA as control dataset True-positive regions from RIP should be relatively enriched with tags than

H ANDLING THE ASSUMPTION Obtain RNA-seq from the same source of transcriptome Directly compare both RNA-seq and RIP-seq data RIP-chip discriminate enriched region with >4-fold than RNA-chip data Maybe 4-fold is the magic number ? Current tag distribution observed by Dr Li Guoliang Non-uniform as opposed to what rQuant has observed on RNA-seq

Q&A