Verna Vu & Timothy Abreo

Slides:



Advertisements
Similar presentations
RNA-seq library prep introduction
Advertisements

Optimal designs for one and two-colour microarrays using mixed models
RNAseq.
We processed six samples in triplicate using 11 different array platforms at one or two laboratories. we obtained measures of array signal variability.
Transcriptome Sequencing with Reference
Peter Tsai Bioinformatics Institute, University of Auckland
Gene regulation in cancer 11/14/07. Overview The hallmark of cancer is uncontrolled cell proliferation. Oncogenes code for proteins that help to regulate.
RNA-seq: the future of transcriptomics ……. ?
Timothy H. W. Chan, Calum MacAulay, Wan Lam, Stephen Lam, Kim Lonergan, Steven Jones, Marco Marra, Raymond T. Ng Department of Computer Science, University.
Data Analysis for High-Throughput Sequencing
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Next-generation sequencing and PBRC. Next Generation Sequencer Applications DeNovo Sequencing Resequencing, Comparative Genomics Global SNP Analysis Gene.
Gene Expression Chapter 9.
Xiaole Shirley Liu STAT115, STAT215, BIO298, BIST520
Gene Expression And Regulation Bioinformatics January 11, 2006 D. A. McClellan
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Microarray Data Analysis Using R Studies in Tissue Databases Mark Reimers, NCI.
Microarray Analysis Jesse Mecham CS 601R. Microarray Analysis It all comes down to Experimental Design Experimental Design Preprocessing Preprocessing.
Microarrays: Theory and Application By Rich Jenkins MS Student of Zoo4670/5670 Year 2004.
Chris Chander, Luke Adea BioSci D145 Feb. 12, 2015
Post-Genomics Experimental Design CSC Gene Expression and Proteomics Simon Cockell & Cedric Simillion.
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
High Throughput Sequencing
Real Time PCR = Quantitative PCR.
Diabetes and Endocrinology Research Center The BCM Microarray Core Facility: Closing the Next Generation Gap Alina Raza 1, Mylinh Hoang 1, Gayan De Silva.
Li and Dewey BMC Bioinformatics 2011, 12:323
A Quantitative Overview to Gene Expression Profiling in Animal Genetics Armidale Animal Breeding Summer Course, UNE, Feb MPSS Massively Parallel.
Swiss Institute of Bioinformatics Institut Suisse de Bioinformatique LF Probe selection for Microarrays Considerations and pitfalls.
Ji-hye Choi August Introduction (2006) ABRF-NGS (the Association fo Biomolecular Resource Facilities next-generation sequencing study)
Transcriptome analysis With a reference – Challenging due to size and complexity of datasets – Many tools available, driven by biomedical research – GATK.
CDNA Microarrays MB206.
Gene expression and DNA microarrays Old methods. New methods based on genome sequence. –DNA Microarrays Reading assignment - handout –Chapter ,
The Center for Medical Genomics facilitates cutting-edge research with state-of-the-art genomic technologies for studying gene expression and genetics,
RNAseq analyses -- methods
Agenda Introduction to microarrays
Finish up array applications Move on to proteomics Protein microarrays.
Small RNAs and their regulatory roles. Presented by: Chirag Nepal.
Bioinformatics Expression profiling and functional genomics Part II: Differential expression Ad 27/11/2006.
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
1 Transcript modeling Brent lab. 2 Overview Of Entertainment  Gene prediction Jeltje van Baren  Improving gene prediction with tiling arrays Aaron Tenney.
Summarization of Oligonucleotide Expression Arrays BIOS Winter 2010.
Glue Grant Human Transcriptome Array. 2 Affymetrix Confidential PNAS (9) ; published ahead of print February 11, 2011, doi: /pnas
Systems Biology through Pathway Statistics Chris Evelo BiGCaT Bioinformatics Group – BMT-TU/e & UM Diepenbeek; May
1 Global expression analysis Monday 10/1: Intro* 1 page Project Overview Due Intro to R lab Wednesday 10/3: Stats & FDR - * read the paper! Monday 10/8:
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Figure SOM1. Functional roles of the genes affected in zmet2-m1 mutants. Although the genes localized on the intracellular membranes were slightly over-represented.
Design of Micro-arrays Lecture Topic 6. Experimental design Proper experimental design is needed to ensure that questions of interest can be answered.
Introduction to RNAseq
Microarray (Gene Expression) DNA microarrays is a technology that can be used to measure changes in expression levels or to detect SNiPs Microarrays differ.
Microarray analysis Quantitation of Gene Expression Expression Data to Networks BIO520 BioinformaticsJim Lund Reading: Ch 16.
Proteome and Gene Expression Analysis Chapter 15 & 16.
TOX680 Unveiling the Transcriptome using RNA-seq Jinze Liu.
Shankar Subramaniam University of California at San Diego Data to Biology.
The iPlant Collaborative
No reference available
Transcriptome What is it - genome wide transcript abundance How do you obtain it - Arrays + MPSS What do you do with it when you have it - ?
Statistical Analysis for Expression Experiments Heather Adams BeeSpace Doctoral Forum Thursday May 21, 2009.
Affymetrix User’s Group Meeting Boston, MA May 2005 Keynote Topics: 1. Human genome annotations: emergence of non-coding transcripts -tiling arrays: study.
Microarray: An Introduction
Reliable Identification of Genomic Variants from RNA-seq Data Robert Piskol, Gokul Ramaswami, Jin Billy Li PRESENTED BY GAYATHRI RAJAN VINEELA GANGALAPUDI.
Canadian Bioinformatics Workshops
RNA Quantitation from RNAseq Data
RNA-Seq for the Next Generation RNA-Seq Intro Slides
Biases and their Effect on Biological Interpretation
EXTENDING GENE ANNOTATION WITH GENE EXPRESSION
Working with RNA-Seq Data
Bo Li, Akshay Tambe, Sharon Aviran, Lior Pachter  Cell Systems 
Sequence Analysis - RNA-Seq 2
Derek de Rie and Imad Abuessaisa Presented by: Cassandra Derrick
Presentation transcript:

Verna Vu & Timothy Abreo

Study the transcriptome of transgenic mice vs wild type δC-doublecortin-like kinase (Dclk1) Transgenic mice express a constitutively active splice variant of doublecortin-like kinase 1 (Dclk1) gene. Goals: Study the transcriptome of wild type vs transgenic mice Compare microarray with DGE http://pubs.niaaa.nih.gov/publications/arh284/images/swart1.gif http://www.lintoninst.co.uk/Portals/0/productimages/276_0adeb.jpg δC-DCLK-short mice spend less time, move less in the open arms of the elevated plus maze→ δC-DCLK-short mice display a more anxious behavioral phenotype. http://www.lintoninst.co.uk http://pubs.niaaa.nih.gov/

Microarray Problems: 5 different microarray high back ground measures only relative abundances of transcripts only predefined sequences can be detected transcripts expressed at low levels can’t be detected 5 different microarray Affymetrix Agilent Illumina Applied Biosystems Home spotted http://dnachips-microarrays.wikispaces.com/file/view/flowchart.gif/143884121/flowchart.gif dnachips-microarrays.wikispaces.com/

Quantitative PCR analysis Cycle Based Sybr Green: Binds to all ds PCR products Gold standard: used to validate Problems Can’t Multiplex Many cycles to detect low frequency RNA transcripts, can be lost in background noise Can’t use on all transcripts real time PCR gold standard. can’t qPCR all genes but used to validate in this study a lot of work http://www.thermoscientificbio.com

Helpful Nomenclature Canonical sequence: sequences that reflects the most common choice of base at each position Noncanonical sequence: tags in the genome that map to any known exon either strand Transcripts per million: t.p.m.

Origins of Digital Gene Expression (DGE): SAGE Problems: Laborious Sequencing steps 100 k canonical tags would take up to a year Considerable financial investment needed http://www.sagenet.org/images/Sagenv1b.gif www.sagenet.org/

Digital Gene Expression (DGE) tag profiling Procedure 1st and 2nd strand cDNA synthesis using oligo-dT beads to capture polyadenylated RNA While still on beads, NlaIII was used to digest the DNA, makes cDNA fragment with 3’ most CATG GEX adapter 1 is ligated to the free 5’ end MmeI then cuts 17bp downstream of CATG site Gex adapter 2 ligate to free 3’ end PCR conducted using primers complementary to adapters, then sent for deep sequencing using Solexa/Illumina Whole Genome Sequencer NO CLONING First purify RNA, make cDNA, digest with first restriction enzyme, ligate first adapter to 3’ end, digest with other enzyme and ligate 5’ adapter

ENSEMBL transcripts To enable comparison, all canonical sequence tags and microarray probe sequences were put in FASTA format and then aligned to the ENSEMBL transcript database. Same samples used for DGE and microarray. http://en.wikipedia.org/wiki/FASTA_format http://en.wikipedia.org

Alternative Polyadenylation Transcripts with different 3’ ends that are separated by at least one restriction site can be differentiated with DGE 47% of detected ENSEMBL transcripts were discovered by more than one tag, most likely a result of alternative polyadenylation in 3’UTR 29% estimated previously based on EST sequences http://www.nature.com/nrg/journal/v14/n7/images_article/nrg3482-f2.jpg http://www.nature.com

Antisense Transcription Found evidence of bidirectional transcription in 51% of gene clusters when considering canonical and non-canonical tags of abundance >2 t.p.m. Antisense transcripts were expressed at high levels, although sense transcripts were still most abundant http://www.nature.com/nrg/journal/v14/n12/images/nrg3594-i1.jpg Importance: there could be gene transcripts in the opposite direction that we don’t know the function antisense tags that are found were not due to reverse transcriptase artifacts http://www.nature.com

Differentially Expressed Genes Sequencing of pooled samples caused problems; blood contamination in one sample Used Bayesian statistics to attempt to exclude genes that were not truly differentially expressed abundance of transcripts found to be highly expressed in blood

Biological Implications Used Gene Ontology consortium and DGE data to identify differentially regulated gene sets Most affected pathway: Disturbances of microtubule guided transport of SNARE containing synaptic vesicles due to changes in gene expression CaMK pathway was the second most affected pathway, possibly via feedback mechanisms involved in behavioral pathways in brain http://eferrari.blogs.lincoln.ac.uk/files/2013/02/SNAREs.png http://eferrari.blogs.lincoln.ac.uk

Dynamic Range The ratio between the largest and smallest possible values of a changeable quantity 3 to 4 orders of magnitude Lowest frequency but most consistently detected was 2 t.p.m. (0.3 copies per cell) limit of chips at lower range. due to noise. low signal linear portion of sigmoidal curve =DR

Comparison to SAGE and Microarray The effect of sequencing depth on detection of differentially expressed genes Simulated SAGE: randomly took 1/60 of DGE reads. Detected a decrease in 15 fold from 3179 to 200 reads Simulated SAGE: lowest: 91 t.p.m. DGE: average: >2 t.p.m. median: 4 t.p.m lowest: 0.8 t.pm. Microarray platforms median: 106 t.p.m Affymetrix had the most transcripts in common with DGE 11 different probes per transcript

Validation with qPCR 29 significant genes from DGE, 33 from microarray 43/62 showed some sort of variation between the wt and transgenic same direction only 5 were considered to be significant by both platforms To test for accuracy log 2 ratio: expression level of transgenic/wild type

Reproducibility The correlation between the normalized number of counts from the summed individual samples in their laboratory and the pool analyzed in the other laboratory were 0.98 wild-type 0.96 transgenic To test for precision Technique allows collaboration with other labs. False discovery rate was 8.5% determined the distribution of the differences between independent replicate measurements of log ratios between wild-type and transgenic samples

Why use DGE over Microarrays? unbiased view of the transcriptome detects high levels of differential polyadenylation and antisense transcription data are more precise & accurate than microarray data data analysis requires a lower number of preprocessing  facilitates interlaboratory comparisons high interlaboratory comparability of DGE data, probably due to the avoidance of hybridization processes (notoriously difficult to standardize) more sensitive in the detection of low-abundant transcripts and of small changes in gene expression. Absence of background signal and saturation effects unbiased view of the transcriptome detects high levels of differential polyadenylation and antisense transcription (undetectable with standard microarrays) data are more precise & accurate than microarray data data analysis requires a lower number of preprocessing steps (like background correction and normalization), which facilitates interlaboratory comparisons high interlaboratory comparability of DGE data, probably due to the avoidance of hybridization processes, which are notoriously difficult to standardize more sensitive in the detection of low-abundant transcripts and of small changes in gene expression. Absence of background signal and saturation effects (major causes of ratio compression on microarrays)

Implications: Enhancements in sequencing depth to improve accuracy in particular for low abundant transcripts Improvements in sensitivity, resolution, interlaboratory consistency Boost field of expression profiling Basic research and comparative genomics fields will benefit from major improvement of data portability Once held back by extensive and lengthy standardization issues

What the Future has in Store Deep sequencing→ robustness, comparability and richness of expression profiling data. Will boost collaborative, comparative and integrative genomics studies RNAseq http://static1.squarespace.com/static/526da5e4e4b039ceb7da1058/t/5317ed02e4b05fcac6a8d60b/1394076931330/human-dna.jpg static1.squarespace.com/

Questions? Criticism? http://www.clipartbest.com http://www.clipartbest.com/cliparts/jcx/pgp/jcxpgpnKi.png http://www.clipartbest.com