Presentation is loading. Please wait.

Presentation is loading. Please wait.

Manuel Holtgrewe Algorithmic Bioinformatics, Department of Mathematics and Computer Science PMSB Project: RNA-Seq Read Simulation.

Similar presentations


Presentation on theme: "Manuel Holtgrewe Algorithmic Bioinformatics, Department of Mathematics and Computer Science PMSB Project: RNA-Seq Read Simulation."— Presentation transcript:

1 Manuel Holtgrewe Algorithmic Bioinformatics, Department of Mathematics and Computer Science PMSB Project: RNA-Seq Read Simulation

2 2 Titel, Datum Overview The actual process of RNA-Seq is quite involved For simple simulation, we can break it down to two steps: -Simulate splicing of genes in the organism -Simulate sampling process by the sequencing machine.

3 3 Part A: Simulate Splicing Read in: -Reference Sequence -Gene/Exon/Intro annotation from GFF (GTF File) Generate transcriptome of organism -For each gene -For each mRNA -Splice together sequences from genome -Write out transcriptome to FASTA File Titel, Datum

4 4 Part B: Read Simulation Read in -Reference sequence (transcriptome) -Read in expression levels (aka counts) for each transcript Simulate Reads -Single-End: Sample uniformly with fixed length. -Paired-End: Sample paired, insert size follows normal distribution. -Error Model: Uniform Errors -Error Model: Read in FASTQ files, convert quality values into error probabilities, use this form position dependent errors. Write out simulated reads Titel, Datum


Download ppt "Manuel Holtgrewe Algorithmic Bioinformatics, Department of Mathematics and Computer Science PMSB Project: RNA-Seq Read Simulation."

Similar presentations


Ads by Google