Introduction to Illumina Sequencing

Slides:



Advertisements
Similar presentations
Next-Generation Sequencing: Methodology and Application
Advertisements

Genome Biology for Programmers Lecture Series: Illumina Sequencing
The Good, Bad, and Ugly of Next-Gen Sequencing
Reference mapping and variant detection Peter Tsai Bioinformatics Institute, University of Auckland.
Next–generation DNA sequencing technologies – theory & practice
High-Throughput Sequencing Technologies
MCB Lecture #9 Sept 23/14 Illumina library preparation, de novo genome assembly.
The past, present, and future of DNA sequencing Dan Russell.
The 454 and Ion PGM at the Genomics Core Facility Dr. Deborah Grove, Director for Genetic Analysis Genomics Core Facility Huck Institutes of the Life Sciences.
Introduction to Short Read Sequencing Analysis
Canadian Bioinformatics Workshops
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
What Is Genomics? Genomics is the study of how the entire genome of a species functions as a unit and evolves over time. It is the study of life’s blueprint,
Greg Phillips Veterinary Microbiology
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Chem 395 Bioanalytical Chemistry
BME 130 – Genomes Lecture 4 Sequencing technology II Next generation sequencing.
High Throughput Sequencing
CS 6293 Advanced Topics: Current Bioinformatics
Next Generation DNA Sequencing Platforms: Evolving Tools for
Diabetes and Endocrinology Research Center The BCM Microarray Core Facility: Closing the Next Generation Gap Alina Raza 1, Mylinh Hoang 1, Gayan De Silva.
GENOME SEQUENCING. I. Genome sequencing The Sanger Method (1977) Denaturation +priming Polymerization.
NGS Data Generation Dr Laura Emery. Overview The NGS data explosion Sequencing technologies An example of a sequencing workflow Bioinformatics challenges.
Update on Next-Generation Sequencing
Molecular Biology Dr. Chaim Wachtel April 4, 2013.
Mapping protein-DNA interactions by ChIP-seq Zsolt Szilagyi Institute of Biomedicine.
High Throughput Sequencing Methods and Concepts
MES Genome Informatics I - Lecture IV. NGS basics Sangwoo Kim, Ph.D. Assistant Professor, Severance Biomedical Research Institute, Yonsei University.
DNA Methylation mapping
Introduction to Short Read Sequencing Analysis
NEXT – GEN SEQUENCING TECHNIQUES
Ion Torrent and Minion Relatively low cost ‘next generation’ sequencing Wendy Smith School of Computing Science, Alan Ward Newcastle University, UK.
High Throughput Sequencing Methods and Concepts Cedric Notredame adapted from S.M Brown.
A statistical base-caller for the Illumina Genome Analyzer Wally Gilks University of Leeds.
Detection of Genomic Rearrangements in K562 cells using Paired End Sequencing Rosa Maria Alvarez Massachusetts Institute of Technology Class of 2009.
Chromatin Immunoprecipitation DNA Sequencing (ChIP-seq)
An efficient model-based base-calling algorithm for high-throughput sequencing Wei-Chun Kao and Yun S. Song, UC Berkeley Presented by: Frank Chun Yat Li.
CHAPTER 7 DNA SEQUENCING - INTRODUCTION - SANGER DIDEOXY METHOD - AUTOMATED SEQUENCING - NEXT GENERATION OF SEQUENCING METHODS MISS NUR SHALENA SOFIAN.
Gerton Lunter Wellcome Trust Centre for Human Genetics From calling bases to calling variants: Experiences with Illumina data.
Stratton Nature 45: 719, 2009 Evolution of DNA sequencing technologies to present day DNA SEQUENCING & ASSEMBLY.
How will new sequencing technologies enable the HMP? Elaine Mardis, Ph.D. Associate Professor of Genetics Co-Director, Genome Sequencing Center Washington.
Next-Generation Sequencing of Microbial Genomes and Metagenomes
Molecular Biology Dr. Chaim Wachtel May 28, 2015.
SEQUENCING – THE BENCHTOPS. Roche 454 Junior Same technology as 454 FLX Read length: 400 bases Paired-end 100,000 reads 12 hours (instrument time) Output.
Ultra-High Throughput DNA Sequencing on the 454/Roche GS-FLX
Sequence File Formats.
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Third Generation Sequencing. Today Illumina – Solexa sequencing technology 454 Life sciences – 454 sequencer Applied Biosystem – SOLiD system Tomorrow.
Canadian Bioinformatics Workshops
Library QA & QC Day 1, Video 3
BME 130 – Genomes Lecture 4 Sequencing technology II Next generation sequencing.
16S rRNA Experimental Design
Next-generation sequencing technology
DNA Sequencing Second generation techniques
Short Read Sequencing Analysis Workshop
Sequencing technologies
Illumina Processing Steven Leonard
DNA Sequencing -sayed Mohammad Amin Nourion -A’Kia Buford
Next-generation sequencing technology
NGS technologies.
Sequencing technology and assembly
SOLEXA aka: Sequencing by Synthesis
2nd (Next) Generation Sequencing
High-throughput sequencing techniques
ULTRASEQUENCING. Next Generation Sequencing: methods and applications.
The impact of next-generation sequencing technology on genetics
High-Throughput Sequencing Technologies
High-Throughput Sequencing Technologies
BF nd (Next) Generation Sequencing
Canadian Bioinformatics Workshops
Presentation transcript:

Introduction to Illumina Sequencing Day 1, Video 2 Overview of Next-gen sequencing Introduction to Illumina sequencing Multiplexing Sequencing run statistics

Next-Gen Sequencing Millions of reactions performed in parallel Shorter read lengths, higher error rate Sample/library prep is required Many different approaches Illumina sequencing-by-synthesis (Solexa technology) Roche 454 pyrosequencing AB SOLID color-based sequencing by ligation Ion Torrent semiconductor sequencing Single-molecule sequencing (PacBio, MinION, etc)

Some general terminology SR: single-read sequencing, sequence from only one end PE: paired-end sequencing, sequence from both ends Adapters: DNA added to the ends of DNA/RNA fragments to be sequenced. The adapters allow the DNA/RNA to attach to the flowcell Index/barcode: used interchangeable to indicate sequence identifier for multiplexing PhiX: commercially available genomic library of PhiX bacteriophage DNA, commonly spiked into libraries

Steps to Illumina sequencing Library construction Fragment, attach adapter DNA Cluster generation Add to flow cell Bridge amplification Sequencing Single base at a time, imaging Data analysis Images transformed into basecalls and ‘reads’

Illumina sequencing SBS chemistry video http://www.illumina.com/technology/next-generation-sequencing/sequencing-technology.html

Clustering, the first step to sequencing

Sequencing by Synthesis overview

The importance of cluster density Well-spaced clusters easier to call Densely-packed clusters difficult to call Illumina reports “optimal” cluster density for each platform pM amounts of libraries are used for sequencing Accurate QC and quantification are essential!

Anatomy of a library P5 and P7 ends of adapters bind to flow cell DNA insert typically ranges 200-600 bp (<1kb) Different methods of indexing Inline (part of the insert) – any level of multiplexing Single index read (≤96) Dual index reads (384+)

Multiplexing – single index read

Multiplexing – dual index reads hf

Some terminology Clusters (raw): number of clusters detected through imaging Reads: the number of reads – some people refer to a cluster as a read (a DNA molecule), others refer to the number of sequences so for PE data this is 2 x DNA molecules % passed-filter (%PF): % of clusters or reads that pass a chastity filter (the useable clusters) %>=Q30: % of bases that have a quality score greater than 30 (e.g. high-quality reads) % aligned: percent of PF reads uniquely aligned to PhiX genome (should be close to the %PhiX spiked in) Error rate: calculated error rate based on alignment to PhiX Phasing/Prephasing: percentage of molecules in a cluster that fall behind (phasing) or ahead (prephasing) of the current cycle during sequencing

Run statistics - SAV df

Considerations for your library The first 25 bases of a read are used by the instrument Bases 1-4 used to create cluster ‘map’ – high diversity is critical Bases 1-12 used for phasing/prephasing calculations Quality scores and alignment to PhiX start at cycle 26 Phasing/prephasing increases with read length Cluster images grow with read length and PE turnaround

Illumina sequencing Based on reversible terminator chemistry Sequencing by synthesis (SBS) All 4 fluorescently labeled bases present