Sequencing Technologies

Slides:



Advertisements
Similar presentations
Vanderbilt Center for Quantitative Sciences Summer Institute Sequencing Analysis Yan Guo.
Advertisements

Next–generation DNA sequencing technologies – theory & practice
High-Throughput Sequencing Technologies
Next-generation sequencing
Canadian Bioinformatics Workshops
Next-generation sequencing – the informatics angle Gabor T. Marth Boston College Biology Department AGBT 2008 Marco Island, FL. February
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Greg Phillips Veterinary Microbiology
Bioinformatics for high-throughput DNA sequencing Gabor Marth Boston College Biology New grad student orientation Boston College September 8, 2009.
Next-generation sequencing: informatics & software aspects Gabor T. Marth Boston College Biology Department.
1 Next Generation Sequencing Itai Sharon November 11th, 2009 Introduction to Bioinformatics.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Chem 395 Bioanalytical Chemistry
Emily Buckhouse. Nitrogenous Bases Nucleosides  Base linked to a 2-deoxy-D-ribose at 1’ carbon Nucleotides Nucleosides with a phosphate at 5’ carbon.
Delon Toh. Pitfalls of 2 nd Gen Amplification of cDNA – Artifacts – Biased coverage Short reads – Medium ~100bp for Illumina – 700bp for 454.
Department of Bioinformatics and Computational Biology
CS 6293 Advanced Topics: Current Bioinformatics
Next Generation DNA Sequencing Platforms: Evolving Tools for
NEXT GENERATION SEQUENCING Technologies on Biomedical Research
GENOME SEQUENCING. I. Genome sequencing The Sanger Method (1977) Denaturation +priming Polymerization.
Update on Next-Generation Sequencing
The impact of next-generation sequencing technology of genetics Elaine R. Mardis – 11 February Washington School of Medicine, Genome Sequencing Center.
Next Now-Generation Genomics: methods and applications for modern disease research Aaron J. Mackey, Ph.D. Center for Public Health.
High-Throughput Sequencing Technologies
Introduction to Next-Generation Sequencing
Next generation sequencing Xusheng Wang 4/29/2010.
High Throughput Sequencing Methods and Concepts
NEXT – GEN SEQUENCING TECHNIQUES
Module 1 Section 1.3 DNA Technology
High throughput sequencing: informatics & software aspects Gabor T. Marth Boston College Biology Department BI543 Fall 2013 January 29, 2013.
High Throughput Sequencing Methods and Concepts Cedric Notredame adapted from S.M Brown.
A statistical base-caller for the Illumina Genome Analyzer Wally Gilks University of Leeds.
By: Muhammad Qayash Khan PhD Zoology, Ist Semester 13-arid-3853 Next-Generation Sequencing: Methodology and Application.
Genomics: READING genome sequences ASSEMBLY of the sequence ANNOTATION of the sequence carry out dideoxy sequencing connect seqs. to make whole chromosomes.
Stratton Nature 45: 719, 2009 Evolution of DNA sequencing technologies to present day DNA SEQUENCING & ASSEMBLY.
How will new sequencing technologies enable the HMP? Elaine Mardis, Ph.D. Associate Professor of Genetics Co-Director, Genome Sequencing Center Washington.
Molecular Biology Dr. Chaim Wachtel May 28, 2015.
SEQUENCING – THE BENCHTOPS. Roche 454 Junior Same technology as 454 FLX Read length: 400 bases Paired-end 100,000 reads 12 hours (instrument time) Output.
EE Slawson Tempel, © WUSTL. STOP EE Slawson Tempel, © WUSTL.
Lecture 6. Functional Genomics: DNA microarrays and re-sequencing individual genomes by hybridization.
Ultra-High Throughput DNA Sequencing on the 454/Roche GS-FLX
GigAssembler. Genome Assembly: A big picture
Next-generation sequencing: the informatics angle
Next-generation sequencing: the informatics angle Gabor T. Marth Boston College Biology Department CHI Next-Generation Data Analysis meeting Providence,
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Third Generation Sequencing. Today Illumina – Solexa sequencing technology 454 Life sciences – 454 sequencer Applied Biosystem – SOLiD system Tomorrow.
Template preparation EE Slawson Tempel, © WUSTL. Template preparation EE Slawson Tempel, © WUSTL.
Statistical Genomics Zhiwu Zhang Washington State University Lecture 6: Genotype.
Introduction to Illumina Sequencing
Biochemistry 412 Overview of Genomics & Proteomics 20 January 2004.
Next-generation sequencing technology
Next generation sequencing
Lecture 6: Genotype by sequencing
Next Generation Sequencing
Sequencing technologies
DNA Sequencing -sayed Mohammad Amin Nourion -A’Kia Buford
Next-generation sequencing technology
Sequencing Technologies
Lecture 6: Genotype by sequencing
SOLEXA aka: Sequencing by Synthesis
B3- Olympic High School Bioinformatics
2nd (Next) Generation Sequencing
ULTRASEQUENCING. Next Generation Sequencing: methods and applications.
The impact of next-generation sequencing technology on genetics
Massively Parallel Sequencing: The Next Big Thing in Genetic Medicine
High-Throughput Sequencing Technologies
High-Throughput Sequencing Technologies
Next-generation DNA sequencing
BF nd (Next) Generation Sequencing
Presentation transcript:

Sequencing Technologies 2nd Generation (“NextGen”) Sequencing Technologies “Fantastic” bizarre or exotic; seeming more appropriate to a fairy tale than to reality or practical use

Read Length is Not As Important For Resequencing Jay Shendure

Paired End Reads are Important! Known Distance Read 1 Read 2 Repetitive DNA Unique DNA Paired read maps uniquely Single read maps to multiple positions

emulsion PCR emPCR Margulies M et al., (2006) Genome sequencing in microfabricated high-density picolitre reactors Nature 437, 376-380

Roche 454 Margulies M et al., (2006) Genome sequencing in microfabricated high-density picolitre reactors Nature 437, 376-380

OH P P P P P P P EE Slawson Tempel, © WUSTL

OH P P P P P P P EE Slawson Tempel, © WUSTL

OH P P P OH P P P OH P P P OH P P P OH P P P P P P P EE Slawson Tempel, © WUSTL

P P P OH P P P OH P P P P P P P EE Slawson Tempel, © WUSTL

P 3 2 1 OH P P P P P P P P P P EE Slawson Tempel, © WUSTL

P Pyrophosphate OH P P P P P P P P EE Slawson Tempel, © WUSTL

P OH P P P P P P P P EE Slawson Tempel, © WUSTL

ATP + luciferin P OH P P P P P P P P EE Slawson Tempel, © WUSTL

ATP + luciferin P OH P P P P P P P P EE Slawson Tempel, © WUSTL

P OH P OH P OH P OH OH P P P P P P P P EE Slawson Tempel, © WUSTL

P OH P OH P OH P OH OH P P P P P P P P EE Slawson Tempel, © WUSTL

P OH P OH P OH P OH OH P P P P P P P P EE Slawson Tempel, © WUSTL

P OH P OH P OH P P OH P P P P P P P P EE Slawson Tempel, © WUSTL

P OH P OH P P OH P OH P P P P P P P P EE Slawson Tempel, © WUSTL

P OH P OH P P P P OH P P P P P P P P P EE Slawson Tempel, © WUSTL

P OH P P P OH P OH P P P P P P P P P EE Slawson Tempel, © WUSTL

P OH P P P P OH P P P P P P P P P P EE Slawson Tempel, © WUSTL

P P P P OH P P P P P P P P P P EE Slawson Tempel, © WUSTL

ATP + luciferin P P P P OH P P P P P P P P P P EE Slawson Tempel, © WUSTL

ATP + luciferin P P P P OH P P P P P P P P P P EE Slawson Tempel, © WUSTL

Brightness of flash is proportional to number of nucleotides added Flash is too bright 4-mer 3-mer Flash brightness 2-mer 1-mer TCACTTCAAGGGT… EE Slawson Tempel, © WUSTL

A T G C ~ 0.5 Gb/run Read length 350-400 bp 200 cycles Roche 454 EE Slawson Tempel, © WUSTL

Illumina Nebulizer ~ 400 bp EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

Flow cell 8 channels (“lanes”) Surface of flow cell coated with a lawn of oligo pairs

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

Each piece has a unique sequence EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

“bridge PCR” EE Slawson Tempel, © WUSTL

thousands of strands/cluster each cluster (“polony”) has a unique sequence EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

EE Slawson Tempel, © WUSTL

STOP P P P P P P P P P P P P EE Slawson Tempel, © WUSTL

Metzger M (2009) Nature Reviews Genetics 11: 31-46

STOP P P P STOP P P P STOP P P P STOP P P P EE Slawson Tempel, © WUSTL

P P P P OH P P P P P P P P P EE Slawson Tempel, © WUSTL STOP STOP STOP

P P P P OH P P P P P P P P P EE Slawson Tempel, © WUSTL STOP STOP STOP

P STOP P STOP P P P P P P P P P P EE Slawson Tempel, © WUSTL

P STOP P P P P P P P P P P EE Slawson Tempel, © WUSTL

STOP P P P P P P P P P P EE Slawson Tempel, © WUSTL

STOP P P P P P P P P P P EE Slawson Tempel, © WUSTL

STOP OH P P P P P P P P P P EE Slawson Tempel, © WUSTL

P P P P P P P P OH P P P P P P P P P P EE Slawson Tempel, © WUSTL STOP

P P P P P P P P OH P P P P P P P P P P EE Slawson Tempel, © WUSTL STOP

P STOP P STOP STOP P P P P P P P P P P P P EE Slawson Tempel, © WUSTL

P STOP P P P P P P P P P P P EE Slawson Tempel, © WUSTL

STOP P P P P P P P P P P P EE Slawson Tempel, © WUSTL

STOP P P P P P P P P P P P EE Slawson Tempel, © WUSTL

STOP OH P P P P P P P P P P P EE Slawson Tempel, © WUSTL

G… © Illumina, EEST, © WUSTL

GC… © Illumina, EEST, © WUSTL

GCT… © Illumina, EEST, © WUSTL

GCTG… © Illumina, EEST, © WUSTL

GCTGA… © Illumina, EEST, © WUSTL

100+ Million Clusters Per Flow Cell 100 Microns

Camera time is the limiting step! Flowcell 8 lanes For picture taking: Each lane is broken up into 100 tiles, each fluor is imaged separately – 2400 pictures taken per cycle EE Slawson Tempel, © WUSTL

Chemistry problem 1: terminator is retained out of phase STOP P P P P P P P P P P EE Slawson Tempel, © WUSTL

Chemistry problem 2: fluor is retained OH P P P P P P P P P P EE Slawson Tempel, © WUSTL

Chemistry problem 2: fluor is retained STOP P P P P P P P P P P EE Slawson Tempel, © WUSTL

Chemistry problem 2: fluor is retained STOP P P P P P P P P P P EE Slawson Tempel, © WUSTL

Illumina >100 Gb/run HiSeq ~ 3 – 30 Gb/run GAII Read length 90x106 reads/lane * 102 bp/read = 9x109 bp/lane * 16 lanes/run = 144 Gb/run ~ 3 – 30 Gb/run GAII Read length 30 – 120 bp

ABI SOLiD Support Oligonucleotide Ligation Detection emPCR

ABI SOLiD Mardis ER. (2008) Next-generation DNA sequencing methods. Annu Rev Genomics Hum Genet. 2008;9:387-402.

ABI SOLiD Mardis ER. (2008) Next-generation DNA sequencing methods. Annu Rev Genomics Hum Genet. 2008;9:387-402.

ABI SOLID Mardis ER. (2008) Next-generation DNA sequencing methods. Annu Rev Genomics Hum Genet. 2008;9:387-402.

ABI SOLiD Mardis ER. (2008) Next-generation DNA sequencing methods. Annu Rev Genomics Hum Genet. 2008;9:387-402.

Mardis ER. (2008) Next-generation DNA sequencing methods Mardis ER. (2008) Next-generation DNA sequencing methods. Annu Rev Genomics Hum Genet. 2008;9:387-402.

Ion Torrent

Nature 475:348 (2011) ~100 bp reads 30 Mb/run 

Ion Torrent read quality

454, 7.4X, 24.5 Gb cost < $1M 3.3 million SNPs 10,654 cause aa substitution (7,648 different from Venter) 222,718 indels (2 to 40kb) 18 CNVs (26 kb to 1.6 Mb) carrier of 10 highly penetrant disease alleles

Illumina, 73X, 173 Gb contig N50 = 40 kb scaffold N50 = 1.3 Mb PMID: 20010809 Illumina, 73X, 173 Gb contig N50 = 40 kb scaffold N50 = 1.3 Mb

ABI SOLiD, 30X coverage 35 interchromosomal translocations PLoS Genetics 6: e1000832 (2010) ABI SOLiD, 30X coverage 107.5 Gb of raw data 55.51 Gb mapped to genome 35 interchromosomal translocations 1,315 structural variations (>100 bp) 191,743 small (<21 bp) indels 2,384,470 SNVs 512 genes homozygously mutated

a recessive EMS-induced mutation affecting egg shell morphology Genetics 2009 182: 25–32 a recessive EMS-induced mutation affecting egg shell morphology Illumina, 8X coverage 103 SNP differences between mutant and wt 9 non-synonomous 2 nonsense >> one in encore, an obvious candidate

Illumina 5.1 Gb of sequence 76 bp reads 40X coverage 30 volume 42 | number 1 | january 2010 Illumina 5.1 Gb of sequence 76 bp reads 40X coverage 4 affected individuals

RNA-Seq Pepke S, Wold B & Mortazavi A. (2009) Nature Methods 6:S22

ChIP-Seq Lefrançois P et al. (2009) Efficient yeast ChIP-Seq using multiplex short-read DNA sequencing. BMC Genomics 10:37

Plant Physiology, July 2009, Vol. 150, pp. 1541–1555

“Fabulous” 3rd Generation (“Next2Gen”) Sequencing Technologies having no basis in reality; mythical

. A T G C . A T G C . A T G C . A T G C

Helicos Single-molecule sequencing

Gupta PK. (2008) Single-molecule DNA sequencing technologies for future genomics research. Trends Biotechnol. 26:602-11

Metzger M (2009) Nature Reviews Genetics 11: 31-46

Helicos 105 to 140 Megabases per hour ~ 35 bp average read length

(2009) Volume 27: 847 Helicos, 28X coverage, 84 Gb 2.8M SNPs 752 CNVs

Ion Torrent Single-molecule sequencing

Single-molecule sequencing - + Gupta PK. (2008) Single-molecule DNA sequencing technologies for future genomics research. Trends Biotechnol. 26:602-11

Nanopore sequencing - + Gupta PK. (2008) Single-molecule DNA sequencing technologies for future genomics research. Trends Biotechnol. 26:602-11

Nanopore sequencing Gupta PK. (2008) Single-molecule DNA sequencing technologies for future genomics research. Trends Biotechnol. 26:602-11

Pacific Biosciences Single-molecule sequencing Eid et al 2008

detection volume 20 zeptoliters (10-21 liters). emission excitation ZMW: a hole, tens of nanometers in diameter, fabricated in a 100nm metal film deposited on a silicon dioxide substrate detection volume 20 zeptoliters (10-21 liters). PacBio technology backgrounder: http://www.pacificbiosciences.com/index.php?q=technology-introduction

PacBio technology backgrounder: http://www. pacificbiosciences

When the DNA polymerase encounters the nucleotide complementary to the next base in the template, it is incorporated into the growing DNA chain. During incorporation, the enzyme holds the nucleotide in the ZMWs detection volume for tens of milliseconds, orders of magnitude longer than the average diffusing nucleotide. The system detects this as a flash of bright light because the background is very low. The polymerase advances to the next base and the process continues to repeat PacBio technology backgrounder: http://www.pacificbiosciences.com/index.php?q=technology-introduction

multiple reads of the same molecule PacBio technology backgrounder: http://www.pacificbiosciences.com/index.php?q=technology-introduction

Eid J et al. (2009) Molecules Real-Time DNA Sequencing from Single Polymerase Molecules. Science 323, 133 PMID: 19023044

Does it work? 150 bp circular template ~93% raw accuracy 15x coverage 99.3% accuracy Eid et al., 2009

~ 2-5 bp/sec PacBio claims that, by 2013, the technology will be able to give a ‘raw’ human genome sequence in less than 3 min, and a complete high-quality sequence in 15 min. (http://www.bio-itworld.com/BioIT_Content.aspx?id=71746andamp;terms=Feb+12+2008+Pacific+Biosciences). Gupta PK. (2008) Single-molecule DNA sequencing technologies for future genomics research. Trends Biotechnol. 26:602-11

F. Sanger, S. Nicklen, and A. R. Coulson, Proc Natl Acad Sci U S A