High-Throughput Sequencing Technologies

Slides:



Advertisements
Similar presentations
Next-Generation Sequencing: Methodology and Application
Advertisements

Polymerase Chain Reaction (PCR). PCR produces billions of copies of a specific piece of DNA from trace amounts of starting material. (i.e. blood, skin.
Next–generation DNA sequencing technologies – theory & practice
Next-generation sequencing
Canadian Bioinformatics Workshops
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Greg Phillips Veterinary Microbiology
Accurate Multiplex Polony Sequencing of an Evolved Bacterial Genome Jay Shendure, Gregory J. Porreca, Nikos B. Reppas, Xiaoxia Lin, John P. McCutcheon.
The SOLiD System: Next-Generation Sequencing Overview of the SOLiD System –  Scalable  Accurate Ultra High Throughput  Flexible  Mate Pairs.
Chem 395 Bioanalytical Chemistry
BME 130 – Genomes Lecture 4 Sequencing technology II Next generation sequencing.
CS 6293 Advanced Topics: Current Bioinformatics
The polymerase chain reaction (PCR) rapidly
Next Generation DNA Sequencing Platforms: Evolving Tools for
Diabetes and Endocrinology Research Center The BCM Microarray Core Facility: Closing the Next Generation Gap Alina Raza 1, Mylinh Hoang 1, Gayan De Silva.
GENOME SEQUENCING. I. Genome sequencing The Sanger Method (1977) Denaturation +priming Polymerization.
Sequencing Errors and Biases Biological Sequence Analysis BNFO 691/602 Spring 2013 Mark Reimers.
Update on Next-Generation Sequencing
The impact of next-generation sequencing technology of genetics Elaine R. Mardis – 11 February Washington School of Medicine, Genome Sequencing Center.
Next Now-Generation Genomics: methods and applications for modern disease research Aaron J. Mackey, Ph.D. Center for Public Health.
High-Throughput Sequencing Technologies
DNA Technology and Genomics
Finishing the Human Genome
Analyzing your clone 1) FISH 2) “Restriction mapping” 3) Southern analysis : DNA 4) Northern analysis: RNA tells size tells which tissues or conditions.
Mapping protein-DNA interactions by ChIP-seq Zsolt Szilagyi Institute of Biomedicine.
High Throughput Sequencing Methods and Concepts
Library Preparation Application dependant, using standard molecular biological techniques. Fragment library oligo kit: (per library)$35 GeneAmp dNTP blend:
How do you identify and clone a gene of interest? Shotgun approach? Is there a better way?
DNA Methylation mapping
Ion Torrent and Minion Relatively low cost ‘next generation’ sequencing Wendy Smith School of Computing Science, Alan Ward Newcastle University, UK.
High Throughput Sequencing Methods and Concepts Cedric Notredame adapted from S.M Brown.
Chromatin Immunoprecipitation DNA Sequencing (ChIP-seq)
CHAPTER 7 DNA SEQUENCING - INTRODUCTION - SANGER DIDEOXY METHOD - AUTOMATED SEQUENCING - NEXT GENERATION OF SEQUENCING METHODS MISS NUR SHALENA SOFIAN.
Stratton Nature 45: 719, 2009 Evolution of DNA sequencing technologies to present day DNA SEQUENCING & ASSEMBLY.
How will new sequencing technologies enable the HMP? Elaine Mardis, Ph.D. Associate Professor of Genetics Co-Director, Genome Sequencing Center Washington.
Molecular Biology Dr. Chaim Wachtel May 28, 2015.
SEQUENCING – THE BENCHTOPS. Roche 454 Junior Same technology as 454 FLX Read length: 400 bases Paired-end 100,000 reads 12 hours (instrument time) Output.
KEY CONCEPT Biotechnology relies on cutting DNA at specific places.
Ultra-High Throughput DNA Sequencing on the 454/Roche GS-FLX
Trends Biomedical Science
Biology Chapter 9 & Honors Biology Chapter 13 Frontiers Of Biotechnology.
Lecture-5 ChIP-chip and ChIP-seq
Genomics I: The Transcriptome RNA Expression Analysis Determining genomewide RNA expression levels.
Title: Studying whole genomes Homework: learning package 14 for Thursday 21 June 2016.
Introduction to Illumina Sequencing
BME 130 – Genomes Lecture 4 Sequencing technology II Next generation sequencing.
Next-generation sequencing technology
Research Techniques Made Simple: Next-Generation Sequencing:
Biotechnology.
DNA Sequencing Second generation techniques
Next generation sequencing
Next Generation Sequencing
Sequencing technologies
DNA Sequencing -sayed Mohammad Amin Nourion -A’Kia Buford
Next-generation sequencing technology
PCR uses polymerases to copy DNA segments.
High-throughput sequencing techniques
ULTRASEQUENCING. Next Generation Sequencing: methods and applications.
Massively Parallel Sequencing: The Next Big Thing in Genetic Medicine
PCR uses polymerases to copy DNA segments.
PCR uses polymerases to copy DNA segments.
High-Throughput Sequencing Technologies
High-Throughput Sequencing Technologies
Next-generation DNA sequencing
PCR uses polymerases to copy DNA segments.
PCR uses polymerases to copy DNA segments.
PCR uses polymerases to copy DNA segments.
Standard (Sanger) sequencing
PCR uses polymerases to copy DNA segments.
Presentation transcript:

High-Throughput Sequencing Technologies Biological Sequence Analysis BNFO 691/602 Spring 2014 Mark Reimers

Outline What can we do with next-generation sequencing? De novo sequencing of simple genomes Re-sequence individual variations Generate genome-wide quantitative data for a variety of assays What technologies are now available and which are up-and-coming? Roche, Illumina, SOLiD, Ion Torrent, etc…

What is High-Throughput Sequencing? Generating many thousands or millions of short (30 to 1,000 base) sequences by sequencing parts of longer (200+ base) DNA fragments Most research uses reads from one end of a fragment (single-end), but most technologies can be adapted to make paired-end reads on opposite strands __could add picture of fragment and read

Full Genome Re-sequencing has been done for many cancers and rare clinical disorders Many times mutations have been found in regulatory regions

Exome sequencing is a cost-effective to identify de novo protein coding mutations __ show how exome libraries are prepared exome libraries are prepared by capture on an array of synthetic oligos

Targeted re-sequencing of a few relevant genes can identify diverse critical mutations across a large number of cases

RNA-seq

ChIP-seq

DNA methylation profiling mC  C C  U After PCR C  C U  T PCR+Seq

DNAse Hyper-sensitivity DNAse I enzyme cuts DNA Much more likely to cut at open chromatin Two approaches: Cut slowly then fragment and sequence ends Cut rapidly then sequence short fragments

Mapping of chromatin interactions (5C) (courtesy Elemento lab)

HTS Technologies Roche-454 Illumina SOLiD Ion Torrent Newer Technologies Outlook

Founded by Jonathan Rothberg as a secret project (code-named ‘454’) within CuraGen

Roche 454 Sequencing Metzker, NG 2010

Roche 454 Sequencing

Roche 454 Peak Heights Data

Advantages & Drawbacks PRO Long reads are uniquely identifiable Relatively quick ~20 hours total CON Cost is relatively high Frequent errors in runs of bases Frequent G-A transitions

Best Uses of Roche 454 De novo small genome (prokaryote or small eukaryote genome) sequencing Metagenomics by16S profiling Used to be best for metagenomics by random sequencing new long reads from Illumina are competitive Targeted re-sequencing of small samples

Illumina (Solexa) Genome Analyzer and Flow Cell

Illumina On-Chip Amplification

Illumina (Solexa) Sequencing

Paired-End Illumina Method Paired-end reads are easy on Illumina because the clusters are generated by ligated linkers. Different linkers and primers are attached to each end

Advantages & Drawbacks PRO Very high throughput Most widespread technology so that comparisons seem easier CON Sequencing representation biases, especially at beginning Slow – up to a week for a run

Best Uses of Illumina Expression analysis (RNA-Seq) Chromatin Immunoprecipitation (ChIP-Seq) Metagenomics by random sequencing

SOLiD Sequencing by Oligonucleotide Ligation and Detection

SOLiD History George Church licensed his ‘polony’ technique to Agencourt Personal Genomics ABI acquired the SOLiD technology from Agencourt in 2006

SOLiD Preparation Steps Prepare either single or ‘mate-pair’ library from DNA fragments Attach library molecules to beads; amplify library by emulsion PCR Modify 3’ ends of clones; attach beads to surface

Emulsion PCR Emulsion PCR isolates individual DNA molecules along with primer-coated beads in aqueous droplets within an oil phase. A polymerase chain reaction (PCR) then coats each bead with clonal copies of the DNA molecule. The bead is immobilized for sequencing.

ABI SOLiD Sequencing Cycle

SOLiD Reads Each Base Twice Most bases are matched by two primers in different ligation cycles

SOLiD Color Coding Scheme Blue is color of homopolymer runs Mapping into color space If you translate color reads directly into base reads then every sequence with an error in the color calls will result in a frame-shift of the base calls. it is best to convert the reference sequence into color-space. There is one unambiguous conversion of a base reference sequence into color-space, but there are four possible conversions of a color string into base strings

Advantages & Drawbacks PRO Very high throughput Di-base ligation ensures built-in accuracy check Low error rate for low-coverage Can handle repetitive regions easily CON Strong cycle-dependent biases (can be modeled and partly overcome – see Wu et al, Nature Methods, 2011) Low quality color calls (Phred < 20) are common Reported problems with paired ends – most mapped tags don’t map to the same chromosome

Ion Torrent Sample Prep Emulsion PCR loads copies of unique sequences onto beads One bead is deposited in each well of a micro-machined plate

An Ion Torrent Chip

When a nucleotide is incorporated into a strand of DNA by a polymerase, a hydrogen ion is released From promotional material

Ion Torrent Sequencing Process As in 454, nucleotides are washed over the nascent strand in a prescribed sequence. Each time a nucleotide is incorporated, hydrogen ions are released and detected.

Ion Torrent Advantages & Drawbacks Loading Density PRO Very high throughput potential Very fast (an afternoon) CON Homopolymer runs are still a problem Very uneven loading of sequences wastes a lot of real estate on the chips No prospect of paired-ends Not many applications yet for their error model

Newest Machine – Ion Proton $150K per machine Ion Proton I chip has 165 million sensors Intended for exomes Ion Proton II chip has 660 million sensors 50X more than 318 chip Claim $1K genome this year

Newer Technologies Complete Genomics Pacific Biosciences Oxford Nanopore

Complete Genomics Service company only – no equipment sales ~$4,000 per human genome (2011 price) DNA Nanoball technology generates paired-end sequences plated at high density Sequenced by ligation

Pacific Biosciences Single-molecule real-time (SMRT) sequencing by circular strand technology using semiconductor technology Long reads promised at under $200 per genome High error rates reported

Oxford Nanopore Single-molecule sequencing by threading DNA through a protein nanopore GridION is a general technology for sequencing polymers by measuring current – can do polypeptides also