Accurate Multiplex Polony Sequencing of an Evolved Bacterial Genome Jay Shendure, Gregory J. Porreca, Nikos B. Reppas, Xiaoxia Lin, John P. McCutcheon.

Slides:



Advertisements
Similar presentations
Capillary Electrophoresis and the Human Genome
Advertisements

High-Throughput Sequencing Technologies
DNA Footprinting: Detecting DNA Protein Interactions DNA footprinting is another method that allows you to determine if a DNA binding protein interacts.
What if we want to know what allele(s) of beta-globin an individual has?
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Molecular Genomic Imaging Center (CEGS) Harvard / Wash U George Church, Rob Mitra Greg Porreca, Jay Shendure Sequencing by Ligation on Polony Beads with.
1 Next Generation Sequencing Itai Sharon November 11th, 2009 Introduction to Bioinformatics.
RNA-Seq An alternative to microarray. Steps Grow cells or isolate tissue (brain, liver, muscle) Isolate total RNA Isolate mRNA from total RNA (poly.
Gene Expression Analysis by SAGE. Gene Expression Some challenges: –Large number of genes How do you keep samples and equipment small and affordable?
DNA Sequencing How do you do it?. DNA Sequencing DNA sequencing – used to determine the actual DNA sequence of an organism. Using a computer, one can.
DNA Sequencing.
Lecture 10: DNA Quantitation.  Purpose of DNA quantitation  Quantitation Methods  Interchelating dyes  Slot blot  qPCR ▪ Taqman (Life Technologies.
7.1 cont’d: Sanger Sequencing SBI4UP MRS. FRANKLIN.
CS 6293 Advanced Topics: Current Bioinformatics
University of Oklahoma Genome Center4/14/12.
High-Throughput Sequencing Technologies
Automated DNA Sequencing LECTURE 7: Biotechnology; 3 Credit hours Atta-ur-Rahman School of Applied Biosciences (ASAB) National University of Sciences and.
DNA Sequencing Today, laboratories routinely sequence the order of nucleotides in DNA. DNA sequencing is done to: Confirm the identity of genes isolated.
Reading the Blueprint of Life
6.3 Advanced Molecular Biological Techniques 1. Polymerase chain reaction (PCR) 2. Restriction fragment length polymorphism (RFLP) 3. DNA sequencing.
Chapter 19 – Molecular Genetic Analysis and Biotechnology
Recombinant DNA Technology………..
High Throughput Sequencing Methods and Concepts
DNA Fingerprinting. Use of DNA to Determine Identity DNA controls production of proteins DNA controls production of proteins Results in phenotype (eye.
Applications of DNA technology
Restriction Nucleases Cut at specific recognition sequence Fragments with same cohesive ends can be joined.
High Throughput Sequencing Methods and Concepts Cedric Notredame adapted from S.M Brown.
1 Chapter 2: DNA replication and applications DNA replication in the cell Polymerase chain reaction (PCR) Sequence analysis of DNA.
CHAPTER 7 DNA SEQUENCING - INTRODUCTION - SANGER DIDEOXY METHOD - AUTOMATED SEQUENCING - NEXT GENERATION OF SEQUENCING METHODS MISS NUR SHALENA SOFIAN.
Polymerase Chain Reaction (PCR) Developed in 1983 by Kary Mullis Major breakthrough in Molecular Biology Allows for the amplification of specific DNA fragments.
Stratton Nature 45: 719, 2009 Evolution of DNA sequencing technologies to present day DNA SEQUENCING & ASSEMBLY.
Sequencing DNA 1. Maxam & Gilbert's method (chemical cleavage) 2. Fred Sanger's method (dideoxy method) 3. AUTOMATED sequencing (dideoxy, using fluorescent.
GENE SEQUENCING. INTRODUCTION CELL The cells contain the nucleus. The chromosomes are present within the nucleus.
Ultra-High Throughput DNA Sequencing on the 454/Roche GS-FLX
6.3 Advanced Molecular Biological Techniques 1. Polymerase chain reaction (PCR) 2. Restriction fragment length polymorphism (RFLP) 3. DNA sequencing.
Locating and sequencing genes
Chemical Synthesis, Amplification, and Sequencing of DNA (Part II)
1 PCR: identification, amplification, or cloning of DNA through DNA synthesis DNA synthesis, whether PCR or DNA replication in a cell, is carried out by.
Sanger or Dideoxy DNA Sequencing
PCR With PCR it is possible to amplify a single piece of DNA, or a very small number of pieces of DNA, over many cycles, generating millions of copies.
Laboratory: Unit 4: PCR for T-RFLP (pages 83-84) Lecture: Terminal Restriction Fragment Length Polymorphism (T-RFLP) Analysis In-Class Writing: peer review.
Online Counseling Resource YCMOU ELearning Drive… School of Architecture, Science and Technology Yashwantrao Chavan Maharashtra Open University, Nashik.
DNA Sequencing Hunter Jones, Mitchell Gage. What’s the point? In a process similar to PCR, DNA sequencing uses a mixture of temperature changes, enzymes.
DNA Sequencing A population of DNA fragments is generated. –One end is common to all fragments (the 5’ end of the sequencing primer). –The other end terminates.
Primers to map bsd deletion points on genomic DNA NameForwardReverse Amplicon size (base pairs) A15’-CCACGGATGGAGTGAGTTCT-3’5’-GCCCCCAAGATGAGGATTAT-3’931.
Arun Kumar. B M.Sc 1st Year Biotechnology SSBS
Statistical Genomics Zhiwu Zhang Washington State University Lecture 6: Genotype.
Next-generation sequencing technology
Next generation sequencing
Lecture 6: Genotype by sequencing
Figure 1. Single-molecule analysis through microfluidic enrichment of rolling circle amplification products. (A) Sample preparation and target detection.
Next Generation Sequencing
Next-generation sequencing technology
copying & sequencing DNA
Volume 1, Issue 1, Pages (February 2002)
AMPLIFYING AND ANALYZING DNA.
Lecture 6: Genotype by sequencing
SOLEXA aka: Sequencing by Synthesis
Primers 2 primers are required for exponential amplification
DNA Sequencing The DNA from the genome is chopped into bits- whole chromosomes are too large to deal with, so the DNA is broken into manageably-sized overlapping.
DNA Sequencing A population of DNA fragments is generated.
DNA Sequencing A population of DNA fragments is generated.
DNA Sequencing A population of DNA fragments is generated.
ULTRASEQUENCING. Next Generation Sequencing: methods and applications.
The impact of next-generation sequencing technology on genetics
DNA and the Genome Key Area 8a Genomic Sequencing.
Massively Parallel Sequencing: The Next Big Thing in Genetic Medicine
Mass and Molar Ratios of DNA
Standard (Sanger) sequencing
SBI4U0 Biotechnology.
Presentation transcript:

Accurate Multiplex Polony Sequencing of an Evolved Bacterial Genome Jay Shendure, Gregory J. Porreca, Nikos B. Reppas, Xiaoxia Lin, John P. McCutcheon. Abraham M. Rosenbaum, Michael D. Wang, Kun Zhang, Robi D. Mitra, George M. Church Science 2005 vol 309, pg

Authors - BA 1973: Duke University - PhD 1984 :Harvard University - Professor of Genetics at Harvard Medical School - Director of the Harvard/MIT DOE Genomes-to-Life Center - Director of the Harvard/MIT/WashU NHGRI CEGS George M. Church Jay Shendure - MD-PhD Student in the Church Lab

Background - Sanger Sequencing Developed By Fredrick Sanger Uses ddNTPs to block DNA synthesis DNA fragments run in gel and analyzed

Sequence analysis - Then Use radioactively labeled dNTPs Run samples in four columns Expose your film Manually determine sequence

Sequence analysis - Now Use fluorescently labeled ddNTPs Run samples together in a capillary Fluorescence detected with a laser Computer determines sequence

The Problem Electrophoretic methods may very well be reaching their limits New sequencing method need to be developed if we are to achieve the goal of the $1000 genome

The Problem Electrophoretic methods may very well be reaching their limits New sequencing method need to be developed if we are to achieve the goal of the $1000 genome The Solution: Use a cyclic array method

Step One: Generate DNA Fragments Fig. 1a Genomic DNA is sheared into ~1kb fragments (purified from a gel) Circularized using an “universal linker” containing Mme1 cut sites Rolling circle amplification Mme1 is a downstream cutter, digestion results in bps genomic DNA tags flanking the linker Universal sequences are then ligated to the 5’ and 3’ ends of the fragment

Step Two: Emulsion PCR Fig. 1b Critically dilute DNA into PCR mix containing 1µm paramagnetic beads containing one of the PCR primers Make water-oil emulsion, creates mini reaction chambers containing 1 bead and single DNA fragment PCR reactions will result in each bead being coated in a single fragment

Step Three: Enrichment & Monolayering Fig. 1c Non-magnetic, low density “capture” beads allow for enrichment of amplified fraction of PCR beads via centrifugation Beads are then immobilized and mounted for automated sequencing

Step Four: Cycles of Sequencing and Imaging Fig. 1d Sequencing is done in automated cycles Computer algorithm identifies each bead and determines which color it is fluorescing each round

How Sequencing Works Beads are incubated with one of four anchor primers flanking the genomic DNA tags Excess primer is washed away Supplementary Fig. 7a+b

How Sequencing Works Next a mixture of degenerate nonamers are ligated to the anchor primer Each is specific only at one position the “query position” and is labeled with a different fluorphore Excess nonamers are washed away Supplementary Fig. 7a+b

How Sequencing Works Image Strip off the ligated primer and repeat with new anchor Accurate for first 6 bases in 5’-3’ direction and 7 bases in 3’-5’ direction Identifies 26 bp per amplicon Map it against reference genome Supplementary Fig. 7a+b

Raw Data Acquisition and Base Calling Fig. 2a + b Top: Bright field image of beads Bottom: false colored depiction of four fluorescent images acquired during a single ligation reaction Image represents 0.01% of total area

Raw Data Acquisition and Base Calling Fig. 2c Tetrahedral representation of the data obtained from a single image cycle Data points clustered around the 4 potential base calls for a single position on the amplicon

Results Table 1 ~1.16 million of 1.6 million reads were accurately mapped to reference genome ~30.1 million bases of resequencing data with raw accuracy of 99.7% 91.4% of genome covered

Results Table random single nucleotide changes (SNC) were added to the reference sequence Data represents 2 independent sets of SNC Were able to identify all the SNC present in the dataset when there was at least 2x coverage

Results By looking through the data set for aberrantly mapped mate-paired tags, were able to identify rearrangements C: 776bp deletion E: heterogeneous inversion These were know features Fig. 3c+e

Results Detected 3 novel polymorphisms in the evolved strain These were confirmed by Sanger sequencing Table 2

Conclusions Current high-throughput centers sequence at a rate of 20 bases per instrument second and at a cost of $1.00 per kb This method has an overall sequence rate of ~140 bp/s and a cost of $0.08 per kb, in house enzyme production could drop this number even lower Open source technology. Reasonably cheap to set up equipment ~ $140,000