V Improvements to 3kb Long Insert Size Paired-End Library Preparation Naomi Park, Lesley Shirley, Michael Quail, Harold Swerdlow Wellcome Trust Sanger.

Slides:



Advertisements
Similar presentations
Next-Generation Sequencing: Methodology and Application
Advertisements

Section G Gene manipulation
Fosmid-pools and 40-kb mate-pairs Children’s Hospital Oakland Research Institute (CHORI) BACPAC Resources Center.
Next–generation DNA sequencing technologies – theory & practice
BCM208 Metabolic Biochemistry Topic 7: Gene metabolism and Expression.
MCB Lecture #9 Sept 23/14 Illumina library preparation, de novo genome assembly.
The 454 and Ion PGM at the Genomics Core Facility Dr. Deborah Grove, Director for Genetic Analysis Genomics Core Facility Huck Institutes of the Life Sciences.
Mining SNPs from EST Databases Picoult-Newberg et al. (1999)
The SOLiD System: Next-Generation Sequencing Overview of the SOLiD System –  Scalable  Accurate Ultra High Throughput  Flexible  Mate Pairs.
Novel multi-platform next generation assembly methods for mammalian genomes The Baylor College of Medicine, Australian Government and University of Connecticut.
High Throughput Sequencing
CS 6293 Advanced Topics: Current Bioinformatics
11 © 2009 PerkinElmer © 2010 PerkinElmer November 20, 2012 DNA Services Overview.
Diabetes and Endocrinology Research Center The BCM Microarray Core Facility: Closing the Next Generation Gap Alina Raza 1, Mylinh Hoang 1, Gayan De Silva.
NGS Data Generation Dr Laura Emery. Overview The NGS data explosion Sequencing technologies An example of a sequencing workflow Bioinformatics challenges.
Presentation on genome sequencing. Genome: the complete set of gene of an organism Genome annotation: the process by which the genes, control sequences.
De-novo Assembly Day 4.
Section G Gene manipulation
Introduction to next generation sequencing Rolf Sommer Kaas.
DNA Methylation mapping
Genomics – Next-Gen sequencing and Microarrays
Sequence assembly using paired- end short tags Pramila Ariyaratne Genome Institute of Singapore SOC-FOS-SICS Joint Workshop on Computational Analysis of.
Ion Torrent and Minion Relatively low cost ‘next generation’ sequencing Wendy Smith School of Computing Science, Alan Ward Newcastle University, UK.
Steps in a genome sequencing project Funding and sequencing strategy source of funding identified / community drive development of sequencing strategy.
Sequencing at the speed of life. Simple is beautiful.
Detection of Genomic Rearrangements in K562 cells using Paired End Sequencing Rosa Maria Alvarez Massachusetts Institute of Technology Class of 2009.
Chromatin Immunoprecipitation DNA Sequencing (ChIP-seq)
Stratton Nature 45: 719, 2009 Evolution of DNA sequencing technologies to present day DNA SEQUENCING & ASSEMBLY.
How will new sequencing technologies enable the HMP? Elaine Mardis, Ph.D. Associate Professor of Genetics Co-Director, Genome Sequencing Center Washington.
Double-Ended Shotgun Sequencing of PA14 Daniel G. Lee 10/30/02.
Sequencing Kristian Stevens Mark Crepeau Charis Cardeno Charles H. Langley University of California, Davis Evolution.
Genomics Core Facility at UNH: High-Throughput Sequencing on the Illumina HiSeq 2500 Platform Project Consultation Sample Submission Library Creation Illumina.
1.Data production 2.General outline of assembly strategy.
billion-piece genome puzzle
Ultra-High Throughput DNA Sequencing on the 454/Roche GS-FLX
Locating and sequencing genes
Comparison between old generation and new generation of sequencing machines.
 CHANGE!! MGL Users Group meetings will now be on the 1 st Monday of each month 3:00-4:00 Room Note the change of time and room.
Lecture 12 RNA – seq analysis.
Adapter and quality trimming Mick Watson Director of ARK-Genomics The Roslin Institute.
Library QA & QC Day 1, Video 3
Arun Kumar. B M.Sc 1st Year Biotechnology SSBS
Introduction to Illumina Sequencing
RAD – technology overview Baird et al PLoS ONE.
Will 10x technology make us rethink genome assemblies?
Next-generation sequencing technology
Research Techniques Made Simple: Next-Generation Sequencing:
Detection of FLT3 Internal Tandem Duplication in Targeted, Short-Read-Length, Next- Generation Sequencing Data  David H. Spencer, Haley J. Abel, Christina.
Sequencing technologies
DNA Sequencing -sayed Mohammad Amin Nourion -A’Kia Buford
Supplementary figures
Next-generation sequencing technology
Primer-Dimer Formation: The Problem and the Solution
Small RNA Sample Preparation
Detection of FLT3 Internal Tandem Duplication in Targeted, Short-Read-Length, Next- Generation Sequencing Data  David H. Spencer, Haley J. Abel, Christina.
mRNA Sequencing Sample Preparation
2nd (Next) Generation Sequencing
Hybrid Capture and Next-Generation Sequencing Identify Viral Integration Sites from Formalin-Fixed, Paraffin-Embedded Tissue  Eric J. Duncavage, Vincent.
ChIP DNA Sample Preparation
Molecular Diagnosis of Autosomal Dominant Polycystic Kidney Disease Using Next- Generation Sequencing  Adrian Y. Tan, Alber Michaeel, Genyan Liu, Olivier.
Massively Parallel Sequencing: The Next Big Thing in Genetic Medicine
Chapter 9 Molecular Genetic Techniques and Genomics
Next-Generation Sequencing Strategies Enable Routine Detection of Balanced Chromosome Rearrangements for Clinical Diagnostics and Genetic Research  Michael E.
RNA sequencing (RNA-Seq) and its application in ovarian cancer
Digital Gene Expression – Tag Profiling Sample Preparation
By Keisha Persaud.
BF nd (Next) Generation Sequencing
(Top) Construction of synthetic long read clouds with 10× Genomics technology. (Top) Construction of synthetic long read clouds with 10× Genomics technology.
Genomic DNA Sample Preparation
Genomic & RNA Profiling Core Facility
Presentation transcript:

v Improvements to 3kb Long Insert Size Paired-End Library Preparation Naomi Park, Lesley Shirley, Michael Quail, Harold Swerdlow Wellcome Trust Sanger Institute, Hinxton, Cambridge, UK Long insert size paired-end libraries, also known as mate-pair libraries, are highly valuable for determining the orientation and relative positions of contigs produced by de novo sequencing and assembly. Construction of these libraries can be problematic in the case of low sample availability and/or quality. Additionally, the number of libraries generated in parallel is limited by the complex nature of the library preparation steps. Preparation of a 3kb long insert size paired-end library can broadly be described in the following steps: Introduction At Sanger the Isolated mate-pair undergoes Illumina library preparation (end repair, A-tail, Illumina adapter ligation and amplification), before sequencing on the Miseq or Hiseq. The forward and reverse reads map to the reference genome, ~3kb apart, in an outward configuration: Shearing This poster summarises how adjustments made to steps within the library construction can improve the quality of the final mate-pair library, and/or enable library construction from as little as 800ng of gDNA of varying integrity. We have used the Hydroshear and Covaris to shear gDNA to 3kb with comparable size distributions. The same gDNA sample, sheared using Covaris blue miniTUBEs (manufacturer recommended settings) and the Hydroshear (std assembly, speed code 13, 20 cycles): Hydroshear Covaris The Hydroshear and Covaris both have unique advantages and disadvantages: AdvantageDisadvantage Hydroshear Flexibility - Parameters can be altered to generate desired fragment size Low throughput - frequent user intervention required Assemblies may become blocked by particulates Covaris miniTUBE Higher throughput – upto 24 samples can be sheared in 1 hour using LE220 without user intervention High recovery – samples are contained within the miniTUBE, minimising sample loss The ability to adjust parameters to generate desired fragment size is limited by miniTUBE type; red (5kb) and blue (3kb) Due to the advantages of the Covaris miniTUBE, we now use this method for 3kb mate-pair projects with limited sample availability and/or high sample number. For mate-pair projects in which the desired insert size cannot be generated using Covaris miniTUBEs, the Hydroshear is used. Circularisation The length and range of the material which undergoes circularisation during mate-pair construction determines the gap size of the paired reads of the final library. The method of size-selection used at Sanger is determined by the final library gap size required and the availability of input material. We have compared three different methods of 3kb mate-pair library size-selection, manual 0.7% agarose gel, 0.75% Pippin (Sage Science), and Ampure XP bead (resuspended in an in-house modified buffer). Pre size-selected, 3kb sheared fragments (red) are overlaid with post size selected fragments (blue) in the bioanalyser traces below: 3.4kb 3.9kb 1.0kb 2.0kb 3.4kb 1.0kb 2.5kb 3.4kb 1.0kb 1.5kb 0.7% Agarose gel 0.75% Pippin Ampure XP in modified buffer METHOD~YIELD (% of total input) Mapped Library ~20% In the case of ≥5 µg starting material, the Pippin 0.75% cassette is our method of choice. In contrast to a manual gel, it is complete in less than 2 hours and successfully removes smaller fragments. However, as the 0.75% Pippin cassette settings are dependant upon input quantity and size, these must be established first. In the case of µg starting material, inexpensive size-selection with Ampure XP in modified buffer removes the majority of fragments <1.5 kb. This enables a degree of size-selection when samples are limited, in addition to enabling an automated, high throughput size-selection method. 16 hr 1.5 hr 0.3hr ~80% Size-Selection Illumina’s mate-pair sequencing protocol enables the production of 2-5kb libraries from a single kit, which are ready for sequencing on the Illumina platform. Other sequencing platforms employ alternative methods of circularisation within their mate-pair protocols, which can be adapted to generate libraries for Illumina sequencing. We have sequenced Illumina adapted 3kb libraries, using circularisation methodology from Illumina, Life Technologies and Roche. A summary is shown in the following table: Illumina SOLiD 4SOLiD 5500 Roche 454 Circularisation Method 3’ Biotinylation Blunt Ended Ligation Random Shearing Adapter biotinylated 2bp hybridisation and ligation Nick translation and digestion Adapter biotinylated Longer bp hybridisation Nick translation and digestion Adapter biotinylated Cre/lox recombination Random shearing Positives Single kit High number of outward facing reads Sequence even either side of adapter High number of outward facing reads Greater efficiency of circularisation than SOLiD 4 Sequence even either side of adapter Cre/lox recombination is highly efficient for circularisation Negatives Biotin can insert into nicked DNA, causing inward facing reads Circularisation inefficient Sequence uneven surrounding junction Low yield from circularisation Requires both left and right adapter ligation, 50% of adapter ligated fragments will be unable to circularise Circularisation reaction involves heating to 70°C Requires both left and right adapter ligation, 50% of ligations will be unable to circularise Sequence uneven surrounding junction Future Work 1kb 2kb 3kb 4kb 5kb Due to the positives and negatives of each commercial circularisation method, we have designed our own method to address these. There is demand for mate-pair libraries on a larger scale. To address this we are looking to optimise a mate-pair workflow suitable for automation. Additionally, we are looking to optimise a method to improve the generation of larger mate-pair libraries.