Bacterial Genome Finishing Using Optical Mapping Dibyendu Kumar, Fahong Yu and William Farmerie Interdisciplinary Center for Biotechnology Research, University.

Slides:



Advertisements
Similar presentations
Accurate Assembly of Maize BACs Patrick S. Schnable Srinivas Aluru Iowa State University.
Advertisements

Assembly of DNA Graphs whose Edges are Helix Axes Phiset Sa-Ardyen*, Natasa Jonoska** and Nadrian C. Seeman* *New York University, New York, NY **University.
Sequencing a genome. Definition Determining the identity and order of nucleotides in the genetic material – usually DNA, sometimes RNA, of an organism.
GENOME SEQUENCING AND OBJECTIVES
Max BachourJessica Chen. Shotgun or 454 sequencing High throughput sequencing technique that can collect a large amount of data at a fast rate. Works.
Lecture 14 Genome sequencing projects
9 Genomics and Beyond Brief Chapter Outline
DNA Sequencing Lecture 9, Tuesday April 29, 2003.
Genome Sequence Assembly: Algorithms and Issues Fiona Wong Jan. 22, 2003 ECS 289A.
DNA Sequencing – “Plus and Minus” Plus –Incubate with T4 DNA Polymerase and single dNTP –T4 Polymerase degrades 3’ ends in absence of dNTP –Fractionated.
Physical Mapping I CIS 667 February 26, Physical Mapping A physical map of a piece of DNA tells us the location of certain markers  A marker is.
Class 02: Whole genome sequencing. The seminal papers ``Is Whole Genome Sequencing Feasible?'' ``Whole-Genome DNA.
DNA Sequencing. The Walking Method 1.Build a very redundant library of BACs with sequenced clone- ends (cheap to build) 2.Sequence some “seed” clones.
Mining SNPs from EST Databases Picoult-Newberg et al. (1999)
Novel multi-platform next generation assembly methods for mammalian genomes The Baylor College of Medicine, Australian Government and University of Connecticut.
16 and 20 February, 2004 Chapter 9 Genomics Mapping and characterizing whole genomes.
Sequencing and Assembly Cont’d. CS273a Lecture 5, Aut08, Batzoglou Steps to Assemble a Genome 1. Find overlapping reads 4. Derive consensus sequence..ACGATTACAATAGGTT..
CS273a Lecture 4, Autumn 08, Batzoglou Hierarchical Sequencing.
CS273a Lecture 2, Autumn 10, Batzoglou DNA Sequencing (cont.)
Genome sequencing and assembling
Genome sequencing. Vocabulary Bac: Bacterial Artificial Chromosome: cloning vector for yeast Pac, cosmid, fosmid, plasmid: cloning vectors for E. coli.
Genome Analysis Determine locus & sequence of all the organism’s genes More than 100 genomes have been analysed including humans in the Human Genome Project.
Sequencing a genome and Basic Sequence Alignment
BioInformatics (2). Physical Mapping - I Low resolution  Megabase-scale High resolution  Kilobase-scale or better Methods for low resolution mapping.
Reading the Blueprint of Life
De-novo Assembly Day 4.
Mouse Genome Sequencing
CS 394C March 19, 2012 Tandy Warnow.
CUGI Pilot Sequencing/Assembly Projects Christopher Saski.
A hierarchical approach to building contig scaffolds Mihai Pop Dan Kosack Steven L. Salzberg Genome Research 14(1), pp , 2004.
Phred/Phrap/Consed Analysis A User’s View Arthur Gruber International Training Course on Bioinformatics Applied to Genomic Studies Rio de Janeiro 2001.
Manipulating DNA.
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
Laboratory for Molecular and Computational Genomics Optical Mapping of E-coli O157:H7 Alex Lim.
O PTICAL M APPING AS A M ETHOD OF W HOLE G ENOME A NALYSIS M AY 4, 2009 C OURSE : 22M:151 P RESENTED BY : A USTIN J. R AMME.
Sequence assembly using paired- end short tags Pramila Ariyaratne Genome Institute of Singapore SOC-FOS-SICS Joint Workshop on Computational Analysis of.
Steps in a genome sequencing project Funding and sequencing strategy source of funding identified / community drive development of sequencing strategy.
Sequencing a genome. Approximate Molecular Dynamics: New Algorithms with Applications in Protein Folding Author: Qun (Marc) Ma Predicting the 3D native.
Biological Motivation for Fragment Assembly Rhys Price Jones Anne R. Haake.
Cutting and Pasting DNA The cutters are called restriction enzymes, they cut DNA at specific nucleotide sequences.
SIZE SELECT SHEAR Shotgun DNA Sequencing (Technology) DNA target sample LIGATE & CLONE Vector End Reads (Mates) SEQUENCE Primer.
Sequencing a genome and Basic Sequence Alignment
The Changing Face of Sequencing
How will new sequencing technologies enable the HMP? Elaine Mardis, Ph.D. Associate Professor of Genetics Co-Director, Genome Sequencing Center Washington.
Problems of Genome Assembly James Yorke and Aleksey Zimin University of Maryland, College Park 1.
Wageningen, April 24-25, 2008 II Tomato Finishing Workshop Chromosome 12 Update ENEA, Rome University of Naples ‘Federico II’ CRIBI and Univ. of Padua.
Applied Bioinformatics Week 5. Topics Cleaning of Nucleotide Sequences Assembly of Nucleotide Reads.
Human Genome.
Lindsay A. Shearer1, Lorinda K
Mojavensis: Issues of Polymorphisms Chris Shaffer GEP 2009 Washington University.
COMPUTATIONAL GENOMICS GENOME ASSEMBLY
Accessing and visualizing genomics data
1. Assembly by alignment Instead of overlap-layout-consensus we use alignment-consensus 2.
Whole-Genome Optical Mapping
Chapter 5 Sequence Assembly: Assembling the Human Genome.
454 Genome Sequence Assembly and Analysis HC70AL S Brandon Le & Min Chen.
OPERA highthroughput paired-end sequences Reconstructing optimal genomic scaffolds with.
Learning Hidden Graphs Hung-Lin Fu 傅 恆 霖 Department of Applied Mathematics Hsin-Chu Chiao Tung Univerity.
JERI DILTS SUZANNA KIM HEMA NAGRAJAN DEEPAK PURUSHOTHAM AMBILY SIVADAS AMIT RUPANI LEO WU Genome Assembly Final Results
Cse587A/Bio 5747: L2 1/19/06 1 DNA sequencing: Basic idea Background: test tube DNA synthesis DNA polymerase (a natural enzyme) extends 2-stranded DNA.
Quality Control & Preprocessing of Metagenomic Data
Naotoshi Seo, Hiroshi Toyoizumi Performance Evaluation Laboratory
Genomics Sequencing genomes.
Denovo genome assembly of Moniliophthora roreri
Genome sequence assembly
6. Sequencing Genomes.
Single-molecule sequencing and chromatin conformation capture enable de novo reference assembly of the domestic goat genome.
CSCI 1810 Computational Molecular Biology 2018
A Physical Map, Including a BAC/PAC Clone Contig, of the Williams-Beuren Syndrome– Deletion Region at 7q11.23  Risa Peoples, Yvonne Franke, Yu-Ker Wang,
Presentation transcript:

Bacterial Genome Finishing Using Optical Mapping Dibyendu Kumar, Fahong Yu and William Farmerie Interdisciplinary Center for Biotechnology Research, University of Florida, Gainesville, FL Optical Mapping Method Optical circular XhoI map of a bacterial genome. The outermost red circle represents the consensus map created from the single-molecule maps shown as arcs. Different arc color is random and for contrast. Abstract Conclusions Optical chip containing single DNA molecule After digestion with restriction enzyme Size and order of fragments deduction Vertical lines indicate the location of restriction enzyme  Based on our experience, we strongly recommend including optical mapping in normal genome sequencing pipeline.  It is relatively efficient, very fast and independent way to validate a bacterial genome assembly.  Paired end reads are critical in building scaffold that can be aligned to optical map. Without paired end reads only a minority of contigs align to map. Many contigs remains as orphan.  Effectiveness of optical map depends on choice of enzyme used for mapping. Sometimes, with some finishing jobs second mapping is critical.  An optical map increases the speed of finishing and decreases the overall cost of the genome sequencing project.  No physical library, unknown Gap sizes  Sequence read length  sequencing errors such as, carry forward, homopolymer length, incomplete extension, etc  genome rearrangement Scaffolds Contigs Optical Mapping Limitation: Assembled Contig Length Anchoring and Orienting Contigs to Optical Map Detecting Misassembly Optical Comparative Mapping Challenges to Assembly Estimating Gap Size Optical map [PvuII] Fragment: 64 Length: 19,211 bo Cut Position: 423,252 The cost-efficiency of modern next-generation DNA sequencing technology allows investigators to undertake genome projects that were not affordable earlier. These sequencing techniques also bring new problems in genome assembly and finishing. Our core laboratory has several ongoing bacterial genome projects addressing a variety of challenges to genome assembly and closure. Several factors contribute to these challenges; including sequence repeats versus read length, intrinsic sequencing errors, and genome rearrangements. Together these factors complicate genome closure when using shotgun DNA sequencing data alone. In the absence of a physical map, we adopted whole-genome optical mapping as a tool to validate bacterial genome assemblies. OpGen, Inc. (Gaithersburg, Maryland) prepared the optical maps used in these projects. Briefly, an optical map is a complete genome restriction map deduced from a number of partial restriction maps. Optical maps are generated by spreading carefully extracted genomic DNA onto a treated glass surface containing many narrow channels, followed by digestion in situ with restriction enzymes. About 50–100 overlapping partial optical contigs are combined by alignment software to produce a contiguous whole genome restriction map. The contiguous optical map can be aligned and compared with the in silico restriction map of contigs obtained from whole-genome assembly. We successfully used optical mapping for guiding the closure of four closely related bacterial genomes. The optical map not only orient scaffolds but also allowed us to identify assembly errors, which was not possible using shotgun DNA sequencing data alone. Thus, we conclude that, in order to ensure the accuracy of a finished bacterial genome and to accelerate overall finishing process, optical mapping is an important tool to de-novo assemblies generated by next-generation DNA sequencing. Optical Mapping Limitation: Missing Fragments Detecting Deletion in Assembled Sequence Assembled Genome Optical Map Mismatch in Fragment length, missing repeat Assembled Genome Optical Map Missed fragment