Introduction to Sequencing

Slides:



Advertisements
Similar presentations
The Human Genome Project
Advertisements

Frontiers of Genetics Chapter 13.
Sequencing a genome. Definition Determining the identity and order of nucleotides in the genetic material – usually DNA, sometimes RNA, of an organism.
SEQUENCING-related topics 1. chain-termination sequencing 2. the polymerase chain reaction (PCR) 3. cycle sequencing 4. large scale sequencing stefanie.hartmann.
Physical Mapping I CIS 667 February 26, Physical Mapping A physical map of a piece of DNA tells us the location of certain markers  A marker is.
DNA Sequencing. CS273a Lecture 3, Spring 07, Batzoglou DNA sequencing How we obtain the sequence of nucleotides of a species …ACGTGACTGAGGACCGTG CGACTGAGACTGACTGGGT.
Sequencing Informatics Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
Sequencing Informatics Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics.
DNA Sequencing and Assembly. DNA sequencing How we obtain the sequence of nucleotides of a species …ACGTGACTGAGGACCGTG CGACTGAGACTGACTGGGT CTAGCTAGACTACGTTTTA.
CS273a Lecture 2, Autumn 10, Batzoglou DNA Sequencing (cont.)
DNA Sequencing. CS273a Lecture 3, Autumn 08, Batzoglou DNA sequencing How we obtain the sequence of nucleotides of a species …ACGTGACTGAGGACCGTG CGACTGAGACTGACTGGGT.
Human Genome Sequence and Variability Gabor T. Marth, D.Sc. Department of Biology, Boston College Medical Genomics Course – Debrecen, Hungary,
DNA Sequencing. Next few topics DNA Sequencing  Sequencing strategies Hierarchical Online (Walking) Whole Genome Shotgun  Sequencing Assembly Gene Recognition.
Genome sequencing. Vocabulary Bac: Bacterial Artificial Chromosome: cloning vector for yeast Pac, cosmid, fosmid, plasmid: cloning vectors for E. coli.
Genome Analysis Determine locus & sequence of all the organism’s genes More than 100 genomes have been analysed including humans in the Human Genome Project.
BioInformatics (2). Physical Mapping - I Low resolution  Megabase-scale High resolution  Kilobase-scale or better Methods for low resolution mapping.
Reading the Blueprint of Life
DNA Technology and Genomics
Genome Sequencing & App. of DNA Technologies Genomics is a branch of science that focuses on the interactions of sets of genes with the environment. –
Recombinant DNA Technology for the non- science major.
DNA Technology- Cloning, Libraries, and PCR 17 November, 2003 Text Chapter 20.
Presentation on genome sequencing. Genome: the complete set of gene of an organism Genome annotation: the process by which the genes, control sequences.
HAPLOID GENOME SIZES (DNA PER HAPLOID CELL) Size rangeExample speciesEx. Size BACTERIA1-10 Mb E. coli: Mb FUNGI10-40 Mb S. cerevisiae 13 Mb INSECTS.
Mouse Genome Sequencing
Chapter 13 Section 1 DNA Technology. DNA Identification Only.10% of the human genome varies from person to person 98% of our genetic makeup does not code.
Chapter 20 DNA Technology. DNA Cloning  Gene cloning allows scientists to work with small sections of DNA (single genes) in isolation. –Exactly what.
Technological Solutions. In 1977 Sanger et al. were able to work out the complete nucleotide sequence in a virus – (Phage 0X174) This breakthrough allowed.
© 2005 Prentice Hall Inc. / A Pearson Education Company / Upper Saddle River, New Jersey Chapter 3 Fundamentals of Mapping and Sequencing Basic principles.
Steps in a genome sequencing project Funding and sequencing strategy source of funding identified / community drive development of sequencing strategy.
Biological Motivation for Fragment Assembly Rhys Price Jones Anne R. Haake.
A Sequenciação em Análises Clínicas Polymerase Chain Reaction.
Chapter 21 Eukaryotic Genome Sequences
Restriction Fragments and Mapping Restriction Fragment Analysis – System used to compare the genes and DNA sequences between individuals in a population.
Review from last week. The Making of a Plasmid Plasmid: - a small circular piece of extra-chromosomal bacterial DNA, able to replicate - bacteria exchange.
Recombinant DNA Technology and Genomics A.Overview: B.Creating a DNA Library C.Recover the clone of interest D.Analyzing/characterizing the DNA - create.
Genome Characterization DNA sequence-ULTIMATE Map DNA sequencing-methods Assembly/sequencing BIO520 BioinformaticsJim Lund Assigned reading: Service 2006.
15.2, slides with notes to write down
Human Genome.
GENE SEQUENCING. INTRODUCTION CELL The cells contain the nucleus. The chromosomes are present within the nucleus.
Biotechnology and Genomics Chapter 16. Biotechnology and Genomics 2Outline DNA Cloning  Recombinant DNA Technology ­Restriction Enzyme ­DNA Ligase 
GENETIC ENGINEERING CHAPTER 20
Genetic Engineering Genetic engineering is also referred to as recombinant DNA technology – new combinations of genetic material are produced by artificially.
Locating and sequencing genes
DNA Technology and Genomics
Chapter 5 Sequence Assembly: Assembling the Human Genome.
Genome Analysis. This involves finding out the: order of the bases in the DNA location of genes parts of the DNA that controls the activity of the genes.
Genome sequencing and annotation Week 2 reading assignment - pages 63-78, 93-98, Boxes 2.1 and don’t worry about details of similarity scoring.
Title: Studying whole genomes Homework: learning package 14 for Thursday 21 June 2016.
Objectives: Outline the steps involved in sequencing the genome of an organism. Outline how gene sequencing allows for genome wide comparisons between.
 We have mentioned previously that it is possible to insert DNA from one organism to another ◦ These are known as transgenic organisms  These have many.
Virginia Commonwealth University
Restriction Fragments and Mapping
DNA Sequencing -sayed Mohammad Amin Nourion -A’Kia Buford
Genomics Sequencing genomes.
15.2, slides with notes to write down
Pre-genomic era: finding your own clones
Cloning Overview DNA can be cloned into bacterial plasmids for research or commercial applications. The recombinant plasmids can be used as a source of.
Section 2 Genetics and Biotechnology DNA Technology
Relationship between Genotype and Phenotype
The Human Genome Project
Bellwork: What is the human genome project. What was its purpose
Biology, 9th ed,Sylvia Mader
DNA Sequencing The DNA from the genome is chopped into bits- whole chromosomes are too large to deal with, so the DNA is broken into manageably-sized overlapping.
Genome sequencing informatics
DNA and the Genome Key Area 8a Genomic Sequencing.
A Sequenciação em Análises Clínicas
Biology, 9th ed,Sylvia Mader
Sequence the 3 billion base pairs of human
Relationship between Genotype and Phenotype
Presentation transcript:

Introduction to Sequencing BI420 – Introduction to Bioinformatics Introduction to Sequencing

The nuclear genome (chromosomes)

The genome sequence the primary template on which to outline functional features of our genetic code (genes, regulatory elements, secondary structure, tertiary structure, etc.)

Completed genomes Humans, mouse ~3,000 Mb D. melanogaster ~120 Mb C. elegans ~100 Mb E. coli ~5 Mb

Main genome sequencing strategies Whole-genome shotgun sequencing Celera Genomics, Inc. Clone-based shotgun sequencing Human Genome Project

Hierarchical genome sequencing BAC (bacterial artificial chromosome) library construction clone mapping shotgun subclone library construction sequencing sequence reconstruction (sequence assembly) Lander et al. Nature 2001

BACs The genome is broken down into chunks 100-200 kb long, and these are individually integrated into BACs (bacterial artificial chromosomes). The BACs are then grown in bacterial cells to produce a large number of duplicate copies of the 100-200kb chunk. The original location of each BAC in the human genome is “fingerprinted” based on the lengths of sequences generated when restriction enzymes are applied.

Clone mapping – “sequence ready” map Restriction enzymes (ex. A, B, C, D, E) are applied to the chunk, generating a set of fragments of particular lengths. These provide a “fingerprint” for where the chunk came from in the genome.

Hierarchical genome sequencing BAC library construction clone mapping shotgun subclone library construction sequencing/read processing sequence reconstruction (sequence assembly) Lander et al. Nature 2001

Shotgun subclone library construction cloning vector BAC primary clone subclone insert sequencing vector The BAC sequence is fragmented into short reads, and these are sequenced.

Hierarchical genome sequencing BAC library construction clone mapping shotgun subclone library construction sequencing/read processing sequence reconstruction (sequence assembly) Lander et al. Nature 2001

Traditional gel-based sequencing Polymerize off single-strand in the presence of some radioactive dideoxy nucleotides, which cap a DNA sequence and will leave a film signal due to radioactivity. Then run in a gel, which separates segments by length. Repeat 4 times with each of dideoxy A, C, G and T

Sequencing using fluorescence Use dideoxy nucleotides that fluoresce under UV light with a different color for each of A, C, G, T. Instead of gel do electrophoresis in a capillary tube.

Robotic automation Lander et al. Nature 2001

Base calling PHRED base = A Q = 40 Software such as PHRED is used to interpret the chromatogram and generate base calls.

Vector clipping Because the sequence was generated in a BAC, there may be some overhangs of bacterial sequence (pink). These should be removed.

Hierarchical genome sequencing BAC library construction clone mapping shotgun subclone library construction sequencing/read processing sequence reconstruction (sequence assembly) Lander et al. Nature 2001

Sequence assembly PHRAP Software such as PHRAP is used to find sequences which overlap one another

Repetitive DNA may confuse assembly

region of low sequence coverage and/or quality Sequence completion (finishing) region of low sequence coverage and/or quality gap CONSED, AUTOFINISH Software such as CONSED produces an assembled genome, taking into account base quality.

Main genome sequencing strategies Whole-genome shotgun sequencing Celera Genomics, Inc. Clone-based shotgun sequencing Human Genome Project

Whole-genome shotgun sequencing characteristics PROS WGS less labor-intensive than clone-based sequencing Faster Very effective in re-sequencing projects where the scaffold of the genome is known, e.g. human genome population studies CONS WGS has greater uncertainty in mapping of read positions Difficult to use WGS to de novo assemble genomes that have not previously been sequenced or where structure is not known (seawater sampling, soil sampling, or other metagenomic studies)

Current usage Whole genome shotgun is used for re-sequencing studies, i.e. those where at least one individual of the species has already been fully sequenced. This is because indviduals in a species have very similar genomes, making it easy to assemble any new individual off the existing scaffold.

DNA Sequencing: Instructor Demo Platform: UNIX (bioclass.bc.edu) Instructions: http://bioinformatics.bc.edu/~marth/BI820/pages/sequencingComputerSession.html Data: bioclass:~marth/CLONE.tar.gz or http://bioinformatics.bc.edu/~marth/Data/CLONE.tar.gz