Human Genome.

Slides:



Advertisements
Similar presentations
Genomics – The Language of DNA Honors Genetics 2006.
Advertisements

Sequencing a genome. Definition Determining the identity and order of nucleotides in the genetic material – usually DNA, sometimes RNA, of an organism.
DNA fingerprinting Every human carries a unique set of genes (except twins!) The order of the base pairs in the sequence of every human varies In a single.
SEQUENCING-related topics 1. chain-termination sequencing 2. the polymerase chain reaction (PCR) 3. cycle sequencing 4. large scale sequencing stefanie.hartmann.
Cloning lab results Cloning the human genome Physical map of the chromosomes Genome sequencing Integrating physical and recombination maps Polymorphic.
9 Genomics and Beyond Brief Chapter Outline
DNA Sequencing Lecture 9, Tuesday April 29, 2003.
Genome Sequence Assembly: Algorithms and Issues Fiona Wong Jan. 22, 2003 ECS 289A.
Physical Mapping I CIS 667 February 26, Physical Mapping A physical map of a piece of DNA tells us the location of certain markers  A marker is.
Class 02: Whole genome sequencing. The seminal papers ``Is Whole Genome Sequencing Feasible?'' ``Whole-Genome DNA.
DNA Sequencing. The Walking Method 1.Build a very redundant library of BACs with sequenced clone- ends (cheap to build) 2.Sequence some “seed” clones.
DNA Sequencing and Assembly
The Human Genome Race. Collins vs. Venter Collins Venter.
16 and 20 February, 2004 Chapter 9 Genomics Mapping and characterizing whole genomes.
3 September, 2004 Chapter 20 Methods: Nucleic Acids.
CS273a Lecture 4, Autumn 08, Batzoglou Hierarchical Sequencing.
DNA Sequencing and Assembly. DNA sequencing How we obtain the sequence of nucleotides of a species …ACGTGACTGAGGACCGTG CGACTGAGACTGACTGGGT CTAGCTAGACTACGTTTTA.
Human Genome Project. Basic Strategy How to determine the sequence of the roughly 3 billion base pairs of the human genome. Started in Various side.
CS273a Lecture 2, Autumn 10, Batzoglou DNA Sequencing (cont.)
Lecture 2. Genome sequencing What good is it? 9/2/09.
DNA Sequencing. Next few topics DNA Sequencing  Sequencing strategies Hierarchical Online (Walking) Whole Genome Shotgun  Sequencing Assembly Gene Recognition.
Genome sequencing and assembling
Genome sequencing. Vocabulary Bac: Bacterial Artificial Chromosome: cloning vector for yeast Pac, cosmid, fosmid, plasmid: cloning vectors for E. coli.
Genome Analysis Determine locus & sequence of all the organism’s genes More than 100 genomes have been analysed including humans in the Human Genome Project.
Molecular Cloning: Construction of a recombinant DNA
Last lecture summary. recombinant DNA technology DNA polymerase (copy DNA), restriction endonucleases (cut DNA), ligases (join DNA) DNA cloning – vector.
Today’s Lecture Genetic mapping studies: two approaches
BioInformatics (2). Physical Mapping - I Low resolution  Megabase-scale High resolution  Kilobase-scale or better Methods for low resolution mapping.
Reading the Blueprint of Life
DNA Technology- Cloning, Libraries, and PCR 17 November, 2003 Text Chapter 20.
Presentation on genome sequencing. Genome: the complete set of gene of an organism Genome annotation: the process by which the genes, control sequences.
HAPLOID GENOME SIZES (DNA PER HAPLOID CELL) Size rangeExample speciesEx. Size BACTERIA1-10 Mb E. coli: Mb FUNGI10-40 Mb S. cerevisiae 13 Mb INSECTS.
Mouse Genome Sequencing
Chapter 19 – Molecular Genetic Analysis and Biotechnology
Large-scale genome projects
-The methods section of the course covers chapters 21 and 22, not chapters 20 and 21 -Paper discussion on Tuesday - assignment due at the start of class.
DNA Technology Chapter 20.
How do you identify and clone a gene of interest? Shotgun approach? Is there a better way?
Applications of DNA technology
Fig Chapter 12: Genomics. Genomics: the study of whole-genome structure, organization, and function Structural genomics: the physical genome; whole.
Last lecture summary. recombinant DNA technology DNA polymerase (copy DNA), restriction endonucleases (cut DNA), ligases (join DNA) DNA cloning – vector.
Steps in a genome sequencing project Funding and sequencing strategy source of funding identified / community drive development of sequencing strategy.
Sequencing a genome. Approximate Molecular Dynamics: New Algorithms with Applications in Protein Folding Author: Qun (Marc) Ma Predicting the 3D native.
Biological Motivation for Fragment Assembly Rhys Price Jones Anne R. Haake.
A Sequenciação em Análises Clínicas Polymerase Chain Reaction.
Status report on gap closure of the human chromosome 5 BAC map Authentication of C5 BAC maps Map and sequence status Gap status and steps used to close.
SIZE SELECT SHEAR Shotgun DNA Sequencing (Technology) DNA target sample LIGATE & CLONE Vector End Reads (Mates) SEQUENCE Primer.
MAPPING GENOMES – genetic, physical & cytological maps Genetic distance (in cM) 1 centimorgan = 1 map unit, corresponding to recombination frequency of.
19.1 Techniques of Molecular Genetics Have Revolutionized Biology
Used for detection of genetic diseases, forensics, paternity, evolutionary links Based on the characteristics of mammalian DNA Eukaryotic genome 1000x.
Recombinant DNA Technology and Genomics A.Overview: B.Creating a DNA Library C.Recover the clone of interest D.Analyzing/characterizing the DNA - create.
DNA TECHNOLOGY AND GENOMICS CHAPTER 20 P
Linkage and Mapping. Figure 4-8 For linked genes, recombinant frequencies are less than 50 percent.
GENETIC ENGINEERING CHAPTER 20
Mojavensis: Issues of Polymorphisms Chris Shaffer GEP 2009 Washington University.
Genome Analysis Assaad text book slides only Lectures by F. Assaad can be downlaoded from muenchen.de/~farhah/index.htm.
Plan A Topics? 1.Making a probiotic strain of E.coli that destroys oxalate to help treat kidney stones in collaboration with Dr. Lucent and Dr. VanWert.
Chapter 5 Sequence Assembly: Assembling the Human Genome.
Gene Technologies and Human ApplicationsSection 3 Section 3: Gene Technologies in Detail Preview Bellringer Key Ideas Basic Tools for Genetic Manipulation.
Genome Analysis. This involves finding out the: order of the bases in the DNA location of genes parts of the DNA that controls the activity of the genes.
Human Genome Project.
Pre-genomic era: finding your own clones
Cloning Overview DNA can be cloned into bacterial plasmids for research or commercial applications. The recombinant plasmids can be used as a source of.
Material for Quiz 5: Chapter 8
Today’s Lecture Genetic mapping studies: two approaches
The Human Genome Project
Genomics Genetic Analysis on a Genome-wide (global) scale
New Class Offering.
Introduction to Sequencing
Sequence the 3 billion base pairs of human
Presentation transcript:

Human Genome

Human Genome Contents: 3200 Mb Genes: 1200 Mb Genes 48 Mb Related 1152 Mb: Pseudogenes, Gene Fragments, Introns Intergenic DNA 2000 Mb Interspersed Repeats 1400 Mb Microsatellite (short tandem repeats) 90 Mb Telomeres: End Sequences Centromeres: Single Nucleotide Polymorphisms

Chromosomes Shorter than DNA they contain Histones: DNA binding proteins Two Copies held together by centromeres Telomere: Terminal region Two humans differ by 0.1%

Donors HGP: Celera: 5 subjects (three men; two women) Opportunity advertised near labs First come; First Taken 5-10 samples for every one used No link between donor and sample Celera: 5 subjects (three men; two women) One Asian; One African-American; One Hispanic; Two Caucasians Craig Venter

Basic Technology Physical Mapping Cloning Shotgun Sequencing Computational Sequence Reassembly

STS High Resolution, Rapid, Simple 100 - 500 bp Collection of overlapping fragments Each point represented multiple times in random fragments Sequence must be known Unique in chromosome under study

Physical Mapping A set of clone fragments whose position relative to each other is known Restriction Maps: Relative locations of Restriction Sites Fluorescent in situ hybridization (FISH): Marker locations mapped by hybridizing probe to chromosomes Sequence Tagged Sites (STS): Positions of short sequences mapped by PCR or hybridization analysis of genome fragments Expressed Sequence Tags (EST): short sequences from cDNA clones

Genome cut into fragments Cloned as library in vector (red)

Hybridisation mapping:1 pick clones into a grid 2 hybridise to probe 1 3 hybridise to probe 2 4 build contigs In this case, two clones hybridised to both probes and thus they are predicted to overlap. Those hybridising to only one probe are predicted to extend out to the left or right.

Fingerprinting: Digest clones and run On gel Overlap by shared bands

Assembly of Contiguous DNA Sequence Shotgun Approach Contigs: Result of joining overlapping sequences Scaffold: Result of connecting contigs by filling in gaps BAC: Bacteria artificial chromosome vector: Inserts 100 - 200 kbs

Regional mapping

Regional mapping

Regional mapping Minimal tiling path selected for sequencing.

Restriction fragment fingerprinting Molecular weight marker every 5th lane Restriction fragment fingerprinting >20 kbp ~300 bp - BAC clones are grown in 96-well format - Hind III digest - 1% agarose

Contig assembly FPC* Overlap identification by restriction pattern similarities Facilitated contig assembly *Sanger Centre C. Soderlund, I Longden and R. Mott Clone A B C D E F G * All restriction fragments within a clone selected for the tiling path must be verified by their presence in overlapping clones. : insert fragments : vector fragments

BCM- HGSC

Shotgun Sequencing I :RANDOM PHASE Sheared DNA: 1.0-2.0 kb Bac Clone: 100-200 kb Random Reads Sequencing Templates: BCM- HGSC

Shotgun Sequencing II:ASSEMBLY Low Base Quality Single Stranded Region Mis-Assembly (Inverted) Sequence Gap Consensus BCM- HGSC

Shotgun Sequencing III: FINISHING Low Base Quality Single Stranded Region Mis-Assembly (Inverted) Sequence Gap Consensus BCM- HGSC

Shotgun Sequencing III: FINISHING Single Stranded Region Mis-Assembly (Inverted) Sequence Gap Consensus BCM- HGSC

Shotgun Sequencing III: FINISHING Mis-Assembly (Inverted) Sequence Gap Consensus BCM- HGSC

Shotgun Sequencing III: FINISHING Mis-Assembly (Inverted) Consensus BCM- HGSC

Shotgun Sequencing III: FINISHING High Accuracy Sequence: < 1 error/ 10,000 bases BCM- HGSC

Whole Genome Shotgun Sequencing Sheared DNA: 1.0-2.0 kb Whole Genome: 3,000 Mb Random Reads Sequencing Templates: BCM- HGSC

Whole Genome Shotgun Sequencing:Assembly Low Base Quality Single Stranded Region Mis-Assembly (Inverted) Sequence Gap Consensus BCM- HGSC

Whole Genome Shotgun Sequencing:Assembly Sequence Gap Low Base Quality Consensus BCM- HGSC

Random fragmentation of genome produces good sampling of its sequence space. Overlaps are identified, and subassembly of sequence takes place after cloning into universal vector.

Digested into Random Fragments

Cloned into Vector

Sequenced from know ends of plasmid (vector)

Assembled into contigs Assembled into contigs. Gaps and single-stranded regions identified for further study. Targeted for new sequencing. Double-Barreled: Both Strands.

In the gaps:

Whole-Genome Shotgun Sequencing Speed-up: Assembled Correctly? Avoid up-front mapping Huge amount of computer time to identify overlaps Have to reference a map Repeats are a problem: Leave out sequence between repeats Missing Reference End Sequence means Error

HGP Isolate large fragments in BACs with framework of landmark-based physical map Sequence on clone-by-clone basis Time-Consuming subcloning of random fragments and physical mapping

Sequence Reassembly Phrap Shortest Covering Superstring Map Assembly Overlap: Finding overlapping fragments Layout: ordering fragments Consensus: Sequences from layout