Sequencing a genome and Basic Sequence Alignment Lecture 10 1Global Sequence.

Slides:



Advertisements
Similar presentations
Prokaryotic Gene Regulation:
Advertisements

Frontiers of Genetics Chapter 13.
Regulation of eukaryotic gene sequence expression Lecture 6.
DNA Technology & Gene Mapping Biotechnology has led to many advances in science and medicine including the creation of DNA clones via recombinant clones,
Measuring the degree of similarity: PAM and blosum Matrix
Phylogenetic Trees Understand the history and diversity of life. Systematics. –Study of biological diversity in evolutionary context. –Phylogeny is evolutionary.
DNA sequences alignment measurement
Bioinformatics Unit 1: Data Bases and Alignments Lecture 2: “Homology” Searches and Sequence Alignments.
Finding Eukaryotic Open reading frames.
Sequence similarity.
Mutagenesis Methods Lily Peterson April 5 th, 2010.
Bioinformatics Unit 1: Data Bases and Alignments Lecture 3: “Homology” Searches and Sequence Alignments (cont.) The Mechanics of Alignments.
General Microbiology (Micr300) Lecture 11 Biotechnology (Text Chapters: ; )
1 Review Describe the process scientists use to copy DNA Use Analogies How is genetic engineering like computer programming 2 Review What is a transgenic.
Sequencing a genome and Basic Sequence Alignment Lecture 8 1Global Sequence.
Finding prokaryotic genes and non intronic eukaryotic genes
Sequencing a genome and Basic Sequence Alignment
Regulation of eukaryotic gene sequence expression
Assessment of sequence alignment Lecture Introduction The Dot plot Matrix visualisation matching tool: – Basics of Dot plot – Examples of Dot plot.
Concept 20.1: DNA cloning yields multiple copies of a gene or other DNA segment To work directly with specific genes, scientists prepare well-defined segments.
Chapter 2 Genes Encode RNAs and Polypeptides
Fig 11-1 Chapter 11: recombinant DNA and related techniques.
Objective 2: TSWBAT describe the basic process of genetic engineering and the applications of it.
Presentation on genome sequencing. Genome: the complete set of gene of an organism Genome annotation: the process by which the genes, control sequences.
Assessment of sequence alignment Lecture Introduction The Dot plot Matrix visualisation matching tool: – Basics of Dot plot – Examples of Dot plot.
Multiple Sequence Alignment May 12, 2009 Announcements Quiz #2 return (average 30) Hand in homework #7 Learning objectives-Understand ClustalW Homework#8-Due.
What is comparative genomics? Analyzing & comparing genetic material from different species to study evolution, gene function, and inherited disease Understand.
1 Genetics Faculty of Agriculture Instructor: Dr. Jihad Abdallah Topic 13:Recombinant DNA Technology.
CISC667, S07, Lec5, Liao CISC 667 Intro to Bioinformatics (Spring 2007) Pairwise sequence alignment Needleman-Wunsch (global alignment)
Chapter 15: Genetic Engineering
20.1 Structural Genomics Determines the DNA Sequences of Entire Genomes The ultimate goal of genomic research: determining the ordered nucleotide sequences.
DNA Technology. 1.Isolation – of the DNA containing the required gene 2.Insertion – of the DNA into a vector 3.Transformation – Transfer of DNA into a.
DNA alphabet DNA is the principal constituent of the genome. It may be regarded as a complex set of instructions for creating an organism. Four different.
Sequencing a genome and Basic Sequence Alignment
Chapter 13. The Central Dogma of Biology: RNA Structure: 1. It is a nucleic acid. 2. It is made of monomers called nucleotides 3. There are two differences.
Chapter 21 Eukaryotic Genome Sequences
Review from last week. The Making of a Plasmid Plasmid: - a small circular piece of extra-chromosomal bacterial DNA, able to replicate - bacteria exchange.
Recombinant DNA Technology and Genomics A.Overview: B.Creating a DNA Library C.Recover the clone of interest D.Analyzing/characterizing the DNA - create.
Sequence Alignment Csc 487/687 Computing for bioinformatics.
Chapter 17 From Gene to Protein. Gene Expression DNA leads to specific traits by synthesizing proteins Gene expression – the process by which DNA directs.
Highlights of DNA Technology. Cloning technology has many applications: Many copies of the gene are made Protein products can be produced.
DNA and the genetic code. Write out the following DNA organic bases and match them with their specific and complementary base pairs: A G T A C C C T G.
Biocomputation: Comparative Genomics Tanya Talkar Lolly Kruse Colleen O’Rourke.
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
Concept 20.1: DNA cloning yields multiple copies of a gene or other DNA segment To work directly with specific genes, scientists prepare well-defined segments.
Genetic Engineering Genetic engineering is also referred to as recombinant DNA technology – new combinations of genetic material are produced by artificially.
Chapter 20: DNA Technology and Genomics - Lots of different techniques - Many used in combination with each other - Uses information from every chapter.
Chapter 3 The Interrupted Gene.
Genomics Part 1. Human Genome Project  G oal is to identify the DNA sequence of every gene in humans Genome  all the DNA in one cell of an organism.
Chapter 20 DNA Technology and Genomics. Biotechnology is the manipulation of organisms or their components to make useful products. Recombinant DNA is.
1 DNA and Biotechnology. 2 Outline DNA Structure and Function DNA Replication RNA Structure and Function – Types of RNA Gene Expression – Transcription.
D.N.A Describe how you would go about genetically engineering a bacterium to produce human epidermal growth factor (EGF), a protein used in treating burns.
Finding genes in the genome
Gene Technologies and Human ApplicationsSection 3 Section 3: Gene Technologies in Detail Preview Bellringer Key Ideas Basic Tools for Genetic Manipulation.
15 March 2016 Today’s Title: CW: Introduction to genetic engineering Learning Question: what is genetic engineering?
Recombinant DNA Techniques Playing god for fun and profit! Or What happens when a mummy test tube and a daddy test tube love each other very much.
DNA sequences alignment measurement Lecture 13. Introduction Measurement of “strength” alignment Nucleic acid and amino acid substitutions Measurement.
DNA Replication/Protein Synthesis Topic 2.7. DNA replication DNA molecule is unzipped down the middle by an enzyme Helicase Helicase: straightens out.
Techniques for Protein Sequence Alignment and Database Searching G P S Raghava Scientist & Head Bioinformatics Centre, Institute of Microbial Technology,
Title: Studying whole genomes Homework: learning package 14 for Thursday 21 June 2016.
Genetic Code and Interrupted Gene Chapter 4. Genetic Code and Interrupted Gene Aala A. Abulfaraj.
Section 3: Gene Technologies in Detail
Cloning Overview DNA can be cloned into bacterial plasmids for research or commercial applications. The recombinant plasmids can be used as a source of.
Chapter 20: DNA Technology and Genomics
Protein Synthesis Genetics.
Chapter 14 Bioinformatics—the study of a genome
What is RNA? Do Now: What is RNA made of?
DNA and the Genome Key Area 8a Genomic Sequencing.
Unit Genomic sequencing
Chapter 20: DNA Technology and Genomics
Presentation transcript:

Sequencing a genome and Basic Sequence Alignment Lecture 10 1Global Sequence

Annotation of sequences Once the gene sequence’s have been determined then the data must be annotated: (Klug 2010) – Identify regulatory regions – Other sequences of interest: exons/ introns, coding sequences (cds), polyA signal – In protein annotation there are mRNA sequences – Other organisms where the DNA sequence/ AA sequence is to found – Journals/Reference to where data came from. – Refer to NCBI DNA sequence AA sequence for Leukocyte Elastase Gene for more detailed annotation recordDNA sequenceAA sequence 2Global Sequence

DNA recombinant technology DNA recombinant technology is essential to produce DNA sequences that can be used to analyse chapter 17 (klug 2010): – sequences in genes, – regulatory sequences – large DNA strands. Some of the important terms in this field, found in include: – Cloning DNA: making copies of DNA. – Restriction enzymes: cuts DNA at specific sites : vary in size from sites of 4bp to 8bp or longer; 4 bp cuts into fragments of 256 bp in size ; of 8 b.p 4 8 (64,000) b.p. ; e.g. EcoR1 site: GAATTC – Restriction maps: map of restriction enzyme sites (refer to figure 17.5 klug) 3Global Sequence

DNA recombinant technology – Plasmid Vectors: help insert the DNA fragment that needs cloned into a host cell. Inside the host cell both the vector and the DNA fragment are cloned. In the example a DNA fragment is inserted into the plasmid. The plasmid then inserted into the cost cells and produces many copies of itself. – The LacZ gene is used as a marker. If markers is disrupted then it means that the host cell has a plasmid vector (recombinant plasmid) in it

Sequencing DNA strands dATP is an adenine base nucleic acid ddATP is a modified adenine base which has a coloured florescent marker attached. In has the added property of terminating the elongation if chosen instead of dATP During the process all possible lengths of chain are produced. Lengths are separated based on weight and analysed to give The complementary sequence of the template strand. [ note the sequences in part 1 and part4]

GENOMES: Sequencing and assembling Plasmids and other recombinant DNA technology only produce relatively small DNA segments. To sequence an organism’ s entire genome : – Must use the “shot gun” approach Shot gun approach requires two genetic technologies and one computational technique: – Restriction enzymes: cut up denatured (ss)DNA – Fast DNA sequencing of fragments (sequences) – Combining overlapping contiguous DNA sequences 6Global Sequence

Overlapping Contiguous Fragments Adapted from [1] p Global Sequence

Overlapping Fragments: example Original sentence: This is DT228 bioinformatics course. Cut 2 copies of the sentence into fragmentes This is course DT228 bioinformatics This is DT228 Bioinformatics course 8Global Sequence

Overlapping Fragments: example Check for overlaps (prefix and suffix) This is This is DT228 DT228 bioinformatics bioinformatics course course Result of alignment of fragments is: – This is DT228 bioinformatics course 9Global Sequence

Overlapping Fragments: example Reconstruct the sentence from the following 2 sentences (same as the original) which have been randomly fragmented. – molto questa lingue. – mondo. ha dodici anni e – sono nato nel posto – inglese e mi piace – megliore nel – parlo io un pochino – – nel mondo. Ha dodici anni e parlo – piace molto questa lingue. – nel posto meglioare – un pochino inglese e mi – sono nato 10Global Sequence

Example of Contigs alignment: The above diagram shows an DNA example of how overlapping contiguous sequences are aligned. However it is an oversimplification as actual segments are many times larger than shown and overlapping does not always happen at then end of ends of segments. Adapted from: Klug 7 th p Global Sequence

Pair-wise sequence Alignment: A simple global match The assignment of residues-residue corresponds: A Global match: align all of one sequence with another. The figure shows to sequences of nucleic acids. Some have the same base (nucleic acid ) and so there is a match at this position between the strands. This is represented by a vertical line and a blue highlight. Others do not match and have no vertical line and blue highlight: these are unmatched pairs and correspond to substitutions. In DNA nucleic acids transitions A  > G and T  > C are the most common than transversions This figure adapted from Klug is a comparison of a “leptin gene” from a dog (top) and a homo sapiens (bottom) This technique of global alignment matching is important in the area of: Comparative genomics, homologous gene analysis and the development of evolutionary trees. 12Global Sequence

Global alignment: a small different size Example 1 I am from Cork I am not from Cork **** ( 4 matches out of 18; based on length of bottom string) Global Sequence13 Example 2 I am ---- from Cork I am not from Cork **** ********** (14 matches out of 18; based on length of bottom string) A Global alignment between sequence of difference sizes requires the inclusions of gaps [dash] in order to optimise the matching process. Example 1 with not inclusion of gaps produces a much lower number of matches than example 2 which includes dashes. the assumption is that the both strands are homologous [ have a common ancestor; were the same sequence] but are now different through a series substitution [mismatch], Deletions /insertions [gaps]

Example of alignment Nucleic acids Global Sequence14 tAdapted from Klug p. 384

Sequence alignment: Amino Acids Global Sequence15 “*” match; “-” gap; “:” conserved substitution “.”semi-conserved substitution. In DNA the sequence is most important in relation to its functionality however in proteins its final structure is most significant; while it relates to the sequence but also to: The property of amino acids plays a significant part in the final configuration (refer to lecture 3 slide 5). Amino Acids with similar properties /structure will have overlapping “effects” on the final 3-D structure of the protein. Therefore the type of substitutions must be extended to included this and so you can have conserved and semi-conserved substitutions

Sequence Alignment: pairwise : a local Match A local Match : find a region in one sequence that matches a region of another overhangs at the end are not treated as gaps A local match is generally used if there is a larger difference in size between the sequences In example – global Scores is 9 out of 13; – Local score is 8 out of 10 ( no overhangs…) Example Global Sequence16

Sequence Alignment: pairwise : a motif match Motif (small region) match A motif match can find: a “perfect match between a small sequence and one or more regions in a larger sequence. This plays an important part in looking for repeating sequences [tandem repeats], and other “relatively small” regions that may be conserved between organisms The motif match like the others of course does not have to be “perfect” example not I am not from Cork *** Global Sequence17

Multiple alignment: many sequences Similar to the previous except you look for areas conserved between all the sequences in the alignment: My name is denis and I am from cork My name is kieran and I am not from cork We name the dog “canis familiaris” name Programs like clustaW are used to align multiple sequences which can be used to check for conserved motifs/sequences in many species: often for phylogenetic relationshipsclustaW Global Sequence18

Exam Questions Explain, using suitable examples, the “shot- gun” genomic alignment approach and why it has become the dominant method for analysing genomes. DNA Sequences alignment matching can take a number of forms; describe the different types of matching. Explain how the different types of point mutations are incorporated into the sequence alignment matching process. Discuss why the inclusions of “gaps” in to a matching alignment increase the degree of matching and explain what these gaps mean and what it means for the aligned sequences.