DOT PLOT Daniel Svozil. Software choice source: Bioinformatics for Dummies.

Slides:



Advertisements
Similar presentations
DNA Technology & Gene Mapping Biotechnology has led to many advances in science and medicine including the creation of DNA clones via recombinant clones,
Advertisements

Recombinant DNA technology
BLAST Sequence alignment, E-value & Extreme value distribution.
Sequence Comparison Intragenic - self to self. -find internal repeating units. Intergenic -compare two different sequences. Dotplot - visual alignment.
Mutagenesis Methods Lily Peterson April 5 th, 2010.
Bioinformatics Unit 1: Data Bases and Alignments Lecture 3: “Homology” Searches and Sequence Alignments (cont.) The Mechanics of Alignments.
©2003/04 Alessandro Bogliolo Primer design. ©2003/04 Alessandro Bogliolo Outline 1.Polymerase Chain Reaction 2.Primer design.
© Wiley Publishing All Rights Reserved. Working with a Single DNA Sequence.
Whole genome alignments Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas
Assessment of sequence alignment Lecture Introduction The Dot plot Matrix visualisation matching tool: – Basics of Dot plot – Examples of Dot plot.
Project I Verifying the restriction map of a DNA insert.
Interdisciplinary Center for Biotechnology Research
Polymerase Chain Reaction
PCR Primer Design Guidelines
© Wiley Publishing All Rights Reserved.
Chromosome 16: PV92 PCR. What is PCR? DNA replication gone crazy in a tube!DNA replication gone crazy in a tube! Makes many copies of target sequence.
Objective 2: TSWBAT describe the basic process of genetic engineering and the applications of it.
IN THE NAME OF GOD. PCR Primer Design Lecturer: Dr. Farkhondeh Poursina.
PCR- Polymerase chain reaction
PCR optimization. Primers – design must be good but influenced by template sequence Quality of template DNA/impurities Components of PCR may need to be.
Molecular Biology basics. Restriction enzymes Natural enzymes made by bacteria to protect against viral and other infections Each restriction enzyme recognizes.
Assessment of sequence alignment Lecture Introduction The Dot plot Matrix visualisation matching tool: – Basics of Dot plot – Examples of Dot plot.
Recombinant DNA Technology……….. BTEC3301. DNA Libraries How do you identify the gene of interest and clone only the DNA sequence you are interested? Read.
Genomic walking (1) To start, you need: -the DNA sequence of a small region of the chromosome -An adaptor: a small piece of DNA, nucleotides long.
Bioinformatics 生物信息学理论和实践 唐继军 北京林业大学计算生物学中心
1 Genetics Faculty of Agriculture Instructor: Dr. Jihad Abdallah Topic 13:Recombinant DNA Technology.
Recombinant DNA I Basics of molecular cloning Polymerase chain reaction cDNA clones and screening.
Tools of Bioinformatics
By: Kelly and Kathryn PCR. What exactly is PCR? PCR stands for “polymerase chain reaction” and is a lab technique used to clone segments of DNA. Two main.
Technological Solutions. In 1977 Sanger et al. were able to work out the complete nucleotide sequence in a virus – (Phage 0X174) This breakthrough allowed.
DNA Cloning and PCR.
Module 1 Section 1.3 DNA Technology
Dave Palmer Primer Design Dave Palmer
13-1 Changing the Living World
PCR provides a forensics tool for identifying colonies
CS5263 Bioinformatics Lecture 20 Practical issues in motif finding Final project.
FQ. DNA Replication and Repair.
Basic terms:  Similarity - measurable quantity. Similarity- applied to proteins using concept of conservative substitutions Similarity- applied to proteins.
BLAST: Basic Local Alignment Search Tool Altschul et al. J. Mol Bio CS 466 Saurabh Sinha.
Basic Local Alignment Search Tool BLAST Why Use BLAST?
PCR is used in; Cloning into plasmid vectors DNA sequencing Genetic screening DNA based phylogeny Functional analysis of genes Identification of DNA fingerprints.
PPT-1. Experiment Objective: The objective of this experiment is to amplify a DNA fragment by Polymerase Chain Reaction (PCR) and to clone the amplified.
Human Genomics. Writing in RED indicates the SQA outcomes. Writing in BLACK explains these outcomes in depth.
The Polymerase Chain Reaction (DNA Amplification)
Chapter 10: Genetic Engineering- A Revolution in Molecular Biology.
Polymerase Chain Reaction A process used to artificially multiply a chosen piece of genetic material. May also be known as DNA amplification. One strand.
Sequence Alignment.
Chapter 20 DNA Technology and Genomics. Biotechnology is the manipulation of organisms or their components to make useful products. Recombinant DNA is.
710.LC GRADUATE MOLECULAR BIOLOGY 10/31/2011. Lecture 4 Competency Test.
Lecturer: Bahiya Osrah Background PCR (Polymerase Chain Reaction) is a molecular biological technique that is used to amplify specific.
D. Darban, Ph.D Department of Microbiology School of Medicine Alborz University of Medical Sciences 1 Probe and Primer Design.
PCR Polymerase chain reaction. PCR is a method of amplifying (=copy) a target sequence of DNA.
Polymerase Chain Reaction (PCR). DNA DNA is a nucleic acid that is composed of two complementary nucleotide building block chains. The nucleotides are.
Polymerase Chain Reaction
Part 3 Gene Technology & Medicine
Success criteria - PCR By the end of this lesson we will be know:
Primer Design and Sequencing
PCR TECHNIQUE
Molecular Cloning.
Material for Quiz 5: Chapter 8
Chapter 14 Bioinformatics—the study of a genome
Step 1: amplification and cloning procedures
Introduction to Bioinformatics II
Basic Local Alignment Search Tool
Simulating Genetic Screening
Basic Local Alignment Search Tool (BLAST)
Molecular Cloning.
KEY CONCEPT Biotechnology relies on cutting DNA at specific places.
Sequence alignment, E-value & Extreme value distribution
Presentation transcript:

DOT PLOT Daniel Svozil

Software choice source: Bioinformatics for Dummies

Dotlet Learn by example – use the sequence from the Repeated domains

In this case, the darker the pixel, the lower the score. There will be a large number of pixels with low scores and only a few ones with high scores. Tune the grayscale in order to make the background noise (low scores) disappear and the similar regions stand out more clearly. To do this, use the histogram window. This represents the frequency of each score, over all the pixels, on linear (blue) and logarithmic (purple) scales. lowest possible score on the left and the highest on the right If the sequence has some similarity, there will be a smaller peak of higher scores. Semi-logarithmic plot makes it even more visible.

With the scrollbars below and above the histogram, respectively, bring the lower threshold just past the first peak, and the higher threshold just past the second peak. Now, the background noise has disappeared from the dots window, and the similar regions stand out more clearly.

Well matching residues – blue. The cursor can also be moved with the keyboard with the arrow keys, and with ' ' (down right), '[' (up right), and ']' (down left). Now play with all sequences in Dotlet exampes section, read the comment and try to understand:

Getting the right window size Long windows = clean plots. The size of a window should be within the same range as the size of the elements you’re looking for. For instance, if you’re looking for conserved domains in proteins, a size of 50 amino acids or higher is appropriate. Shorter windows are more sensitive but bring some noise with them. Start with a large window and narrow it a little until the signal you’re looking for appears.

More of Dotlet What is the UniProtKB database? What are the UniProtKB/Swiss-Prot and UniProtKB/TrEMBL? What is the difference between them? Using Dotlet, compare following two Uniprot sequences: P05049 (1 st sequence) and P08246 (2 nd sequence). Are these sequences homologous? What is the function of P05049? P05049 is a serine protease. Would you run a wet lab experiment to check the protease activity of P08246? You should check if these two sequences are homologous in the serine protease region. Do you see some homologous regions on the dotplot?

Working with a single DNA sequence

Removing vector sequences Contamination from your own vector sequence (as a responsible scientist, you’re expected to have this information) – you may search for the vector sequence you expect Cross-contamination by somebody else’s vector – search not only for the sequence you expect, but also for other possible vector sequences. Before working with your DNA sequence, you should always clean it with e.g. NCBI VecScreen Basically, it performs a blastn search against UniVec – database of vector sequences

VecScreen Sources of contamination Try sequence1.txt "No significant similarity found“ - a good news, indicates that the sequence does not contain any vector contamination sequences. sequence2.txt

the query sequence matches three vector sequences Let’s say we know the vector used for cloning: pCR2.1-TOPO. Which sequence would you remove? Remove this sequence and check the results on the cleaned sequence.

Clean sequence3.txt. What is the result? Such a sequence is generally considered as the esult of a chimeric clone – i.e. clone consisting two sequences. In this case, throwing it away is the safest thing to do! In sequence4.txt is a sequence you cloned in the vector pUC19. Is it contaminated? How would you clean it? VecScreen reports a strong match with the lactose operon genes from E. coli. Not from pUC19! However, this is ok as most commercial vectors are derived from the same initial natural plasmid and E. coli constructs. Their sequences are identical, and UniVec matches are reported in the ordedr they appear in the database.

Restriction map It is possible to cut DNA sequences using restriction enzymes. Each type of restriction enzyme recognizes and cuts a different sequence: EcoR1: GAATTC BamH1: GGATCC There are more than 900 different restriction enzymes, each with a different specificity The restriction map is the list of all potential cleavage sites in a DNA molecule

Restriction map To compute a restriction map is not that difficult. All you need to do is to look for exact matches of a given restriction-enzyme site within your sequence. Enzymes and sites are in the REBASE database Nebcutter - Webcutter - VIRS - Try to construct a restriction map of the sequence5.fasta.

PCR primer design DNA polymerase needs a template can only extend an existing piece of DNA (primer) always moves in the 5’ → 3’ direction Steps of PCR denaturation – 94°C annealing – 60°C extension – 72°C Heat Cool

PCR primer design DNA polymerase needs a template can only extend an existing piece of DNA (primer) always moves in the 5’ → 3’ direction Steps of PCR denaturation – 94°C annealing – 60°C extension – 72°C

PCR primer design DNA polymerase needs a template can only extend an existing piece of DNA (primer) always moves in the 5’ → 3’ direction Steps of PCR denaturation – 94°C annealing – 60°C extension – 72°C

Primers

GC content Primers with a 40-60% GC content ensure stable binding of primer/template. The presence of G or C bases at the 3′ end of primers (GC clamp) helps to promote correct binding at the 3′ end due to the stronger hydrogen bonding of G and C bases. However, strings of G and of C can form internal, non-Watson-Crick base pairs that disrupt stable primer binding. Generally, sequences containing more than three repeats of G or of C in sequence should be avoided in the first five bases from the 3′ end of the primer. A short run of G’s at or near the 5′ end of a primer will not disrupt stable binding because the 5′ positioning does not lead to involvement in disruptive secondary structures. It is best to select primers with a random base distribution.

Primers no secondary structures Presence of the primer secondary structures produced by intermolecular or intramolecular interactions can lead to poor or no yield of the product. e,g, hairpins, self dimers, cross dimers It is desirable to design specific primer pairs which do not assume secondary structures during the reaction. AutoDimer - screens primers for primer-dimer and hairpins erProgramHomepage.htm erProgramHomepage.htm source:

PCR Primer Design Pick some sequence from NCBI nucleotide (<1000 bp) and play with the primer design tool Primer3 – from After you’ve got your primers, you must verify they will not hybridize anywhere except you intend them to hybridize. e.g. primer sequences are not outside the gene you’re interested in or primers do not resemble a frequent repeats in DNA Technique for avoiding this problem: BLAST searches against the vector sequences, the genome sequences, their most common repeats.

PCR Primer Design PrimerBLAST at NCBI It uses Primer3 to design PCR primers and then submits them to BLAST search against user-selected database. The BLAST results are then automatically analyzed to avoid primer pairs that can cause amplification of targets other than the input template.