The genetic code.

Slides:



Advertisements
Similar presentations
Uses of Cloned Genes sequencing reagents (eg, probes) protein production insufficient natural quantities modify/mutagenesis library screening Expression.
Advertisements

Center for Biological Sequence Analysis Prokaryotic gene finding Marie Skovgaard Ph.D. student
Protein Synthesis (making proteins)
ATG GAG GAA GAA GAT GAA GAG ATC TTA TCG TCT TCC GAT TGC GAC GAT TCC AGC GAT AGT TAC AAG GAT GAT TCT CAA GAT TCT GAA GGA GAA AAC GAT AAC CCT GAG TGC GAA.
RNA Say Hello to DNA’s little friend!. EngageEssential QuestionExplain Describe yourself to long lost uncle. How do the mechanisms of genetics and the.
Supplementary Fig.1: oligonucleotide primer sequences.
Gene Mutations Worksheet
Transcription & Translation Worksheet
1 Essential Computing for Bioinformatics Bienvenido Vélez UPR Mayaguez Lecture 5 High-level Programming with Python Part II: Container Objects Reference:
Transcription and Translation
Transcription & Translation
Today… Genome 351, 8 April 2013, Lecture 3 The information in DNA is converted to protein through an RNA intermediate (transcription) The information in.
GENE MUTATIONS aka point mutations. DNA sequence ↓ mRNA sequence ↓ Polypeptide Gene mutations which affect only one gene Transcription Translation © 2010.
IGEM Arsenic Bioremediation Possibly finished biobrick for ArsR by adding a RBS and terminator. Will send for sequencing today or Monday.
Nature and Action of the Gene
FEATURES OF GENETIC CODE AND NON SENSE CODONS
Biological Dynamics Group Central Dogma: DNA->RNA->Protein.
More on translation. How DNA codes proteins The primary structure of each protein (the sequence of amino acids in the polypeptide chains that make up.
Undifferentiated Differentiated (4 d) Supplemental Figure S1.
Supplemental Table S1 For Site Directed Mutagenesis and cloning of constructs P9GF:5’ GAC GCT ACT TCA CTA TAG ATA GGA AGT TCA TTT C 3’ P9GR:5’ GAA ATG.
Lecture 10, CS5671 Neural Network Applications Problems Input transformation Network Architectures Assessing Performance.
PART 1 - DNA REPLICATION PART 2 - TRANSCRIPTION AND TRANSLATION.
TRANSLATION: information transfer from RNA to protein the nucleotide sequence of the mRNA strand is translated into an amino acid sequence. This is accomplished.
Today… Genome 351, 8 April 2013, Lecture 3 The information in DNA is converted to protein through an RNA intermediate (transcription) The information in.
LO: SWBAT explain how protein shape is determined and differentiate between the different types of mutations. DN: h/0 protein synthesis HW: Read pp #
Cell Division and Gene Expression
Passing Genetic Notes in Class CC106 / Discussion D by John R. Finnerty.
Transcription & Translation
DNA, RNA, and Protein Synthesis
Markert Biology  Molecules of DNA are composed of long chains of _______.
1 Human chromosomes: 50->250 million base pairs. Average gene: 3000 base pairs.
©1998 Timothy G. Standish From DNA To RNA To Protein Timothy G. Standish, Ph. D.
Parts is parts…. AMINO ACID building block of proteins contain an amino or NH 2 group and a carboxyl (acid) or COOH group PEPTIDE BOND covalent bond link.
Suppl. Figure 1 APP23 + X Terc +/- Terc +/-, APP23 + X Terc +/- G1Terc -/-, APP23 + X G1Terc -/- G2Terc -/-, APP23 + X G2Terc -/- G3Terc -/-, APP23 + and.
Structure and Function of DNA DNA Replication and Protein Synthesis.
GENE EXPRESSION. Transcription 1. RNA polymerase unwinds DNA 2. RNA polymerase adds RNA nucleotides (A ↔ U, G ↔ C) 3. mRNA is formed! DNA reforms a double.
 Molecules of DNA are composed of long chains of _______.
Example 1 DNA Triplet mRNA Codon tRNA anticodon A U A T A U G C G
DNA, RNA and Protein.
The Central Dogma Biology I.
Translation PROTEIN SYNTHESIS.
Whole process Step by step- from chromosomes to proteins.
DNA, RNA and Protein Synthesis
Please turn in your homework
Protein Synthesis DNA RNA Protein.
Modelling Proteomes.
The blueprint of life; from DNA to Protein
Where is Cytochrome C? What is the role? Where does it come from?
Supplementary information Table-S1 (Xiao)
Sequence – 5’ to 3’ Tm ˚C Genome Position HV68 TMER7 Δ mt. Forward
Laboratory Encounters in Plant Genomics
GENE MUTATIONS aka point mutations © 2016 Paul Billiet ODWS.
Review Sheet: DNA, RNA & Protein Synthesis
Supplementary Figure 1 – cDNA analysis reveals that three splice site alterations generate multiple RNA isoforms. (A) c.430-1G>C (IVS 6) results in 3.
Protein Synthesis Translation.
DNA By: Mr. Kauffman.
DNA and RNA.
PROTEIN SYNTHESIS RELAY
More on translation.
Fundamentals of Protein Structure
NOTE SHEET 13 – Protein Synthesis
Warm Up 3 2/5 Can DNA leave the nucleus?
Today’s notes from the student table Something to write with
Central Dogma and the Genetic Code
Laboratory Encounters in Plant Genomics
Python.
Station 2 Protein Synethsis.
DNA, RNA, Amino Acids, Proteins, and Genes!.
DNA to proteins.
Presentation transcript:

The genetic code

The Genetic Code Each amino acid is specified by a triplet of nucleotides, known as a codon.

The Genetic Code TTT Phe TCT Ser TAT Tyr TGT Cys TTC Phe TCC Ser TAC Tyr TGC Cys TTA Leu TCA Ser TAA Och TGA Umb TTG Leu TCG Ser TAG Amb TGG Trp   CTT Leu CCT Pro CAT His CGT Arg CTC Leu CCC Pro CAC His CGC Arg CTA Leu CCA Pro CAA Gln CGA Arg CTG Leu CCG Pro CAG Gln CGG Arg ATT Ile ACT Thr AAT Asn AGT Ser ATC Ile ACC Thr AAC Asn AGC Ser ATA Ile ACA Thr AAA Lys AGA Arg ATG Met ACG Thr AAG Lys AGG Arg GTT Val GCT Ala GAT Asp GGT Gly GTC Val GCC Ala GAC Asp GGC Gly GTA Val GCA Ala GAA Glu GGA Gly GTG Val GCG Ala GAG Glu GGG Gly

ATG Met The Genetic Code Single methionine codon acts as initiator TTT Phe TCT Ser TAT Tyr TGT Cys TTC Phe TCC Ser TAC Tyr TGC Cys TTA Leu TCA Ser TAA Och TGA Umb TTG Leu TCG Ser TAG Amb TGG Trp   CTT Leu CCT Pro CAT His CGT Arg CTC Leu CCC Pro CAC His CGC Arg CTA Leu CCA Pro CAA Gln CGA Arg CTG Leu CCG Pro CAG Gln CGG Arg ATT Ile ACT Thr AAT Asn AGT Ser ATC Ile ACC Thr AAC Asn AGC Ser ATA Ile ACA Thr AAA Lys AGA Arg ATG Met ACG Thr AAG Lys AGG Arg GTT Val GCT Ala GAT Asp GGT Gly GTC Val GCC Ala GAC Asp GGC Gly GTA Val GCA Ala GAA Glu GGA Gly GTG Val GCG Ala GAG Glu GGG Gly ATG Met Single methionine codon acts as initiator

TAA Och TGA Umb TAG Amb Three nonsense codons act as stop signals The Genetic Code TTT Phe TCT Ser TAT Tyr TGT Cys TTC Phe TCC Ser TAC Tyr TGC Cys TTA Leu TCA Ser TAA Och TGA Umb TTG Leu TCG Ser TAG Amb TGG Trp   CTT Leu CCT Pro CAT His CGT Arg CTC Leu CCC Pro CAC His CGC Arg CTA Leu CCA Pro CAA Gln CGA Arg CTG Leu CCG Pro CAG Gln CGG Arg ATT Ile ACT Thr AAT Asn AGT Ser ATC Ile ACC Thr AAC Asn AGC Ser ATA Ile ACA Thr AAA Lys AGA Arg ATG Met ACG Thr AAG Lys AGG Arg GTT Val GCT Ala GAT Asp GGT Gly GTC Val GCC Ala GAC Asp GGC Gly GTA Val GCA Ala GAA Glu GGA Gly GTG Val GCG Ala GAG Glu GGG Gly TAA Och TGA Umb TAG Amb Three nonsense codons act as stop signals

Some amino acids (e.g. leucine) have up to six codons The Genetic Code TTT Phe TCT Ser TAT Tyr TGT Cys TTC Phe TCC Ser TAC Tyr TGC Cys TTA Leu TCA Ser TAA Och TGA Umb TTG Leu TCG Ser TAG Amb TGG Trp   CTT Leu CCT Pro CAT His CGT Arg CTC Leu CCC Pro CAC His CGC Arg CTA Leu CCA Pro CAA Gln CGA Arg CTG Leu CCG Pro CAG Gln CGG Arg ATT Ile ACT Thr AAT Asn AGT Ser ATC Ile ACC Thr AAC Asn AGC Ser ATA Ile ACA Thr AAA Lys AGA Arg ATG Met ACG Thr AAG Lys AGG Arg GTT Val GCT Ala GAT Asp GGT Gly GTC Val GCC Ala GAC Asp GGC Gly GTA Val GCA Ala GAA Glu GGA Gly GTG Val GCG Ala GAG Glu GGG Gly Some amino acids (e.g. leucine) have up to six codons

Some amino acids (e.g. proline) have four codons The Genetic Code TTT Phe TCT Ser TAT Tyr TGT Cys TTC Phe TCC Ser TAC Tyr TGC Cys TTA Leu TCA Ser TAA Och TGA Umb TTG Leu TCG Ser TAG Amb TGG Trp   CTT Leu CCT Pro CAT His CGT Arg CTC Leu CCC Pro CAC His CGC Arg CTA Leu CCA Pro CAA Gln CGA Arg CTG Leu CCG Pro CAG Gln CGG Arg ATT Ile ACT Thr AAT Asn AGT Ser ATC Ile ACC Thr AAC Asn AGC Ser ATA Ile ACA Thr AAA Lys AGA Arg ATG Met ACG Thr AAG Lys AGG Arg GTT Val GCT Ala GAT Asp GGT Gly GTC Val GCC Ala GAC Asp GGC Gly GTA Val GCA Ala GAA Glu GGA Gly GTG Val GCG Ala GAG Glu GGG Gly Some amino acids (e.g. proline) have four codons

Some amino acids (e.g. glutamine) have two codons The Genetic Code TTT Phe TCT Ser TAT Tyr TGT Cys TTC Phe TCC Ser TAC Tyr TGC Cys TTA Leu TCA Ser TAA Och TGA Umb TTG Leu TCG Ser TAG Amb TGG Trp   CTT Leu CCT Pro CAT His CGT Arg CTC Leu CCC Pro CAC His CGC Arg CTA Leu CCA Pro CAA Gln CGA Arg CTG Leu CCG Pro CAG Gln CGG Arg ATT Ile ACT Thr AAT Asn AGT Ser ATC Ile ACC Thr AAC Asn AGC Ser ATA Ile ACA Thr AAA Lys AGA Arg ATG Met ACG Thr AAG Lys AGG Arg GTT Val GCT Ala GAT Asp GGT Gly GTC Val GCC Ala GAC Asp GGC Gly GTA Val GCA Ala GAA Glu GGA Gly GTG Val GCG Ala GAG Glu GGG Gly Some amino acids (e.g. glutamine) have two codons

Tryptophan and methionine have one codon each The Genetic Code TTT Phe TCT Ser TAT Tyr TGT Cys TTC Phe TCC Ser TAC Tyr TGC Cys TTA Leu TCA Ser TAA Och TGA Umb TTG Leu TCG Ser TAG Amb TGG Trp   CTT Leu CCT Pro CAT His CGT Arg CTC Leu CCC Pro CAC His CGC Arg CTA Leu CCA Pro CAA Gln CGA Arg CTG Leu CCG Pro CAG Gln CGG Arg ATT Ile ACT Thr AAT Asn AGT Ser ATC Ile ACC Thr AAC Asn AGC Ser ATA Ile ACA Thr AAA Lys AGA Arg ATG Met ACG Thr AAG Lys AGG Arg GTT Val GCT Ala GAT Asp GGT Gly GTC Val GCC Ala GAC Asp GGC Gly GTA Val GCA Ala GAA Glu GGA Gly GTG Val GCG Ala GAG Glu GGG Gly

The last ACT Thr nucleotide ACC Thr in a codon ACA Thr is often The Genetic Code TTT Phe TCT Ser TAT Tyr TGT Cys TTC Phe TCC Ser TAC Tyr TGC Cys TTA Leu TCA Ser TAA Och TGA Umb TTG Leu TCG Ser TAG Amb TGG Trp   CTT Leu CCT Pro CAT His CGT Arg CTC Leu CCC Pro CAC His CGC Arg CTA Leu CCA Pro CAA Gln CGA Arg CTG Leu CCG Pro CAG Gln CGG Arg ATT Ile ACT Thr AAT Asn AGT Ser ATC Ile ACC Thr AAC Asn AGC Ser ATA Ile ACA Thr AAA Lys AGA Arg ATG Met ACG Thr AAG Lys AGG Arg GTT Val GCT Ala GAT Asp GGT Gly GTC Val GCC Ala GAC Asp GGC Gly GTA Val GCA Ala GAA Glu GGA Gly GTG Val GCG Ala GAG Glu GGG Gly The last nucleotide in a codon is often irrelevant ACT Thr ACC Thr ACA Thr ACG Thr

When the last nucleotide does matter, it is usually CAT His The Genetic Code When the last nucleotide does matter, it is usually only important whether it is a purine or pyrimidine TTT Phe TCT Ser TAT Tyr TGT Cys TTC Phe TCC Ser TAC Tyr TGC Cys TTA Leu TCA Ser TAA Och TGA Umb TTG Leu TCG Ser TAG Amb TGG Trp   CTT Leu CCT Pro CAT His CGT Arg CTC Leu CCC Pro CAC His CGC Arg CTA Leu CCA Pro CAA Gln CGA Arg CTG Leu CCG Pro CAG Gln CGG Arg ATT Ile ACT Thr AAT Asn AGT Ser ATC Ile ACC Thr AAC Asn AGC Ser ATA Ile ACA Thr AAA Lys AGA Arg ATG Met ACG Thr AAG Lys AGG Arg GTT Val GCT Ala GAT Asp GGT Gly GTC Val GCC Ala GAC Asp GGC Gly GTA Val GCA Ala GAA Glu GGA Gly GTG Val GCG Ala GAG Glu GGG Gly CAT His CAC His CAA Gln CAG Gln

The Genetic Code A nucleotide consists of a ribose sugar bonded to phosphoric acid, with a nitrogen base of either a pyrimidine (cytosine or thymine) or purine (adenine or guanine) as a side chain. A base called Uracil replaces all thymine bases in mRNA.

ATT Ile ATC Ile ATA Ile ATG Met With methionine and tryptophan, The Genetic Code TTT Phe TCT Ser TAT Tyr TGT Cys TTC Phe TCC Ser TAC Tyr TGC Cys TTA Leu TCA Ser TAA Och TGA Umb TTG Leu TCG Ser TAG Amb TGG Trp   CTT Leu CCT Pro CAT His CGT Arg CTC Leu CCC Pro CAC His CGC Arg CTA Leu CCA Pro CAA Gln CGA Arg CTG Leu CCG Pro CAG Gln CGG Arg ATT Ile ACT Thr AAT Asn AGT Ser ATC Ile ACC Thr AAC Asn AGC Ser ATA Ile ACA Thr AAA Lys AGA Arg ATG Met ACG Thr AAG Lys AGG Arg GTT Val GCT Ala GAT Asp GGT Gly GTC Val GCC Ala GAC Asp GGC Gly GTA Val GCA Ala GAA Glu GGA Gly GTG Val GCG Ala GAG Glu GGG Gly ATT Ile ATC Ile ATA Ile ATG Met With methionine and tryptophan, the exact base matters

Recommended supplementary reading The Genetic Code Recommended supplementary reading Chatty, readable account of how Crick and Brenner solved the mystery of the genetic code. This is not a textbook. It is Francis Crick’s autobiographical answer to James Watson’s book The Double Helix, which describes the search for the structure of DNA, and in which Watson notes the dictionary definition of a crick as “a pain in the neck”. Crick, F. What Mad Pursuit? 1989 (James Cameron-Gifford Library Q143.C7, George Green Library QH506.CRI)

How was the code deciphered? Most of the work to show the general form of the genetic code was done by Francis Crick and Sidney Brenner. Brenner Crick

How was the code deciphered? They started off with George Gamow’s arguments based on simple school arithmetic to show that the code was probably a triplet code. Crick Brenner

Why must the code be in triplets? There are only four nucleotides, therefore a singlet code (i.e. a code in which each nucleotide specifies an amino acid) could only encode four amino acids. However, there are twenty amino acids found in most proteins. Therefore, the code cannot be singlet in nature.

Why must the code be in triplets? G A T C G A T C If the code were doublet, then there would be four possible nucleotides in the first position and four in the second. This gives: 4 4 = 42 = 16 codons Still too few to encode 20 amino acids.

Why must the code be in triplets? If the code were triplet, then there would be four possible nucleotides in the first position, four in the second and four in the third. This gives: 4 4 4 = 43 = 64 permutations This is too many to encode 20 amino acids but the code could work if either some permutations are not used or if more than one encodes each amino acid (or both).

Why must the code be in triplets? Type of code Number of permutations Singlet 41 = 4 Doublet 42 = 16 Triplet 43 = 64 Quadruplet 44 = 256 Pentuplet 45 = 1024 Only the triplet code really looks feasible

How was the code deciphered? There are also different ways that the code can be read: It can be punctuated or unpunctuated. If it is unpunctuated it can be overlapping or non-overlapping.

GTCACCCATGGAGGTATCT An overlapping code 1 2 3 4 Once the first codon is set (e.g. GTC), the next one can only be one of four (TCA, TCG, TCT or TCC). This is a disadvantage.

A non-overlapping unpunctuated code GTCACCCATGGAGGTATCT 1 2 3 4 5 1 2 3 4 5 There are three ways to read this type of code, referred to as “reading frames”. This makes this type of code non-ideal.

A non-overlapping punctuated code GTCACCCATGGAGGTATCT 1 2 3 4 5 Here, one nucleotide (A) is used as a punctuation mark. This code has several advantages: 1. The reading frame is set by the punctuation. 2. Because only three nucleotides are used in codons, the number of coding permutations available is 33 = 27 amino acids

Is the code really overlapping? GTCACCCATGGAGGTATCT 1 2 3 4 Once the first amino acid is set, the next one can only be one of four. Therefore, certain amino acids could never be next to each other. This can be tested by experimentation

Is the code really overlapping? Francis Crick and Sidney Brenner did “nearest neighbour” analysis on real proteins. They found that any amino acid could be next to any other one. Therefore, the code cannot be overlapping.

Is the code punctuated? Francis Crick and Sidney Brenner went on to analyse a particular type of mutant that is induced by intercalating agents (e.g. acridine dye). Intercalating agents will insert themselves between the base pairs of DNA. These can stretch the base pairs apart during replication and cause an extra nucleotide to be inserted or one to be left out.

Is the code punctuated? They found a gene (the rII gene) that has special properties. It can tolerate several wrong codons in the early part of the coding sequence and still make an active protein as long as the later part of the coding sequence is correct.

Is the code punctuated? The mutations caused by intercalating agents fall into two classes, 1 and 2. Both cause a mutant phenotype in the rII gene. 1 2 Mutant phenotype

Is the code punctuated? Double mutants (two mutations in one gene) also cause a mutant phenotype in the rII gene. 1 2 Mutant phenotype

When the double mutant has two different kinds of mutation, they suppress each other and you get a non-mutant phenotype in the rII gene. 1 2 Wild type (non-mutant) phenotype

Remember that the mutations caused by acridine dyes result from the loss or gain of one nucleotide. They cause the reading frame to change and are called frame-shift mutations. The fact that they can arise means that there must be reading frames and that means that the code in unpunctuated.

GTCACCCATGGAGGTATCT GTCTACCCATGGAGGTATC How does this work? 1 2 3 4 5 Original code GTCTACCCATGGAGGTATC 1 2 3 4 5 Code with frame shift mutation All codons after the inserted nucleotide are wrong (some may be stop codons).

Two wrongs can make a right GTCACCCATGGAGGTATCT 1 2 3 4 5 Original code GTCTACCATGGAGGTATCT 1 2 3 4 5 Code with different frame shift mutations After second mutation, codons back in original frame.

Is the code triplet? Crick and Brenner went on to show that three frame shift mutations of the first type (insertion) or three of the second type (deletion) in the rII gene could also give a wild-type phenotype. This could only happen if the code was triplet. If the code was quadruplet then you would have to add or delete four nucleotides to reset the reading frame.

Three wrongs can make a right GTCACCCATGGAGGTATCT 1 2 3 4 5 Original code GTCTACTCACATGGAGGTA 1 2 3 4 5 6 Code with three similar frame shift mutations An extra codon is inserted and a few codons are wrong, then all of the rest are OK.

How was the code “cracked”? We must first consider how genetic information is used by the cell. In higher organisms (eukaryotes) the DNA is in the nucleus and the protein is made in the cytoplasm  there must be an intermediate. Messenger RNA (mRNA) moves from the nucleus to the cytoplasm and carries the genetic code.

How was the code “cracked”? The Central Dogma Francis Crick proposed the idea that genetic information moves in one direction and called this the central dogma of molecular genetics. replication DNA RNA Protein transcription translation

How was the code “cracked”? Cells can be broken open and the elements needed for protein synthesis can be isolated. When RNA is added, the protein encoded by that RNA is made. Artificial RNA can be made in the test tube and added to this system.

How was the code “cracked”? Cells can be broken open and the elements needed for protein synthesis can be isolated. When RNA is added, the protein encoded by that RNA is made. http://profiles.nlm.nih.gov/JJ/Views/Exhibit/documents/codeoflife.html Artificial RNA can be made in the test tube and added to this system. This work was done by Marshall Nirenberg and Har Gobind Khorana http://www.ucs.mun.ca/~c64dcp/Khorana.html Nirenberg Khorana

How was the code “cracked”? Nirenberg made simple RNA with the sequence: UUUUUUUUUUUUUUUUUUUUU When he put this into the test tube, he found that the protein made was a string of one type of amino acid, phenylalanine, joined together. Therefore the codon UUU (or TTT in DNA), encodes phenylalanine.

How was the code “cracked”? Similarly, RNA with the sequence: CCCCCCCCCCCCCCCCCCCCCC encodes a protein that is all proline. AAAAAAAAAAAAAAAAAAAA encodes a protein that is all lysine.

How was the code “cracked”? Khorana made less simple RNA with the sequence: UGUGUGUGUGUGUGUGUGUGU When he put this into the test tube, he found that the protein made was a string of two alternating amino acids, valine and cysteine. TGT = Cys GTG = Val

How was the code “cracked”? By successively more sophisticated experiments of this type, the amino acids specified by most of the 61 amino acid encoding triplets were identified. Final confirmation required experiments with another type of RNA, transfer RNA (tRNA), which is the subject of the next lecture.