Codon usage bias Ref: Chapter 9 Xuhua Xia dambe.bio.uottawa.ca.

Slides:



Advertisements
Similar presentations
. Class 1: Introduction. The Tree of Life Source: Alberts et al.
Advertisements

Chapter 4 Transcription and Translation. The Central Dogma.
© 2006 W.W. Norton & Company, Inc. DISCOVER BIOLOGY 3/e
The origins & evolution of genome complexity Seth Donoughe Lynch & Conery (2003)
TRANSLATION The process of converting the information stored in mRNA into a protein is called translation mRNA carries information from a gene to a structure.
The phylogenetics project data revealed! October 4, 2010 OEB 192.
GOOD MORNING! HAPPY TUESDAY!. WARM UP FOR TUES 10/28/14 ADV BIO 1. What is an anticodon and what is its function? 2. How do ribosomes assist protein synthesis?
A genome-wide perspective on translation of proteins Dec 2012 Regulatory Genomics Lecturer: Prof. Yitzhak Pilpel.
From Gene to Protein. Gene Expression Process by which DNA directs the synthesis of a protein 2 stages transcription translation All organisms One gene.
Chapter 13.2 (Pgs ): Ribosomes and Protein Synthesis
Biology 1060 Chapter 17 From Gene to Protein. Genetic Information Important: Fig Describe how genes control phenotype –E.g., explain dwarfism in.
Signposts for translation initiation: An illustration of formulating a research project Xuhua Xia
Xuhua Xia Signposts for translation initiation: An illustration of formulating a research project Xuhua Xia.
Gene Expression and Gene Regulation. The Link between Genes and Proteins At the beginning of the 20 th century, Garrod proposed: – Genetic disorders such.
Companion site for Biotechnology. by Clark Copyright © 2009 by Academic Press. All rights reserved. 1 Expression of Eukaryotic Proteins A bacterial Promoter/terminator.
1 TRANSCRIPTION AND TRANSLATION. 2 Central Dogma of Gene Expression.
Do Now: On the “Modeling DNA” handout, determine the complimentary DNA sequence and the mRNA sequence by using the sequence given.
Replication Transcription Translation
WARM UP What is the Central Dogma? What are the 3 differences between RNA and DNA? Where does transcription occur?
Chapter 17: From Gene to Protein Objectives 1. To understand the central dogma 2.To understand the process of transcription 3.To understand the purpose.
Chapter 7 Gene Expression and Control Part 2. Transcription: DNA to RNA  The same base-pairing rules that govern DNA replication also govern transcription.
Genes and How They Work Chapter The Nature of Genes information flows in one direction: DNA (gene)RNAprotein TranscriptionTranslation.
Chapter 10 Opener. Figure 10.1 Metabolic Diseases and Enzymes.
Codon usage bias Ref: Chapter 9
CHAPTER 9 Proteins and Their Synthesis CHAPTER 9 Proteins and Their Synthesis Copyright 2008 © W H Freeman and Company.
The Genetic Code. The DNA that makes up the human genome can be subdivided into information bytes called genes. Each gene encodes a unique protein that.
Translation 7.3. Translation the information coded in mRNA is translated to a polypeptide chain.
1 Codon Usage. 2 Discovering the codon bias 3 In the year 1980 Four researchers from Lyon analyzed ALL published mRNA sequences of more than about 50.
The Genetic Code! Theme: Organization. Cell as a Protein Factory.
Do Now: On the “Modeling DNA” handout, determine the complimentary DNA sequence and the mRNA sequence by using the sequence given.
Chapter 12-3: RNA & Protein Synthesis Essential Questions:  What are 3 types of RNA?  What is the function of 3 types of RNA?  What happens during transcription?
The Central Dogma of Molecular Biology DNA  RNA  Protein  Trait.
Part 5 Translation Translation: mRNA and the Genetic Code Triplet code- codon 3 sequential Nitrogen bases in mRNA 64 different mRNA codons.
From Gene to Protein Transcription and Translation.
CHAPTER 13 RNA & Protein Synthesis. GENE EXPRESSION  When a cell “reads” the DNA, it doesn’t directly say for example blue eyes.  What the DNA actually.
Chapter 17 From Gene to Protein. One gene, one protein Chapter 17 From Gene to Protein.
1. 2 Discovering the codon bias 3 Il codice genetico è DEGENERATO.
Discovering the codon bias
Part 5 Translation.
GENETICS.
Protein Synthesis 3.A.1.c. Genetic information flows from a sequence of nucleotides in a gene sequence of amino acids in a protein.
Transcription.
12.3 Assessment Answers.
Ch 10: Protein Synthesis DNA to RNA to Proteins
Basic concepts in molecular evolution
DNA vs RNA.
13.3 RNA & Gene Expression I. An Overview of Gene _____________ A. RNA
13.3 RNA & Gene Expression I. An Overview of Gene Expression A. RNA
Transcription and Translation
Gene Expression: from DNA to protein
RNA and Protein Synthesis
RNA and Protein Synthesis
Translation elongation, amino acid usage, and codon usage indices
Translation initiation and co-evolution between SD and aSD in bacteria
From Gene to Protein Chapter 17.
From Gene to Protein.
GENE REGULATION prokaryotic cells – have about 2,000 genes
Protein Synthesis Translation.
Protein Synthesis Translation.
Lecture4 Synthesis of Proteins. Lecture4 Synthesis of Proteins.
Gene Expression: From Gene to Protein
in the Gene’s Polypeptide Product
RNA and Protein Synthesis
Unit 7 Part 2 Notes: From Gene to Protein
Gene Expression aka Protein Synthesis.
Relationship between Genotype and Phenotype
AS Level Paper 1 and 2. A2 Level Paper 1 and 3 - Topics 1-4
Big Q: What role does the ribosome play in assembling proteins?
from nucleic acid language to amino acid language to PROTEIN language
Protein Synthesis.
Presentation transcript:

Codon usage bias Ref: Chapter 9 Xuhua Xia dambe.bio.uottawa.ca

Xuhua Xia Slide 2 Objectives Understand how codon usage bias affect translation efficiency and gene expression Biomedical relevance –Protein drugs in pharmaceutical industry –Transgenic experiments in agriculture Factors affecting codon usage bias Indices measuring codon usage bias Develop bioinformatic skills to study the genomic codon usage.

Xuhua Xia Slide 3 Codon Usage Bias Observation: Strongly biased codon usage in a variety of species ranging from viruses, mitochondria, plastids, prokaryotes and eukaryotes. Hypotheses: –Differential mutation hypothesis, e.g., Transcriptional hypothesis of codon usage (Xia 1996 Genetics 144: ) –Different selection hypothesis, e.g., (Xia 1998 Genetics 149: 37-44) Predictions: –From mutation hypothesis: Concordance between codon usage and mutation pressure –From Selection hypothesis: Concordance between differential availability of tRNA and differential codon usage. The concordance is stronger in highly expressed genes than lowly expressed genes (CAI is positively correlated with gene expression). UCC~tRNA~Gly GCC~tRNA~Gly UCC~tRNA~Gly Polycistronic mRNA Ribosome Gene 1 Gene 2 Gene 3 RNA polymerase Protein

Xuhua Xia Slide 4 Table 9-2, yeast Xia Bioinformatics and the cell.

Xuhua Xia Slide 5 Conflict: Initiation and Elongation Met codon usage from the 12 CDSs: AUA214 AUG 37 Possible tRNA Met/CAU, tRNA Met/UAU Vertebrate mitochondrial genome has only one tRNA Met. Which one to have? –tRNA Met/CAU : Good for initiation, but not efficient for AUA codons even with the C modified to 5-formylcytidine –tRNA Met/UAU : Good for AUA codons, but not good for initiation. –anticodon: CAU favoring the AUG codon Nature has chosen CAU: All mitochondrial genomes with a single tRNA Met has a CAU anticodon. Problem with AUA codons in translation? Xia et al PLoS One

Xuhua Xia Slide 6 Hypothesis and Predictions MetLeuGluLysGlnArgTrp AUAUUAGAAAAACAAAGAUGA Favoured by mutation, but not by tRNA- mediated selection because the first (wobble) position in tRNA anticodon is C. Favoured by mutation Also favoured by tRNA-mediated selection: the first (wobble) position of tRNA is U. Predictions: 1. Proportion of A-ending codons (or RSCU) should be smaller in the Met codon family than in other R-ending codon families: P NNA = N NNA /N NNG 2. Availability of tRNAMet/UAU should increase P AUA.

Xuhua Xia Slide 7 Selection against AUA codons Carullo, M. and Xia, X J Mol Evol 66:484–493.

Xia, X In: RS Singh et al.. Evolution in the fast lane: Rapidly evolving genes and genetic systems. Oxford University Press. Fig. 5. Relationship between P AUA and P UUA, highlighting the observation that P AUA is greater when both a tRNA Met/CAU and a tRNA Met/UAU are present than when only tRNA Met/CAU is present in the mtDNA, for bivalve species (a) and chordate species (b). The filled squares are for mtDNA containing both tRNA Met/CAU and tRNA Met/UAU genes, and the open triangles are for mtDNA without a tRNA Met/UAU gene.

Xuhua Xia Slide 9 Calculation of RSCU RSCU is codon-specific RSCU and proportion: Different scaling.

Xuhua Xia Slide 10 Calculation of CAI Compound 6- or 8-fold codon families should be broken into two codon families CAI is gene-specific. 0  CAI  1 CAI computed with different reference sets are not comparable. Problem with computing w as F i /F i.max : Suppose an amino acid is rarely used in highly expressed genes, then there is little selection on it, and the codon usage might be close to even, with w i  1. Now if we have a lowly expressed gene that happen to be made of entire of this amino acid, then the CAI for this lowly expressed gene would be 1, which is misleading. There has been no good alternative. Further research is needed. N 2,3,4 : Number of 2-, 3-, 4-fold codon families

Xuhua Xia Slide 11 Weak mRNA predictive power y = x R 2 = mRNA abundance Protein abundance FRS2 ENO1

Xuhua Xia Slide 12 Effect of Codon Usage Bias y = x R 2 = Codon usage bias Protein abundance FRS2 ENO1

Xuhua Xia Slide 13 Problems with CAI Formulation –Reference set –w = 0 Implementation –AUG –UGG –Multiple codon families for one amino acid Dependence on AT% Solutions (Xia, X Evolutionary Bioinformatics)

Xuhua Xia Slide 14 RSCU (HIV-1 vs Human) Fig. 1. Relative synonymous codon usage (RSCU) of HIV- 1 compared to RSCU of highly expressed human genes. Data points for codons ending with A, C, G or U are annotated with different combinations of colors and symbols. A-ending codons exhibit strong discordance in their usage between HIV-1 and human and are annotated with their coded amino acids. van Weringh et al MBE.

Xuhua Xia Slide 15 Research Observation on HIV-1: –Strong surplus of A-ending codon –High mutation rate Hypothesis: Strong A-biased mutation disrupting codon adaptation. –Prediction: Strong A-biased mutation (confirmed) If mutation rate is lower, then there will be better codon adaptation (The related HTLV-1 parasitizes the same cell as HIV-1, but have lower mutation rate: HTLV-1 genes should exhibit better codon adaptation)

Xuhua Xia Slide 16 RSCU (HTLV-1 vs Human) Relative synonymous codon usage (RSCU) of HTLV-1 compared to RSCU of highly expressed human genes. Data points for codons ending with A, C, G or U are annotated with different combinations of colors and symbols. A-ending codons exhibit strong discordance in their usage between HIV-1 and human and are annotated with their coded amino acids.

Table 2. Frequency of A residues, length and codon adaptation index (CAI) for the three HIV-1 early (tat, rev and nef) and five late (gag-pol, vif, vpu, vpr, and env) coding sequences (CDS). Gene CDS (bp) CAI tat rev nef gag pol vif vpr vpu env Any problem with the mutation hypothesis?