Evolution of gene function

Slides:



Advertisements
Similar presentations
IMPRS workshop Comparative Genomics 18 th -21 st of February 2013 Lecture 4 Positive selection.
Advertisements

Evolution of genomes.
Phylogenetics workshop: Protein sequence phylogeny week 2 Darren Soanes.
Whole Genome Duplications (Polyploidy) Made famous by S. Ohno, who suggested WGD can be a route to evolutionary innovation (focusing on neofunctionalization)
Chapter 19 Evolutionary Genetics 18 and 20 April, 2004
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Copyright, ©, 2002, John Wiley & Sons, Inc.,Karp/CELL & MOLECULAR BIOLOGY 3E The Stability of the Genome Duplication, Deletion, Transposition.
14 Molecular Evolution and Population Genetics
The origins & evolution of genome complexity Seth Donoughe Lynch & Conery (2003)
BIOE 109 Summer 2009 Lecture 6- Part II Molecular evolution.
[Bejerano Fall10/11] 1 Any Project reflections?
1 Functional prediction in proteins (purifying and positive selection)
Mutation and DNA Mutation = change(s) in the nucleotide/base sequence of DNA; may occur due to errors in DNA replication or due to the impacts of chemicals.
Online Counseling Resource YCMOU ELearning Drive… School of Architecture, Science and Technology Yashwantrao C havan Maharashtra Open University, Nashik.
Chapter 3 Substitution Patterns Presented by: Adrian Padilla.
- any detectable change in DNA sequence eg. errors in DNA replication/repair - inherited ones of interest in evolutionary studies Deleterious - will be.
Origins and impact of constraints in evolution of gene families Boris E. Shakhnovich and Eugene V.Koonin Genome Research 2006, October 19 Stella Veretnik.
Ultraconserved Elements in the Human Genome Bejerano, G., et.al. Katie Allen & Megan Mosher.
Generating Diversity: how genes and genomes evolve Erin “They call me Dr. Worm” Friedman 29 September 2005.
Evolution Chapters Evolution is both Factual and the basis of broader theory What does this mean? What are some factual examples of evolution?
Genome Organization and Evolution. Assignment For 2/24/04 Read: Lesk, Chapter 2 Exercises 2.1, 2.5, 2.7, p 110 Problem 2.2, p 112 Weblems 2.4, 2.7, pp.
Lecture 25 - Phylogeny Based on Chapter 23 - Molecular Evolution Copyright © 2010 Pearson Education Inc.
The Biology and Genetic Base of Cancer. 2 (Mutation)
Mehdi Layeghifard Evolutionary Mechanisms Underlying the Functional Divergence of Vertebrates’ Circadian Rhythm Genes.
Questions. 09_12_Mutation.jpg Gene Evolution Pages
Anatomy of a Genome Project A.Sequencing 1. De novo vs. ‘resequencing’ 2.Sanger WGS versus ‘next generation’ sequencing 3.High versus low sequence coverage.
Evolutionary Biology Concepts Molecular Evolution Phylogenetic Inference BIO520 BioinformaticsJim Lund Reading: Ch7.
Models of Molecular Evolution III Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections 7.5 – 7.8.
Cédric Notredame (08/12/2015) Molecular Evolution Cédric Notredame.
Selectionist view: allele substitution and polymorphism
Evolution at the Molecular Level
Evolution at the Molecular Level. Outline Evolution of genomes Evolution of genomes Review of various types and effects of mutations Review of various.
NEW TOPIC: MOLECULAR EVOLUTION.
Evolution at the Molecular Level. Outline Evolution of genomes Evolution of genomes Review of various types and effects of mutations Review of various.
Evolution of Duplicated Genomes Talline Martins
In populations of finite size, sampling of gametes from the gene pool can cause evolution. Incorporating Genetic Drift.
1 What forces constrain/drive protein evolution? Looking at all coding sequences across multiple genomes can shed considerable light on which forces contribute.
Eukaryotic genes are interrupted by large introns. In eukaryotes, repeated sequences characterize great amounts of noncoding DNA. Bacteria have compact.
De novo creation of new genes 1.Retrotransposition (+/- cooption of other sequences) AAAAA Pre-mRNA AAAAA Splicing to remove intron Reverse transcription.
Genetic Code and Interrupted Gene Chapter 4. Genetic Code and Interrupted Gene Aala A. Abulfaraj.
LBA ProtPars. LBA Prot Dist no Gamma and no alignment.
SC.912.L.16.3 DNA Replication. – During DNA replication, a double-stranded DNA molecule divides into two single strands. New nucleotides bond to each.
Primary Mechanism of Duplication : Unequal Crossing Over Crossing over Between Daughter Strands Addition (duplication) Deletion (tandom duplications)
Phylogeny and the Tree of Life
Evolution and Population Genetics
Evolution of Populations
Evolutionary genomics can now be applied beyond ‘model’ organisms
Evolution of eukaryotic genomes
Population Genetics direct extension of Mendel’s laws, molecular genetics, and the ideas of Darwin Instead of genetic transmission between individuals,
Genetics and Evolutionary Biology
Causes of Variation in Substitution Rates
The neutral theory of molecular evolution
Basics of Comparative Genomics
Neutrality Test First suggested by Kimura (1968) and King and Jukes (1969) Shift to using neutrality as a null hypothesis in positive selection and selection.
Linkage and Linkage Disequilibrium
The Neutral Theory M. Kimura, 1968
Very important to know the difference between the trees!
Horizontal gene transfer and the history of life
SGN23 The Organization of the Human Genome
What are the Patterns Of Nucleotide Substitution Within Coding and
1. "HARD" Selection can 'cost' a population individuals:
Gene duplications: evolutionary role
Evolution of eukaryote genomes
Evolutionary Biology Concepts
Pedir alineamiento múltiple
Evolutionary genetics
Chapter 6 Clusters and Repeats.
1. Unequal Crossing-Over a. process: If homologs line up askew:
Basics of Comparative Genomics
Reminder The AP Exam registration is open in Naviance. The Exam is on Monday, May 13. I’ll let you know when the next test/homework will be.
Presentation transcript:

Evolution of gene function Divergent evolution: the importance of gene duplication a. Ohno’s model b. Subfunctionalization c. Neofunctionalization Introducing novelty: generation of entirely new proteins/functions a. Lateral gene transfer b. Domain fusion c. Intron junction evolution? d. New genes through TEs?

Evolution of gene function First, some key basic concepts: Selection acts on phenotypes, based on their fitness cost/advantage, to affect the population frequencies of the underlying genotypes. In the case of DNA sequence: Neutral substitutions = no effect on fitness, no effect on selection Deleterious substitutions = fitness cost * These are removed by purifying (negative) selection Advantageous substitutions = fitness advantage * These alleles are enriched for through adaptive (positive) selection

The Neutral Theory M. Kimura, 1968 Most DNA substitutions are likely to be neutral = no effect on fitness. They arise through new mutations. Given a ~constant mutation rate, can convert the # of substitutions into time of divergence since speciation = molecular clock theory. Neutral changes evolve by genetic drift, not natural selection. * Most are probably lost, some can become fixed in the population Purifying selection to remove deleterious changes is pervasive, while positive selection may be relatively rare.

The Nearly-Neutral Theory T. Ohta, 1973 Many slightly deleterious (or slightly advantageous) substitutions are not selected against efficiently if population sizes are large. Therefore, many substitutions that are nearly neutral can evolve mostly by drift. * Small populations are more subject to drift (e.g. random events). * Selection is ‘slower’ in large populations … therefore many weakly deleterious substitutions have yet to be removed by selection. ** Practically what this means is that SOME substitutions found in extant sequences can be slightly deleterious & have yet to be removed

therefore removed through purifying selection Most genes are under constraint = many substitutions are deleterious and therefore removed through purifying selection Constraint can be due to maintaining: * Protein function (e.g. catalytic site) * Protein folding & stability * Interactions with other proteins, molecules * Other features like translation efficiency, RNA folding, etc. Then how do new functions emerge? How can proteins evolve?

Evolution of gene function Divergent evolution: the importance of gene duplication a. Ohno’s model b. Subfunctionalization c. Neofunctionalization Introducing novelty: generation of entirely new proteins/functions a. Lateral gene transfer b. Domain fusion c. Intron junction evolution? d. New genes through TEs?

Most functions evolve through divergent evolution due to relaxed constraint Susumu Ohno (1970): gene duplication is the main route to neofunctionalization, where one copy is allowed to evolve an entirely new function. 1. Gene duplication 2. Brief period of complete redundancy & relaxed constraint for both genes x x xx X x x x xx 3. Often one copy is lost as a pseudogene x x x xx 3. Or one copy can evolve a new function

Most functions evolve through divergent evolution due to relaxed constraint Susumu Ohno (1970): gene duplication is the main route to neofunctionalization, where one copy is allowed to evolve an entirely new function. 1. Gene duplication 2. Brief period of complete redundancy & relaxed constraint for both genes x x xx X x x x xx 3. Often one copy is lost as a pseudogene x x x xx 3. Or one copy can evolve a new function Force & Lynch (1999) formalized the concept of subfunctionalization, where both copies evolve and the ancestral function becomes split between the paralogs

Segments often flanked by 1. Segmental (dispersed) duplication & recombination (Homologous or Illegitimate) Segments often flanked by repetitive sequence 2. Tandem duplication through replication slippage 3. Duplication through retrotransposition (= loss of introns & flanked by repeats) 4. Whole-genome duplication (WGD, covered in Lecture 5) Once a gene has been duplicated, gene conversion through recombination can obscure rates

* Identified gene duplicates (BLAST) in 9 taxa Science 2000 * Identified gene duplicates (BLAST) in 9 taxa * Dated duplicates based on # of silent substitutions (molecular clock) Ks (sometimes called Ds ): # of silent substitutions that encode SAME (synonymous) codon * often these changes are ASSUMED to be neutral** * given a constant rate of point mutations, Ks can be used to date a sequence ** now people realize that Ks can also be constrained by other things besides codon Ka (sometimes called DN ): # of substitutions encoding a nonsynonymous codon

The Ka/Ks ratio: a measure of constraint on coding sequences If we assume that Ks reflects the underlying neutral rate of change: Ka/Ks = 1 …. Rate of codon changes is the same as rate of silent changes * taken to mean NO constraint on gene sequence Ka/Ks < 1 …. Rate of codon changes is LESS than the rate of neutral change * implies deleterious codon changes were removed by purifying selection * therefore implies constraint on gene sequence Ka/Ks > 1 …. Rate of codon changes is the GREATER than rate of silent changes * implies codon changes have been selected for by positive selection Ks can also be used to date the age of sequences according to the ‘molecular clock’ theory

Science 2000 * Identified gene duplicates (BLAST) in 9 taxa * Dated duplicates based on # of silent substitutions (molecular clock) * Measured several features over ‘time’ (# silent substitutions) to show:

Lynch & Conery 2000

* Identified gene duplicates (BLAST) in 9 taxa Science 2000 * Identified gene duplicates (BLAST) in 9 taxa * Dated duplicates based on # of silent substitutions (molecular clock) * Measured several features over ‘time’ (# silent substitutions) to show: Duplicates experience brief window of relaxed constraint before reintroduction of purifying selection Average half-life of gene duplicates is ~4 million years In yeast and drosophila: rate of gene duplication: 0.002 - 0.02 per gene per million years, depending on species (e.g. if 13,000 genes = 31 new duplicates per genome per million years) … may be inflated if gene conversion makes ancient duplicates appear ‘young’ ** The estimated rate of gene duplication is on the same order as rate of new mutations!

X Fate of gene duplicates 1. Lost as a pseudogene x xx 2. Neofunctionalization x x x xx 3. Subfunctionalization x x x xx 4. Retained & conserved * Can be maintained due to advantage of increased dosage * Can promote regulatory innovation

Reverse transcription by TE polymerases De novo creation of new genes Retrotransposition (+/- cooption of other sequences) Often see short flanking repeats due to mechanism of TE integration Integration into the genome (in NUCLEUS) Reverse transcription by TE polymerases (in CYTOSOL) AAAAA Splicing to remove intron AAAAA Pre-mRNA

De novo creation of new genes Retrotransposition (+/- cooption of other sequences) Gene duplication into other sequences = chimeric structure/regulation

De novo creation of new genes Retrotransposition (+/- cooption of other sequences) Gene duplication into other sequences = chimeric structure/regulation Cooption of non-coding DNA (from introns, intergenic sequence)

De novo creation of new genes Retrotransposition (+/- cooption of other sequences) Gene duplication into other sequences = chimeric structure/regulation Cooption of non-coding DNA (from introns, intergenic sequence) Horizontal gene transfer (very prevalent in bacteria: Lecture 5) - also observed from bacterial parasites to insect hosts Challenge in distinguishing Novel Gene vs. missed orthology due to rapid evolution