Testing the Neutral Mutation Hypothesis The neutral theory predicts that polymorphism within species is correlated positively with fixed differences between.

Slides:



Advertisements
Similar presentations
IMPRS workshop Comparative Genomics 18 th -21 st of February 2013 Lecture 4 Positive selection.
Advertisements

Evolution of Populations
Background The demographic events experienced by populations influence their genealogical history and therefore the pattern of neutral polymorphism observable.
R ATES OF P OINT M UTATION. The rate of mutation = the number of new sequence variants arising in a predefined target region per unit time. Target region.
Chapter 19 Evolutionary Genetics 18 and 20 April, 2004
Sampling distributions of alleles under models of neutral evolution.
MIGRATION  Movement of individuals from one subpopulation to another followed by random mating.  Movement of gametes from one subpopulation to another.
Modeling Populations forces that act on allelic frequencies.
Random fixation and loss of heterozygosity
Change in frequency of the unbanded allele (q) as a function of q for island populations. Equilibrium points a)Strong selection for q, little migration.
Plant of the day! Pebble plants, Lithops, dwarf xerophytes Aizoaceae
Atelier INSERM – La Londe Les Maures – Mai 2004
Signatures of Selection
Population Genetics I. Evolution: process of change in allele
Forward Genealogical Simulations Assumptions:1) Fixed population size 2) Fixed mating time Step #1:The mating process: For a fixed population size N, there.
Molecular evolution:   how do we explain the patterns of variation observed in DNA sequences? how do we detect selection by comparing silent site substitutions.
14 Molecular Evolution and Population Genetics
From population genetics to variation among species: Computing the rate of fixations.
Dispersal models Continuous populations Isolation-by-distance Discrete populations Stepping-stone Island model.
Scott Williamson and Carlos Bustamante
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Population Genetics 101 CSE280Vineet Bafna. Personalized genomics April’08Bafna.
Chapter 3 Substitution Patterns Presented by: Adrian Padilla.
1 Genetic Variability. 2 A population is monomorphic at a locus if there exists only one allele at the locus. A population is polymorphic at a locus if.
Hidenki Innan and Yuseob Kim Pattern of Polymorphism After Strong Artificial Selection in a Domestication Event Hidenki Innan and Yuseob Kim A Summary.
Molecular phylogenetics
- any detectable change in DNA sequence eg. errors in DNA replication/repair - inherited ones of interest in evolutionary studies Deleterious - will be.
Lecture 21: Tests for Departures from Neutrality November 9, 2012.
In the deterministic model, the time till fixation depends on the selective advantage, but fixation is guaranteed.
MIGRATION  Movement of individuals from one subpopulation to another followed by random mating.  Movement of gametes from one subpopulation to another.
Population assignment likelihoods in a phylogenetic and demographic model. Jody Hey Rutgers University.
Deviations from HWE I. Mutation II. Migration III. Non-Random Mating IV. Genetic Drift A. Sampling Error.
Models of Molecular Evolution III Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections 7.5 – 7.8.
Coalescent Models for Genetic Demography
Section 6 Maintenance of Genetic Diversity Levels of genetic diversity result from the joint impacts of: Mutation & migration adding variation Chance &
Selectionist view: allele substitution and polymorphism
Population genetics. coalesce 1.To grow together; fuse. 2.To come together so as to form one whole; unite: The rebel units coalesced into one army to.
N=50 s=0.150 replicates s>0 Time till fixation on average: t av = (2/s) ln (2N) generations (also true for mutations with negative “s” ! discuss among.
Lecture 20 : Tests of Neutrality
NEW TOPIC: MOLECULAR EVOLUTION.
By Mireya Diaz Department of Epidemiology and Biostatistics for EECS 458.
Molecular evolution Part I: The evolution of macromolecules.
Restriction enzyme analysis The new(ish) population genetics Old view New view Allele frequency change looking forward in time; alleles either the same.
In populations of finite size, sampling of gametes from the gene pool can cause evolution. Incorporating Genetic Drift.
Fixed Parameters: Population Structure, Mutation, Selection, Recombination,... Reproductive Structure Genealogies of non-sequenced data Genealogies of.
Human survivorship Developed Developing Bob May (2007), TREE 22:
8 and 11 April, 2005 Chapter 17 Population Genetics Genes in natural populations.
Inferences on human demographic history using computational Population Genetic models Gabor T. Marth Department of Biology Boston College Chestnut Hill,
Data analyses Course code: ZOO560 Week 3
Detection of the footprint of natural selection in the genome
MIGRATION Movement of individuals from one subpopulation to another followed by random mating. Movement of gametes from one subpopulation to another followed.
Polymorphism Polymorphism: when two or more alleles at a locus exist in a population at the same time. Nucleotide diversity: P = xixjpij considers.
Signatures of Selection
The neutral theory of molecular evolution
Allele frequency Time.
Neutrality Test First suggested by Kimura (1968) and King and Jukes (1969) Shift to using neutrality as a null hypothesis in positive selection and selection.
Montgomery Slatkin  The American Journal of Human Genetics 
Human Chimp How does DNA evolve? Nucleotide substitutions
The Neutral Theory M. Kimura, 1968
Pipelines for Computational Analysis (Bioinformatics)
Deviations from HWE I. Mutation II. Migration III. Non-Random Mating
What are the Patterns Of Nucleotide Substitution Within Coding and
Statistical Modeling of Ancestral Processes
Testing the Neutral Mutation Hypothesis
MIGRATION Movement of individuals from one subpopulation to another followed by random mating. Movement of gametes from one subpopulation to another followed.
Genetic Drift, followed by selection can cause linkage disequilibrium
Montgomery Slatkin  The American Journal of Human Genetics 
Testing for Selective Neutrality
Evolution by Genetic Drift : Main Points (p. 231)
Evolution by Genetic Drift : Main Points (p. 231)
Presentation transcript:

Testing the Neutral Mutation Hypothesis The neutral theory predicts that polymorphism within species is correlated positively with fixed differences between species Genes that exhibit many interspecific differences will also have high levels of intraspecific polymorphism. i.e.

Nonsynonymous Synonymous Fixed Differences Polymorphisms % % nonsynonymous fixed synonymous fixed nonsynonymous polymorphism synonymous polymorphism = McDonald-Krietman Test G6PDH from D. melanogaster and D. simulans. Eanes et al Neutral Prediction: % nonsynonymous

Frequency If most nonsynonymous substitutions are adaptive, then they will increase in frequency and be fixed more rapidly than neutral alleles. Time As a result, they spend less time in a polymorphic state, therefore contribute less to within species polymorphism. neutral allele advantageous allele

Nonsynonymous Synonymous Fixed Differences Polymorphisms % % Adh from D. melanogaster, D. simulans, and D. Yakuba MacDonald and Kreitman 1991 % nonsynonymous Another example (N = 6-12 alleles per species for the coding region).

t2t2 t3t3 t4t4 t5t5 Coalescent Process t m is time for coalescence from m to m-1 sequences Gene Tree

ab cd e fgh Coalescent Process The geneology of n sequences has 2(n-1) branches. Gene Tree n = number of external branches. n-2 are internal

How long will the coalescence process take? Simplest case: If pick two random gene copies, probability that the second is the same as the first is 1 / (2N). This is the probability that two alleles coalesce in previous generation. It follows that / (2N) is the probability that two sequences were derived from different sequences in the preceding generation. Therefore, the probability that 2 sequences derived from the same ancestor 2 generations ago (grandparent) is / (2N) x 1 / (2N). It can be shown that the probability that two sequences were derived from the same ancestor t generations ago is: [1 - 1 / (2N) t x (1 / (2N)] ~ (1 / (2N e (-t/(2N)

[1 - 1 / (2N) g-1 x (1 / (2N)] Because N is in denominator, the probability will depend on sample size Consider probability of common ancestry for: Generations agoProb(N=5) Prob(N=10) It can be shown that the average time back to common ancestry of a pair of genes in a diploid population is 2Ne, and the average time back to common ancestry of all gene copies is 4Ne generations.

Small pop Large pop Time back to common ancestor

The average degree of relationship increases with time. All of the gene copies in a population can be traced back to a single ancestral gene. A population will eventually become monomorphic for one allele or another, with this probability determined by initial allele frequencies. Coalescence with no mutation

Coalescence with mutation If each lineage experiences  mutations per generation, then the number of base pair differences between them will be #dif = 2  t ca. If the average time to coalescence is 2N for two randomly chosen gene copies, then #dif = 2  (2N). Therefore, expect the average number of base pair differences between gene copies to be greater in a larger population.

Total length of branches of gene tree I + L = J Internal branches External branches + = Total time length Now consider mutation among branches during the coalescent process.  i ) +  e ) =  Mutations internal branches + = Total number of mutations in gene tree Mutations external branches In theory: total number of mutations  equals the number of segregating sites (K)

Tajima’ s Test (1989):  - K / a V(  - K/a) D = Using the difference in estimates of polymorphism to detect deviation from neutrality. Normalizing factor Rationale:  and K are differentially influenced by the frequency of alleles. Testing for Selective Neutrality

D = 0 neutral prediction D > 0balancing selection D < 0directional selection Few alleles at intermediate frequency Many low frequency, variable alleles  K/a ><><

Fu and Li’s Test (1993):  i -  e / (a - 1) V[  i -  e / (a - 1) D = Using the difference in # mutations in gene tree to detect deviation from neutrality. Rationale: An equivalent number of mutations is expected between interior verses exterior branches of a neutral gene tree.

D = 0 neutral prediction D > 0balancing selection D < 0directional selection Few alleles at intermediate frequency Many low frequency, variable alleles  i  e ><><

Gene genealogies under no selection, positive selection, balancing selection, and background selection. No Selection : 7 neutral mutations accumulate since the time of the last common ancestor. D = 0

Consider the Effects of Selection on Neutral Sites Linked to a Selected Site Positive Selection : neutral variation at linked sites will be eliminated (swept away) as the advantageous allele quickly is fixed in the population. This process is also called hitch-hiking. Time D < 0

Consider the Effects of Selection on Neutral Sites Linked to a Selected Site Balancing Selection : neutral variation at linked sites accumulates during the long period of time that both allele lineages are maintained. Time D > 0

Consider the Effects of Selection on Neutral Sites Linked to a Selected Site Background Selection : gene lineages become extinct not only by chance, but because of deleterious mutations to which they are linked, which eliminates some gene copies. Time D < 0

Problem: Background selection and hitchhiking are contrasting processes that lead to the same pattern. How to differentiate? Dramatic examples of reduced polymorphism=hitchhiking. Less dramatic examples=background selection.