Trees & Topologies Chapter 3, Part 2. A simple lineage Consider a given gene of sample size n. How long does it take before this gene coalesces with another.

Slides:



Advertisements
Similar presentations
Markov chains Assume a gene that has three alleles A, B, and C. These can mutate into each other. Transition probabilities Transition matrix Probability.
Advertisements

Background The demographic events experienced by populations influence their genealogical history and therefore the pattern of neutral polymorphism observable.
Sampling distributions of alleles under models of neutral evolution.
Coalescence with Mutations Towards incorporating greater realism Last time we discussed 2 idealized models – Infinite Alleles, Infinite Sites A realistic.
Retinoblastoma by Michele Chasteen What is Retinoblastoma? n n Tumor of the eye that can occur at a high frequency in children and sporadically at an.
Preview What does Recombination do to Sequence Histories. Probabilities of such histories. Quantities of interest. Detecting & Reconstructing Recombinations.
N-gene Coalescent Problems Probability of the 1 st success after waiting t, given a time-constant, a ~ p, of success 5/20/2015Comp 790– Continuous-Time.
Lecture 23: Introduction to Coalescence April 7, 2014.
Atelier INSERM – La Londe Les Maures – Mai 2004
Forward Genealogical Simulations Assumptions:1) Fixed population size 2) Fixed mating time Step #1:The mating process: For a fixed population size N, there.
Exact Computation of Coalescent Likelihood under the Infinite Sites Model Yufeng Wu University of Connecticut ISBRA
DATA ANALYSIS Module Code: CA660 Lecture Block 2.
From population genetics to variation among species: Computing the rate of fixations.
March 2006Vineet Bafna CSE280b: Population Genetics Vineet Bafna/Pavel Pevzner
March 2006Vineet Bafna CSE280b: Population Genetics Vineet Bafna/Pavel Pevzner
Dispersal models Continuous populations Isolation-by-distance Discrete populations Stepping-stone Island model.
March 2006Vineet Bafna CSE280b: Population Genetics Vineet Bafna/Pavel Pevzner
Combinatorics & the Coalescent ( ) Tree Counting & Tree Properties. Basic Combinatorics. Allele distribution. Polya Urns + Stirling Numbers. Number.
Extensions to Basic Coalescent Chapter 4, Part 1.
1 We should appreciate, cherish and cultivate blessings.
Extensions to Basic Coalescent Chapter 4, Part 2.
Phylogenetics and Coalescence Lab 9 October 24, 2012.
Population assignment likelihoods in a phylogenetic and demographic model. Jody Hey Rutgers University.
MAT 4830 Mathematical Modeling 4.1 Background on DNA
Trees & Topologies Chapter 3, Part 1. Terminology Equivalence Classes – specific separation of a set of genes into disjoint sets covering the whole set.
Phylogeny GENE why is coalescent theory important for understanding phylogenetics (species trees)? coalescent theory lets us test our assumptions.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Models and their benefits. Models + Data 1. probability of data (statistics...) 2. probability of individual histories 3. hypothesis testing 4. parameter.
Getting Parameters from data Comp 790– Coalescence with Mutations1.
1 Population Genetics Basics. 2 Terminology review Allele Locus Diploid SNP.
Coalescent Models for Genetic Demography
Ch. 20 – Mechanisms of Evolution 20.1 – Population Genetics macro-evolution – evolution on a large scale, such as the evolution of new species from a common.
Introduction to History of Life. Biological evolution consists of change in the hereditary characteristics of groups of organisms over the course of generations.
Introduction to Genetics Biology 1-2. Genetics Genetics is the study of heredity and inheritable traits. Genetics is the study of heredity and inheritable.
FINE SCALE MAPPING ANDREW MORRIS Wellcome Trust Centre for Human Genetics March 7, 2003.
Probability Rules. We start with four basic rules of probability. They are simple, but you must know them. Rule 1: All probabilities are numbers between.
Population genetics. coalesce 1.To grow together; fuse. 2.To come together so as to form one whole; unite: The rebel units coalesced into one army to.
By Mireya Diaz Department of Epidemiology and Biostatistics for EECS 458.
Coalescent theory CSE280Vineet Bafna Expectation, and deviance Statements such as the ones below can be made only if we have an underlying model that.
Testing the Neutral Mutation Hypothesis The neutral theory predicts that polymorphism within species is correlated positively with fixed differences between.
Restriction enzyme analysis The new(ish) population genetics Old view New view Allele frequency change looking forward in time; alleles either the same.
Lecture PowerPoint Slides Basic Practice of Statistics 7 th Edition.
Chapter IV Statistic and simple probability. A. Meaning of probability 1.Random event 2.Simple event 3.Relative frequent and probability of an event Relative.
Fixed Parameters: Population Structure, Mutation, Selection, Recombination,... Reproductive Structure Genealogies of non-sequenced data Genealogies of.
A Little Intro to Statistics What’s the chance of rolling a 6 on a dice? 1/6 What’s the chance of rolling a 3 on a dice? 1/6 Rolling 11 times and not getting.
Copyright © 2010, 2007, 2004 Pearson Education, Inc. Lecture Slides Elementary Statistics Eleventh Edition and the Triola Statistics Series by.
Course 2 Probability Basics 7.9 and Theoretical Probability Theoretical Probability is the ratio of the number of ways an event can occur to the.
What information do they give us?. Discuss with someone next to you. Make a Simple sketch on your whiteboard.
Lecture 6 Genetic drift & Mutation Sonja Kujala
Lecture Slides Elementary Statistics Twelfth Edition
Gil McVean Department of Statistics
Polymorphism Polymorphism: when two or more alleles at a locus exist in a population at the same time. Nucleotide diversity: P = xixjpij considers.
Population Genetics As we all have an interest in genomic epidemiology we are likely all either in the process of sampling and ananlysising genetic data.
Montgomery Slatkin  The American Journal of Human Genetics 
COALESCENCE AND GENE GENEALOGIES
The Heritage of Pathogen Pressures and Ancient Demography in the Human Innate- Immunity CD209/CD209L Region  Luis B. Barreiro, Etienne Patin, Olivier Neyrolles,
Evolution and Classification
Testing the Neutral Mutation Hypothesis
S=.2 s=.1.
Lecture 2: Basic Population Genetics
The coalescent with recombination (Chapter 5, Part 1)
S=.2 s=.1.
Montgomery Slatkin  The American Journal of Human Genetics 
Trees & Topologies Chapter 3, Part 2
Trees & Topologies Chapter 3, Part 2
Outline Cancer Progression Models
Probability.
Complex Signatures of Natural Selection at the Duffy Blood Group Locus
Lactase Haplotype Diversity in the Old World
The Heritage of Pathogen Pressures and Ancient Demography in the Human Innate- Immunity CD209/CD209L Region  Luis B. Barreiro, Etienne Patin, Olivier Neyrolles,
Presentation transcript:

Trees & Topologies Chapter 3, Part 2

A simple lineage Consider a given gene of sample size n. How long does it take before this gene coalesces with another gene in the sample?

Single Lineage How many events pass before it coalesces with another gene?

Disjoint subsamples Consider a sample of size n that is divided into two disjoint subsamples, A and B of sizes k and n-k, respectively.

Disjoint Subsamples (cont’d) The probability that all genes in A find a MRCA coalescing with any gene in B is: The probability that one of the two samples finds a MRCA before coalescing with members of the other sample is:

Disjoint Subsamples (cont’d)

Jump Process of Disjoint Subsamples Jump processes: – (i,j) -> (i-1, j) with probability (i+1)/(i+j) – (i,j) -> (i,j-1) with probability (j-1)/(i+j) Process starts in (k, n-k) and continues until (1,j) for some j. Eventually jumps to (0,j) for some j and finally reaches (0,1), where 0 denotes that sample A has been fully absorbed into B.

Disjoint Subsamples Example Gene tree of the PHDA1 gene from a sample of Africans and non-Africans.

A sample partitioned by a mutation Now, consider a sample of size n where a polymorphism divides the sample into two disjoint subsamples, A and B, of size k and n-k, respectively.

Comparing the mean values Jump processes: (i,j) -> (i-1,j) with probability i/(i+j-1) (i,j) -> (i, j-1) with probability (j-1)/(i+j-1)

Unknown ancestral state If we do not know which of the two alleles is older, we have a slightly different situation. Probability that an allele found in frequency k out of n genes is the oldest is k/n. Probability that A carries the mutant allele is 1-k/n = (n-k)/n. Jump processes become: – (i,j) -> (i-1,j) with probability i/(i+j) – (i,j) -> (i, j-1) with probability j/(i+j)

The age of the MRCA for two sequences Now consider the situation of two sequences with S 2 = k segregating sites.

Probability of going from n ancestors to k ancestors Probability of different number of ancestors starting with seven ancestors at time 0.

Probability of going from n ancestors to k ancestors Probability of different number of ancestors starting with seven ancestors at time 0 and ending with 4 ancestors at a different time.

Probability of going from n ancestors to k ancestors Probability that a sample of three genes have two ancestors at time r.

Questions? Slides are available on the Wiki at: omp_ omp_