Trees & Topologies Chapter 3, Part 1. Terminology Equivalence Classes – specific separation of a set of genes into disjoint sets covering the whole set.

Slides:



Advertisements
Similar presentations
A Separate Analysis Approach to the Reconstruction of Phylogenetic Networks Luay Nakhleh Department of Computer Sciences UT Austin.
Advertisements

Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
PHYLOGENETIC TREES Bulent Moller CSE March 2004.
Huffman code and ID3 Prof. Sin-Min Lee Department of Computer Science.
Population Genetics, Recombination Histories & Global Pedigrees Finding Minimal Recombination Histories Global Pedigrees Finding.
Algorithms Analysis Lecture 6 Quicksort. Quick Sort Divide and Conquer.
Reading Phylogenetic Trees Gloria Rendon NCSA November, 2008.
Sampling distributions of alleles under models of neutral evolution.
Coalescence with Mutations Towards incorporating greater realism Last time we discussed 2 idealized models – Infinite Alleles, Infinite Sites A realistic.
Preview What does Recombination do to Sequence Histories. Probabilities of such histories. Quantities of interest. Detecting & Reconstructing Recombinations.
Phylogenetic reconstruction
Reading Phylogenetic Trees
Phylogenetic trees Sushmita Roy BMI/CS 576 Sep 23 rd, 2014.
Molecular Evolution Revised 29/12/06
Forward Genealogical Simulations Assumptions:1) Fixed population size 2) Fixed mating time Step #1:The mating process: For a fixed population size N, there.
Genetica per Scienze Naturali a.a prof S. Presciuttini Human and chimpanzee genomes The human and chimpanzee genomes—with their 5-million-year history.
March 2006Vineet Bafna CSE280b: Population Genetics Vineet Bafna/Pavel Pevzner
Haplotyping via Perfect Phylogeny Conceptual Framework and Efficient (almost linear-time) Solutions Dan Gusfield U.C. Davis RECOMB 02, April 2002.
March 2006Vineet Bafna CSE280b: Population Genetics Vineet Bafna/Pavel Pevzner
Continuous Coalescent Model
CSE182-L17 Clustering Population Genetics: Basics.
. Class 9: Phylogenetic Trees. The Tree of Life D’après Ernst Haeckel, 1891.
Incorporating Mutations
March 2006Vineet Bafna CSE280b: Population Genetics Vineet Bafna/Pavel Pevzner
Phylogenetic Networks of SNPs with Constrained Recombination D. Gusfield, S. Eddhu, C. Langley.
Chapter 2 Opener How do we classify organisms?. Figure 2.1 Tracing the path of evolution to Homo sapiens from the universal ancestor of all life.
Phylogenetic trees Sushmita Roy BMI/CS 576
Extensions to Basic Coalescent Chapter 4, Part 1.
Molecular phylogenetics
1 Generalized Tree Alignment: The Deferred Path Heuristic Stinus Lindgreen
Extensions to Basic Coalescent Chapter 4, Part 2.
Phylogenetic Analysis. General comments on phylogenetics Phylogenetics is the branch of biology that deals with evolutionary relatedness Uses some measure.
Phylogenetics and Coalescence Lab 9 October 24, 2012.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
 Read Chapter 4.  All living organisms are related to each other having descended from common ancestors.  Understanding the evolutionary relationships.
OUTLINE Phylogeny UPGMA Neighbor Joining Method Phylogeny Understanding life through time, over long periods of past time, the connections between all.
Chapter 8 Molecular Phylogenetics: Measuring Evolution.
Introduction to Phylogenetics
Reading Phylogenetic Trees
Ch.6 Phylogenetic Trees 2 Contents Phylogenetic Trees Character State Matrix Perfect Phylogeny Binary Character States Two Characters Distance Matrix.
Models and their benefits. Models + Data 1. probability of data (statistics...) 2. probability of individual histories 3. hypothesis testing 4. parameter.
Getting Parameters from data Comp 790– Coalescence with Mutations1.
Gene tree discordance and multi-species coalescent models Noah Rosenberg December 21, 2007 James Degnan Randa Tao David Bryant Mike DeGiorgio.
1 Population Genetics Basics. 2 Terminology review Allele Locus Diploid SNP.
Coalescent Models for Genetic Demography
Estimating Recombination Rates. LRH selection test, and recombination Recall that LRH/EHH tests for selection by looking at frequencies of specific haplotypes.
Introduction to Phylogenetic trees Colin Dewey BMI/CS 576 Fall 2015.
Phylogenetic trees Sushmita Roy BMI/CS 576 Sep 23 rd, 2014.
By Mireya Diaz Department of Epidemiology and Biostatistics for EECS 458.
Coalescent theory CSE280Vineet Bafna Expectation, and deviance Statements such as the ones below can be made only if we have an underlying model that.
Estimating Recombination Rates. Daly et al., 2001 Daly and others were looking at a 500kb region in 5q31 (Crohn disease region) 103 SNPs were genotyped.
Testing the Neutral Mutation Hypothesis The neutral theory predicts that polymorphism within species is correlated positively with fixed differences between.
Restriction enzyme analysis The new(ish) population genetics Old view New view Allele frequency change looking forward in time; alleles either the same.
Fixed Parameters: Population Structure, Mutation, Selection, Recombination,... Reproductive Structure Genealogies of non-sequenced data Genealogies of.
The UNIVERSITY of NORTH CAROLINA at CHAPEL HILL Mining Complex Data COMP Seminar Spring 2011.
Recombination and Pedigrees Genealogies and Recombination: The ARG Recombination Parsimony The ARG and Data Pedigrees: Models and Data Pedigrees & ARGs.
Trees & Topologies Chapter 3, Part 2. A simple lineage Consider a given gene of sample size n. How long does it take before this gene coalesces with another.
Tree Terminologies. Phylogenetic Tree - phylogenetic relationships are normally displayed in a tree-like diagram (phylogenetic tree/cladogram) - a cladogram.
Trees Chapter 15.
Polymorphism Polymorphism: when two or more alleles at a locus exist in a population at the same time. Nucleotide diversity: P = xixjpij considers.
Notes and Skills Practice
Estimating Recombination Rates
Lectures on Graph Algorithms: searching, testing and sorting
CS 581 Tandy Warnow.
The coalescent with recombination (Chapter 5, Part 1)
Reading Phylogenetic Trees
Trees & Topologies Chapter 3, Part 2
Trees & Topologies Chapter 3, Part 2
Outline Cancer Progression Models
Phylogenetic Trees Jasmin sutkovic.
Presentation transcript:

Trees & Topologies Chapter 3, Part 1

Terminology Equivalence Classes – specific separation of a set of genes into disjoint sets covering the whole set of genes Jump Process – describes which pair of genes coalesce at each coalescence event Waiting Time Process – the waiting time to the next coalescent event when there are k genes left 2/19/2009COMP 790-Trees & Topologies2

Coalescent Tree 2/19/2009COMP 790-Trees & Topologies3

Coalescent vs. Phylogenic Trees Phylogenetic tree: branch length = #of mutations Coalescent tree: branch length = time to coalescence (coalescent time x 2N generations x generation time) Expected number of mutations =  /2 Coalescent time 2/19/2009COMP 790-Trees & Topologies4 Rooted Phylogenetic Tree Four representations of a coalescent tree

Counting Trees & Topologies (C k ) # of coalescent topologies with k leaves (B k ) # of binary unrooted tree topologies with k leaves 2/19/2009COMP 790-Trees & Topologies5

Recursion Illustrated 2/19/2009COMP 790-Trees & Topologies6 Basic recursion for the number of unrooted tree topologies as a function of leaves

Recurrence Intuition K BkBk x x10 20 CkCk x x /19/2009COMP 790-Trees & Topologies7

Gene Trees Graph that shows the ancestral relationship between genes. Assume infinite sites model to build gene trees. (Ch. 5 discusses what happens without this assumption) Not a coalescent tree. Clusters genes according to their type and mutation pattern. 2/19/2009COMP 790-Trees & Topologies8

Example Gene Tree 2/19/2009COMP 790-Trees & Topologies9 Data set with five sequences and four segregating sites with relative positions. Built up, starting with first site, and continually adding more sites to the tree.

Building Gene Trees 1.Determine if data passes 4-gamete test. If not, there cannot be a gene tree. 2.If each column is a binary number, sort the numbers in decreasing order, with largest binary number in column one. 3.Add each sequence with all its characters one at a time. The characters of a sequence to be added is a specific row, which is read right to left. The sequence is placed by tracing from the leaves towards the root. It has its own edges until the prefix is encountered where it coincides with the last added character. 4.Root is labeled with an open circle. It can be removed to form an unrooted tree. 2/19/2009COMP 790-Trees & Topologies10

Example Given the following table, build a gene tree. 1.Determine if data passes 4-gamete test. If not, there cannot be a gene tree. 2.If each column is a binary number, sort the numbers in decreasing order, with largest binary number in column one. 3.Add each sequence with all its characters one at a time. The characters of a sequence to be added is a specific row, which is read right to left. The sequence is placed by tracing from the leaves towards the root. It has its own edges until the prefix is encountered where it coincides with the last added character. 4.Root is labeled with an open circle. It can be removed to form an unrooted tree. 2/19/2009COMP 790-Trees & Topologies11 ABCD

Nested Subsamples Assume a sample A, is taken of size n, and within that sample a subsample B, of size m is taken, m  n. Process describing the number of ancestors starts out in (m,n) and jumps to either (m,n-1) or (m-1,n-1) 2/19/2009COMP 790-Trees & Topologies12

More nested subsamples Probability that the MRCA of B is also the MRCA of A Special case: A is the whole population (n  , or n = 2N, and 2N is large) 2/19/2009COMP 790-Trees & Topologies13

More nested subsamples M P (A = B)0 / 2 (no info)1/31/22/3 = 0.674/5 = 0.809/10 = /15 = /19/2009COMP 790-Trees & Topologies14 Remember: time until whole population has found a MRCA is 2 (in coalescent units) and the time until a sample of size two has found a MRCA is 1.

Hanging Subtrees 2/19/2009COMP 790-Trees & Topologies15

Unbalanced Trees Probability that the basal split into two lineages at the root of the tree results in the labeled, unordered partition (i, n-i), i = 1,2,…,  n/2  In large samples, unbalanced trees are unlikely. 2/19/2009COMP 790-Trees & Topologies16

Neanderthal Example Nordborg(1998) studied the tree of a combined sample of 986 human mitochondrial sequences and 1 Neanderthal sequence. Assuming random mating: 2 /(986 *985) = 2 * Nordborg pointed out that a large part of the human sample had found a common ancestor during the time the sequence Neanderthal lived (30, ,000 years ago) For example, if there were 5 ancestors to present human sample 30,000 years ago, the probability is 2 /(5*4) = 10%. Does not provide strong evidence against interbreeding between Neanderthals and humans. 2/19/2009COMP 790-Trees & Topologies17

Next Time More Trees & Topologies – A single lineage – Disjoint subsamples – A sample partitioned by a mutation – The probability of going from n ancestors to k ancestors. 2/19/2009COMP 790-Trees & Topologies18