UPGMA and FM are distance based methods. UPGMA enforces the Molecular Clock Assumption. FM (Fitch-Margoliash) relieves that restriction, but still enforces.

Slides:



Advertisements
Similar presentations
Tree Building What is a tree ? How to build a tree ? Cladograms Trees
Advertisements

Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
Bioinformatics Phylogenetic analysis and sequence alignment The concept of evolutionary tree Types of phylogenetic trees Measurements of genetic distances.
. Class 9: Phylogenetic Trees. The Tree of Life Evolution u Many theories of evolution u Basic idea: l speciation events lead to creation of different.
An Introduction to Phylogenetic Methods
Reading Phylogenetic Trees Gloria Rendon NCSA November, 2008.
 Aim in building a phylogenetic tree is to use a knowledge of the characters of organisms to build a tree that reflects the relationships between them.
Phylogenetics - Distance-Based Methods CIS 667 March 11, 2204.
Phylogenetic reconstruction
Maximum Likelihood. Likelihood The likelihood is the probability of the data given the model.
Molecular Evolution Revised 29/12/06
Tree Reconstruction.
Lecture 7 – Algorithmic Approaches Justification: Any estimate of a phylogenetic tree has a large variance. Therefore, any tree that we can demonstrate.
Bioinformatics Algorithms and Data Structures
In addition to maximum parsimony (MP) and likelihood methods, pairwise distance methods form the third large group of methods to infer evolutionary trees.
The Tree of Life From Ernst Haeckel, 1891.
Maximum Likelihood Flips usage of probability function A typical calculation: P(h|n,p) = C(h, n) * p h * (1-p) (n-h) The implied question: Given p of success.
Maximum Parsimony.
. Class 9: Phylogenetic Trees. The Tree of Life D’après Ernst Haeckel, 1891.
Molecular Clocks, Base Substitutions, & Phylogenetic Distances.
Building Phylogenies Parsimony 1. Methods Distance-based Parsimony Maximum likelihood.
Building Phylogenies Distance-Based Methods. Methods Distance-based Parsimony Maximum likelihood.
Parsimony methods the evolutionary tree to be preferred involves ‘the minimum amount of evolution’ Edwards & Cavalli-Sforza Reconstruct all evolutionary.
Phylogenetic trees Sushmita Roy BMI/CS 576
Phylogenetic Analysis
Multiple Sequence Alignments and Phylogeny.  Within a protein sequence, some regions will be more conserved than others. As more conserved,
Phylogenetic analyses Kirsi Kostamo. The aim: To construct a visual representation (a tree) to describe the assumed evolution occurring between and among.
Phylogeny Estimation: Traditional and Bayesian Approaches Molecular Evolution, 2003
Terminology of phylogenetic trees
Molecular phylogenetics
Molecular evidence for endosymbiosis Perform blastp to investigate sequence similarity among domains of life Found yeast nuclear genes exhibit more sequence.
Phylogenetics Alexei Drummond. CS Friday quiz: How many rooted binary trees having 20 labeled terminal nodes are there? (A) (B)
1 Dan Graur Molecular Phylogenetics Molecular phylogenetic approaches: 1. distance-matrix (based on distance measures) 2. character-state.
Phylogenetic Analysis. General comments on phylogenetics Phylogenetics is the branch of biology that deals with evolutionary relatedness Uses some measure.
Molecular phylogenetics 1 Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections
Computational Biology, Part D Phylogenetic Trees Ramamoorthi Ravi/Robert F. Murphy Copyright  2000, All rights reserved.
Lecture 25 - Phylogeny Based on Chapter 23 - Molecular Evolution Copyright © 2010 Pearson Education Inc.
BINF6201/8201 Molecular phylogenetic methods
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
 Read Chapter 4.  All living organisms are related to each other having descended from common ancestors.  Understanding the evolutionary relationships.
OUTLINE Phylogeny UPGMA Neighbor Joining Method Phylogeny Understanding life through time, over long periods of past time, the connections between all.
Introduction to Phylogenetics
Reading Phylogenetic Trees
Calculating branch lengths from distances. ABC A B C----- a b c.
Using traveling salesman problem algorithms for evolutionary tree construction Chantal Korostensky and Gaston H. Gonnet Presentation by: Ben Snider.
Ch.6 Phylogenetic Trees 2 Contents Phylogenetic Trees Character State Matrix Perfect Phylogeny Binary Character States Two Characters Distance Matrix.
Phylogenetic Analysis Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics Figures from Higgs & Attwood.
Chapter 10 Phylogenetic Basics. Similarities and divergence between biological sequences are often represented by phylogenetic trees Phylogenetics is.
Why do trees?. Phylogeny 101 OTUsoperational taxonomic units: species, populations, individuals Nodes internal (often ancestors) Nodes external (terminal,
Maximum Likelihood Given competing explanations for a particular observation, which explanation should we choose? Maximum likelihood methodologies suggest.
Phylogeny Ch. 7 & 8.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
Interpreting Cladograms
Distance-based methods for phylogenetic tree reconstruction Colin Dewey BMI/CS 576 Fall 2015.
Maximum Parsimony Phenetic (distance based) methods are fast and often accurate but discard data and are not based on explicit character states at each.
Introduction to Bioinformatics Resources for DNA Barcoding
Phylogenetic basis of systematics
Phylogeny and the Tree of Life
Goals of Phylogenetic Analysis
The Tree of Life From Ernst Haeckel, 1891.
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Phylogeny and the Tree of Life
Phylogeny and the Tree of Life
Chapter 20 Phylogenetic Trees. Chapter 20 Phylogenetic Trees.
Reading Phylogenetic Trees
Phylogeny and the Tree of Life
Lecture 7 – Algorithmic Approaches
Phylogenetic Trees Jasmin sutkovic.
Presentation transcript:

UPGMA and FM are distance based methods. UPGMA enforces the Molecular Clock Assumption. FM (Fitch-Margoliash) relieves that restriction, but still enforces the idea that the clades should include the entities that are closest to each other. There is another numerical method called Neighbor Joining (NJ – see p of K&R) but it involves a complicated combinatorial method for locating neighbors. The latter two are extensions and modifications of UPGMA. While distance based methods make sense in some contexts, the problem is that they try to reduce the relationship between all entities in a group to a collection of pairwise distances between them. We now look at an entirely different approach called character based methods. We only consider one example.

Method of Maximum Parsimony The idea behind this method is that looking at all possible trees that might relate a collection of sequences, it chooses only those that involve the fewest number of mutations to produce all of the sequences in the group. Let’s start with a simple case. Suppose we have four sequences and in a particular position (we will get to multiple sequences later) we have two sequences that have a T and two that have a C. Consider two possible trees – the numbers indicate from which sequence the nucleotide was taken CC C C TT TT Which tree is more likely to explain the evolution of the four sequences? ** * Suppose the ancestral sequence contained a C then the *’s indicate the branches where a mutation occurred. A similar result would occur if the ancestral sequence contained a T.

Still sticking with two nucleotides, C and T, consider the five taxa above. Suppose figures (a) and (b) are from one position in an MSA and (c) and (d) are from another. Parsimony shows that (a) is more likely than (b). However, parsimony does not help us in determining whether (c) is more likely than (d). On the other hand, considering the two positions together, using the parsimony criterion, we would choose the relationships shown in (a) and (c).

Note: The root in the constructions used as illlustrations in these slides is merely for convenience, Parsimony, like FM, produces and unrooted tree and an outgroup is generally used to find a root. Note: the number of *’s (denoting mutations is called the parsimony score of a tree). We do not need to restrict ourselves to only two different nucleotides at a site. Consider five taxa and three nucleotides. GATTA * ** G AA T T * * Score: 3 Score: 2 Actually, there are several trees for these five taxa with a parsimony score of 2.

Consider the following four short sequences: GTA, GCA, ATC,ACC Suppose the following tree is proposed: Start at the top of this tree * We observed an initial mutation in the middle nucleotide. We placed the * on the branch with the C, assuming that the ancestor had a T in that position. We could have assumed the ancestor had a C in the middle position and put a * on the other branch. The choice does not affect our final Parsimony Score.

Continuing, assuming the previous partial scoring ** This one is more tricky, because we have changes in two positions, the first and the third. Completing the marking of this tree ** * This tree has a Parsimony Score of 4. Can you find one with a lower score?

Generally, we deal with much longer sequences. However, not all sites will affect the number of mutations needed for a tree. It is obvious if all sites have the same nucleotide that this site will not contribute to the score. Less obvious is when all but one site have the same nucleotide, say A, and the remaining site has some other nucleotide. In this case regardless of the tree topology if we put an A at every interior vertex, we will have the minimum number of mutations. This leads to the following definition: A site is called informative if at least two different bases occur at least twice each among the sequences being considered. We generally choose between alternative trees that may be offered to explain an evolutionary event, by computing the parsimony score only on the basis of informative sites.

Maximum Parsimony is generally of little help in tree construction in that All possible trees must be considered (albeit, some can be rejected out of hand for a variety of good reasons.) Many times there are two or more trees that have the same parsimony score. It makes no use of the Jukes-Cantor or any other model of DNA mutation in its evaluation. Only criterion – simplicity is best. The best explanation of evolutionary history is the one with the fewest mutations. It is used in conjunction with distance methods when there are relatively few mutations obscuring previous mutations. Both parsimony methods and distance methods have their strong advocates, and a serious, sometimes acrimonious debate is still going on in the area of evolutionary tree reconstruction.

Read: Krane and Raymer Chapter 5 pp 98 – 111 Homework : Question on slide #6 (Can you take the sequences in the tree having a parsimony score of 4 and find a tree with a lower score.) K&R p115,116: 5.1, 5.2, 5.3