Multiple Alignment and Phylogenetic Trees

Slides:



Advertisements
Similar presentations
Computing a tree Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas.
Advertisements

Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
Bioinformatics Phylogenetic analysis and sequence alignment The concept of evolutionary tree Types of phylogenetic trees Measurements of genetic distances.
. Class 9: Phylogenetic Trees. The Tree of Life Evolution u Many theories of evolution u Basic idea: l speciation events lead to creation of different.
An Introduction to Phylogenetic Methods
Computing a tree Genome 559: Introduction to Statistical and Computational Genomics Prof. James H. Thomas.
Lecture 13 CS5661 Phylogenetics Motivation Concepts Algorithms.
 Aim in building a phylogenetic tree is to use a knowledge of the characters of organisms to build a tree that reflects the relationships between them.
Phylogenetic Trees - I.
Plant Molecular Systematics (Phylogenetics). Systematics classifies species based on similarity of traits and possible mechanisms of evolution, a change.
Phylogenetic reconstruction
Phylogenetic trees Sushmita Roy BMI/CS 576 Sep 23 rd, 2014.
Molecular Evolution Revised 29/12/06
Tree Reconstruction.
Lecture 7 – Algorithmic Approaches Justification: Any estimate of a phylogenetic tree has a large variance. Therefore, any tree that we can demonstrate.
BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference.
Building phylogenetic trees Jurgen Mourik & Richard Vogelaars Utrecht University.
Phylogenetic reconstruction
. Class 9: Phylogenetic Trees. The Tree of Life D’après Ernst Haeckel, 1891.
Chapter 2 Opener How do we classify organisms?. Figure 2.1 Tracing the path of evolution to Homo sapiens from the universal ancestor of all life.
Building Phylogenies Distance-Based Methods. Methods Distance-based Parsimony Maximum likelihood.
Phylogenetic trees Sushmita Roy BMI/CS 576
9/1/ Ultrametric phylogenies By Sivan Yogev Based on Chapter 11 from “Inferring Phylogenies” by J. Felsenstein.
Multiple Sequence Alignments and Phylogeny.  Within a protein sequence, some regions will be more conserved than others. As more conserved,
Terminology of phylogenetic trees
Molecular phylogenetics
P HYLOGENETIC T REE. OVERVIEW Phylogenetic Tree Phylogeny Applications Types of phylogenetic tree Terminology Data used to build a tree Building phylogenetic.
Molecular evidence for endosymbiosis Perform blastp to investigate sequence similarity among domains of life Found yeast nuclear genes exhibit more sequence.
Phylogenetics Alexei Drummond. CS Friday quiz: How many rooted binary trees having 20 labeled terminal nodes are there? (A) (B)
1 Dan Graur Molecular Phylogenetics Molecular phylogenetic approaches: 1. distance-matrix (based on distance measures) 2. character-state.
1 Generalized Tree Alignment: The Deferred Path Heuristic Stinus Lindgreen
Phylogenetic Analysis. General comments on phylogenetics Phylogenetics is the branch of biology that deals with evolutionary relatedness Uses some measure.
BINF6201/8201 Molecular phylogenetic methods
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
 Read Chapter 4.  All living organisms are related to each other having descended from common ancestors.  Understanding the evolutionary relationships.
Applied Bioinformatics Week 8 Jens Allmer. Practice I.
OUTLINE Phylogeny UPGMA Neighbor Joining Method Phylogeny Understanding life through time, over long periods of past time, the connections between all.
Phylogenetic Prediction Lecture II by Clarke S. Arnold March 19, 2002.
Phylogenetic Trees  Importance of phylogenetic trees  What is the phylogenetic analysis  Example of cladistics  Assumptions in cladistics  Frequently.
Building phylogenetic trees. Contents Phylogeny Phylogenetic trees How to make a phylogenetic tree from pairwise distances  UPGMA method (+ an example)
Introduction to Phylogenetics
Using traveling salesman problem algorithms for evolutionary tree construction Chantal Korostensky and Gaston H. Gonnet Presentation by: Ben Snider.
Evolutionary tree reconstruction
Phylogeny Ch. 7 & 8.
Phylogeny & the Tree of Life
PHYLOGENY AND THE TREE OF LIFE CH 26. I. Phylogenies show evolutionary relationships A. Binomial nomenclature: – Genus + species name Homo sapiens.
Applied Bioinformatics Week 8 Jens Allmer. Theory I.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
1 CAP5510 – Bioinformatics Phylogeny Tamer Kahveci CISE Department University of Florida.
Distance-Based Approaches to Inferring Phylogenetic Trees BMI/CS 576 Colin Dewey Fall 2010.
Distance-based methods for phylogenetic tree reconstruction Colin Dewey BMI/CS 576 Fall 2015.
Phylogeny and the Tree of Life
Introduction to Bioinformatics Resources for DNA Barcoding
Phylogenetic basis of systematics
Phylogeny - based on whole genome data
Distance based phylogenetics
Phylogeny & the Tree of Life
Inferring a phylogeny is an estimation procedure.
Cladistics.
18.2 Modern Evolutionary Classification
Welcome to AP Biology Saturday Study Session
Systematics: Tree of Life
Inferring phylogenetic trees: Distance and maximum likelihood methods
Phylogenetic Trees.
Systematics: Tree of Life
Reading Phylogenetic Trees
Lecture 7 – Algorithmic Approaches
Phylogeny.
Molecular data assisted morphological analyses
Phylogenetic Trees Jasmin sutkovic.
Presentation transcript:

Multiple Alignment and Phylogenetic Trees Csc 487/687 Computing for Bioinformatics

Terms related to graphs Graph ― an abstract structure containing nodes and edges Path ― a consecutive set of edges Tree ― a connected graph with exactly one path between every two nodes Edge length ―a number assigned to each edge signifying in some sense the distance between the nodes connected by edge Path length ― the sum of the lengths of the edges that comprise the path We describe phylogenetic relationships as trees. Graph, path, connected graph. Root, leaf. Binary tree. Directed graph. Each edge has one direction. Phylogenetic tree, directed graph. Ancestor-descendent relationship. Maybe you assign numbers to each edge for signifying the distance between the nodes connected by edge. In phylogenetic trees, edge lengths means some measure of dissimilarity between two species. The time since their separation.

Deriving phylogenetic trees Phenetic approach Measuring a set of distances between species, and generate the tree by a hierarchical clustering procedure. Cladistic approach Consider possible pathways of evolution Infer the features of the ancestor at each node Choose an optimal tree according to some model of evolutionary changes. Genealogy There are two approach. Makes no reference to any historical model of the relationships. Similarity, genealogy.

Phenetic (clustering) approach a simple clustering procedure Determine a measure of the similarity or difference for all pairs Create a tree from the set of dissimilarity iteratively First choose two closely-related species, and insert a node to represent their common ancestor. Replace the two selected species by a set containing both Replace the distances from the pair to the others by the average distance of the two selected species to the others. It can be used to produce a tree even in the absence of evolutionary relationships. Departmental store has goods clustered into sections according the types of product. Given a set of species, This could depend on the physical body trait such as the difference of the average heights of members of two species. The number of different bases. For DNA sequence.

Phenetic (clustering) approach Example UPGMA method (Unweighted pair group method with arithmetic mean) The number of different bases as the measure of dissimilarity between each pair of species. Use a simple clustering procedure to derive the phylogenetic tree. The distance matrix. Symmetric, half. Nonzero distance. The first cluster. External evidence

Cladistic methods The goal is to select the correct tree by utilizing an explicit model of the evolutionary process. Maximum parsimony Maximum likelihood They are specialized to sequence data, starting from a multiple sequence alignment Anatomic. Postulate. Ancestry. Taxa

Maximum parsimony method An optimal tree as the one that introduces the fewest mutations. Several trees may postulate the same number of mutations Not unique answer

Maximum likelihood method Assign quantitative probabilities to mutational events The optimal tree is the one with the highest likelihood of generating the observed data Not just counting them

The problem of varying rates of evolution Subject to the errors if the rates of evolutionary change vary along different branches of the tree outgroup―a species more distantly related to all the species than any pair of them is to each other. Suppose that the four species A, B, C, D have the phylogenetic tree. Consistent with the dissimilarity matrix. Change very fast. Phylogeny doesn’t change. Incorrect phylogenetic tree. Primate, non-primate mammal such as a cow

Computational Consideration Cladistic methods―maximum parsimony and maximum likelihood―are more accurate than simpler clustering method But require large amount of computer time Approximate answers Calculated phylogenies are often approximations