Phylogenies Preliminaries Distance-based methods Parsimony Methods.

Slides:



Advertisements
Similar presentations
Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
Advertisements

Bioinformatics Phylogenetic analysis and sequence alignment The concept of evolutionary tree Types of phylogenetic trees Measurements of genetic distances.
. Class 9: Phylogenetic Trees. The Tree of Life Evolution u Many theories of evolution u Basic idea: l speciation events lead to creation of different.
1 Dan Graur Methods of Tree Reconstruction. 2 3.
Brandon Andrews CS6030.  What is a phylogenetic tree?  Goals in a phylogenetic tree generator  Distance based method  Fitch-Margoliash Method Example.
Phylogenetics - Distance-Based Methods CIS 667 March 11, 2204.
Phylogenetic reconstruction
Phylogenetic trees Sushmita Roy BMI/CS 576 Sep 23 rd, 2014.
Molecular Evolution Revised 29/12/06
Tree Reconstruction.
Molecular Phylogeny Fredj Tekaia Institut Pasteur
UPGMA Algorithm.  Main idea: Group the taxa into clusters and repeatedly merge the closest two clusters until one cluster remains  Algorithm  Add a.
BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference.
Bioinformatics and Phylogenetic Analysis
In addition to maximum parsimony (MP) and likelihood methods, pairwise distance methods form the third large group of methods to infer evolutionary trees.
The Tree of Life From Ernst Haeckel, 1891.
07/05/2004 Evolution/Phylogeny Introduction to Bioinformatics MNW2.
. Class 9: Phylogenetic Trees. The Tree of Life D’après Ernst Haeckel, 1891.
Lecture 24 Inferring molecular phylogeny Distance methods
FPGA Acceleration of Gene Rearrangement Analysis Jason D. Bakos Dept. of Computer Science and Engineering University of South Carolina Columbia, SC USA.
Chapter 2 Opener How do we classify organisms?. Figure 2.1 Tracing the path of evolution to Homo sapiens from the universal ancestor of all life.
Building Phylogenies Parsimony 2.
Building Phylogenies Distance-Based Methods. Methods Distance-based Parsimony Maximum likelihood.
Phylogenetic Analysis. 2 Phylogenetic Analysis Overview Insight into evolutionary relationships Inferring or estimating these evolutionary relationships.
Bioinformatics Basics Cyrus Chan, Peter Lo, David Lam Courtesy from LO Leung Yau’s original presentation.
Phylogenetic trees Sushmita Roy BMI/CS 576
Multiple Sequence Alignments and Phylogeny.  Within a protein sequence, some regions will be more conserved than others. As more conserved,
Phylogenetic analyses Kirsi Kostamo. The aim: To construct a visual representation (a tree) to describe the assumed evolution occurring between and among.
Phylogenetic Analysis. 2 Introduction Intension –Using powerful algorithms to reconstruct the evolutionary history of all know organisms. Phylogenetic.
Terminology of phylogenetic trees
Molecular phylogenetics
Molecular evidence for endosymbiosis Perform blastp to investigate sequence similarity among domains of life Found yeast nuclear genes exhibit more sequence.
Phylogenetics Alexei Drummond. CS Friday quiz: How many rooted binary trees having 20 labeled terminal nodes are there? (A) (B)
1 Dan Graur Molecular Phylogenetics Molecular phylogenetic approaches: 1. distance-matrix (based on distance measures) 2. character-state.
PHYLOGENETIC TREES Dwyane George February 24,
Phylogentic Tree Evolution Evolution of organisms is driven by Diversity  Different individuals carry different variants of.
Phylogenetic Analysis. General comments on phylogenetics Phylogenetics is the branch of biology that deals with evolutionary relatedness Uses some measure.
Molecular phylogenetics 1 Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections
Lecture 25 - Phylogeny Based on Chapter 23 - Molecular Evolution Copyright © 2010 Pearson Education Inc.
BINF6201/8201 Molecular phylogenetic methods
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Parsimony-Based Approaches to Inferring Phylogenetic Trees BMI/CS 576 Colin Dewey Fall 2010.
Phylogenetic Tree Reconstruction
Applied Bioinformatics Week 8 Jens Allmer. Practice I.
Molecular phylogenetics 4 Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections
OUTLINE Phylogeny UPGMA Neighbor Joining Method Phylogeny Understanding life through time, over long periods of past time, the connections between all.
Phylogenetic Trees  Importance of phylogenetic trees  What is the phylogenetic analysis  Example of cladistics  Assumptions in cladistics  Frequently.
Chapter 8 Molecular Phylogenetics: Measuring Evolution.
Introduction to Phylogenetics
Calculating branch lengths from distances. ABC A B C----- a b c.
Phylogenetic Analysis Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics Figures from Higgs & Attwood.
Phylogeny Ch. 7 & 8.
Phylogenetic trees Sushmita Roy BMI/CS 576 Sep 23 rd, 2014.
Applied Bioinformatics Week 8 Jens Allmer. Theory I.
Phylogenetic Trees - Parsimony Tutorial #13
Construcción de cladogramas y Reconstrucción Filogenética
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
1 CAP5510 – Bioinformatics Phylogeny Tamer Kahveci CISE Department University of Florida.
Part 9 Phylogenetic Trees
Distance-based methods for phylogenetic tree reconstruction Colin Dewey BMI/CS 576 Fall 2015.
Fitch-Margoliash Algorithm 1.From the distance matrix find the closest pair, e.g., A & B 2.Treat the rest of the sequences as a single composite sequence.
CSCE555 Bioinformatics Lecture 13 Phylogenetics II Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu Course page:
Introduction to Bioinformatics Resources for DNA Barcoding
Phylogenetic basis of systematics
Multiple Alignment and Phylogenetic Trees
Patterns in Evolution I. Phylogenetic
Phylogenetic Trees.
Phylogeny.
Lecture 19: Evolution/Phylogeny
Presentation transcript:

Phylogenies Preliminaries Distance-based methods Parsimony Methods

CS/BIO Introduction to Bioinformatics2 Phylogenetic Trees  Hypothesis about the relationship between organisms  Can be rooted or unrooted ABCDE AB C D E Time Root

CS/BIO Introduction to Bioinformatics3 Tree proliferation SpeciesNumber of Rooted TreesNumber of Unrooted Trees ,459,4252,027, ,458,046,767,8757,905,853,580,625 88,200,794,532,637,891,559,375221,643,095,476,699,771,875

CS/BIO Introduction to Bioinformatics4 Molecular phylogenetics  Specific genomic sequence variations (alleles) are much more reliable than phenotypic characteristics  More than one gene should be considered

CS/BIO Introduction to Bioinformatics5 An ongoing didactic  Pheneticists tend to prefer distance based metrics, as they emphasize relationships among data sets, rather than the paths they have taken to arrive at their current states.  Cladists are generally more interested in evolutionary pathways, and tend to prefer more evolutionarily based approaches such as maximum parsimony.

CS/BIO Introduction to Bioinformatics6 Distance matrix methods SpeciesABCD B9––– C811–– D121510– E

CS/BIO Introduction to Bioinformatics7 UPGMA  Similar to average-link clustering  Merge the closest two groups Replace the distances for the new, merged group with the average of the distance for the previous two groups  Repeat until all species are joined

CS/BIO Introduction to Bioinformatics8 UPGMA Step 1 SpeciesABCD B9––– C811–– D121510– E Merge D & E DE SpeciesABC B9–– C811– DE

CS/BIO Introduction to Bioinformatics9 UPGMA Step 2 Merge A & C DE SpeciesABC B9–– C811– DE AC SpeciesBAC 10– DE

CS/BIO Introduction to Bioinformatics10 UPGMA Steps 3 & 4 Merge B & AC DEAC SpeciesBAC 10– DE B Merge ABC & DE DEACB (((A,C)B)(D,E))

CS/BIO Introduction to Bioinformatics11 Parsimony approaches  Belong to the broader class of character based methods of phylogenetics  Emphasize simpler, and thus more likely evolutionary pathways I: GCGGACG II: GTGGACG C T III (C or T) C T III A (C or T)

CS/BIO Introduction to Bioinformatics12 Informative and uninformative sites Position Seq GGGGGG 2 GGGAGT 3 GGATAG 4 GATCAT For positions 5 & 6, it is possible to select more parsimonious trees – those that invoke less substitutions.

CS/BIO Introduction to Bioinformatics13 Parsimony methods  Enumerate all possible trees  Note the number of substitutions events invoked by each possible tree Can be weighted by transition/transversion probabilities, etc.  Select the most parsimonious

CS/BIO Introduction to Bioinformatics14 Branch and Bound methods  Key problem – number of possible trees grows enormous as the number of species gets large  Branch and bound – a technique that allows large numbers of candidate trees to be rapidly disregarded  Requires a “good guess” at the cost of the best tree

CS/BIO Introduction to Bioinformatics15 Branch and Bound for TSP  Find a minimum cost round-trip path that visits each intermediate city exactly once  NP-complete  Greedy approach: A,G,E,F,B,D,C,A = 251 A C F E D G B

CS/BIO Introduction to Bioinformatics16 Search all possible paths All paths A  G (20) A  G  F (88) AGFBAGFBAGFEAGFEAGFCAGFC A  G  E (55) A  B (46)A  C (93) A  C  B (175) A  C  B  E (257) ACDACDACFACF  Best estimate: 251

CS/BIO Introduction to Bioinformatics17 Parsimony – Branch and Bound  Use the UPGMA tree for an initial best estimate of the minimum cost (most parsimonious) tree  Use branch and bound to explore all feasible trees  Replace the best estimate as better trees are found  Choose the most parsimonious

CS/BIO Introduction to Bioinformatics18 Parsimony example Position Seq GGGGGG 2 GGGAGT 3 GGATAG 4 GATCAT All trees (1,2) [0] (1,3) [1] (1,4) [1] Position 5: Etc.