Introduction to Phylogenetic trees Colin Dewey BMI/CS 576 Fall 2015.

Slides:



Advertisements
Similar presentations
Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
Advertisements

. Class 9: Phylogenetic Trees. The Tree of Life Evolution u Many theories of evolution u Basic idea: l speciation events lead to creation of different.
Reading Phylogenetic Trees Gloria Rendon NCSA November, 2008.
Introduction to Phylogenies
Phylogenetic Analysis
GENE TREES Abhita Chugh. Phylogenetic tree Evolutionary tree showing the relationship among various entities that are believed to have a common ancestor.
Phylogenetics - Distance-Based Methods CIS 667 March 11, 2204.
BIO2093 – Phylogenetics Darren Soanes Phylogeny I.
Plant Molecular Systematics (Phylogenetics). Systematics classifies species based on similarity of traits and possible mechanisms of evolution, a change.
Summer Bioinformatics Workshop 2008 Comparative Genomics and Phylogenetics Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State.
Phylogenetic reconstruction
Reading Phylogenetic Trees
Phylogenetic trees Sushmita Roy BMI/CS 576 Sep 23 rd, 2014.
Classification systems have changed over time as information has increased. Section 2: Modern Classification K What I Know W What I Want to Find Out L.
Molecular Evolution Revised 29/12/06
Tree Reconstruction.
BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference.
. Class 9: Phylogenetic Trees. The Tree of Life D’après Ernst Haeckel, 1891.
. Comput. Genomics, Lecture 5b Character Based Methods for Reconstructing Phylogenetic Trees: Maximum Parsimony Based on presentations by Dan Geiger, Shlomo.
Chapter 2 Opener How do we classify organisms?. Figure 2.1 Tracing the path of evolution to Homo sapiens from the universal ancestor of all life.
Phylogenetic Analysis. 2 Phylogenetic Analysis Overview Insight into evolutionary relationships Inferring or estimating these evolutionary relationships.
Phylogenetic trees Sushmita Roy BMI/CS 576
TGCAAACTCAAACTCTTTTGTTGTTCTTACTGTATCATTGCCCAGAATAT TCTGCCTGTCTTTAGAGGCTAATACATTGATTAGTGAATTCCAATGGGCA GAATCGTGATGCATTAAAGAGATGCTAATATTTTCACTGCTCCTCAATTT.
Multiple Sequence Alignments and Phylogeny.  Within a protein sequence, some regions will be more conserved than others. As more conserved,
Terminology of phylogenetic trees
Molecular phylogenetics
P HYLOGENETIC T REE. OVERVIEW Phylogenetic Tree Phylogeny Applications Types of phylogenetic tree Terminology Data used to build a tree Building phylogenetic.
BINF6201/8201 Molecular phylogenetic methods
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Underlying Principles of Zoology Laws of physics and chemistry apply. Principles of genetics and evolution important. What is learned from one animal group.
 Read Chapter 4.  All living organisms are related to each other having descended from common ancestors.  Understanding the evolutionary relationships.
Trees & Topologies Chapter 3, Part 1. Terminology Equivalence Classes – specific separation of a set of genes into disjoint sets covering the whole set.
Introduction to Phylogenetic Trees
Evolutionary Biology Concepts Molecular Evolution Phylogenetic Inference BIO520 BioinformaticsJim Lund Reading: Ch7.
Introduction to Phylogenetics
Reading Phylogenetic Trees
CSCE555 Bioinformatics Lecture 12 Phylogenetics I Meeting: MW 4:00PM-5:15PM SWGN2A21 Instructor: Dr. Jianjun Hu Course page:
Calculating branch lengths from distances. ABC A B C----- a b c.
Phylogenetic Analysis Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics Figures from Higgs & Attwood.
Chapter 10 Phylogenetic Basics. Similarities and divergence between biological sequences are often represented by phylogenetic trees Phylogenetics is.
Phylogeny Ch. 7 & 8.
Phylogeny & the Tree of Life
Phylogenetic trees Sushmita Roy BMI/CS 576 Sep 23 rd, 2014.
PHYLOGENY AND THE TREE OF LIFE CH 26. I. Phylogenies show evolutionary relationships A. Binomial nomenclature: – Genus + species name Homo sapiens.
Phylogenetic Trees - Parsimony Tutorial #13
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
Probabilistic methods for phylogenetic tree reconstruction BMI/CS 576 Colin Dewey Fall 2015.
Probabilistic Approaches to Phylogenies BMI/CS 576 Sushmita Roy Oct 2 nd, 2014.
Phylogeny.
PHYOGENY & THE Tree of life Represent traits that are either derived or lost due to evolution.
Distance-Based Approaches to Inferring Phylogenetic Trees BMI/CS 576 Colin Dewey Fall 2010.
Distance-based methods for phylogenetic tree reconstruction Colin Dewey BMI/CS 576 Fall 2015.
Classification Biology I. Lesson Objectives Compare Aristotle’s and Linnaeus’s methods of classifying organisms. Explain how to write a scientific name.
Building Phylogenies. Phylogenetic (evolutionary) trees Human Gorilla Chimp Gibbon Orangutan Describe evolutionary relationships between species Cannot.
Chapter 26 Phylogeny and the Tree of Life
Phylogenetic trees. 2 Phylogeny is the inference of evolutionary relationships. Traditionally, phylogeny relied on the comparison of morphological features.
What is phylogenetic analysis and why should we perform it? Phylogenetic analysis has two major components: (1) Phylogeny inference or “tree building”
Bioinformatics Lecture 3 Molecular Phylogenetic By: Dr. Mehdi Mansouri Mehr 1395.
Molecular Evolution and Ebola
Molecular Evolution and Ebola
Evolutionary genomics can now be applied beyond ‘model’ organisms
Phylogenetic basis of systematics
Phylogeny & the Tree of Life
Multiple Alignment and Phylogenetic Trees
Cladogram notes.
Endeavour to reconstruct the characters of each hypothetical ancestor.
Molecular Evolution and Ebola
Reading Phylogenetic Trees
Unit Genomic sequencing
Phylogenetic Trees Jasmin sutkovic.
Presentation transcript:

Introduction to Phylogenetic trees Colin Dewey BMI/CS Fall 2015

Key concepts in this section What are phylogenies or phylogenetic trees? – Terminology such as extant, ancestral, branch point, branch length Why build phylogenetic trees? Algorithms to build phylogenetic trees – Distance-based methods – Parsimony methods Minimize the number of changes – Probabilistic methods Find the tree that best explains the data using probabilistic models

What is a tree? Graph theoretically: – Undirected case: graph without cycles – Directed case: underlying undirected graph is a tree Often it is required that indegree(v) ≤ 1 for all v

What are phylogenetic trees? A tree that describes evolutionary relationships among entities – Species, genes, strains This relationship is called “phylogeny” Leaves represent extant (current day) species Internal nodes represent ancestral species Phylogenetics: – The task for inferring the phylogenetic tree from observations in existing organisms

Why phylogenetic trees? Inform multiple sequence alignments Identify signatures of conservation of sequence Understand how organisms are related – Do humans and chimpanzees share a more recent common ancestor than do humans and gorillas? Ask how closely organisms are related – Humans and chimpanzees shard a common ancestor 5mya How specific functions/traits have evolved – What made us human?

From Tree of life aims to represents the phylogeny of all species on earth

Example Gene Tree: Globins

Tracing the evolution of the Ebola virus Ebola virus: a lethal human pathogen, fatality rate 78% 2014 Ebola epidemic in Africa – Until recently the largest known case happened in 1976 (318 cases) – Outbreak reported in Feb 2014 – As of last week, 11,312 deaths have been reported – Good news: no confirmed cases reported in week ending October 4th Key questions – Where did the pathogen come from? – How is it evolving? In a 2014 Science paper, researchers reported whole genome sequence alignment of 78 Ebola virus samples

Phylogenetic tree of the Ebola virus Gire et al, Science 2014 Three recent outbreaks from the same ancestor

Insights gained from sequence comparison “Genetic similarity across the sequenced 2014 samples suggests a single transmission from the natural reservoir, followed by human-to-human transmission during the outbreak” “..data suggest that the Sierra Leone outbreak stemmed from the introduction of two genetically distinct viruses from Guinea around the same time…” “..the catalog of 395 mutations, including 50 fixed nonsynonymous changes with 8 at positions with high levels of conservation across ebola viruses, provides a starting point for such studies” Gire et al., Science 2014

Phylogenetic tree basics Leaves represent entities (genes, species, individuals/strains) being compared – the term taxon (taxa plural) is used to refer to these when they represent species and broader classifications of organisms – For example if taxa are species, the tree is a species tree Internal nodes are ancestral units Phylogenetic trees can be rooted or unrooted – the root represents the common ancestor In a rooted tree, path from root to a node represents an evolutionary path – Gives directionality to evolutionary time An unrooted tree specifies relationships among taxa, but lacks directionality information

Tree basics Branch Leaf node: Extant Internal node: Ancestral For a species tree, internal nodes represent speciation events Unrooted treeRooted tree Each tree topology represents a different evolutionary history Branch length Branch length describes the evolutionary divergence between two nodes Root

Tree counting A rooted binary tree with n leaf nodes has – n-1 internal nodes – 2n-2 edges/branches An unrooted binary tree with n leaf nodes has – n-2 internal nodes – 2n-3 edges/branches – A root can be added to any of these branches to give 2n-3 rooted trees for any unrooted tree E.g. for n=3 there is one unrooted tree and three rooted trees

Tree counting An unrooted tree Possible positions for rootRooted trees

Tree counting Instead of adding a root we could add a branch for the n+1 th taxon

Tree counting A tree with 3 nodes can be grown in (2*3)-3=3 ways to make a tree of 4 nodes Each tree with 4 nodes can be grown in (2*4)-3=5 ways to make a tree of 5 nodes – So we have 3*5 trees Each tree of 5 nodes can be grown in (2*5)-3=7 – So we have 3*5*7 In general for n nodes we can have – (1)*(3)*(5)*...(2n-5) unrooted trees

Number of Possible Trees given n sequences, there are possible unrooted trees and possible rooted trees This grows very fast – For n=10, we have 2 million unrooted trees – For n=20, we have 2.2*10 20

Constructing phylogenetic trees Phylogenetic tree construction – Given observations of n taxonomical units infer the tree that best describes the evolutionary relationships among the units Three types of methods – Distance based methods – Parsimony methods – Probabilistic approaches