Introduction to Phylogenetics

Slides:



Advertisements
Similar presentations
Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
Advertisements

1 Dan Graur Methods of Tree Reconstruction. 2 3.
GENE TREES Abhita Chugh. Phylogenetic tree Evolutionary tree showing the relationship among various entities that are believed to have a common ancestor.
1 General Phylogenetics Points that will be covered in this presentation Tree TerminologyTree Terminology General Points About Phylogenetic TreesGeneral.
Phylogenetics - Distance-Based Methods CIS 667 March 11, 2204.
Phylogenetic reconstruction
Phylogenetic trees Sushmita Roy BMI/CS 576 Sep 23 rd, 2014.
Molecular Evolution Revised 29/12/06
Tree Reconstruction.
Lecture 7 – Algorithmic Approaches Justification: Any estimate of a phylogenetic tree has a large variance. Therefore, any tree that we can demonstrate.
BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference.
Bioinformatics and Phylogenetic Analysis
. Class 9: Phylogenetic Trees. The Tree of Life D’après Ernst Haeckel, 1891.
Phylogenetic Analysis. 2 Phylogenetic Analysis Overview Insight into evolutionary relationships Inferring or estimating these evolutionary relationships.
Phylogenetic trees Sushmita Roy BMI/CS 576
Multiple Sequence Alignments and Phylogeny.  Within a protein sequence, some regions will be more conserved than others. As more conserved,
Phylogenetic Analysis. 2 Introduction Intension –Using powerful algorithms to reconstruct the evolutionary history of all know organisms. Phylogenetic.
Molecular Phylogeny and Evolution.
Terminology of phylogenetic trees
Molecular phylogenetics
P HYLOGENETIC T REE. OVERVIEW Phylogenetic Tree Phylogeny Applications Types of phylogenetic tree Terminology Data used to build a tree Building phylogenetic.
Molecular evidence for endosymbiosis Perform blastp to investigate sequence similarity among domains of life Found yeast nuclear genes exhibit more sequence.
Phylogenetics Alexei Drummond. CS Friday quiz: How many rooted binary trees having 20 labeled terminal nodes are there? (A) (B)
1 Dan Graur Molecular Phylogenetics Molecular phylogenetic approaches: 1. distance-matrix (based on distance measures) 2. character-state.
plants animals monera fungi protists protozoa invertebrates vertebrates mammals Five kingdom system (Haeckel, 1879)
Phylogenetic Analysis. General comments on phylogenetics Phylogenetics is the branch of biology that deals with evolutionary relatedness Uses some measure.
Phylogenetic trees School B&I TCD Bioinformatics May 2010.
BINF6201/8201 Molecular phylogenetic methods
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
 Read Chapter 4.  All living organisms are related to each other having descended from common ancestors.  Understanding the evolutionary relationships.
OUTLINE Phylogeny UPGMA Neighbor Joining Method Phylogeny Understanding life through time, over long periods of past time, the connections between all.
Phylogenetic Trees  Importance of phylogenetic trees  What is the phylogenetic analysis  Example of cladistics  Assumptions in cladistics  Frequently.
Building phylogenetic trees. Contents Phylogeny Phylogenetic trees How to make a phylogenetic tree from pairwise distances  UPGMA method (+ an example)
Evolutionary Biology Concepts Molecular Evolution Phylogenetic Inference BIO520 BioinformaticsJim Lund Reading: Ch7.
Calculating branch lengths from distances. ABC A B C----- a b c.
Phylogenetic Analysis Gabor T. Marth Department of Biology, Boston College BI420 – Introduction to Bioinformatics Figures from Higgs & Attwood.
Chapter 10 Phylogenetic Basics. Similarities and divergence between biological sequences are often represented by phylogenetic trees Phylogenetics is.
Why do trees?. Phylogeny 101 OTUsoperational taxonomic units: species, populations, individuals Nodes internal (often ancestors) Nodes external (terminal,
Phylogeny Ch. 7 & 8.
Phylogeny & the Tree of Life
Phylogenetic trees Sushmita Roy BMI/CS 576 Sep 23 rd, 2014.
Applied Bioinformatics Week 8 Jens Allmer. Theory I.
Phylogenetics.
Phylogenetic Trees - Parsimony Tutorial #13
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
1 CAP5510 – Bioinformatics Phylogeny Tamer Kahveci CISE Department University of Florida.
{ Early Earth and the Origin of Life Chapter 15.  The Earth formed 4.6 billion years ago  Earliest evidence for life on Earth  Comes from 3.5 billion-year-old.
Distance-Based Approaches to Inferring Phylogenetic Trees BMI/CS 576 Colin Dewey Fall 2010.
Distance-based methods for phylogenetic tree reconstruction Colin Dewey BMI/CS 576 Fall 2015.
Phylogeny and the Tree of Life
Introduction to Bioinformatics Resources for DNA Barcoding
Evolutionary genomics can now be applied beyond ‘model’ organisms
Phylogenetic basis of systematics
Multiple Alignment and Phylogenetic Trees
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Phylogenetic Trees.
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Phylogeny and the Tree of Life
Phylogeny and the Tree of Life
Phylogeny and the Tree of Life
Chapter 20 Phylogenetic Trees. Chapter 20 Phylogenetic Trees.
Reading Phylogenetic Trees
Chapter 19 Molecular Phylogenetics
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Molecular data assisted morphological analyses
Warm-Up Contrast adaptive radiation vs. convergent evolution? Give an example of each. What is the correct sequence from the most comprehensive to least.
Phylogenetic Trees Jasmin sutkovic.
Phylogeny and the Tree of Life
1 2 Biology Warm Up Day 6 Turn phones in the baskets
Presentation transcript:

Introduction to Phylogenetics

Phylogenetics Phylogenetics is the study of evolutionary relationships among a group of organisms. Phylogenetic analysis is the means of inferring or estimating these relationships. The result of phylogenetic studies is a hypothesis about the evolutionary history of taxonomic groups: their phylogeny

Phylogenetic tree The evolutionary history inferred from phylogenetic analysis is usually depicted as branching, treelike diagrams that represent an estimated pedigree of the inherited relationships among molecules (‘‘gene trees’’), organisms, or both. Human Mouse Rat Fugu Tetraodon Zebrafish Frog Species tree

Why phylogenetic trees? Phylogenetic trees are extremely handy tools used by biologists to understand: - the composition of genomes, - relationships among genes in humans and other species, - the functions of proteins that run our cells, - historical relationships among diverse species, - the processes that generate unique body shapes, - the origins of remarkable abilities in living organisms,

Types of trees Rooted trees reflect the most basal ancestor of the tree in question Unrooted trees do not imply a known ancestral root.

Terminologies Internal branch Node (common ancestors, hypothetical) Nucleotide substitution or amino acid substitution Internal branch Node (common ancestors, hypothetical) Leaf/operational taxonomic units (OTUs) Root External branch outgroup

Terminologies Nodes: represent taxonomic units. Internal nodes correspond to ancestral species that are not part of the data. Internal branch: between 2 nodes. Internal nodes are connected by internal branches. External branch: between a node and a leaf (OTUs). Leaves are connected to the rest of the tree by the external branches emanating from an internal node. Leaf/operational taxonomic units (OTUs): The observed species (corresponding to the data) appear at the tip of the branches. Species/genes/population. Horizantal branch length: The branch lenght usually represents the number of changes that have occured in that branch.

Outgroup: Is a taxon that is clearly more distantly related to the taxa of interest than any of them is to another of these taxa. Tree topology: The branching pattern of a tree is called the topology. Clade: A set of all taxa derived from particular common ancestor. Cladogenesis: The process of branching. Unscaled trees: Branch lengths are not proportional to the number of changes. Scaled trees: Branch lengths are proportional to the number of changes

Altering the position of root, changes the meaning of Phylogenetic tree D B D C A B B A C D A B C D C D A B A B C D C and D branch late C and D branch early

Changing the taxon order doesn't matter

Types of trees Simply shows relative recency of common ancestor A cladogram with branch lengths A dendogram having all tips equidistant from root

Types of Phylogenetic Data Biomolecular sequences: DNA, RNA, amino acid, in a multiple alignment Molecular markers (e.g., SNPs, RFLPs, etc.) Morphology Gene order and content

Molecular Data Taxa Characters Species A ATGGCTATTCTTATAGTACG There are two types of molecular data: characters and distances Characters: can be a nucleotide / amino acid at a site in DNA /protein sequence, or the presence or absence of deletion or insertion at a site. That is each nucleotide/amino acid site in a DNA/protein sequence can be consider a character site. Taxa Characters Species A ATGGCTATTCTTATAGTACG Species B ATCGCTAGTCTTATATTACA Species C TTCACTAGACCT--TGGTCCA Species D TTGACCAGACCT--TGGTCCG Species E TTGACCAGTTCT-- TAGTTCG

Making trees using character-based methods The main idea of character based methods is to search for a tree that requires the smallest number of evolutionary changes to explain the differences among the OTUs under study. Such a tree is called maximum parsimonious (“simple”) tree.

As an example of tree-building using maximum parsimony, consider these four taxa: AAG AAA GGA AGA How might they have evolved from a common ancestor such as AAA?

Tree-building methods: Maximum parsimony

Molecular Data Distance: the other type of data are distance data, which are computed from DNA or amino acid sequence data . These data are also called the distance matrix data, because the distance are usually presented in the matrix from.

Tree construction using distances

The simplest distance based method is UPGMA UPGMA employs a sequential clustring algorithm, in which local topological relationships are inferred in order of decreasing similarity and a phylogenetic tree is built in a stepwise manner. First we identify the two OTUs that are most similar to each other (having the shortest distance) and treat them as a new single OTU. Such an OTU is is referred to as a composite OTU. Subsequently from among the new group of OTUs, we identify the pair with the highest similarity and so on, until only two OTUs are left.

UPGMA

UPGMA Consider a case of four OTUs. The pairwise evolutionary distances are given by the following matrix: OTU 1 2 3 4 d12 d13 d14 d23 d24 d34

UPGMA: Construct a phylogenetic tree following UPGMA algorithm.

Distance Matrix A B C D E F 2 4 6 8

TRY THIS!!!

Neighbor joining Method Neighbor-joining (Saitou and Nei, 1987) is a method that is related to the cluster method but does not require the data to be ultrametric. In other words it does not require that all lineages have diverged by equal amounts. The method is especially suited for datasets comprising lineages with largely varying rates of evolution.

Neighbor joining Method The neighbor-joining method Is especially useful for making a tree having a large number of taxa. Begin by placing all the taxa in a star-like structure.

Compute pairwise distances among all OTUs. Retain the pair with smallest distance (neighbors). Group i and j in the tree. Connect these neighbors to other OTUs via an internal branch, XY. When two nodes are linked, their common ancestral node is added to the tree and the terminal nodes with their respective branches are removed from the tree.

Neighbor joining Method

Applications of phylogeny "Species" trees recover the genealogy of taxa, individuals of a population, etc. Internal nodes represent speciation or other taxonomic events. Species trees should contain sequences from only orthologous genes.

"Gene" trees represent the evolutionary history of the genes included in the study. Gene trees can provide evidence for gene duplication events, as well as speciation events. Sequences from different homologs can be included in a gene tree; the subsequent analyses should cluster orthologs, thus demonstrating the evolutionary history of the orthologs.

Tools/Softwares CLUSTALW PHYLIP PAUP Mega5 http://www.ebi.ac.uk/Tools/services/web/toolform.ebi?tool=clustalw2 PHYLIP http://evolution.genetics.washington.edu/phylip.html PAUP http://paup.csit.fsu.edu/ Mega5 http://www.ncbi.nlm.nih.gov/pubmed/21546353