Application of Phylogenetic Networks in Evolutionary Studies Daniel H. Huson and David Bryant Presented by Peggy Wang.

Slides:



Advertisements
Similar presentations
Tree Building What is a tree ? How to build a tree ? Cladograms Trees
Advertisements

A Separate Analysis Approach to the Reconstruction of Phylogenetic Networks Luay Nakhleh Department of Computer Sciences UT Austin.
Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
Bioinformatics Phylogenetic analysis and sequence alignment The concept of evolutionary tree Types of phylogenetic trees Measurements of genetic distances.
Reading Phylogenetic Trees Gloria Rendon NCSA November, 2008.
1 General Phylogenetics Points that will be covered in this presentation Tree TerminologyTree Terminology General Points About Phylogenetic TreesGeneral.
BIO2093 – Phylogenetics Darren Soanes Phylogeny I.
Summer Bioinformatics Workshop 2008 Comparative Genomics and Phylogenetics Chi-Cheng Lin, Ph.D., Professor Department of Computer Science Winona State.
Phylogenetic reconstruction
Reading Phylogenetic Trees
Molecular Evolution Revised 29/12/06
BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference.
Distance Methods. Distance Estimates attempt to estimate the mean number of changes per site since 2 species (sequences) split from each other Simply.
The (Supertree) of Life: Procedures, Problems, and Prospects Presented by Usman Roshan.
Haplotyping via Perfect Phylogeny Conceptual Framework and Efficient (almost linear-time) Solutions Dan Gusfield U.C. Davis RECOMB 02, April 2002.
Branch lengths Branch lengths (3 characters): A C A A C C A A C A C C Sum of branch lengths = total number of changes.
Tree Evaluation Tree Evaluation. Tree Evaluation A question often asked of a data set is whether it contains ‘significant cladistic structure’, that is.
Fast Computation of the Exact Hybridization Number of Two Phylogenetic Trees Yufeng Wu and Jiayin Wang Department of Computer Science and Engineering University.
Chapter 11 Multiple Regression.
Ranking by Odds Ratio A Probability Model Approach let be a Boolean random variable: document d is relevant to query q otherwise Consider document d as.
. Comput. Genomics, Lecture 5b Character Based Methods for Reconstructing Phylogenetic Trees: Maximum Parsimony Based on presentations by Dan Geiger, Shlomo.
Probabilistic methods for phylogenetic trees (Part 2)
Building Phylogenies Parsimony 2.
Building Phylogenies Distance-Based Methods. Methods Distance-based Parsimony Maximum likelihood.
Phylogenetic trees Sushmita Roy BMI/CS 576
What Is Phylogeny? The evolutionary history of a group.
Multiple Sequence Alignments and Phylogeny.  Within a protein sequence, some regions will be more conserved than others. As more conserved,
Phylogenetic analyses Kirsi Kostamo. The aim: To construct a visual representation (a tree) to describe the assumed evolution occurring between and among.
Phylogenetic Analysis. 2 Introduction Intension –Using powerful algorithms to reconstruct the evolutionary history of all know organisms. Phylogenetic.
Terminology of phylogenetic trees
Molecular phylogenetics
P HYLOGENETIC T REE. OVERVIEW Phylogenetic Tree Phylogeny Applications Types of phylogenetic tree Terminology Data used to build a tree Building phylogenetic.
Molecular evidence for endosymbiosis Perform blastp to investigate sequence similarity among domains of life Found yeast nuclear genes exhibit more sequence.
Phylogenetic Analysis. General comments on phylogenetics Phylogenetics is the branch of biology that deals with evolutionary relatedness Uses some measure.
Computational Biology, Part D Phylogenetic Trees Ramamoorthi Ravi/Robert F. Murphy Copyright  2000, All rights reserved.
Phylogenetics and Coalescence Lab 9 October 24, 2012.
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
 Read Chapter 4.  All living organisms are related to each other having descended from common ancestors.  Understanding the evolutionary relationships.
Summarising Sets of Phylogenies Consensus Trees and Split/Consensus Networks Aidan Budd EMBL Heidelberg Friday July 2nd 2010 Basic Molecular Evolution.
Applied Bioinformatics Week 8 Jens Allmer. Practice I.
Molecular phylogenetics 4 Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Sections
Introduction to Phylogenetics
Reading Phylogenetic Trees
The bootstrap, consenus-trees, and super-trees Phylogenetics Workhop, August 2006 Barbara Holland.
Ch.6 Phylogenetic Trees 2 Contents Phylogenetic Trees Character State Matrix Perfect Phylogeny Binary Character States Two Characters Distance Matrix.
More statistical stuff CS 394C Feb 6, Today Review of material from Jan 31 Calculating pattern probabilities Why maximum parsimony and UPGMA are.
Gene tree discordance and multi-species coalescent models Noah Rosenberg December 21, 2007 James Degnan Randa Tao David Bryant Mike DeGiorgio.
1 Population Genetics Basics. 2 Terminology review Allele Locus Diploid SNP.
Chapter 10 Phylogenetic Basics. Similarities and divergence between biological sequences are often represented by phylogenetic trees Phylogenetics is.
Introduction to Phylogenetic trees Colin Dewey BMI/CS 576 Fall 2015.
Phylogeny Ch. 7 & 8.
Applied Bioinformatics Week 8 Jens Allmer. Theory I.
Phylogenetic Trees - Parsimony Tutorial #13
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
1 CAP5510 – Bioinformatics Phylogeny Tamer Kahveci CISE Department University of Florida.
Why use phylogenetic networks?
by d. gusfield v. bansal v. bafna y. song presented by vikas taliwal
Evolutionary genomics can now be applied beyond ‘model’ organisms
Phylogenetic basis of systematics
394C, Spring 2012 Jan 23, 2012 Tandy Warnow.
Multiple Alignment and Phylogenetic Trees
Methods of molecular phylogeny
Cladistics (Ch. 22) Based on phylogenetics – an inferred reconstruction of evolutionary history.
Molecular Clocks Rose Hoberman.
Summary and Recommendations
Reading Phylogenetic Trees
September 1, 2009 Tandy Warnow
Phylogenetic Trees Jasmin sutkovic.
Summary and Recommendations
1 2 Biology Warm Up Day 6 Turn phones in the baskets
Presentation transcript:

Application of Phylogenetic Networks in Evolutionary Studies Daniel H. Huson and David Bryant Presented by Peggy Wang

The plan  Terminology  Split networks: What are they? How can they be interpreted? Phylogenetic inference  SplitsTree4

A tree of terms..

Terminology  phylogenetic network any network in which taxa are represented by nodes and their evolutionary relationships are represented by edges

Types of networks  phylogenetic tree Leaf labeled tree that represents the evolutionary history of a set of taxa, possibly with branch(edge) lengths, either unrooted or rooted.  reticulate network Phylogenetic tree + additional edges. Nodes with more than two parents represent reticulate events such as hybridization, recombination and hgt  split network Represents incompatible and ambiguous signals in a data set. Parallel edges represents splits computed from data. Incompatible splits may result in nodes that do not represent ancestral species.

Wait… whats a split again?  split A partition of the taxa into two nonempty subsets, such as the partition obtained when we remove a branch from a phylogenetic tree.  split network (formally) For a given taxon set X and set of splits S, we define a split network N to be a connected graph in which some of the nodes are labeled by taxa and all edges are labeled by splits.

 Removing all edges associated with a given split s in S divides N into two connected components, one part containing all taxa on one side of S and the other part containing all taxa on the other side.  The edges along any shortest path in N are all associated with different splits.  A split network contains exactly the same information as a list of splits with a weight for each split.

 Every split network represents a unique collection of splits.  A given collection of splits can have many different split network representations.  The interpretation of the network depends on how the splits were constructed and assigned weights…

Interpreting Split Networks: Representing multiple trees  We can use split networks to summarize a large collection of trees. Code each individual tree as a collection of splits Define a summary set of splits Represent the set using a split network.  Consensus networks Constructed from all splits appearing in at least some fixed proportion of input trees

Interpreting Split Networks: Representing multiple trees  Confidence sets Assign an interval for the weights of each split A tree is contained within the split network N if (1)Every split in the tree is a split in the network (2)For every split in the tree, the corresponding branch length is contained within the corresponding interval (3)For every split in the network not in the tree, the assigned interval contains zero.. N

Interpreting Split Networks: Representing multiple trees Geometric interpretation: Index splits from 1 to m. Tree can be coded as a point in m-dimensional space: the ith coordinate is the length of the ith split, or 0 if that split is not present in the tree.

Interpreting Splits Networks: Networks and systematic error  Sampling error Random error resulting from a small sample size (number of sites). Deal with these errors using nonparametric bootstrap, multiple samples from posterior distribution  Systematic error Mistakes in the assumptions of a model or method which cause data to be misinterpreted. Likely to occur with large, multigene, heterogeneous data sets. How to deal with these errors?

Interpreting Splits Networks: Networks and systematic error  Phylogenetic inference (1)Construct a split network using the best available model and method. (2)Determine if the network is significantly different from a tree. (3)If the tree is significantly non-treelike, then there is probably an error in the model. If possible, improve the model and try again. (4)If the network is treelike, and there is no significant sampling error, the continue with a tree-based phylogenetic analysis.

Reticulate Networks 2 disagreeing trees Split network represents all splits present in either of the two trees Reticulate network Explains the differences in the two trees using 3 reticulation events

SplitsTree4  Integrates a wide range of phylogenetic network and phylogenetic tree methods, inference tools, data management utilities, and validation methods.  Included methods for inferring split networks: From character data. Median networks, parsimony splits, spectral analysis From distance matrices. Split decomposition and neighbor-net From sets of trees. Consensus networks and supernetworks.  Also constructs other types of phylogenetic networks, eg recombination and hybridization networks  User friendly?!

Example 1: Heterogeneous Evolution Jukes-Cantor p=0.75 q=0.05 0<r<0.4

Example 2: Animal Phylogeny Coelomate hypothesis Ecdysozoa hypothesis

More examples.. Dusky dolphins 60 variables (sites of DNA) 35 haplotypes Neighbor-joining tree with bootstrap values Consensus network of 3 MP trees Split decomposition network 95% confidence networkMedian networkNeighbor-net network

Conclusion!  Split networks are useful for visualization.  However they are not useful for making conclusive phylogenetic analysis.  SplitsTree4 encompasses many tools, but are they really that useful?