The Evolution Trees From: Computational Biology by R. C. T. Lee S. J. Shyu Department of Computer Science Ming Chuan University.

Slides:



Advertisements
Similar presentations
Great Theoretical Ideas in Computer Science
Advertisements

Introduction to Graph Theory Instructor: Dr. Chaudhary Department of Computer Science Millersville University Reading Assignment Chapter 1.
Great Theoretical Ideas in Computer Science for Some.
1 Discrete Structures & Algorithms Graphs and Trees: III EECE 320.
Lecture 24 Coping with NPC and Unsolvable problems. When a problem is unsolvable, that's generally very bad news: it means there is no general algorithm.
Approximation Algorithms for TSP
NP-Complete Problems Polynomial time vs exponential time
Combinatorial Algorithms
Minimum Spanning Trees Kun-Mao Chao ( 趙坤茂 ) Department of Computer Science and Information Engineering National Taiwan University, Taiwan
Great Theoretical Ideas in Computer Science.
1 Discrete Structures & Algorithms Graphs and Trees: II EECE 320.
Approximation Algorithms
. Phylogeny II : Parsimony, ML, SEMPHY. Phylogenetic Tree u Topology: bifurcating Leaves - 1…N Internal nodes N+1…2N-2 leaf branch internal node.
Approximation Algorithms: Combinatorial Approaches Lecture 13: March 2.
Approximation Algorithms: Concepts Approximation algorithm: An algorithm that returns near-optimal solutions (i.e. is "provably good“) is called an approximation.
Approximation Algorithms Lecture for CS 302. What is a NP problem? Given an instance of the problem, V, and a ‘certificate’, C, we can verify V is in.
Great Theoretical Ideas in Computer Science.
5 - 1 Chap 5 The Evolution Trees Evolutionary Tree.
CSE 326: Data Structures NP Completeness Ben Lerner Summer 2007.
Approximation Algorithms for the Traveling Salesperson Problem.
1 NP-Complete Problems Polynomial time vs exponential time –Polynomial O(n k ), where n is the input size (e.g., number of nodes in a graph, the length.
NP-Complete Problems (Fun part)
Chapter 5 The Evolution Trees.
9-1 Chapter 9 Approximation Algorithms. 9-2 Approximation algorithm Up to now, the best algorithm for solving an NP-complete problem requires exponential.
9-1 Chapter 9 Approximation Algorithms. 9-2 Approximation algorithm Up to now, the best algorithm for solving an NP-complete problem requires exponential.
Approximation Algorithms Motivation and Definitions TSP Vertex Cover Scheduling.
Approximation Algorithms Ola Svensson. Course Information Goal: – Learn the techniques used by studying famous applications Graduate Course FDD
1 NP-Complete Problems (Fun part) Polynomial time vs exponential time –Polynomial O(n k ), where n is the input size (e.g., number of nodes in a graph,
Approximation Algorithms
Phylogenetic trees Sushmita Roy BMI/CS 576
Programming & Data Structures
Theory of Computing Lecture 10 MAS 714 Hartmut Klauck.
1 Introduction to Approximation Algorithms. 2 NP-completeness Do your best then.
The Traveling Salesperson Problem Algorithms and Networks.
The Traveling Salesman Problem Approximation
1 Introduction to Approximation Algorithms. 2 NP-completeness Do your best then.
Advanced Algorithm Design and Analysis (Lecture 13) SW5 fall 2004 Simonas Šaltenis E1-215b
CSE 326: Data Structures NP Completeness Ben Lerner Summer 2007.
Princeton University COS 423 Theory of Algorithms Spring 2001 Kevin Wayne Approximation Algorithms These lecture slides are adapted from CLRS.
Using traveling salesman problem algorithms for evolutionary tree construction Chantal Korostensky and Gaston H. Gonnet Presentation by: Ben Snider.
CSE 589 Part VI. Reading Skiena, Sections 5.5 and 6.8 CLR, chapter 37.
Projects Network Theory VLSI PSM 1. Network 1. Steiner trees
1 Approximation Algorithm Updated on 2012/12/25. 2 Approximation Algorithm Up to now, the best algorithm for solving an NP-complete problem requires exponential.
Approximation Algorithms for TSP Tsvi Kopelowitz 1.
Approximation Algorithms Guo QI, Chen Zhenghai, Wang Guanhua, Shen Shiqi, Himeshi De Silva.
Approximation Algorithms
Distance-based methods for phylogenetic tree reconstruction Colin Dewey BMI/CS 576 Fall 2015.
Approximation Algorithms by bounding the OPT Instructor Neelima Gupta
Instructor Neelima Gupta Table of Contents Introduction to Approximation Algorithms Factor 2 approximation algorithm for TSP Factor.
CSE 589 Applied Algorithms Spring 1999 Prim’s Algorithm for MST Load Balance Spanning Tree Hamiltonian Path.
Traveling Salesman Problem DongChul Kim HwangRyol Ryu.
TU/e Algorithms (2IL15) – Lecture 11 1 Approximation Algorithms.
Optimization problems such as
Hamiltonian Cycle and TSP
Hamiltonian Cycle and TSP
The Evolution Trees (Part I)
Great Theoretical Ideas in Computer Science
Greedy Algorithms / Minimum Spanning Tree Yin Tat Lee
Approximation Algorithms
Graph Algorithm.
Approximation Algorithms for TSP
Discrete Mathematics for Computer Science
The Tree of Life From Ernst Haeckel, 1891.
Minimum Spanning Trees
Minimum Spanning Trees
Minimum Spanning Trees
Approximation Algorithms
Minimum Spanning Trees
Presentation transcript:

The Evolution Trees From: Computational Biology by R. C. T. Lee S. J. Shyu Department of Computer Science Ming Chuan University

S. J. Shyu Biological Assumption All species evolve from a common ancestor. Root: the suspected common ancestor Leaves: species (alive) Internal nodes: unknown species Length on edge ( a, b ): the time needed to evolve from a to b

S. J. Shyu Assumptions on Evolution Trees for Computer Scientists (I) Rooted evolution tree Degree of each internal node is 3, except the root. Unrooted evolution tree Degree of each internal node is 3.

S. J. Shyu Assumptions on Evolution Trees for Computer Scientists (II) The input is a distance matrix among all of the species. The distances satisfy the triangular inequality relationship. Depending upon different conditions, different ETs will be constructed to reflect the distances among species. Let d ( s i, s j ) ( dt ( s i, s j )) be the distance between s i and s i in the distance matrix (some evolution tree). Then dt ( i, j )  d ( i, j ). If the ET is rooted, then the distance from the root too all leaves are the same.

S. J. Shyu How many unrooted evolution trees are there? (I) NE ( n )=2 n -3 (by induction) Whenever a new species is added, #edgs+=2.

S. J. Shyu How many unrooted evolution trees are there? (II) Every edge is possible to be split to add a new species. Number of unrooted ETs for n species: TU ( n +1)=(2 n -3) TU ( n )or TU ( n )=(2 n -5) TU ( n -1)

S. J. Shyu How many rooted evolution trees are there? (I) Every edge in an unrooted ET is possible to be split to add a root to turn the ET into a rooted one.

S. J. Shyu How many rooted evolution trees are there? (II) Number of rooted ETs for n species: TR ( n )=(2 n -3) TU ( n ) TU ( n )= TU ( n +1) # of rooted ETs is much larger than that of unrooted ETs. It is desirable to consider unrooted Ets. Still, we cannot explain evolution by an unrooted tree.

S. J. Shyu Transforming unrooted ETs into rooted ones Add a species which is exceeding different from the species analyzed. This outlier species causes a long link that can be used to identify a root.

S. J. Shyu Distance matrix vs. evoluiton trees The input of an evolution tree problem is a distance matrix. We are asked to construct an evolution tree to perfectly reflect these distances. The goodness of an evolution tree is evaluated under some criterion.

S. J. Shyu Criteria of evoluiton trees Let d ( s i, s j ) ( dt ( s i, s j )) be the distance between s i and s j in the distance matrix (some evolution tree). 1. Minimax ETs max. of ( dt ( s i, s j )- d ( s i, s j )) is minimized 2. Minisum ETs total sum of all pairs of distances (  dt ( s i, s j )) is minimized 3. Minisize ETs total length of the tree is minimized

S. J. Shyu The complexity of ET problems MinimaxMinisumMinisize UnrootedNP-complete Unknown Rooted O(n2)O(n2) NP-complete

S. J. Shyu A Minimax rooted ET Algorithm Idea: Preserve the longest distance Suppose d ( s i, s j ) is the longest distance in the input matrix. dt ( s i, s j ) = d ( s i, s j ) Recursively apply for T i and T j Which species are in T i ( T i )?

S. J. Shyu Minimax rooted ET Algorithm

S. J. ShyuExample

5.4 The determination of weights when the ET structure is given What’s the minisize unrooted ET if the ET structure is given? As above for minisize unrooted ET As above for minimax unrooted ET How to determine the structure of ET is a problem. (# of possible ET is exponential to n.) By linear programming Open

S. J. ShyuExamples

5.5 UPGMA for rooted ETs A heuristic to determine a reasonably good structure of rooted ETs.

S. J. Shyu Example (UPGMA)

S. J. Shyu 5.6 The neighbor joining method for unrooted ETs A heuristic to determine a reasonably good structure of unrooted ETs.

S. J. Shyu Example (NJ) (I) w ( x, s 1 )=1/3( d ( s 1, s 2 )+ d ( s 1, s 3 )+ d ( s 1, s 4 )) =1/3(4+4+3)=3.67 (the mean of the distances from this species to all other species) w ( x, s i ) = average( s i ) = (  d ( s i, s j ))/( n -1) ijij 1 center

S. J. Shyu Example (NJ) (II) Geometrical center OC - NC = =2.34

S. J. Shyu Example (NJ) (III) OC - NC = (average( s 1 )+average( s 2 )) - (average( s 1 )+average( s 2 )+ d ( s 1,s 2 ))/2 =(average( s 1 )+average( s 2 )- d ( s 1,s 2 ))/2 d ( s 1,s 2 ) is preserved

S. J. Shyu 5.6 An Approximation Algorithm for an unrooted minisize ET unrooted minisize ET: no polynomial algorithm; never been proved to be NP-complete An approximation algorithm with size smaller than twice of the size of an optimal solution:

S. J. Shyu Example (I) Minimal spanning tree an unrooted minisize ET With error ratio  1

S. J. Shyu Proof of correctness (I) Evolution tree? Degree of each internal node=3 dt ( s i, s j )  d ( s i, s j ) dt ( s i, s j ) = d MST ( s i, s j ) d MST ( s i, s j )  dt ( s i, s j ) (triangular inequality)  1 Error ratio  1

S. J. Shyu Hamiltonian cycle and Euler cycle Hamiltonian cycle: a cycle visiting all of the nodes in G =( V, E ) exactly once Euler cycle: a cycle traversing each edge in G =( V, E ) exactly once

S. J. Shyu Proof of correctness (II)  1 Error ratio  1  1. |MST|  |TSP| P  Delete any edge in TSP  |P|<|TSP|  |MST|  |P|  2. |A|=|MST|  |TSP|  2 3. |TSP|  2|OPT| 4. Duplicate every edge of a tree  there is an Euler tour (Et)  2 5. T: optimal ET  |Et|  2|OPT|  6. CEt: the cycle of species of Et  |CEt|  |Et| ( dt ( s i, s j )  d ( s i, s j ))  2 7. |A|=|MST|  |TSP|  |CEt|  |Et|  2|OPT|

S. J. Shyu 5.6 The minimal spanning tree preservation approach for ET construction D : input distance matrix D t : distance matrix of the ET MST preservation: MST( D ) is an MST( D t )

S. J. Shyu Example (I)

S. J. Shyu Example (II)