Evaluating the Fossil Record with Model Phylogenies Cladistic relationships can be determined without ideas about stratigraphic completeness; implied gaps.

Slides:



Advertisements
Similar presentations
Introduction to molecular dating methods. Principles Ultrametricity: All descendants of any node are equidistant from that node For extant species, branches,
Advertisements

LG 4 Outline Evolutionary Relationships and Classification
Tree Building What is a tree ? How to build a tree ? Cladograms Trees
Phylogenetic Tree A Phylogeny (Phylogenetic tree) or Evolutionary tree represents the evolutionary relationships among a set of organisms or groups of.
 Aim in building a phylogenetic tree is to use a knowledge of the characters of organisms to build a tree that reflects the relationships between them.
1 General Phylogenetics Points that will be covered in this presentation Tree TerminologyTree Terminology General Points About Phylogenetic TreesGeneral.
Fossils & Evolution Chapter 41 Ch. 4—Key concepts Systematics is the study of the kinds (diversity) of organisms and of the evolutionary relationships.
Phylogeny and Systematics
BIO2093 – Phylogenetics Darren Soanes Phylogeny I.
Warm-Up 3/24 What is a derived characteristic? What is a clade?
Phylogenetic reconstruction
Reconstructing and Using Phylogenies
PHYLOGENY AND SYSTEMATICS
Chapter 26 – Phylogeny & the Tree of Life
Phylogenetic trees Level 3 Molecular Evolution and Bioinformatics Jim Provan Page and Holmes: Chapter 2.
Chapter 20 Cladograms.
Maximum Likelihood. Likelihood The likelihood is the probability of the data given the model.
Molecular Evolution Revised 29/12/06
BIOE 109 Summer 2009 Lecture 4- Part II Phylogenetic Inference.
The Simple Regression Model
Tree Evaluation Tree Evaluation. Tree Evaluation A question often asked of a data set is whether it contains ‘significant cladistic structure’, that is.
Classification and phylogeny
What Is Phylogeny? The evolutionary history of a group.
Terminology of phylogenetic trees
Molecular phylogenetics
How classification works
Bioinformatics 2011 Molecular Evolution Revised 29/12/06.
Underlying Principles of Zoology Laws of physics and chemistry apply. Principles of genetics and evolution important. What is learned from one animal group.
 Read Chapter 4.  All living organisms are related to each other having descended from common ancestors.  Understanding the evolutionary relationships.
Jargon Brian O’Meara EEB464 Fall From BBC Life of Birds Channel.
Biostatistics Class 6 Hypothesis Testing: One-Sample Inference 2/29/2000.
Systematics and the Phylogenetic Revolution Chapter 23.
Lecture 2: Principles of Phylogenetics
Introduction to Phylogenetics
Tree Shape Different phylogenies can have very different shapes. Shape typically in terms of symmetry: sister taxa with comparable diversity stem from.
GENE 3000 Fall 2013 slides wiki. wiki. wiki.
What is a synapomorphy?. Terms systematics [taxonomy, phylogenetics] phylogeny/phylogenetic tree cladogram tips, branches, nodes homology apomorphy synapomorhy.
Phylogenies Reconstructing the Past. The field of systematics Studies –the mechanisms of evolution evolutionary agents –the process of evolution speciation.
Phylogeny & the Tree of Life
Phylogenetics: General Outline Basic methods: –Parsimony optimization –Maximum likelihood –Bayesian methods Matrix structure: –Parameters affecting character.
Selecting Genomes for Reconstruction of Ancestral Genomes Louxin Zhang Department of Mathematics National University of Singapore.
Ayesha M.Khan Spring Phylogenetic Basics 2 One central field in biology is to infer the relation between species. Do they possess a common ancestor?
Most Likely Rates given Phylogeny L[  |001] =  0 x (P[  A ] + P[  B ]) +  1 x (P[  C ] + P[  D ])
Chapter 13 Understanding research results: statistical inference.
PHYOGENY & THE Tree of life Represent traits that are either derived or lost due to evolution.
Chapter 9: Introduction to the t statistic. The t Statistic The t statistic allows researchers to use sample data to test hypotheses about an unknown.
Lesson Overview Lesson Overview Modern Evolutionary Classification 18.2.
Chapter 10: The t Test For Two Independent Samples.
Phylogeny & Systematics The study of the diversity and relationships among organisms.
Biology of Invertebrates The basics of Cladistics.
Section 2: Modern Systematics
Phylogeny & the Tree of Life
Phylogenetics Scientists who study systematics are interested in phylogeny, or the ancestral relationships between species. Grouping organisms by similarity.
Statistical inference: distribution, hypothesis testing
Section 2: Modern Systematics
26.3 Shared Characters Are Used To Construct Phylogenetic Trees
Cladistics (Ch. 22) Based on phylogenetics – an inferred reconstruction of evolutionary history.
Endeavour to reconstruct the characters of each hypothetical ancestor.
Cladistics.
Modern Evolutionary Classification 18-2
Systematics: Tree of Life
I. Statistical Tests: Why do we use them? What do they involve?
Systematics: Tree of Life
Phylogeny and the Tree of Life
18.2 Modern Systematics I. Traditional Systematics
Chapter 20 Phylogenetic Trees. Chapter 20 Phylogenetic Trees.
Phylogenetics Chapter 26.
Phylogeny & Systematics
1 2 Biology Warm Up Day 6 Turn phones in the baskets
Presentation transcript:

Evaluating the Fossil Record with Model Phylogenies Cladistic relationships can be determined without ideas about stratigraphic completeness; implied gaps might be useful for evaluating stratigraphy.

Evaluating the Fossil Record with Model Phylogenies Sum of range extensions / ghosts = stratigraphic debt sensu Fisher (1992).

Evaluating the Fossil Record with Model Phylogenies Many metrics attempting to quantify sampling make naïve assumptions about the minimum possible gaps!

Tree-based evaluations of the fossil record Phylogeny can be estimated independently of stratigraphic distributions –Necessarily implies gaps in the record Two basic types of metrics: –Consistency: measures general agreement between predicted and observed orders of appearance; –Gap: measure the sum of gaps implied by a phylogeny.

Tree-based Assessments of Sampling: Stratigraphic Consistency Index Consistent node: one in which the sister taxon appears prior to the node; SCI = Consistent nodes / All nodes

Tree-based Assessments of Sampling: Relative Completeness Index RCI = 1 - (∑ Gaps / ∑ Ranges)

Tree-based Assessments of Sampling: Gap Excess Ratio GER = (M-g)/(M-m) where: –M = maximum possible gaps (= ∑first appearances); –g = implied gaps; –m = minimum possible gaps.

Tree-based Assessments of Sampling: Manhattan Stratigraphic Metric MSM = m/g where: –g = implied gaps; –m = minimum possible gaps. Based on consistency index.

Relationships between Sampling & Tree-Based Sampling Metrics from Simulations 32 taxa with =0.50,  =0.45 & budding cladogenesis.

Relationships between Sampling & Tree-Based Sampling Metrics from Simulations RCI & SCI reflect sampling; GER & (especially) MSM do not.

Properties of the Components to Metrics: Gaps Sum of gaps increases exponentially as sampling gets worse.

Properties of the Components to Metrics: Minimum Gaps Sum of minimum gaps also increases exponentially as sampling gets worse.

Properties of the Components to Metrics: Maximum Gaps Sum of maximum gaps also increases exponentially as sampling gets worse.

Properties of the Components to Metrics: Sum of Ranges Sum of ranges decreases exponentially, but with minimum determined by the number of taxa.

Problem: People often forget that we do not always have gaps! If taxa have good fossil records, then many trees will have minimum possible gaps of 0.

Ignoring Ancestors greatly exaggerates implied Range Extensions Based on 1000 simulations of 32 sampled OTU’s at each R (sampling rate per time unit) with = 0.5 &  = 0.45 per unit

Ignoring Ancestors greatly exaggerates implied Range Extensions The expectations for wide range of preservation rates become indistinguishable.

Ignoring Ancestors greatly exaggerates implied Range Extensions Distortion is huge at sampling levels thought to be typical for marine invertebrates and even some land vertebrates.

Ignoring Ancestors greatly exaggerates implied Range Extensions This is not the case if one accommodates ancestors.

Relationships between Sampling & Tree-Based Sampling Metrics Failing to account for ancestors makes things worse…

Using stratigraphic data to assess phylogenies Stratocladistics: minimize stratigraphic gaps and homoplasies. Confidence Interval Sieving: rejects trees with gaps exceeding 95% confidence intervals (a la Strauss & Sadler 1989). Stratolikelihood: determines the probability of stratigraphic distributions given tree and sampling rates.

Stratocladistics First and last stratigraphic occurrences of each taxon noted. A gap through an interval treated as evidence against a phylogeny equal to that of an extra morphological change. “Stratigraphic debt” reduced by ancestor- descendant relationships as well as by altering cladistic topology. Generates phylogeny, not just a cladogram.

Stratocladistics Sampled ranges of 6 taxa.

Stratocladistics 6 taxa coded for 7 characters (each row a character).

Stratocladistics Parsimony tree for 6 taxa given matrix.

Stratocladistics Phylogeny matching parsimony tree; 8 steps, but gaps (= 3 units of strat. debt) or 11 “steps” overall.

Stratocladistics Phylogeny matching parsimony tree; B set as ancestor to C because it has no apomorphies.

Stratocladistics D not considered ancestral because it has an apomorphy; however, that causes 2 gaps.

Stratocladistics Making D ancestral increases steps to 9 but reduces strat. debt to 1, giving a total score of 10.

Stratocladistics Making E ancestral saves 1 step and induces 1 gap.

Stratocladistics No total savings, but making E ancestral reduces unsampled ancestors (another parsimony criterion).

Assumptions of Stratocladistics Probability of a character changing comparable to probability of a unit of stratigraphic debt. –(ln P [gap] + ln P[stasis]) ≤ ln P[change] Probability of all gaps has the same meaning throughout the tree.

Confidence Interval Sieving Probability of gaps assessed based on confidence intervals; –Number of sampling opportunities over gap considered. If there are no opportunities, then there really is no gap. –Probability of missing a taxon n times assessed given the number of finds and the number of possible finds within its range; –Separate “time scales” used for different geographic / environmental units.

Confidence Interval Sieving If significant gaps exists between a “younger” sister taxon and an “older” species, then apomorphies will be reversed; –This lengthens the tree and makes it possible for another tree to be shorter; –The most poorly sampled member of a clade used to formulate CI for that clade; If significant gaps exist between sister clades, then the tree is simply rejected. Shortest tree with no significant gaps is taken.

“Horizon Scales” for Different sampling realms “Height” measures number of sampling opportunities; the “duration” of a time interval can be very different in different sampling realms.

Confidence Interval Sieving Case simplest for bifurcations…

Confidence Interval Sieving … but not much different for polytomy.

Confidence Interval Sieving Example of how stratigraphy rejects one phylogeny in favor of another.

Confidence Interval Sieving Assumptions Strength of characters uniting a clade ignored; –Gap supported by slowly evolving characters treated no different than a gap supported by highly homoplastic ones; –Degree of significance no considered. Method simply rejects hypotheses; it does not show how well they predict data.

Stratolikelihood Exact probability of gaps calculated given sampling opportunities. Likelihoods of gaps based on sampling rates within lineages; –Because sampling rate is unknown, the rate and gap can be maximized; –Shifts in sampling rates within lineages or within clades taken into account. L[  | stratigraphy] x L[  | morphology] = L[  | data]

Sampling Rates (  ) of Stratolikelihood Given that a taxon is found n=7 times in R=11 horizons, the most likely sampling rate is not 7/11, but instead is 5/9…..

Sampling Rates (  ) of Stratolikelihood (assessment from simulations) … as n/R chronically overestimates R. This is because we do not know the true duration over which we made those n finds.

Use sampling rate (  ) maximizing the probability of a sampling gap AND of the observed finds i.e., use n / D (where D is the number of finds over the hypothesized duration).

Finding Variable  in Stratolikelihood Within lineages, one can test whether  differs significantly early or late in a stratigraphic range.

Stratolikelihood Like stratocladistics, tree evaluated “equally” by both morphologic and stratigraphic data. Like confidence interval sieving, importance of gap depends on the density of sampling and in which sampling realm the gaps should exist. Unlike either, it allows different characters to present different levels of evidence against phylogeny.

Using Inferred Ancestors to test Hypotheses about Speciation Patterns Hypotheses about different modes of speciation make different predictions about morphotypes distributions.

If Anagenesis and Bifurcation predominate, then we expect ancestral morphotypes to predate derived morphotypes Note: Phylogenetic & stratigraphic patterns can only be consistent with anagenesis - imperfect sampling means that we cannot rule out co-existence.

If Budding cladogenesis predominates, then we expect ancestral morphotypes to co- exist with descendant morphotypes. Note: Within the context of a given cladogram, stratigraphy can reject non-budding relationship between two species!