Sampling trees from evolutionary models

Hartmann, Klaas; Wong, D; Stadler, T

File(s) under permanent embargo

Sampling trees from evolutionary models

journal contribution

posted on 2023-05-17, 04:00 authored by Klaas HartmannKlaas Hartmann, Wong, D, Stadler, T

Abstract.—A wide range of evolutionary models for species-level (and higher) diversification have been developed. These models can be used to test evolutionary hypotheses and provide comparisons with phylogenetic trees constructed from real data. To carry out these tests and comparisons, it is often necessary to sample, or simulate, trees from the evolutionary models. Sampling trees from these models is more complicated than it may appear at first glance, necessitating careful consideration and mathematical rigor. Seemingly straightforward sampling methods may produce trees that have systematically biased shapes or branch lengths. This is particularly problematic as there is no simple method for determining whether the sampled trees are appropriate. In this paper, we show why a commonly used simple sampling approach (SSA)—simulating trees forward in time until n species are first reached—should only be applied to the simplest pure birth model, the Yule model. We provide an alternative general sampling approach (GSA) that can be applied to most other models. Furthermore, we introduce the constant-rate birth–death model sampling approach, which samples trees very efficiently from a widely used class of models.We explore the bias produced by SSA and identify situations in which this bias is particularly pronounced. We show that using SSA can lead to erroneous conclusions: When using the inappropriate SSA, the variance of a gradually evolving trait does not correlate with the age of the tree; when the correct GSA is used, the trait variance correlates with tree age. The algorithms presented here are available in the Perl Bio: Phylo package, as a stand-alone program TreeSample, and in the R TreeSim package. [Algorithms; distribution; evolutionary models; phylogenetic trees; sampling; simulating.]

History

Publication title

Systematic Biology

Volume

59

Issue

4

Pagination

465-476

ISSN

1063-5157

Department/School

Institute for Marine and Antarctic Studies

Publisher

Oxford University Press

Place of publication

Great Clarendon Street, Oxford OX2 6DP, England

Rights statement

The definitive publisher-authenticated version is available online at: www.oxfordjournals.org

Repository Status

Restricted

Socio-economic Objectives

Expanding knowledge in the biological sciences

Usage metrics

Keywords

Algorithms distribution evolutionary models phylogenetic trees sampling simulating

Licence

In Copyright

Exports

RefWorks

BibTeX

Ref. manager

Endnote

DataCite

NLM

DC

File(s) under permanent embargo

Sampling trees from evolutionary models

History

Publication title

Volume

Issue

Pagination

ISSN

Department/School

Publisher

Place of publication

Rights statement

Repository Status

Socio-economic Objectives

Usage metrics

Categories

Keywords

Licence

Exports