Lambert Amaury, Stadler Tanja
UPMC Univ Paris 06, Laboratoire de Probabilités et Modèles Aléatoires, CNRS UMR 7599, France; Collège de France, Center for Interdisciplinary Research in Biology, CNRS UMR 7241, Paris, France.
Theor Popul Biol. 2013 Dec;90:113-28. doi: 10.1016/j.tpb.2013.10.002. Epub 2013 Oct 21.
Forward-in-time models of diversification (i.e., speciation and extinction) produce phylogenetic trees that grow "vertically" as time goes by. Pruning the extinct lineages out of such trees leads to natural models for reconstructed trees (i.e., phylogenies of extant species). Alternatively, reconstructed trees can be modelled by coalescent point processes (CPPs), where trees grow "horizontally" by the sequential addition of vertical edges. Each new edge starts at some random speciation time and ends at the present time; speciation times are drawn from the same distribution independently. CPPs lead to extremely fast computation of tree likelihoods and simulation of reconstructed trees. Their topology always follows the uniform distribution on ranked tree shapes (URT). We characterize which forward-in-time models lead to URT reconstructed trees and among these, which lead to CPP reconstructed trees. We show that for any "asymmetric" diversification model in which speciation rates only depend on time and extinction rates only depend on time and on a non-heritable trait (e.g., age), the reconstructed tree is CPP, even if extant species are incompletely sampled. If rates additionally depend on the number of species, the reconstructed tree is (only) URT (but not CPP). We characterize the common distribution of speciation times in the CPP description, and discuss incomplete species sampling as well as three special model cases in detail: (1) the extinction rate does not depend on a trait; (2) rates do not depend on time; (3) mass extinctions may happen additionally at certain points in the past.
时间向前的多样化模型(即物种形成和灭绝)生成的系统发育树会随着时间推移“垂直”生长。从这些树中剪除灭绝的谱系会得到重建树(即现存物种的系统发育)的自然模型。或者,重建树可以通过合并点过程(CPPs)来建模,其中树通过依次添加垂直边“水平”生长。每条新边从某个随机的物种形成时间开始,到当前时间结束;物种形成时间独立地从相同分布中抽取。CPPs使得树似然的计算极其快速,并且能够模拟重建树。它们的拓扑结构总是遵循排名树形状上的均匀分布(URT)。我们刻画了哪些时间向前的模型会导致URT重建树,以及在这些模型中,哪些会导致CPP重建树。我们表明,对于任何“不对称”的多样化模型,其中物种形成率仅取决于时间,灭绝率仅取决于时间和一个非遗传性状(例如年龄),即使现存物种采样不完全,重建树也是CPP。如果速率还取决于物种数量,重建树是(仅)URT(但不是CPP)。我们刻画了CPP描述中物种形成时间的共同分布,并详细讨论了不完全物种采样以及三个特殊的模型情况:(1)灭绝率不取决于性状;(2)速率不取决于时间;(3)过去某些时候可能会额外发生大规模灭绝。