Rabosky Daniel L, Mitchell Jonathan S, Chang Jonathan
Department of Ecology and Evolutionary Biology and Museum of Zoology, University of Michigan, Ann Arbor, MI 48109, USA.
Department of Ecology and Evolutionary Biology, University of California, Los Angeles, CA 90095, USA.
Syst Biol. 2017 Jul 1;66(4):477-498. doi: 10.1093/sysbio/syx037.
Bayesian analysis of macroevolutionary mixtures (BAMM) is a statistical framework that uses reversible jump Markov chain Monte Carlo to infer complex macroevolutionary dynamics of diversification and phenotypic evolution on phylogenetic trees. A recent article by Moore et al. (MEA) reported a number of theoretical and practical concerns with BAMM. Major claims from MEA are that (i) BAMM's likelihood function is incorrect, because it does not account for unobserved rate shifts; (ii) the posterior distribution on the number of rate shifts is overly sensitive to the prior; and (iii) diversification rate estimates from BAMM are unreliable. Here, we show that these and other conclusions from MEA are generally incorrect or unjustified. We first demonstrate that MEA's numerical assessment of the BAMM likelihood is compromised by their use of an invalid likelihood function. We then show that "unobserved rate shifts" appear to be irrelevant for biologically plausible parameterizations of the diversification process. We find that the purportedly extreme prior sensitivity reported by MEA cannot be replicated with standard usage of BAMM v2.5, or with any other version when conventional Bayesian model selection is performed. Finally, we demonstrate that BAMM performs very well at estimating diversification rate variation across the ${\sim}$20% of simulated trees in MEA's data set for which it is theoretically possible to infer rate shifts with confidence. Due to ascertainment bias, the remaining 80% of their purportedly variable-rate phylogenies are statistically indistinguishable from those produced by a constant-rate birth-death process and were thus poorly suited for the summary statistics used in their performance assessment. We demonstrate that inferences about diversification rates have been accurate and consistent across all major previous releases of the BAMM software. We recognize an acute need to address the theoretical foundations of rate-shift models for phylogenetic trees, and we expect BAMM and other modeling frameworks to improve in response to mathematical and computational innovations. However, we remain optimistic that that the imperfect tools currently available to comparative biologists have provided and will continue to provide important insights into the diversification of life on Earth.
宏观进化混合的贝叶斯分析(BAMM)是一种统计框架,它使用可逆跳跃马尔可夫链蒙特卡罗方法来推断系统发育树上多样化和表型进化的复杂宏观进化动态。摩尔等人最近发表的一篇文章(MEA)报告了对BAMM的一些理论和实际问题。MEA的主要观点是:(i)BAMM的似然函数不正确,因为它没有考虑未观察到的速率变化;(ii)速率变化数量的后验分布对先验过于敏感;(iii)BAMM估计的多样化速率不可靠。在这里,我们表明MEA的这些以及其他结论通常是不正确或不合理的。我们首先证明,MEA对BAMM似然性的数值评估因使用无效的似然函数而受到影响。然后我们表明,“未观察到的速率变化”对于多样化过程的生物学合理参数化似乎无关紧要。我们发现,MEA报告的所谓极端先验敏感性在BAMM v2.5的标准使用中无法重现,在进行传统贝叶斯模型选择时,使用任何其他版本也无法重现。最后,我们证明,在MEA数据集中约20%的模拟树上,BAMM在估计多样化速率变化方面表现非常出色,从理论上讲,在这些树上可以自信地推断速率变化。由于确定偏差,他们其余80%所谓的可变速率系统发育在统计上与恒定速率生灭过程产生的系统发育无法区分,因此不太适合用于他们性能评估中使用的汇总统计。我们证明,在BAMM软件的所有主要先前版本中,关于多样化速率的推断都是准确和一致的。我们认识到迫切需要解决系统发育树速率变化模型的理论基础问题,并且我们预计BAMM和其他建模框架将随着数学和计算创新而得到改进。然而,我们仍然乐观地认为,目前比较生物学家可用的不完美工具已经并将继续为地球上生命的多样化提供重要见解。