McConway Kevin J, Sims Hallie J
Department of Statistics, The Open University, Milton Keynes MK7 6AA, United Kingdom.
Evolution. 2004 Jan;58(1):12-23. doi: 10.1111/j.0014-3820.2004.tb01569.x.
Observed variations in rates of taxonomic diversification have been attributed to a range of factors including biological innovations, ecosystem restructuring, and environmental changes. Before inferring causality of any particular factor, however, it is critical to demonstrate that the observed variation in diversity is significantly greater than that expected from natural stochastic processes. Relative tests that assess whether observed asymmetry in species richness between sister taxa in monophyletic pairs is greater than would be expected under a symmetric model have been used widely in studies of rate heterogeneity and are particularly useful for groups in which paleontological data are problematic. Although one such test introduced by Slowinski and Guyer a decade ago has been applied to a wide range of clades and evolutionary questions, the statistical behavior of the test has not been examined extensively, particularly when used with Fisher's procedure for combining probabilities to analyze data from multiple independent taxon pairs. Here, certain pragmatic difficulties with the Slowinski-Guyer test are described, further details of the development of a recently introduced likelihood-based relative rates test are presented, and standard simulation procedures are used to assess the behavior of the two tests in a range of situations to determine: (1) the accuracy of the tests' nominal Type I error rate; (2) the statistical power of the tests; (3) the sensitivity of the tests to inclusion of taxon pairs with few species; (4) the behavior of the tests with datasets comprised of few taxon pairs; and (5) the sensitivity of the tests to certain violations of the null model assumptions. Our results indicate that in most biologically plausible scenarios, the likelihood-based test has superior statistical properties in terms of both Type I error rate and power, and we found no scenario in which the Slowinski-Guyer test was distinctly superior, although the degree of the discrepancy varies among the different scenarios. The Slowinski-Guyer test tends to be much more conservative (i.e., very disinclined to reject the null hypothesis) in datasets with many small pairs. In most situations, the performance of both the likelihood-based test and particularly the Slowinski-Guyer test improve when pairs with few species are excluded from the computation, although this is balanced against a decline in the tests' power and accuracy as fewer pairs are included in the dataset. The performance of both tests is quite poor when they are applied to datasets in which the taxon sizes do not conform to the distribution implied by the usual null model. Thus, results of analyses of taxonomic rate heterogeneity using the Slowinski-Guyer test can be misleading because the test's ability to reject the null hypothesis (equal rates) when true is often inaccurate and its ability to reject the null hypothesis when the alternative (unequal rates) is true is poor, particularly when small taxon pairs are included. Although not always perfect, the likelihood-based test provides a more accurate and powerful alternative as a relative rates test.
分类学多样化速率中观察到的变化归因于一系列因素,包括生物创新、生态系统重组和环境变化。然而,在推断任何特定因素的因果关系之前,至关重要的是要证明观察到的多样性变化显著大于自然随机过程所预期的变化。评估单系类群中姐妹分类单元之间观察到的物种丰富度不对称是否大于对称模型下预期值的相对检验,已在速率异质性研究中广泛使用,并且对于古生物学数据存在问题的类群特别有用。尽管十年前Slowinski和Guyer引入的一种此类检验已应用于广泛的进化枝和进化问题,但该检验的统计行为尚未得到广泛研究,特别是当与Fisher概率合并程序一起用于分析来自多个独立分类单元对的数据时。本文描述了Slowinski - Guyer检验存在的某些实际困难,介绍了最近引入的基于似然性的相对速率检验的进一步发展细节,并使用标准模拟程序评估这两种检验在一系列情况下的行为,以确定:(1)检验的名义I型错误率的准确性;(2)检验的统计功效;(3)检验对包含少量物种的分类单元对的敏感性;(4)检验对由少量分类单元对组成的数据集的行为;(5)检验对某些违反零模型假设的敏感性。我们的结果表明,在大多数生物学上合理的情况下,基于似然性的检验在I型错误率和功效方面具有更好的统计特性,并且我们没有发现Slowinski - Guyer检验明显更优的情况,尽管不同情况下差异程度有所不同。在包含许多小分类单元对的数据集中,Slowinski - Guyer检验往往更为保守(即非常不愿意拒绝零假设)。在大多数情况下,当计算中排除少量物种的分类单元对时,基于似然性的检验以及特别是Slowinski - Guyer检验的性能都会提高,尽管随着数据集中包含的分类单元对数量减少,检验的功效和准确性会下降。当将这两种检验应用于分类单元大小不符合通常零模型所隐含分布的数据集时,它们的性能都相当差。因此,使用Slowinski - Guyer检验分析分类学速率异质性的结果可能会产生误导,因为该检验在实际为真时拒绝零假设(等速率)的能力往往不准确,而在备择假设(不等速率)为真时拒绝零假设的能力很差,特别是当包含小分类单元对时。尽管并非总是完美,但基于似然性的检验作为相对速率检验提供了一种更准确、更强大的替代方法。