Suppr超能文献

模型误设与拓扑结构的概率检验:来自经验数据集的证据

Model misspecification and probabilistic tests of topology: evidence from empirical data sets.

作者信息

Buckley Thomas R

机构信息

Department of Biology, Duke University, Durham, North Carolina, USA, and Landcare Research, Mt. Albert, Auckland, New Zealand.

出版信息

Syst Biol. 2002 Jun;51(3):509-23. doi: 10.1080/10635150290069922.

Abstract

Probabilistic tests of topology offer a powerful means of evaluating competing phylogenetic hypotheses. The performance of the nonparametric Shimodaira-Hasegawa (SH) test, the parametric Swofford-Olsen-Waddell-Hillis (SOWH) test, and Bayesian posterior probabilities were explored for five data sets for which all the phylogenetic relationships are known with a very high degree of certainty. These results are consistent with previous simulation studies that have indicated a tendency for the SOWH test to be prone to generating Type 1 errors because of model misspecification coupled with branch length heterogeneity. These results also suggest that the SOWH test may accord overconfidence in the true topology when the null hypothesis is in fact correct. In contrast, the SH test was observed to be much more conservative, even under high substitution rates and branch length heterogeneity. For some of those data sets where the SOWH test proved misleading, the Bayesian posterior probabilities were also misleading. The results of all tests were strongly influenced by the exact substitution model assumptions. Simple models, especially those that assume rate homogeneity among sites, had a higher Type 1 error rate and were more likely to generate misleading posterior probabilities. For some of these data sets, the commonly used substitution models appear to be inadequate for estimating appropriate levels of uncertainty with the SOWH test and Bayesian methods. Reasons for the differences in statistical power between the two maximum likelihood tests are discussed and are contrasted with the Bayesian approach.

摘要

拓扑结构的概率检验为评估相互竞争的系统发育假说提供了一种强大的方法。针对五个数据集探究了非参数的 Shimodaira-Hasegawa(SH)检验、参数化的 Swofford-Olsen-Waddell-Hillis(SOWH)检验以及贝叶斯后验概率的性能,这五个数据集的所有系统发育关系都具有非常高的确定性。这些结果与之前的模拟研究一致,之前的研究表明,由于模型错误设定以及分支长度异质性,SOWH 检验倾向于产生 I 类错误。这些结果还表明,当原假设实际上正确时,SOWH 检验可能会对真实拓扑结构过度自信。相比之下,即使在高替换率和分支长度异质性的情况下,SH 检验也更为保守。对于一些 SOWH 检验被证明具有误导性的数据集,贝叶斯后验概率也具有误导性。所有检验的结果都受到确切替换模型假设的强烈影响。简单模型,尤其是那些假设位点间速率均匀的模型,具有更高的 I 类错误率,并且更有可能产生误导性的后验概率。对于其中一些数据集,常用的替换模型似乎不足以通过 SOWH 检验和贝叶斯方法估计适当的不确定性水平。讨论了两种最大似然检验在统计功效上存在差异的原因,并与贝叶斯方法进行了对比。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验