Suppr超能文献

基于一般马尔可夫模型的低参数系统发育推断。

Low-parameter phylogenetic inference under the general markov model.

机构信息

School of Mathematics and Physics, University of Tasmania, Hobart 7001, Australia.

出版信息

Syst Biol. 2013 Jan 1;62(1):78-92. doi: 10.1093/sysbio/sys072. Epub 2012 Aug 22.

Abstract

In their 2008 and 2009 articles, Sumner and colleagues introduced the "squangles"-a small set of Markov invariants for phylogenetic quartets. The squangles are consistent with the general Markov (GM) model and can be used to infer quartets without the need to explicitly estimate all parameters. As the GM model is inhomogeneous and hence nonstationary, the squangles are expected to perform well compared with standard approaches when there are changes in base composition among species. However, the GM model assumes constant rates across sites, so the squangles should be confounded by data generated with invariant sites or other forms of rate-variation across sites. Here we implement the squangles in a least-squares setting that returns quartets weighted by either confidence or internal edge lengths, and we show how these weighted quartets can be used as input into a variety of supertree and supernetwork methods. For the first time, we quantitatively investigate the robustness of the squangles to breaking of the constant rates-across-sites assumption on both simulated and real data sets; and we suggest a modification that improves the performance of the squangles in the presence of invariant sites. Our conclusion is that the squangles provide a novel tool for phylogenetic estimation that is complementary to methods that explicitly account for rate-variation across sites, but rely on homogeneous-and hence stationary-models.

摘要

在他们 2008 年和 2009 年的文章中,Sumner 和同事们引入了“squangles”-一组用于系统发育四分体的小马尔可夫不变量。squangles 与通用马尔可夫(GM)模型一致,可以用于推断四分体,而无需显式估计所有参数。由于 GM 模型是非均匀的,因此是非平稳的,因此与标准方法相比,当物种之间的碱基组成发生变化时,squangles 的表现应该更好。然而,GM 模型假设跨位点的速率恒定,因此 squangles 应该与具有不变位点或跨位点其他形式的速率变化的数据混淆。在这里,我们在最小二乘设置中实现了 squangles,该设置返回由置信度或内部边缘长度加权的四分体,我们展示了如何将这些加权四分体用作各种超树和超网络方法的输入。我们首次定量研究了 squangles 在模拟和真实数据集上打破跨位点恒定速率假设的稳健性;并提出了一种改进方法,在存在不变位点的情况下提高了 squangles 的性能。我们的结论是,squangles 为系统发育估计提供了一种新颖的工具,与明确考虑跨位点速率变化的方法互补,但依赖于均匀且因此稳定的模型。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验