Suppr超能文献

可扩展的带外部约束的种系发生树推断。

Scalable Species Tree Inference with External Constraints.

机构信息

Department of Computer Science, University of Illinois Urbana-Champaign, Urbana, Illinois, USA.

出版信息

J Comput Biol. 2022 Jul;29(7):664-678. doi: 10.1089/cmb.2021.0543. Epub 2022 Feb 21.

Abstract

Species tree inference is a basic step in biological discovery, but discordance between gene trees creates analytical challenges and large data sets create computational challenges. Although there is generally some information available about the species trees that could be used to speed up the estimation, only one species tree estimation method that addresses gene tree discordance-ASTRAL-J, a recent development in the ASTRAL family of methods-is able to use this information. Here we describe two new methods, NJst-J and FASTRAL-J, that can estimate the species tree, given a partial knowledge of the species tree in the form of a nonbinary unrooted constraint tree. We show that both NJst-J and FASTRAL-J are much faster than ASTRAL-J and we prove that all three methods are statistically consistent under the multispecies coalescent model subject to this constraint. Our extensive simulation study shows that both FASTRAL-J and NJst-J provide advantages over ASTRAL-J: both are faster (and NJst-J is particularly fast), and FASTRAL-J is generally at least as accurate as ASTRAL-J. An analysis of the Avian Phylogenomics Project data set with 48 species and 14,446 genes presents additional evidence of the value of FASTRAL-J over ASTRAL-J (and both over ASTRAL), with dramatic reductions in running time (20 hours for default ASTRAL, and minutes or seconds for ASTRAL-J and FASTRAL-J, respectively).

摘要

物种树推断是生物发现的基本步骤,但基因树之间的不和谐会带来分析上的挑战,而大型数据集则会带来计算上的挑战。虽然通常有一些关于物种树的信息可以用来加速估计,但只有一种方法——ASTRAL-J,这是 ASTRAL 方法家族的最新发展——能够利用这些信息。在这里,我们描述了两种新的方法,NJst-J 和 FASTRAL-J,它们可以在部分了解物种树的情况下(以非二进制无根约束树的形式)估计物种树。我们表明,NJst-J 和 FASTRAL-J 都比 ASTRAL-J 快得多,并且我们证明了在多物种合并模型下,这三种方法在受到这种约束的情况下都是统计一致的。我们广泛的模拟研究表明,FASTRAL-J 和 NJst-J 都比 ASTRAL-J 具有优势:它们都更快(NJst-J 尤其快),并且 FASTRAL-J 通常至少与 ASTRAL-J 一样准确。对包含 48 个物种和 14446 个基因的鸟类系统发育基因组学项目数据集的分析进一步证明了 FASTRAL-J 优于 ASTRAL-J(以及优于 ASTRAL)的价值,运行时间显著缩短(默认 ASTRAL 运行 20 小时,而 ASTRAL-J 和 FASTRAL-J 分别为几分钟或几秒钟)。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验