Suppr超能文献

StarBeast3:多物种合并下的自适应并行贝叶斯推断。

StarBeast3: Adaptive Parallelized Bayesian Inference under the Multispecies Coalescent.

机构信息

School of Computer Science, University of Auckland, 9 Symonds Street Level 1 Student Commons, Auckland 1010, New Zealand.

出版信息

Syst Biol. 2022 Jun 16;71(4):901-916. doi: 10.1093/sysbio/syac010.

Abstract

As genomic sequence data become increasingly available, inferring the phylogeny of the species as that of concatenated genomic data can be enticing. However, this approach makes for a biased estimator of branch lengths and substitution rates and an inconsistent estimator of tree topology. Bayesian multispecies coalescent (MSC) methods address these issues. This is achieved by constraining a set of gene trees within a species tree and jointly inferring both under a Bayesian framework. However, this approach comes at the cost of increased computational demand. Here, we introduce StarBeast3-a software package for efficient Bayesian inference under the MSC model via Markov chain Monte Carlo. We gain efficiency by introducing cutting-edge proposal kernels and adaptive operators, and StarBeast3 is particularly efficient when a relaxed clock model is applied. Furthermore, gene-tree inference is parallelized, allowing the software to scale with the size of the problem. We validated our software and benchmarked its performance using three real and two synthetic data sets. Our results indicate that StarBeast3 is up to one-and-a-half orders of magnitude faster than StarBeast2, and therefore more than two orders faster than *BEAST, depending on the data set and on the parameter, and can achieve convergence on large data sets with hundreds of genes. StarBeast3 is open-source and is easy to set up with a friendly graphical user interface. [Adaptive; Bayesian inference; BEAST 2; effective population sizes; high performance; multispecies coalescent; parallelization; phylogenetics.].

摘要

随着基因组序列数据的日益丰富,将物种的系统发育推断为串联基因组数据可能很有吸引力。然而,这种方法会导致分支长度和替代率的估计值存在偏差,并且树拓扑结构的估计值不一致。贝叶斯多物种合并(MSC)方法解决了这些问题。这是通过在物种树内约束一组基因树,并在贝叶斯框架下共同推断来实现的。然而,这种方法需要增加计算需求。在这里,我们介绍了 StarBeast3,这是一个用于通过马尔可夫链蒙特卡罗法在 MSC 模型下进行高效贝叶斯推断的软件包。我们通过引入最先进的提议核和自适应操作符来提高效率,并且当应用松弛时钟模型时,StarBeast3 的效率特别高。此外,基因树推断是并行化的,允许软件根据问题的大小进行扩展。我们验证了我们的软件并使用三个真实数据集和两个合成数据集对其性能进行了基准测试。我们的结果表明,StarBeast3 的速度比 StarBeast2 快一个半数量级,因此比*BEAST 快两个数量级以上,具体取决于数据集和参数,并且可以在具有数百个基因的大型数据集上实现收敛。StarBeast3 是开源的,并且具有友好的图形用户界面,易于设置。 [自适应;贝叶斯推断;BEAST 2;有效种群大小;高性能;多物种合并;并行化;系统发生学。]。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c19/9248896/c121a38e9166/syac010f1.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验