• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于简约引导的树提议加速贝叶斯系统发育推断的收敛。

Using Parsimony-Guided Tree Proposals to Accelerate Convergence in Bayesian Phylogenetic Inference.

机构信息

Key Laboratory of Vertebrate Evolution and Human Origins, Institute of Vertebrate Paleontology and Paleoanthropology, Chinese Academy of Sciences, 142 XizhimenWai Street, Beijing 100044, China.

Center for Excellence in Life and Paleoenvironment, Chinese Academy of Sciences, 142 XizhimenWai Street, Beijing 100044, China.

出版信息

Syst Biol. 2020 Sep 1;69(5):1016-1032. doi: 10.1093/sysbio/syaa002.

DOI:10.1093/sysbio/syaa002
PMID:31985810
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7440752/
Abstract

Sampling across tree space is one of the major challenges in Bayesian phylogenetic inference using Markov chain Monte Carlo (MCMC) algorithms. Standard MCMC tree moves consider small random perturbations of the topology, and select from candidate trees at random or based on the distance between the old and new topologies. MCMC algorithms using such moves tend to get trapped in tree space, making them slow in finding the globally most probable trees (known as "convergence") and in estimating the correct proportions of the different types of them (known as "mixing"). Here, we introduce a new class of moves, which propose trees based on their parsimony scores. The proposal distribution derived from the parsimony scores is a quickly computable albeit rough approximation of the conditional posterior distribution over candidate trees. We demonstrate with simulations that parsimony-guided moves correctly sample the uniform distribution of topologies from the prior. We then evaluate their performance against standard moves using six challenging empirical data sets, for which we were able to obtain accurate reference estimates of the posterior using long MCMC runs, a mix of topology proposals, and Metropolis coupling. On these data sets, ranging in size from 357 to 934 taxa and from 1740 to 5681 sites, we find that single chains using parsimony-guided moves usually converge an order of magnitude faster than chains using standard moves. They also exhibit better mixing, that is, they cover the most probable trees more quickly. Our results show that tree moves based on quick and dirty estimates of the posterior probability can significantly outperform standard moves. Future research will have to show to what extent the performance of such moves can be improved further by finding better ways of approximating the posterior probability, taking the trade-off between accuracy and speed into account. [Bayesian phylogenetic inference; MCMC; parsimony; tree proposal.].

摘要

跨树空间采样是使用马尔可夫链蒙特卡罗 (MCMC) 算法进行贝叶斯系统发育推断的主要挑战之一。标准的 MCMC 树移动考虑拓扑的小随机扰动,并随机选择候选树或基于旧拓扑和新拓扑之间的距离进行选择。使用此类移动的 MCMC 算法往往会被困在树空间中,使得它们在找到全局最可能的树(称为“收敛”)和正确估计它们的不同类型的比例(称为“混合”)方面速度较慢。在这里,我们引入了一类新的移动,它们基于简约得分提出树。从简约得分导出的提议分布是候选树的条件后验分布的快速计算但粗糙的近似。我们通过模拟证明,简约引导的移动正确地从先验中对拓扑的均匀分布进行采样。然后,我们使用六个具有挑战性的经验数据集来评估它们与标准移动的性能,对于这些数据集,我们能够使用长 MCMC 运行、拓扑提案的混合和 Metropolis 耦合来获得后验的准确参考估计。在这些数据集上,大小从 357 到 934 个分类单元,从 1740 到 5681 个位点,我们发现使用简约引导移动的单个链通常比使用标准移动的链快一个数量级。它们还表现出更好的混合性,即它们更快地覆盖最可能的树。我们的结果表明,基于后验概率快速而粗略的估计的树移动可以显著优于标准移动。未来的研究将不得不展示通过找到更好的方法来近似后验概率,在准确性和速度之间进行权衡,这种移动的性能可以在多大程度上进一步提高。[贝叶斯系统发育推断;MCMC;简约;树提议。]。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/4689235ffead/syaa002f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/874146dab6b9/syaa002f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/9d1703711c9b/syaa002f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/bb8341382ea2/syaa002f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/f4f38841ce17/syaa002f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/a4e36edface1/syaa002f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/280841c807e5/syaa002f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/87cdbfc041e6/syaa002f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/ab9d471ca304/syaa002f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/4689235ffead/syaa002f9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/874146dab6b9/syaa002f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/9d1703711c9b/syaa002f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/bb8341382ea2/syaa002f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/f4f38841ce17/syaa002f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/a4e36edface1/syaa002f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/280841c807e5/syaa002f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/87cdbfc041e6/syaa002f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/ab9d471ca304/syaa002f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9d64/7440752/4689235ffead/syaa002f9.jpg

相似文献

1
Using Parsimony-Guided Tree Proposals to Accelerate Convergence in Bayesian Phylogenetic Inference.基于简约引导的树提议加速贝叶斯系统发育推断的收敛。
Syst Biol. 2020 Sep 1;69(5):1016-1032. doi: 10.1093/sysbio/syaa002.
2
Guided tree topology proposals for Bayesian phylogenetic inference.贝叶斯系统发育推断的引导树拓扑提议。
Syst Biol. 2012 Jan;61(1):1-11. doi: 10.1093/sysbio/syr074. Epub 2011 Aug 9.
3
Adaptive Tree Proposals for Bayesian Phylogenetic Inference.自适应树提议用于贝叶斯系统发育推断。
Syst Biol. 2021 Aug 11;70(5):1015-1032. doi: 10.1093/sysbio/syab004.
4
An Efficient Independence Sampler for Updating Branches in Bayesian Markov chain Monte Carlo Sampling of Phylogenetic Trees.一种用于在系统发育树的贝叶斯马尔可夫链蒙特卡罗采样中更新分支的高效独立采样器。
Syst Biol. 2016 Jan;65(1):161-76. doi: 10.1093/sysbio/syv051. Epub 2015 Jul 30.
5
Quantifying MCMC exploration of phylogenetic tree space.量化马尔可夫链蒙特卡罗方法对系统发育树空间的探索。
Syst Biol. 2015 May;64(3):472-91. doi: 10.1093/sysbio/syv006. Epub 2015 Jan 27.
6
Efficiency of Markov chain Monte Carlo tree proposals in Bayesian phylogenetics.贝叶斯系统发育学中马尔可夫链蒙特卡罗树提议的效率
Syst Biol. 2008 Feb;57(1):86-103. doi: 10.1080/10635150801886156.
7
Searching for convergence in phylogenetic Markov chain Monte Carlo.在系统发育马尔可夫链蒙特卡罗方法中寻找收敛性。
Syst Biol. 2006 Aug;55(4):553-65. doi: 10.1080/10635150600812544.
8
Efficient Bayesian Species Tree Inference under the Multispecies Coalescent.多物种溯祖模型下的高效贝叶斯物种树推断
Syst Biol. 2017 Sep 1;66(5):823-842. doi: 10.1093/sysbio/syw119.
9
Phylogenetic MCMC algorithms are misleading on mixtures of trees.系统发育马尔可夫链蒙特卡罗算法在树的混合模型上具有误导性。
Science. 2005 Sep 30;309(5744):2207-9. doi: 10.1126/science.1115493.
10
Geometric ergodicity of a hybrid sampler for Bayesian inference of phylogenetic branch lengths.用于系统发育分支长度贝叶斯推断的混合采样器的几何遍历性
Math Biosci. 2015 Oct;268:9-21. doi: 10.1016/j.mbs.2015.07.002. Epub 2015 Aug 7.

引用本文的文献

1
Algorithms to reconstruct past indels: The deletion-only parsimony problem.重建过去插入缺失的算法:仅删除的简约问题。
PLoS Comput Biol. 2025 Jul 28;21(7):e1012585. doi: 10.1371/journal.pcbi.1012585. eCollection 2025 Jul.
2
Bounding the Softwired Parsimony Score of a Phylogenetic Network.对系统发生网络的软布线简约得分进行限定。
Bull Math Biol. 2024 Aug 22;86(10):121. doi: 10.1007/s11538-024-01350-9.
3
Data integration in Bayesian phylogenetics.贝叶斯系统发育学中的数据整合。

本文引用的文献

1
Fast Fitch-Parsimony Algorithms for Large Data Sets.适用于大数据集的快速简约算法。
Cladistics. 1998 Dec;14(4):387-400. doi: 10.1111/j.1096-0031.1998.tb00346.x.
2
A biologist's guide to Bayesian phylogenetic analysis.生物学家贝叶斯系统发育分析指南。
Nat Ecol Evol. 2017 Oct;1(10):1446-1454. doi: 10.1038/s41559-017-0280-x. Epub 2017 Sep 21.
3
RevBayes: Bayesian Phylogenetic Inference Using Graphical Models and an Interactive Model-Specification Language.RevBayes:使用图形模型和交互式模型规范语言进行贝叶斯系统发育推断
Annu Rev Stat Appl. 2023;10:353-377. doi: 10.1146/annurev-statistics-033021-112532. Epub 2022 Sep 28.
4
How Trustworthy Is Your Tree? Bayesian Phylogenetic Effective Sample Size Through the Lens of Monte Carlo Error.你的树有多可靠?从蒙特卡罗误差角度看贝叶斯系统发育有效样本量。
Bayesian Anal. 2024 Jun;19(2):565-593. doi: 10.1214/22-ba1339. Epub 2024 Apr 9.
5
The Limits of the Constant-rate Birth-Death Prior for Phylogenetic Tree Topology Inference.《系统发育树拓扑推断中恒定速率 Birth-Death 先验的局限性》。
Syst Biol. 2024 May 27;73(1):235-246. doi: 10.1093/sysbio/syad075.
6
Representing and extending ensembles of parsimonious evolutionary histories with a directed acyclic graph.用有向无环图表示和扩展简约进化历史的集合。
J Math Biol. 2023 Oct 25;87(5):75. doi: 10.1007/s00285-023-02006-3.
7
Online tree expansion could help solve the problem of scalability in Bayesian phylogenetics.在线树扩展可以帮助解决贝叶斯系统发生学中的可扩展性问题。
Syst Biol. 2023 Nov 1;72(5):1199-1206. doi: 10.1093/sysbio/syad045.
8
Handling Logical Character Dependency in Phylogenetic Inference: Extensive Performance Testing of Assumptions and Solutions Using Simulated and Empirical Data.处理系统发育推断中的逻辑字符相关性:使用模拟和经验数据对假设和解决方案进行广泛的性能测试。
Syst Biol. 2023 Jun 17;72(3):662-680. doi: 10.1093/sysbio/syad006.
9
Recent progress on methods for estimating and updating large phylogenies.关于估计和更新大型系统发育树的方法的最新进展。
Philos Trans R Soc Lond B Biol Sci. 2022 Oct 10;377(1861):20210244. doi: 10.1098/rstb.2021.0244. Epub 2022 Aug 22.
10
StarBeast3: Adaptive Parallelized Bayesian Inference under the Multispecies Coalescent.StarBeast3:多物种合并下的自适应并行贝叶斯推断。
Syst Biol. 2022 Jun 16;71(4):901-916. doi: 10.1093/sysbio/syac010.
Syst Biol. 2016 Jul;65(4):726-36. doi: 10.1093/sysbio/syw021. Epub 2016 May 28.
4
Quantifying MCMC exploration of phylogenetic tree space.量化马尔可夫链蒙特卡罗方法对系统发育树空间的探索。
Syst Biol. 2015 May;64(3):472-91. doi: 10.1093/sysbio/syv006. Epub 2015 Jan 27.
5
ExaBayes: massively parallel bayesian tree inference for the whole-genome era.ExaBayes:全基因组时代的大规模并行贝叶斯树推断
Mol Biol Evol. 2014 Oct;31(10):2553-6. doi: 10.1093/molbev/msu236. Epub 2014 Aug 18.
6
BEAST 2: a software platform for Bayesian evolutionary analysis.BEAST 2:用于贝叶斯进化分析的软件平台。
PLoS Comput Biol. 2014 Apr 10;10(4):e1003537. doi: 10.1371/journal.pcbi.1003537. eCollection 2014 Apr.
7
The estimation of tree posterior probabilities using conditional clade probability distributions.使用条件分支概率分布估计树后验概率。
Syst Biol. 2013 Jul;62(4):501-11. doi: 10.1093/sysbio/syt014. Epub 2013 Mar 11.
8
Revisiting the phylogeny of papilionoid legumes: New insights from comprehensively sampled early-branching lineages.重新探讨豆目蝶形花科植物的系统发育:综合采样早期分支谱系的新见解。
Am J Bot. 2012 Dec;99(12):1991-2013. doi: 10.3732/ajb.1200380. Epub 2012 Dec 8.
9
A total-evidence approach to dating with fossils, applied to the early radiation of the hymenoptera.基于化石的全证据年代测定方法及其在膜翅目早期辐射中的应用
Syst Biol. 2012 Dec 1;61(6):973-99. doi: 10.1093/sysbio/sys058. Epub 2012 Jun 20.
10
Coalescence patterns of endemic Tibetan species of stream salamanders (Hynobiidae: Batrachuperus).特有西藏溪流蝾螈(小鲵科:Batrachuperus)的合并模式。
Mol Ecol. 2012 Jul;21(13):3308-24. doi: 10.1111/j.1365-294X.2012.05606.x. Epub 2012 May 9.