• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种通过广义系统发育剪枝算法的拓扑边缘复合似然法。

A topology-marginal composite likelihood via a generalized phylogenetic pruning algorithm.

作者信息

Jun Seong-Hwan, Nasif Hassan, Jennings-Shaffer Chris, Rich David H, Kooperberg Anna, Fourment Mathieu, Zhang Cheng, Suchard Marc A, Matsen Frederick A

机构信息

Department of Biostatistics and Computational Biology, University of Rochester, Rochester, USA.

Department of Statistics, University of Washington, Seattle, USA.

出版信息

Algorithms Mol Biol. 2023 Jul 31;18(1):10. doi: 10.1186/s13015-023-00235-1.

DOI:10.1186/s13015-023-00235-1
PMID:37525243
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10391877/
Abstract

Bayesian phylogenetics is a computationally challenging inferential problem. Classical methods are based on random-walk Markov chain Monte Carlo (MCMC), where random proposals are made on the tree parameter and the continuous parameters simultaneously. Variational phylogenetics is a promising alternative to MCMC, in which one fits an approximating distribution to the unnormalized phylogenetic posterior. Previous work fit this variational approximation using stochastic gradient descent, which is the canonical way of fitting general variational approximations. However, phylogenetic trees are special structures, giving opportunities for efficient computation. In this paper we describe a new algorithm that directly generalizes the Felsenstein pruning algorithm (a.k.a. sum-product algorithm) to compute a composite-like likelihood by marginalizing out ancestral states and subtrees simultaneously. We show the utility of this algorithm by rapidly making point estimates for branch lengths of a multi-tree phylogenetic model. These estimates accord with a long MCMC run and with estimates obtained using a variational method, but are much faster to obtain. Thus, although generalized pruning does not lead to a variational algorithm as such, we believe that it will form a useful starting point for variational inference.

摘要

贝叶斯系统发育学是一个计算上具有挑战性的推理问题。经典方法基于随机游走马尔可夫链蒙特卡罗(MCMC),即在树参数和连续参数上同时进行随机提议。变分系统发育学是MCMC的一种有前景的替代方法,其中将一个近似分布拟合到未归一化的系统发育后验分布上。先前的工作使用随机梯度下降来拟合这种变分近似,这是拟合一般变分近似的标准方法。然而,系统发育树是特殊的结构,为高效计算提供了机会。在本文中,我们描述了一种新算法,该算法直接推广了费尔斯滕森剪枝算法(又称和积算法),通过同时边缘化祖先状态和子树来计算类似复合似然。我们通过快速对多树系统发育模型的分支长度进行点估计来展示该算法的实用性。这些估计与长时间的MCMC运行结果以及使用变分方法获得的估计结果一致,但获得速度要快得多。因此,尽管广义剪枝本身不会导致变分算法,但我们相信它将成为变分推理的一个有用起点。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6644/10391877/e6e025400fb6/13015_2023_235_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6644/10391877/345f0190638f/13015_2023_235_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6644/10391877/c61fa0232487/13015_2023_235_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6644/10391877/56c7b8d04290/13015_2023_235_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6644/10391877/2842aebe7e52/13015_2023_235_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6644/10391877/02e799465603/13015_2023_235_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6644/10391877/e6e025400fb6/13015_2023_235_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6644/10391877/345f0190638f/13015_2023_235_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6644/10391877/c61fa0232487/13015_2023_235_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6644/10391877/56c7b8d04290/13015_2023_235_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6644/10391877/2842aebe7e52/13015_2023_235_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6644/10391877/02e799465603/13015_2023_235_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6644/10391877/e6e025400fb6/13015_2023_235_Fig6_HTML.jpg

相似文献

1
A topology-marginal composite likelihood via a generalized phylogenetic pruning algorithm.一种通过广义系统发育剪枝算法的拓扑边缘复合似然法。
Algorithms Mol Biol. 2023 Jul 31;18(1):10. doi: 10.1186/s13015-023-00235-1.
2
Efficient Bayesian Species Tree Inference under the Multispecies Coalescent.多物种溯祖模型下的高效贝叶斯物种树推断
Syst Biol. 2017 Sep 1;66(5):823-842. doi: 10.1093/sysbio/syw119.
3
Evaluating probabilistic programming and fast variational Bayesian inference in phylogenetics.评估系统发育学中的概率编程和快速变分贝叶斯推理。
PeerJ. 2019 Dec 18;7:e8272. doi: 10.7717/peerj.8272. eCollection 2019.
4
Variational Hamiltonian Monte Carlo via Score Matching.通过得分匹配的变分哈密顿蒙特卡罗方法
Bayesian Anal. 2018 Jun;13(2):485-506. doi: 10.1214/17-ba1060. Epub 2017 Jul 25.
5
Using Parsimony-Guided Tree Proposals to Accelerate Convergence in Bayesian Phylogenetic Inference.基于简约引导的树提议加速贝叶斯系统发育推断的收敛。
Syst Biol. 2020 Sep 1;69(5):1016-1032. doi: 10.1093/sysbio/syaa002.
6
An Efficient Independence Sampler for Updating Branches in Bayesian Markov chain Monte Carlo Sampling of Phylogenetic Trees.一种用于在系统发育树的贝叶斯马尔可夫链蒙特卡罗采样中更新分支的高效独立采样器。
Syst Biol. 2016 Jan;65(1):161-76. doi: 10.1093/sysbio/syv051. Epub 2015 Jul 30.
7
Systematic Exploration of the High Likelihood Set of Phylogenetic Tree Topologies.系统探索系统发育树拓扑结构的高似然集。
Syst Biol. 2020 Mar 1;69(2):280-293. doi: 10.1093/sysbio/syz047.
8
Variational Supertrees for Bayesian Phylogenetics.变分超级树在贝叶斯系统发生学中的应用。
Bull Math Biol. 2024 Aug 5;86(9):114. doi: 10.1007/s11538-024-01338-5.
9
Geometric ergodicity of a hybrid sampler for Bayesian inference of phylogenetic branch lengths.用于系统发育分支长度贝叶斯推断的混合采样器的几何遍历性
Math Biosci. 2015 Oct;268:9-21. doi: 10.1016/j.mbs.2015.07.002. Epub 2015 Aug 7.
10
An Annealed Sequential Monte Carlo Method for Bayesian Phylogenetics.退火序贯蒙特卡罗方法在贝叶斯系统发育学中的应用。
Syst Biol. 2020 Jan 1;69(1):155-183. doi: 10.1093/sysbio/syz028.

引用本文的文献

1
Finding high posterior density phylogenies by systematically extending a directed acyclic graph.通过系统地扩展有向无环图来寻找高后验密度系统发育树。
Algorithms Mol Biol. 2025 Feb 28;20(1):2. doi: 10.1186/s13015-025-00273-x.
2
Leveraging graphical model techniques to study evolution on phylogenetic networks.利用图形模型技术研究系统发育网络上的进化。
Philos Trans R Soc Lond B Biol Sci. 2025 Feb 13;380(1919):20230310. doi: 10.1098/rstb.2023.0310. Epub 2025 Feb 20.
3
Accurate Bayesian phylogenetic point estimation using a tree distribution parameterized by clade probabilities.

本文引用的文献

1
Automatic Differentiation is no Panacea for Phylogenetic Gradient Computation.自动微分并不是解决系统发育梯度计算的万能药。
Genome Biol Evol. 2023 Jun 1;15(6). doi: 10.1093/gbe/evad099.
2
Gradients Do Grow on Trees: A Linear-Time O(N)-Dimensional Gradient for Statistical Phylogenetics.梯度确实长在树上:统计系统发生学的一种线性时间 O(N)维梯度。
Mol Biol Evol. 2020 Oct 1;37(10):3047-3060. doi: 10.1093/molbev/msaa130.
3
19 Dubious Ways to Compute the Marginal Likelihood of a Phylogenetic Tree Topology.19 种计算系统发育树拓扑结构边际似然的可疑方法。
使用由分支概率参数化的树分布进行精确的贝叶斯系统发育点估计。
PLoS Comput Biol. 2025 Feb 13;21(2):e1012789. doi: 10.1371/journal.pcbi.1012789. eCollection 2025 Feb.
4
Finding high posterior density phylogenies by systematically extending a directed acyclic graph.通过系统地扩展有向无环图来寻找高后验密度系统发育树。
ArXiv. 2024 Nov 18:arXiv:2411.09074v2.
5
Representing and extending ensembles of parsimonious evolutionary histories with a directed acyclic graph.用有向无环图表示和扩展简约进化历史的集合。
J Math Biol. 2023 Oct 25;87(5):75. doi: 10.1007/s00285-023-02006-3.
Syst Biol. 2020 Mar 1;69(2):209-220. doi: 10.1093/sysbio/syz046.
4
BEAGLE 3: Improved Performance, Scaling, and Usability for a High-Performance Computing Library for Statistical Phylogenetics.BEAGLE 3:为统计系统发生学的高性能计算库提供改进的性能、可扩展性和可用性。
Syst Biol. 2019 Nov 1;68(6):1052-1061. doi: 10.1093/sysbio/syz020.
5
Virus genomes reveal factors that spread and sustained the Ebola epidemic.病毒基因组揭示了埃博拉疫情传播和持续的因素。
Nature. 2017 Apr 20;544(7650):309-315. doi: 10.1038/nature22040. Epub 2017 Apr 12.
6
Quantifying MCMC exploration of phylogenetic tree space.量化马尔可夫链蒙特卡罗方法对系统发育树空间的探索。
Syst Biol. 2015 May;64(3):472-91. doi: 10.1093/sysbio/syv006. Epub 2015 Jan 27.
7
The estimation of tree posterior probabilities using conditional clade probability distributions.使用条件分支概率分布估计树后验概率。
Syst Biol. 2013 Jul;62(4):501-11. doi: 10.1093/sysbio/syt014. Epub 2013 Mar 11.
8
Ultrafast approximation for phylogenetic bootstrap.快速近似的系统发育自举法。
Mol Biol Evol. 2013 May;30(5):1188-95. doi: 10.1093/molbev/mst024. Epub 2013 Feb 15.
9
Hessian calculation for phylogenetic likelihood based on the pruning algorithm and its applications.基于剪枝算法的系统发育似然性的黑塞矩阵计算及其应用。
Stat Appl Genet Mol Biol. 2012 Sep 25;11(4):Article 14. doi: 10.1515/1544-6115.1779.
10
Guided tree topology proposals for Bayesian phylogenetic inference.贝叶斯系统发育推断的引导树拓扑提议。
Syst Biol. 2012 Jan;61(1):1-11. doi: 10.1093/sysbio/syr074. Epub 2011 Aug 9.