• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自动微分并不是解决系统发育梯度计算的万能药。

Automatic Differentiation is no Panacea for Phylogenetic Gradient Computation.

机构信息

Australian Institute for Microbiology and Infection, University of Technology Sydney, Ultimo, NSW, Australia.

Centre for Computational Evolution, The University of Auckland, Auckland, New Zealand.

出版信息

Genome Biol Evol. 2023 Jun 1;15(6). doi: 10.1093/gbe/evad099.

DOI:10.1093/gbe/evad099
PMID:37265233
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10282121/
Abstract

Gradients of probabilistic model likelihoods with respect to their parameters are essential for modern computational statistics and machine learning. These calculations are readily available for arbitrary models via "automatic differentiation" implemented in general-purpose machine-learning libraries such as TensorFlow and PyTorch. Although these libraries are highly optimized, it is not clear if their general-purpose nature will limit their algorithmic complexity or implementation speed for the phylogenetic case compared to phylogenetics-specific code. In this paper, we compare six gradient implementations of the phylogenetic likelihood functions, in isolation and also as part of a variational inference procedure. We find that although automatic differentiation can scale approximately linearly in tree size, it is much slower than the carefully implemented gradient calculation for tree likelihood and ratio transformation operations. We conclude that a mixed approach combining phylogenetic libraries with machine learning libraries will provide the optimal combination of speed and model flexibility moving forward.

摘要

概率模型似然相对于其参数的梯度对于现代计算统计学和机器学习至关重要。这些计算可以通过在 TensorFlow 和 PyTorch 等通用机器学习库中实现的“自动微分”来轻松地为任意模型完成。尽管这些库经过了高度优化,但与特定于系统发育学的代码相比,它们的通用性是否会限制其算法复杂度或在系统发育学情况下的实现速度尚不清楚。在本文中,我们比较了六个用于系统发育似然函数的梯度实现,分别孤立地和作为变分推断过程的一部分进行比较。我们发现,尽管自动微分可以在树大小上大致呈线性扩展,但它比针对树似然和比值变换操作精心实现的梯度计算要慢得多。我们得出结论,一种结合系统发育学库和机器学习库的混合方法将为未来的速度和模型灵活性提供最佳组合。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df9a/10282121/d8c4d559ec2c/evad099f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df9a/10282121/6e0c3f0234bf/evad099f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df9a/10282121/d8c4d559ec2c/evad099f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df9a/10282121/6e0c3f0234bf/evad099f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/df9a/10282121/d8c4d559ec2c/evad099f2.jpg

相似文献

1
Automatic Differentiation is no Panacea for Phylogenetic Gradient Computation.自动微分并不是解决系统发育梯度计算的万能药。
Genome Biol Evol. 2023 Jun 1;15(6). doi: 10.1093/gbe/evad099.
2
Evaluating probabilistic programming and fast variational Bayesian inference in phylogenetics.评估系统发育学中的概率编程和快速变分贝叶斯推理。
PeerJ. 2019 Dec 18;7:e8272. doi: 10.7717/peerj.8272. eCollection 2019.
3
Fast likelihood calculation for multivariate Gaussian phylogenetic models with shifts.具有转移的多元高斯系统发育模型的快速似然计算。
Theor Popul Biol. 2020 Feb;131:66-78. doi: 10.1016/j.tpb.2019.11.005. Epub 2019 Dec 2.
4
Model Selection and Parameter Inference in Phylogenetics Using Nested Sampling.使用嵌套抽样进行系统发育学中的模型选择和参数推断。
Syst Biol. 2019 Mar 1;68(2):219-233. doi: 10.1093/sysbio/syy050.
5
RAxML-III: a fast program for maximum likelihood-based inference of large phylogenetic trees.RAxML-III:一个基于最大似然法推断大型系统发育树的快速程序。
Bioinformatics. 2005 Feb 15;21(4):456-63. doi: 10.1093/bioinformatics/bti191. Epub 2004 Dec 17.
6
19 Dubious Ways to Compute the Marginal Likelihood of a Phylogenetic Tree Topology.19 种计算系统发育树拓扑结构边际似然的可疑方法。
Syst Biol. 2020 Mar 1;69(2):209-220. doi: 10.1093/sysbio/syz046.
7
Gradients Do Grow on Trees: A Linear-Time O(N)-Dimensional Gradient for Statistical Phylogenetics.梯度确实长在树上:统计系统发生学的一种线性时间 O(N)维梯度。
Mol Biol Evol. 2020 Oct 1;37(10):3047-3060. doi: 10.1093/molbev/msaa130.
8
Efficient Detection of Repeating Sites to Accelerate Phylogenetic Likelihood Calculations.高效检测重复位点以加速系统发育似然性计算
Syst Biol. 2017 Mar 1;66(2):205-217. doi: 10.1093/sysbio/syw075.
9
Many-core algorithms for statistical phylogenetics.用于统计系统发育学的多核算法。
Bioinformatics. 2009 Jun 1;25(11):1370-6. doi: 10.1093/bioinformatics/btp244. Epub 2009 Apr 15.
10
Hessian calculation for phylogenetic likelihood based on the pruning algorithm and its applications.基于剪枝算法的系统发育似然性的黑塞矩阵计算及其应用。
Stat Appl Genet Mol Biol. 2012 Sep 25;11(4):Article 14. doi: 10.1515/1544-6115.1779.

引用本文的文献

1
Leveraging graphical model techniques to study evolution on phylogenetic networks.利用图形模型技术研究系统发育网络上的进化。
Philos Trans R Soc Lond B Biol Sci. 2025 Feb 13;380(1919):20230310. doi: 10.1098/rstb.2023.0310. Epub 2025 Feb 20.
2
Many-core algorithms for high-dimensional gradients on phylogenetic trees.用于系统发育树上高维梯度的多核算法。
Bioinformatics. 2024 Feb 1;40(2). doi: 10.1093/bioinformatics/btae030.
3
A topology-marginal composite likelihood via a generalized phylogenetic pruning algorithm.一种通过广义系统发育剪枝算法的拓扑边缘复合似然法。

本文引用的文献

1
Stan: A Probabilistic Programming Language.斯坦:一种概率编程语言。
J Stat Softw. 2017;76. doi: 10.18637/jss.v076.i01. Epub 2017 Jan 11.
2
Variational Phylodynamic Inference Using Pandemic-scale Data.基于大流行规模数据的变异系统发育推断。
Mol Biol Evol. 2022 Aug 3;39(8). doi: 10.1093/molbev/msac154.
3
Variational inference using approximate likelihood under the coalescent with recombination.使用重组下合并近似似然的变分推断。
Algorithms Mol Biol. 2023 Jul 31;18(1):10. doi: 10.1186/s13015-023-00235-1.
Genome Res. 2021 Nov;31(11):2107-2119. doi: 10.1101/gr.273631.120. Epub 2021 Aug 23.
4
Relaxed Random Walks at Scale.大规模松弛随机游走。
Syst Biol. 2021 Feb 10;70(2):258-267. doi: 10.1093/sysbio/syaa056.
5
Gradients Do Grow on Trees: A Linear-Time O(N)-Dimensional Gradient for Statistical Phylogenetics.梯度确实长在树上:统计系统发生学的一种线性时间 O(N)维梯度。
Mol Biol Evol. 2020 Oct 1;37(10):3047-3060. doi: 10.1093/molbev/msaa130.
6
Evaluating probabilistic programming and fast variational Bayesian inference in phylogenetics.评估系统发育学中的概率编程和快速变分贝叶斯推理。
PeerJ. 2019 Dec 18;7:e8272. doi: 10.7717/peerj.8272. eCollection 2019.
7
19 Dubious Ways to Compute the Marginal Likelihood of a Phylogenetic Tree Topology.19 种计算系统发育树拓扑结构边际似然的可疑方法。
Syst Biol. 2020 Mar 1;69(2):209-220. doi: 10.1093/sysbio/syz046.
8
BEAGLE 3: Improved Performance, Scaling, and Usability for a High-Performance Computing Library for Statistical Phylogenetics.BEAGLE 3:为统计系统发生学的高性能计算库提供改进的性能、可扩展性和可用性。
Syst Biol. 2019 Nov 1;68(6):1052-1061. doi: 10.1093/sysbio/syz020.
9
Stochastic Variational Inference for Bayesian Phylogenetics: A Case of CAT Model.贝叶斯系统发生学的随机变分推断:CAT 模型案例。
Mol Biol Evol. 2019 Apr 1;36(4):825-833. doi: 10.1093/molbev/msz020.
10
Bayesian phylogenetic and phylodynamic data integration using BEAST 1.10.使用BEAST 1.10进行贝叶斯系统发育和系统动力学数据整合。
Virus Evol. 2018 Jun 8;4(1):vey016. doi: 10.1093/ve/vey016. eCollection 2018 Jan.