• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于快速似然的方法,用于从连续特征数据中估计大型系统发育树。

A fast likelihood approach for estimation of large phylogenies from continuous trait data.

机构信息

Division of Biostatistics, College of Public Health, The Ohio State University, United States; Department of Statistics, The Ohio State University, United States.

Center for Medical Informatics, Yale University, United States.

出版信息

Mol Phylogenet Evol. 2021 Aug;161:107142. doi: 10.1016/j.ympev.2021.107142. Epub 2021 Mar 11.

DOI:10.1016/j.ympev.2021.107142
PMID:33713799
Abstract

Despite the recent availability of large-scale genomic data for many individuals, few methods for phylogenetic inference are both computationally efficient and highly accurate for trees with hundreds of taxa. Model-based methods such as those developed in the maximum likelihood and Bayesian frameworks are especially time-consuming, as they involve both computationally intensive calculations on fixed phylogenies and searches through the space of possible phylogenies, and they are known to scale poorly with the addition of taxa. Here, we propose a fast approximation to the maximum likelihood estimator that directly uses continuous trait data, such as allele frequency data. The approximation works by first computing the maximum likelihood estimates of some internal branch lengths, and then inferring the tree-topology using these estimates. Our approach is more computationally efficient than existing methods for such data while still achieving comparable accuracy. This method is innovative in its use of the mathematical properties of tree-topologies for inference, and thus serves as a useful addition to the collection of methods available for estimating phylogenies from continuous trait data.

摘要

尽管最近有大量的个体基因组数据可供使用,但对于具有数百个分类群的系统发育树,很少有计算效率高且高度准确的系统发育推断方法。基于模型的方法,如最大似然法和贝叶斯框架中开发的方法,特别耗时,因为它们既涉及固定系统发育树上的计算密集型计算,又涉及可能系统发育空间的搜索,并且已知随着分类群的增加而扩展不佳。在这里,我们提出了一种快速逼近最大似然估计量的方法,该方法可以直接使用连续特征数据,例如等位基因频率数据。该逼近方法的工作原理是首先计算某些内部分支长度的最大似然估计量,然后使用这些估计量推断树拓扑结构。与现有方法相比,我们的方法在处理此类数据时更具计算效率,同时仍能达到相当的准确性。该方法在利用拓扑结构的数学性质进行推断方面具有创新性,因此是从连续特征数据估计系统发育的可用方法集合中的有用补充。

相似文献

1
A fast likelihood approach for estimation of large phylogenies from continuous trait data.一种基于快速似然的方法,用于从连续特征数据中估计大型系统发育树。
Mol Phylogenet Evol. 2021 Aug;161:107142. doi: 10.1016/j.ympev.2021.107142. Epub 2021 Mar 11.
2
Reliable estimation of tree branch lengths using deep neural networks.利用深度神经网络可靠估计树枝长度。
PLoS Comput Biol. 2024 Aug 5;20(8):e1012337. doi: 10.1371/journal.pcbi.1012337. eCollection 2024 Aug.
3
Fast likelihood calculation for multivariate Gaussian phylogenetic models with shifts.具有转移的多元高斯系统发育模型的快速似然计算。
Theor Popul Biol. 2020 Feb;131:66-78. doi: 10.1016/j.tpb.2019.11.005. Epub 2019 Dec 2.
4
Systematic Exploration of the High Likelihood Set of Phylogenetic Tree Topologies.系统探索系统发育树拓扑结构的高似然集。
Syst Biol. 2020 Mar 1;69(2):280-293. doi: 10.1093/sysbio/syz047.
5
Online Phylogenetics with matOptimize Produces Equivalent Trees and is Dramatically More Efficient for Large SARS-CoV-2 Phylogenies than de novo and Maximum-Likelihood Implementations.在线系统发育学与 matOptimize 产生等效的树,并且比从头开始和最大似然实现对大型 SARS-CoV-2 系统发育更有效率。
Syst Biol. 2023 Nov 1;72(5):1039-1051. doi: 10.1093/sysbio/syad031.
6
Challenges in Species Tree Estimation Under the Multispecies Coalescent Model.多物种溯祖模型下物种树估计的挑战
Genetics. 2016 Dec;204(4):1353-1368. doi: 10.1534/genetics.116.190173.
7
Coalescent-based species tree inference from gene tree topologies under incomplete lineage sorting by maximum likelihood.基于最大似然法的不完全谱系分选下基于基因树拓扑结构的合并种系树推断。
Evolution. 2012 Mar;66(3):763-775. doi: 10.1111/j.1558-5646.2011.01476.x. Epub 2011 Nov 2.
8
The devil in the details: interactions between the branch-length prior and likelihood model affect node support and branch lengths in the phylogeny of the Psoraceae.细节中的魔鬼:分支长度先验和似然模型之间的相互作用影响了 Psoraceae 系统发育中的节点支持和分支长度。
Syst Biol. 2011 Jul;60(4):541-61. doi: 10.1093/sysbio/syr022. Epub 2011 Mar 24.
9
Maximum likelihood estimates of species trees: how accuracy of phylogenetic inference depends upon the divergence history and sampling design.最大似然估计物种树:系统发育推断的准确性如何取决于分歧历史和采样设计。
Syst Biol. 2009 Oct;58(5):501-8. doi: 10.1093/sysbio/syp045. Epub 2009 Aug 20.
10
Branch length estimation and divergence dating: estimates of error in Bayesian and maximum likelihood frameworks.支长估计和分歧日期:贝叶斯和最大似然框架中的误差估计。
BMC Evol Biol. 2010 Jan 11;10:5. doi: 10.1186/1471-2148-10-5.