• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

贝叶斯估计在系统发育重建中的应用。

Bayes estimators for phylogenetic reconstruction.

机构信息

Lane Center for Computational Biology, Carnegie Mellon University, Mellon Institute Building, 4400 Fifth Avenue, Pittsburgh, PA 15213, USA.

出版信息

Syst Biol. 2011 Jul;60(4):528-40. doi: 10.1093/sysbio/syr021. Epub 2011 Apr 6.

DOI:10.1093/sysbio/syr021
PMID:21471560
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3114872/
Abstract

Tree reconstruction methods are often judged by their accuracy, measured by how close they get to the true tree. Yet, most reconstruction methods like maximum likelihood (ML) do not explicitly maximize this accuracy. To address this problem, we propose a Bayesian solution. Given tree samples, we propose finding the tree estimate that is closest on average to the samples. This "median" tree is known as the Bayes estimator (BE). The BE literally maximizes posterior expected accuracy, measured in terms of closeness (distance) to the true tree. We discuss a unified framework of BE trees, focusing especially on tree distances that are expressible as squared euclidean distances. Notable examples include Robinson-Foulds (RF) distance, quartet distance, and squared path difference. Using both simulated and real data, we show that BEs can be estimated in practice by hill-climbing. In our simulation, we find that BEs tend to be closer to the true tree, compared with ML and neighbor joining. In particular, the BE under squared path difference tends to perform well in terms of both path difference and RF distances.

摘要

树重建方法通常通过其准确性进行评估,准确性的衡量标准是它们与真实树的接近程度。然而,像最大似然法 (ML) 这样的大多数重建方法并没有明确地最大化这个准确性。为了解决这个问题,我们提出了一个贝叶斯解决方案。给定树样本,我们建议找到平均而言最接近样本的树估计值。这个“中位数”树被称为贝叶斯估计器 (BE)。BE 实际上最大化了后验预期准确性,以与真实树的接近程度(距离)来衡量。我们讨论了 BE 树的统一框架,特别关注可表示为平方欧几里得距离的树距离。值得注意的例子包括罗宾逊-福尔德 (RF) 距离、四分体距离和平方路径差。使用模拟和真实数据,我们表明可以通过爬山法在实践中估计 BE。在我们的模拟中,我们发现与 ML 和邻居连接相比,BE 往往更接近真实树。特别是,在平方路径差下的 BE 在路径差和 RF 距离方面表现良好。

相似文献

1
Bayes estimators for phylogenetic reconstruction.贝叶斯估计在系统发育重建中的应用。
Syst Biol. 2011 Jul;60(4):528-40. doi: 10.1093/sysbio/syr021. Epub 2011 Apr 6.
2
Accurate phylogenetic tree reconstruction from quartets: a heuristic approach.基于四重奏的准确系统发育树重建:一种启发式方法。
PLoS One. 2014 Aug 12;9(8):e104008. doi: 10.1371/journal.pone.0104008. eCollection 2014.
3
An efficient algorithm for approximating geodesic distances in tree space.一种用于逼近树空间测地距离的有效算法。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Sep-Oct;8(5):1196-207. doi: 10.1109/TCBB.2010.121.
4
Efficiencies of the NJp, Maximum Likelihood, and Bayesian Methods of Phylogenetic Construction for Compositional and Noncompositional Genes.NJp、最大似然和贝叶斯方法在构建组成和非组成基因系统发育中的效率。
Mol Biol Evol. 2016 Jun;33(6):1618-24. doi: 10.1093/molbev/msw042. Epub 2016 Feb 28.
5
The effect of ambiguous data on phylogenetic estimates obtained by maximum likelihood and Bayesian inference.歧义数据对最大似然法和贝叶斯推断得出的系统发育估计的影响。
Syst Biol. 2009 Feb;58(1):130-45. doi: 10.1093/sysbio/syp017. Epub 2009 May 22.
6
Data-specific substitution models improve protein-based phylogenetics.基于数据的替代模型可提高基于蛋白质的系统发育分析。
PeerJ. 2023 Aug 8;11:e15716. doi: 10.7717/peerj.15716. eCollection 2023.
7
Comparison of Boolean analysis and standard phylogenetic methods using artificially evolved and natural mt-tRNA sequences from great apes.使用人工进化和自然产生的大型猿类 mt-tRNA 序列对布尔分析和标准系统发育方法进行比较。
Mol Phylogenet Evol. 2012 Apr;63(1):193-202. doi: 10.1016/j.ympev.2012.01.010. Epub 2012 Jan 26.
8
Missing data in phylogenetic analysis: reconciling results from simulations and empirical data.系统发育分析中的缺失数据:协调模拟结果与实证数据
Syst Biol. 2011 Oct;60(5):719-31. doi: 10.1093/sysbio/syr025. Epub 2011 Mar 28.
9
Performance comparison between k-tuple distance and four model-based distances in phylogenetic tree reconstruction.在系统发育树重建中,k元组距离与四种基于模型的距离之间的性能比较。
Nucleic Acids Res. 2008 Mar;36(5):e33. doi: 10.1093/nar/gkn075. Epub 2008 Feb 22.
10
Evaluating the Performance of Probabilistic Algorithms for Phylogenetic Analysis of Big Morphological Datasets: A Simulation Study.评估概率算法在大型形态数据集系统发育分析中的性能:一项模拟研究。
Syst Biol. 2020 Nov 1;69(6):1088-1105. doi: 10.1093/sysbio/syaa020.

引用本文的文献

1
Tropical Logistic Regression Model on Space of Phylogenetic Trees.热带进化树空间上的逻辑回归模型。
Bull Math Biol. 2024 Jul 2;86(8):99. doi: 10.1007/s11538-024-01327-8.
2
Molecular Evolution of Aralkylamine N-Acetyltransferase in Fish: A Genomic Survey.鱼类中芳烷基胺N-乙酰基转移酶的分子进化:一项基因组调查
Int J Mol Sci. 2015 Dec 31;17(1):51. doi: 10.3390/ijms17010051.
3
A Bayesian Supertree Model for Genome-Wide Species Tree Reconstruction.一种用于全基因组物种树重建的贝叶斯超树模型。
Syst Biol. 2016 May;65(3):397-416. doi: 10.1093/sysbio/syu082. Epub 2014 Oct 3.
4
Point estimates in phylogenetic reconstructions.系统发育重建中的点估计。
Bioinformatics. 2014 Sep 1;30(17):i534-40. doi: 10.1093/bioinformatics/btu461.
5
Looking for trees in the forest: summary tree from posterior samples.在森林中寻找树木:从后验样本中汇总树。
BMC Evol Biol. 2013 Oct 4;13:221. doi: 10.1186/1471-2148-13-221.
6
Statistical phylogenetic tree analysis using differences of means.使用均值差异的统计系统发育树分析。
Front Neurosci. 2010 Aug 3;4. doi: 10.3389/fnins.2010.00047. eCollection 2010.

本文引用的文献

1
A fast algorithm for computing geodesic distances in tree space.一种用于计算树空间测地距离的快速算法。
IEEE/ACM Trans Comput Biol Bioinform. 2011 Jan-Mar;8(1):2-13. doi: 10.1109/TCBB.2010.3.
2
Quartets MaxCut: a divide and conquer quartets algorithm.四重体最大切割:一种分而治之的四重体算法。
IEEE/ACM Trans Comput Biol Bioinform. 2010 Oct-Dec;7(4):704-18. doi: 10.1109/TCBB.2008.133.
3
A justification for reporting the majority-rule consensus tree in Bayesian phylogenetics.贝叶斯系统发育学中报告多数规则一致树的理由。
Syst Biol. 2008 Oct;57(5):814-21. doi: 10.1080/10635150802422308.
4
An exact algorithm for the geodesic distance between phylogenetic trees.一种用于计算系统发育树之间测地距离的精确算法。
J Comput Biol. 2008 Jul-Aug;15(6):577-91. doi: 10.1089/cmb.2008.0068.
5
A novel test for host-symbiont codivergence indicates ancient origin of fungal endophytes in grasses.一种用于宿主-共生体共分化的新型测试表明禾本科植物中真菌内生菌的古老起源。
Syst Biol. 2008 Jun;57(3):483-98. doi: 10.1080/10635150802172184.
6
On the optimality of the neighbor-joining algorithm.关于邻接法算法的最优性。
Algorithms Mol Biol. 2008 Apr 30;3:5. doi: 10.1186/1748-7188-3-5.
7
A molecular assessment of phylogenetic relationships and lineage accumulation rates within the family Salamandridae (Amphibia, Caudata).蝾螈科(两栖纲,有尾目)系统发育关系及谱系积累速率的分子评估
Mol Phylogenet Evol. 2006 Nov;41(2):368-83. doi: 10.1016/j.ympev.2006.05.008. Epub 2006 May 19.
8
Improving the efficiency of SPR moves in phylogenetic tree search methods based on maximum likelihood.提高基于最大似然法的系统发育树搜索方法中SPR移动的效率。
Bioinformatics. 2005 Dec 15;21(24):4338-47. doi: 10.1093/bioinformatics/bti713. Epub 2005 Oct 18.
9
Theoretical foundation of the balanced minimum evolution method of phylogenetic inference and its relationship to weighted least-squares tree fitting.系统发育推断的平衡最小进化方法的理论基础及其与加权最小二乘树拟合的关系。
Mol Biol Evol. 2004 Mar;21(3):587-98. doi: 10.1093/molbev/msh049. Epub 2003 Dec 23.
10
A simple, fast, and accurate algorithm to estimate large phylogenies by maximum likelihood.一种通过最大似然法估计大型系统发育树的简单、快速且准确的算法。
Syst Biol. 2003 Oct;52(5):696-704. doi: 10.1080/10635150390235520.