在森林中寻找树木：从后验样本中汇总树。

Looking for trees in the forest: summary tree from posterior samples.

机构信息

Department of Computer Science, University of Auckland, Auckland New Zealand.

出版信息

BMC Evol Biol. 2013 Oct 4;13:221. doi: 10.1186/1471-2148-13-221.

DOI:10.1186/1471-2148-13-221

PMID:24093883

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3853548/

Abstract

BACKGROUND

Bayesian phylogenetic analysis generates a set of trees which are often condensed into a single tree representing the whole set. Many methods exist for selecting a representative topology for a set of unrooted trees, few exist for assigning branch lengths to a fixed topology, and even fewer for simultaneously setting the topology and branch lengths. However, there is very little research into locating a good representative for a set of rooted time trees like the ones obtained from a BEAST analysis.

RESULTS

We empirically compare new and known methods for generating a summary tree. Some new methods are motivated by mathematical constructions such as tree metrics, while the rest employ tree concepts which work well in practice. These use more of the posterior than existing methods, which discard information not directly mapped to the chosen topology. Using results from a large number of simulations we assess the quality of a summary tree, measuring (a) how well it explains the sequence data under the model and (b) how close it is to the "truth", i.e to the tree used to generate the sequences.

CONCLUSIONS

Our simulations indicate that no single method is "best". Methods producing good divergence time estimates have poor branch lengths and lower model fit, and vice versa. Using the results presented here, a user can choose the appropriate method based on the purpose of the summary tree.

摘要

背景

贝叶斯系统发生分析会生成一组树，这些树通常会被压缩为一棵代表整个集合的树。有许多方法可用于为一组无根树选择代表拓扑结构，而很少有方法可用于为固定拓扑结构分配分支长度，甚至更少的方法可用于同时设置拓扑结构和分支长度。但是，对于像从 BEAST 分析中获得的有根时间树这样的集合，几乎没有研究如何找到一个好的代表。

结果

我们通过实证比较了生成汇总树的新方法和已知方法。一些新方法是基于树度量等数学结构而提出的，而其他方法则采用在实践中效果良好的树概念。这些方法比现有的方法使用更多的后验信息，后者会丢弃与所选拓扑结构没有直接映射的信息。使用大量模拟的结果，我们评估了汇总树的质量，衡量了（a）它在模型下解释序列数据的程度，以及（b）它与“真实”树的接近程度，即用于生成序列的树。

结论

我们的模拟表明，没有一种方法是“最佳”的。产生良好分歧时间估计的方法具有较差的分支长度和较低的模型拟合度，反之亦然。使用此处呈现的结果，用户可以根据汇总树的目的选择适当的方法。

相似文献

Looking for trees in the forest: summary tree from posterior samples.在森林中寻找树木：从后验样本中汇总树。

BMC Evol Biol. 2013 Oct 4;13:221. doi: 10.1186/1471-2148-13-221.

Identifying the rooted species tree from the distribution of unrooted gene trees under the coalescent.在溯祖理论下，根据无根基因树的分布确定有根物种树。

J Math Biol. 2011 Jun;62(6):833-62. doi: 10.1007/s00285-010-0355-7. Epub 2010 Jul 23.

Estimating species trees using approximate Bayesian computation.使用近似贝叶斯计算估计物种树。

Mol Phylogenet Evol. 2011 May;59(2):354-63. doi: 10.1016/j.ympev.2011.02.019. Epub 2011 Mar 21.

Inferring rooted species trees from unrooted gene trees using approximate Bayesian computation.使用近似贝叶斯计算从未根基因树推断有根物种树。

Mol Phylogenet Evol. 2017 Nov;116:13-24. doi: 10.1016/j.ympev.2017.07.017. Epub 2017 Aug 2.

Accuracy of estimated phylogenetic trees from molecular data. II. Gene frequency data.基于分子数据的系统发育树估计的准确性。II. 基因频率数据。

J Mol Evol. 1983;19(2):153-70. doi: 10.1007/BF02300753.

Anomalous unrooted gene trees.异常无根基因树。

Syst Biol. 2013 Jul;62(4):574-90. doi: 10.1093/sysbio/syt023. Epub 2013 Apr 10.

Species trees from gene trees: reconstructing Bayesian posterior distributions of a species phylogeny using estimated gene tree distributions.从基因树构建物种树：利用估计的基因树分布重建物种系统发育的贝叶斯后验分布。

Syst Biol. 2007 Jun;56(3):504-14. doi: 10.1080/10635150701429982.

The K tree score: quantification of differences in the relative branch length and topology of phylogenetic trees.K树得分：系统发育树相对分支长度和拓扑结构差异的量化。

Bioinformatics. 2007 Nov 1;23(21):2954-6. doi: 10.1093/bioinformatics/btm466. Epub 2007 Sep 22.

Inferring Metric Trees from Weighted Quartets via an Intertaxon Distance.通过种间距离从加权四分体推断度量树

Bull Math Biol. 2020 Jul 16;82(7):97. doi: 10.1007/s11538-020-00773-4.

An Efficient Independence Sampler for Updating Branches in Bayesian Markov chain Monte Carlo Sampling of Phylogenetic Trees.一种用于在系统发育树的贝叶斯马尔可夫链蒙特卡罗采样中更新分支的高效独立采样器。

Syst Biol. 2016 Jan;65(1):161-76. doi: 10.1093/sysbio/syv051. Epub 2015 Jul 30.

引用本文的文献

The emergence of NY10: insights into the 2012 West Nile Virus outbreak in the United States.NY10的出现：对2012年美国西尼罗河病毒疫情的洞察

Virus Evol. 2025 May 14;11(1):veaf037. doi: 10.1093/ve/veaf037. eCollection 2025.

A phylogenetic classification of the Je language family.热依语族的系统发生分类。

Open Res Eur. 2025 May 19;5:29. doi: 10.12688/openreseurope.19346.2. eCollection 2025.

Bayesian phylodynamic inference of population dynamics with dormancy.具有休眠的种群动态的贝叶斯系统发育动力学推断

Proc Natl Acad Sci U S A. 2025 May 6;122(18):e2501394122. doi: 10.1073/pnas.2501394122. Epub 2025 May 2.

Accurate Bayesian phylogenetic point estimation using a tree distribution parameterized by clade probabilities.使用由分支概率参数化的树分布进行精确的贝叶斯系统发育点估计。

PLoS Comput Biol. 2025 Feb 13;21(2):e1012789. doi: 10.1371/journal.pcbi.1012789. eCollection 2025 Feb.

Bayesian phylodynamic inference of population dynamics with dormancy.具有休眠的群体动态的贝叶斯系统发育动力学推断

bioRxiv. 2025 Jan 22:2025.01.19.633741. doi: 10.1101/2025.01.19.633741.

Ancient genomes reveal a deep history of Treponema pallidum in the Americas.古代基因组揭示了梅毒螺旋体在美洲的悠久历史。

Nature. 2025 Apr;640(8057):186-193. doi: 10.1038/s41586-024-08515-5. Epub 2024 Dec 18.

Emergence of the B.1.214.2 SARS-CoV-2 lineage with an Omicron-like spike insertion and a unique upper airway immune signature.出现了具有类似奥密克戎刺突插入和独特上呼吸道免疫特征的 B.1.214.2 谱系 SARS-CoV-2 病毒。

BMC Infect Dis. 2024 Oct 10;24(1):1139. doi: 10.1186/s12879-024-09967-w.

Estimating the mean in the space of ranked phylogenetic trees.估计排序系统发育树空间中的均值。

Bioinformatics. 2024 Aug 2;40(8). doi: 10.1093/bioinformatics/btae514.

COI Barcodes combined with multilocus data for representative Aporia taxa shed light on speciation in the high altitude Irano-Turanian mountain plateaus (Lepidoptera: Pieridae).COI 条码与多基因数据相结合，对代表性 Aporia 类群进行研究，揭示了高海拔伊朗-图兰高原（鳞翅目：Pieridae）的物种形成。

BMC Ecol Evol. 2024 Aug 3;24(1):105. doi: 10.1186/s12862-024-02294-3.

Towards Reliable Detection of Introgression in the Presence of Among-Species Rate Variation.在存在种间速率变异的情况下，实现基因渐渗的可靠检测。

Syst Biol. 2024 Oct 30;73(5):769-788. doi: 10.1093/sysbio/syae028.

本文引用的文献

BEAST 2: a software platform for Bayesian evolutionary analysis.BEAST 2：用于贝叶斯进化分析的软件平台。

PLoS Comput Biol. 2014 Apr 10;10(4):e1003537. doi: 10.1371/journal.pcbi.1003537. eCollection 2014 Apr.

The estimation of tree posterior probabilities using conditional clade probability distributions.使用条件分支概率分布估计树后验概率。

Syst Biol. 2013 Jul;62(4):501-11. doi: 10.1093/sysbio/syt014. Epub 2013 Mar 11.

Bayes estimators for phylogenetic reconstruction.贝叶斯估计在系统发育重建中的应用。

Syst Biol. 2011 Jul;60(4):528-40. doi: 10.1093/sysbio/syr021. Epub 2011 Apr 6.

A fast algorithm for computing geodesic distances in tree space.一种用于计算树空间测地距离的快速算法。

IEEE/ACM Trans Comput Biol Bioinform. 2011 Jan-Mar;8(1):2-13. doi: 10.1109/TCBB.2010.3.

DendroPy: a Python library for phylogenetic computing.DendroPy：一个用于系统发育计算的 Python 库。

Bioinformatics. 2010 Jun 15;26(12):1569-71. doi: 10.1093/bioinformatics/btq228. Epub 2010 Apr 25.

DensiTree: making sense of sets of phylogenetic trees.DensiTree：解析一组系统发生树。

Bioinformatics. 2010 May 15;26(10):1372-3. doi: 10.1093/bioinformatics/btq110. Epub 2010 Mar 12.

BEAST: Bayesian evolutionary analysis by sampling trees.BEAST：通过抽样树进行贝叶斯进化分析。

BMC Evol Biol. 2007 Nov 8;7:214. doi: 10.1186/1471-2148-7-214.

Summarizing a posterior distribution of trees using agreement subtrees.使用一致子树总结树的后验分布。

Syst Biol. 2007 Aug;56(4):578-90. doi: 10.1080/10635150701485091.

Bayesian phylogenetic inference via Markov chain Monte Carlo methods.通过马尔可夫链蒙特卡罗方法进行贝叶斯系统发育推断。

Biometrics. 1999 Mar;55(1):1-12. doi: 10.1111/j.0006-341x.1999.00001.x.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

在森林中寻找树木：从后验样本中汇总树。

Looking for trees in the forest: summary tree from posterior samples.

机构信息

出版信息

BACKGROUND

RESULTS

CONCLUSIONS

背景

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献