基于最小进化原理的快速准确的系统发育重建算法。

Fast and accurate phylogeny reconstruction algorithms based on the minimum-evolution principle.

作者信息

Desper Richard, Gascuel Olivier

机构信息

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 45 Center Drive, Bethesda, MD 20892, USA.

出版信息

J Comput Biol. 2002;9(5):687-705. doi: 10.1089/106652702761034136.

DOI:10.1089/106652702761034136

PMID:12487758

Abstract

The Minimum Evolution (ME) approach to phylogeny estimation has been shown to be statistically consistent when it is used in conjunction with ordinary least-squares (OLS) fitting of a metric to a tree structure. The traditional approach to using ME has been to start with the Neighbor Joining (NJ) topology for a given matrix and then do a topological search from that starting point. The first stage requires O(n(3)) time, where n is the number of taxa, while the current implementations of the second are in O(p n(3)) or more, where p is the number of swaps performed by the program. In this paper, we examine a greedy approach to minimum evolution which produces a starting topology in O(n(2)) time. Moreover, we provide an algorithm that searches for the best topology using nearest neighbor interchanges (NNIs), where the cost of doing p NNIs is O(n(2) + p n), i.e., O(n(2)) in practice because p is always much smaller than n. The Greedy Minimum Evolution (GME) algorithm, when used in combination with NNIs, produces trees which are fairly close to NJ trees in terms of topological accuracy. We also examine ME under a balanced weighting scheme, where sibling subtrees have equal weight, as opposed to the standard "unweighted" OLS, where all taxa have the same weight so that the weight of a subtree is equal to the number of its taxa. The balanced minimum evolution scheme (BME) runs slower than the OLS version, requiring O(n(2) x diam(T)) operations to build the starting tree and O(p n x diam(T)) to perform the NNIs, where diam(T) is the topological diameter of the output tree. In the usual Yule-Harding distribution on phylogenetic trees, the diameter expectation is in log(n), so our algorithms are in practice faster that NJ. Moreover, this BME scheme yields a very significant improvement over NJ and other distance-based algorithms, especially with large trees, in terms of topological accuracy.

摘要

系统发育估计的最小进化（ME）方法已被证明，当它与将度量拟合到树结构的普通最小二乘法（OLS）结合使用时，在统计上是一致的。使用ME的传统方法是从给定矩阵的邻接归并（NJ）拓扑开始，然后从该起点进行拓扑搜索。第一阶段需要O(n(3))时间，其中n是分类单元的数量，而第二阶段的当前实现是O(p n(3))或更多，其中p是程序执行的交换次数。在本文中，我们研究了一种最小进化的贪心方法，该方法在O(n(2))时间内生成一个起始拓扑。此外，我们提供了一种使用最近邻交换（NNI）搜索最佳拓扑的算法，其中执行p次NNI的成本是O(n(2) + p n)，即在实际中是O(n(2))，因为p总是远小于n。贪心最小进化（GME）算法与NNI结合使用时，生成的树在拓扑准确性方面与NJ树相当接近。我们还研究了在平衡加权方案下的ME，其中兄弟子树具有相等的权重，这与标准的“未加权”OLS相反，在OLS中所有分类单元具有相同的权重，因此子树的权重等于其分类单元的数量。平衡最小进化方案（BME）的运行速度比OLS版本慢，构建起始树需要O(n(2) x diam(T))操作，执行NNI需要O(p n x diam(T))操作，其中diam(T)是输出树的拓扑直径。在系统发育树通常的尤尔 - 哈丁分布中，直径期望为log(n)，所以我们的算法在实际中比NJ更快。此外，就拓扑准确性而言，这种BME方案相对于NJ和其他基于距离的算法有非常显著的改进，尤其是对于大树。

相似文献

Fast and accurate phylogeny reconstruction algorithms based on the minimum-evolution principle.

J Comput Biol. 2002;9(5):687-705. doi: 10.1089/106652702761034136.

Theoretical foundation of the balanced minimum evolution method of phylogenetic inference and its relationship to weighted least-squares tree fitting.

Mol Biol Evol. 2004 Mar;21(3):587-98. doi: 10.1093/molbev/msh049. Epub 2003 Dec 23.

Robustness of phylogenetic inference based on minimum evolution.

Bull Math Biol. 2010 Oct;72(7):1820-39. doi: 10.1007/s11538-010-9510-y. Epub 2010 May 7.

Consistency of topological moves based on the balanced minimum evolution principle of phylogenetic inference.

IEEE/ACM Trans Comput Biol Bioinform. 2009 Jan-Mar;6(1):110-7. doi: 10.1109/TCBB.2008.37.

A rapid heuristic algorithm for finding minimum evolution trees.

Mol Phylogenet Evol. 2000 Aug;16(2):173-9. doi: 10.1006/mpev.1999.0728.

A multi-neighbor-joining approach for phylogenetic tree reconstruction and visualization.

Genet Mol Res. 2005 Sep 30;4(3):525-34.

Accuracy guarantees for phylogeny reconstruction algorithms based on balanced minimum evolution.

IEEE/ACM Trans Comput Biol Bioinform. 2013 May-Jun;10(3):576-83. doi: 10.1109/TCBB.2013.39.

Optimality of the neighbor joining algorithm and faces of the balanced minimum evolution polytope.

Bull Math Biol. 2011 Nov;73(11):2627-48. doi: 10.1007/s11538-011-9640-x. Epub 2011 Mar 4.

Clearcut: a fast implementation of relaxed neighbor joining.

Bioinformatics. 2006 Nov 15;22(22):2823-4. doi: 10.1093/bioinformatics/btl478. Epub 2006 Sep 18.

Efficiencies of fast algorithms of phylogenetic inference under the criteria of maximum parsimony, minimum evolution, and maximum likelihood when a large number of sequences are used.

Mol Biol Evol. 2000 Aug;17(8):1251-8. doi: 10.1093/oxfordjournals.molbev.a026408.

引用本文的文献

Antimalarial drug resistance and population structure of Plasmodium falciparum in Mozambique using genomic surveillance at health facilities in 2021 and 2022.

Sci Rep. 2025 Aug 11;15(1):29335. doi: 10.1038/s41598-025-02166-w.

Highly replicating hepatitis C virus variants emerge in immunosuppressed patients causing severe disease.

Res Sq. 2025 Jun 9:rs.3.rs-6194507. doi: 10.21203/rs.3.rs-6194507/v1.

First Mitogenome of the Critically Endangered Arabian Leopard ().

Animals (Basel). 2025 May 27;15(11):1562. doi: 10.3390/ani15111562.

Parallel sensory compensation following independent subterranean colonization by groundwater salamanders ().

Proc Natl Acad Sci U S A. 2025 Jun 10;122(23):e2504850122. doi: 10.1073/pnas.2504850122. Epub 2025 Jun 3.

Bayesian Inference of Phylogenetic Distances: Revisiting the Eigenvalue Approach.

Bull Math Biol. 2025 Jan 23;87(2):32. doi: 10.1007/s11538-024-01403-z.

Comparative genetic mapping and a consensus interspecific genetic map reveal strong synteny and collinearity within the genus.

Front Plant Sci. 2024 Dec 16;15:1475965. doi: 10.3389/fpls.2024.1475965. eCollection 2024.

A regression based approach to phylogenetic reconstruction from multi-sample bulk DNA sequencing of tumors.

PLoS Comput Biol. 2024 Dec 4;20(12):e1012631. doi: 10.1371/journal.pcbi.1012631. eCollection 2024 Dec.

Early separation and parallel clonal selection of dedifferentiated and well-differentiated components in dedifferentiated liposarcoma.

Neoplasia. 2025 Jan;59:101074. doi: 10.1016/j.neo.2024.101074. Epub 2024 Nov 25.

Resolving tumor evolution: a phylogenetic approach.

J Natl Cancer Cent. 2024 Mar 21;4(2):97-106. doi: 10.1016/j.jncc.2024.03.001. eCollection 2024 Jun.

The Metabolism of Genus Decoded by Comparative Genomics.

Microorganisms. 2024 Jul 20;12(7):1487. doi: 10.3390/microorganisms12071487.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于最小进化原理的快速准确的系统发育重建算法。

Fast and accurate phylogeny reconstruction algorithms based on the minimum-evolution principle.

作者信息

Desper Richard, Gascuel Olivier

机构信息

National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health, 45 Center Drive, Bethesda, MD 20892, USA.

出版信息

J Comput Biol. 2002;9(5):687-705. doi: 10.1089/106652702761034136.

DOI:10.1089/106652702761034136

PMID:12487758

Abstract

摘要

基于最小进化原理的快速准确的系统发育重建算法。

Fast and accurate phylogeny reconstruction algorithms based on the minimum-evolution principle.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基于最小进化原理的快速准确的系统发育重建算法。

Fast and accurate phylogeny reconstruction algorithms based on the minimum-evolution principle.

作者信息

机构信息

出版信息

相似文献

引用本文的文献