Suppr超能文献

四重奏最大割算法及其他超树方法的实验研究

An experimental study of Quartets MaxCut and other supertree methods.

作者信息

Swenson M Shel, Suri Rahul, Linder C Randal, Warnow Tandy

机构信息

Department of Computer Science, The University of Texas at Austin, Austin TX, USA.

出版信息

Algorithms Mol Biol. 2011 Apr 19;6:7. doi: 10.1186/1748-7188-6-7.

Abstract

BACKGROUND

Supertree methods represent one of the major ways by which the Tree of Life can be estimated, but despite many recent algorithmic innovations, matrix representation with parsimony (MRP) remains the main algorithmic supertree method.

RESULTS

We evaluated the performance of several supertree methods based upon the Quartets MaxCut (QMC) method of Snir and Rao and showed that two of these methods usually outperform MRP and five other supertree methods that we studied, under many realistic model conditions. However, the QMC-based methods have scalability issues that may limit their utility on large datasets. We also observed that taxon sampling impacted supertree accuracy, with poor results obtained when all of the source trees were only sparsely sampled. Finally, we showed that the popular optimality criterion of minimizing the total topological distance of the supertree to the source trees is only weakly correlated with supertree topological accuracy. Therefore evaluating supertree methods on biological datasets is problematic.

CONCLUSIONS

Our results show that supertree methods that improve upon MRP are possible, and that an effort should be made to produce scalable and robust implementations of the most accurate supertree methods. Also, because topological accuracy depends upon taxon sampling strategies, attempts to construct very large phylogenetic trees using supertree methods should consider the selection of source tree datasets, as well as supertree methods. Finally, since supertree topological error is only weakly correlated with the supertree's topological distance to its source trees, development and testing of supertree methods presents methodological challenges.

摘要

背景

超树方法是估计生命之树的主要方法之一,但尽管最近有许多算法创新,简约矩阵表示法(MRP)仍然是主要的超树算法方法。

结果

我们基于Snir和Rao的四重奏最大割(QMC)方法评估了几种超树方法的性能,结果表明,在许多实际模型条件下,其中两种方法通常优于MRP以及我们研究的其他五种超树方法。然而,基于QMC的方法存在可扩展性问题,这可能会限制它们在大型数据集上的效用。我们还观察到分类群抽样会影响超树的准确性,当所有源树的抽样都很稀疏时,结果很差。最后,我们表明,使超树与源树的总拓扑距离最小化这一流行的最优性标准与超树拓扑准确性的相关性较弱。因此,在生物数据集上评估超树方法存在问题。

结论

我们的结果表明,改进MRP的超树方法是可能的,应该努力为最准确的超树方法开发可扩展且稳健的实现。此外,由于拓扑准确性取决于分类群抽样策略,使用超树方法构建非常大的系统发育树时,应考虑源树数据集的选择以及超树方法。最后,由于超树拓扑误差与超树与其源树的拓扑距离的相关性较弱,超树方法的开发和测试面临方法学挑战。

相似文献

1
An experimental study of Quartets MaxCut and other supertree methods.
Algorithms Mol Biol. 2011 Apr 19;6:7. doi: 10.1186/1748-7188-6-7.
2
MRL and SuperFine+MRL: new supertree methods.
Algorithms Mol Biol. 2012 Jan 26;7(1):3. doi: 10.1186/1748-7188-7-3.
3
The Performance of Two Supertree Schemes Compared Using Synthetic and Real Data Quartet Input.
J Mol Evol. 2018 Feb;86(2):150-165. doi: 10.1007/s00239-018-9833-0. Epub 2018 Feb 19.
4
Quartets MaxCut: a divide and conquer quartets algorithm.
IEEE/ACM Trans Comput Biol Bioinform. 2010 Oct-Dec;7(4):704-18. doi: 10.1109/TCBB.2008.133.
5
A simulation study comparing supertree and combined analysis methods using SMIDGen.
Algorithms Mol Biol. 2010 Jan 4;5:8. doi: 10.1186/1748-7188-5-8.
8
Weighted quartets phylogenetics.
Syst Biol. 2015 Mar;64(2):233-42. doi: 10.1093/sysbio/syu087. Epub 2014 Nov 19.
9
SuperFine: fast and accurate supertree estimation.
Syst Biol. 2012 Mar;61(2):214-27. doi: 10.1093/sysbio/syr092. Epub 2011 Sep 20.
10
Performance of flip supertree construction with a heuristic algorithm.
Syst Biol. 2004 Apr;53(2):299-308. doi: 10.1080/10635150490423719.

引用本文的文献

1
Asymmetric Cluster-Based Measures for Comparative Phylogenetics.
J Comput Biol. 2024 Apr;31(4):312-327. doi: 10.1089/cmb.2023.0338. Epub 2024 Apr 17.
2
Insertions and deletions as phylogenetic signal in an alignment-free context.
PLoS Comput Biol. 2022 Aug 8;18(8):e1010303. doi: 10.1371/journal.pcbi.1010303. eCollection 2022 Aug.
3
Using Robinson-Foulds supertrees in divide-and-conquer phylogeny estimation.
Algorithms Mol Biol. 2021 Jun 28;16(1):12. doi: 10.1186/s13015-021-00189-2.
4
Statistically consistent divide-and-conquer pipelines for phylogeny estimation using NJMerge.
Algorithms Mol Biol. 2019 Jul 19;14:14. doi: 10.1186/s13015-019-0151-x. eCollection 2019.
5
Topological Metrizations of Trees, and New Quartet Methods of Tree Inference.
IEEE/ACM Trans Comput Biol Bioinform. 2020 Nov-Dec;17(6):2107-2118. doi: 10.1109/TCBB.2019.2917204. Epub 2020 Dec 8.
6
BCD Beam Search: considering suboptimal partial solutions in Bad Clade Deletion supertrees.
PeerJ. 2018 Jun 8;6:e4987. doi: 10.7717/peerj.4987. eCollection 2018.
7
The Performance of Two Supertree Schemes Compared Using Synthetic and Real Data Quartet Input.
J Mol Evol. 2018 Feb;86(2):150-165. doi: 10.1007/s00239-018-9833-0. Epub 2018 Feb 19.
8
Bad Clade Deletion Supertrees: A Fast and Accurate Supertree Algorithm.
Mol Biol Evol. 2017 Sep 1;34(9):2408-2421. doi: 10.1093/molbev/msx191.
9
Accurate phylogenetic tree reconstruction from quartets: a heuristic approach.
PLoS One. 2014 Aug 12;9(8):e104008. doi: 10.1371/journal.pone.0104008. eCollection 2014.
10
Supertrees Based on the Subtree Prune-and-Regraft Distance.
Syst Biol. 2014 Jul;63(4):566-81. doi: 10.1093/sysbio/syu023. Epub 2014 Apr 2.

本文引用的文献

1
The Parsimony Ratchet, a New Method for Rapid Parsimony Analysis.
Cladistics. 1999 Dec;15(4):407-414. doi: 10.1111/j.1096-0031.1999.tb00277.x.
2
Quartets MaxCut: a divide and conquer quartets algorithm.
IEEE/ACM Trans Comput Biol Bioinform. 2010 Oct-Dec;7(4):704-18. doi: 10.1109/TCBB.2008.133.
3
Robinson-Foulds supertrees.
Algorithms Mol Biol. 2010 Feb 24;5:18. doi: 10.1186/1748-7188-5-18.
4
A simulation study comparing supertree and combined analysis methods using SMIDGen.
Algorithms Mol Biol. 2010 Jan 4;5:8. doi: 10.1186/1748-7188-5-8.
5
Improved heuristics for minimum-flip supertree construction.
Evol Bioinform Online. 2007 Feb 28;2:347-56.
6
Short quartet puzzling: a new quartet-based phylogeny reconstruction algorithm.
J Comput Biol. 2008 Jan-Feb;15(1):91-103. doi: 10.1089/cmb.2007.0103.
7
PhySIC: a veto supertree method with desirable properties.
Syst Biol. 2007 Oct;56(5):798-817. doi: 10.1080/10635150701639754.
8
Imputing supertrees and supernetworks from quartets.
Syst Biol. 2007 Feb;56(1):57-67. doi: 10.1080/10635150601167013.
9
RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models.
Bioinformatics. 2006 Nov 1;22(21):2688-90. doi: 10.1093/bioinformatics/btl446. Epub 2006 Aug 23.
10
The evolution of supertrees.
Trends Ecol Evol. 2004 Jun;19(6):315-22. doi: 10.1016/j.tree.2004.03.015.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验