Suppr超能文献

构建多数规则超树。

Constructing majority-rule supertrees.

作者信息

Dong Jianrong, Fernández-Baca David, McMorris F R

机构信息

Department of Computer Science, Iowa State University, Ames, IA 50011, USA.

出版信息

Algorithms Mol Biol. 2010 Jan 4;5:2. doi: 10.1186/1748-7188-5-2.

Abstract

BACKGROUND

Supertree methods combine the phylogenetic information from multiple partially-overlapping trees into a larger phylogenetic tree called a supertree. Several supertree construction methods have been proposed to date, but most of these are not designed with any specific properties in mind. Recently, Cotton and Wilkinson proposed extensions of the majority-rule consensus tree method to the supertree setting that inherit many of the appealing properties of the former.

RESULTS

We study a variant of one of Cotton and Wilkinson's methods, called majority-rule (+) supertrees. After proving that a key underlying problem for constructing majority-rule (+) supertrees is NP-hard, we develop a polynomial-size exact integer linear programming formulation of the problem. We then present a data reduction heuristic that identifies smaller subproblems that can be solved independently. While this technique is not guaranteed to produce optimal solutions, it can achieve substantial problem-size reduction. Finally, we report on a computational study of our approach on various real data sets, including the 121-taxon, 7-tree Seabirds data set of Kennedy and Page.

CONCLUSIONS

The results indicate that our exact method is computationally feasible for moderately large inputs. For larger inputs, our data reduction heuristic makes it feasible to tackle problems that are well beyond the range of the basic integer programming approach. Comparisons between the results obtained by our heuristic and exact solutions indicate that the heuristic produces good answers. Our results also suggest that the majority-rule (+) approach, in both its basic form and with data reduction, yields biologically meaningful phylogenies.

摘要

背景

超树方法将来自多个部分重叠树的系统发育信息组合成一个更大的系统发育树,称为超树。迄今为止,已经提出了几种超树构建方法,但其中大多数在设计时并未考虑任何特定属性。最近,科顿和威尔金森将多数规则一致树方法扩展到超树设置,继承了前者的许多吸引人的属性。

结果

我们研究了科顿和威尔金森的一种方法的变体,称为多数规则(+)超树。在证明构建多数规则(+)超树的一个关键潜在问题是NP难问题之后,我们开发了该问题的多项式规模精确整数线性规划公式。然后,我们提出了一种数据约简启发式方法,该方法可以识别可以独立解决的较小子问题。虽然这种技术不能保证产生最优解,但它可以显著减小问题规模。最后,我们报告了对我们的方法在各种真实数据集上的计算研究,包括肯尼迪和佩奇的121分类单元、7棵树的海鸟数据集。

结论

结果表明,我们的精确方法对于中等规模的输入在计算上是可行的。对于更大的输入,我们的数据约简启发式方法使得处理远远超出基本整数规划方法范围的问题成为可能。我们的启发式方法得到的结果与精确解之间的比较表明,该启发式方法能产生较好的答案。我们的结果还表明,多数规则(+)方法,无论是其基本形式还是经过数据约简后,都能产生具有生物学意义的系统发育树。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8392/2826330/a259a9871c07/1748-7188-5-2-1.jpg

相似文献

1
Constructing majority-rule supertrees.
Algorithms Mol Biol. 2010 Jan 4;5:2. doi: 10.1186/1748-7188-5-2.
2
Split-based computation of majority-rule supertrees.
BMC Evol Biol. 2011 Jul 13;11:205. doi: 10.1186/1471-2148-11-205.
3
Robinson-Foulds supertrees.
Algorithms Mol Biol. 2010 Feb 24;5:18. doi: 10.1186/1748-7188-5-18.
4
Performance of flip supertree construction with a heuristic algorithm.
Syst Biol. 2004 Apr;53(2):299-308. doi: 10.1080/10635150490423719.
5
Majority-rule supertrees.
Syst Biol. 2007 Jun;56(3):445-52. doi: 10.1080/10635150701416682.
7
Invariant transformers of Robinson and Foulds distance matrices for Convolutional Neural Network.
J Bioinform Comput Biol. 2022 Aug;20(4):2250012. doi: 10.1142/S0219720022500123. Epub 2022 Jul 6.
8
Improved heuristics for minimum-flip supertree construction.
Evol Bioinform Online. 2007 Feb 28;2:347-56.
9
PhySIC_IST: cleaning source trees to infer more informative supertrees.
BMC Bioinformatics. 2008 Oct 4;9:413. doi: 10.1186/1471-2105-9-413.
10
Semi-strict supertrees.
Cladistics. 2002 Oct;18(5):514-525. doi: 10.1111/j.1096-0031.2002.tb00289.x.

引用本文的文献

1
Linear-time algorithms for phylogenetic tree completion under Robinson-Foulds distance.
Algorithms Mol Biol. 2020 Apr 13;15:6. doi: 10.1186/s13015-020-00166-1. eCollection 2020.
2
A new fast method for inferring multiple consensus trees using k-medoids.
BMC Evol Biol. 2018 Apr 5;18(1):48. doi: 10.1186/s12862-018-1163-8.
3
Split-based computation of majority-rule supertrees.
BMC Evol Biol. 2011 Jul 13;11:205. doi: 10.1186/1471-2148-11-205.
4
An ILP solution for the gene duplication problem.
BMC Bioinformatics. 2011 Feb 15;12 Suppl 1(Suppl 1):S14. doi: 10.1186/1471-2105-12-S1-S14.

本文引用的文献

1
Semi-strict supertrees.
Cladistics. 2002 Oct;18(5):514-525. doi: 10.1111/j.1096-0031.2002.tb00289.x.
2
Properties of majority-rule supertrees.
Syst Biol. 2009 Jun;58(3):360-7. doi: 10.1093/sysbio/syp032. Epub 2009 Jul 3.
4
Mixed integer linear programming for maximum-parsimony phylogeny inference.
IEEE/ACM Trans Comput Biol Bioinform. 2008 Jul-Sep;5(3):323-31. doi: 10.1109/TCBB.2008.26.
5
PhySIC: a veto supertree method with desirable properties.
Syst Biol. 2007 Oct;56(5):798-817. doi: 10.1080/10635150701639754.
6
Efficiently computing the Robinson-Foulds metric.
J Comput Biol. 2007 Jul-Aug;14(6):724-35. doi: 10.1089/cmb.2007.R012.
7
Majority-rule supertrees.
Syst Biol. 2007 Jun;56(3):445-52. doi: 10.1080/10635150701416682.
8
Properties of supertree methods in the consensus setting.
Syst Biol. 2007 Apr;56(2):330-7. doi: 10.1080/10635150701245370.
9
The delayed rise of present-day mammals.
Nature. 2007 Mar 29;446(7135):507-12. doi: 10.1038/nature05634.
10
Integer programming approaches to haplotype inference by pure parsimony.
IEEE/ACM Trans Comput Biol Bioinform. 2006 Apr-Jun;3(2):141-54. doi: 10.1109/TCBB.2006.24.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验