重建一棵使和解最小化的超级基因树。

Reconstructing a SuperGeneTree minimizing reconciliation.

作者信息

Lafond Manuel, Ouangraoua Aïda, El-Mabrouk Nadia

出版信息

BMC Bioinformatics. 2015;16 Suppl 14(Suppl 14):S4. doi: 10.1186/1471-2105-16-S14-S4. Epub 2015 Oct 2.

DOI:10.1186/1471-2105-16-S14-S4

PMID:26451911

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4602317/

Abstract

Combining a set of trees on partial datasets into a single tree is a classical method for inferring large phylogenetic trees. Ideally, the combined tree should display each input partial tree, which is only possible if input trees do not contain contradictory phylogenetic information. The simplest version of the supertree problem is thus to state whether a set of trees is compatible, and if so, construct a tree displaying them all. Classically, supertree methods have been applied to the reconstruction of species trees. Here we rather consider reconstructing a super gene tree in light of a known species tree S. We define the supergenetree problem as finding, among all supertrees displaying a set of input gene trees, one supertree minimizing a reconciliation distance with S. We first show how classical exact methods to the supertree problem can be extended to the supergenetree problem. As all these methods are highly exponential, we also exhibit a natural greedy heuristic for the duplication cost, based on minimizing the set of duplications preceding the first speciation event. We then show that both the supergenetree problem and its restriction to minimizing duplications preceding the first speciation are NP-hard to approximate within a n1-ϵ factor, for any 0 < ϵ < 1. Finally, we show that a restriction of this problem to uniquely labeled speciation gene trees, which is relevant to many biological applications, is also NP-hard. Therefore, we introduce new avenues in the field of supertrees, and set the theoretical basis for the exploration of various algorithmic aspects of the problems.

摘要

将部分数据集上的一组树合并为一棵单一的树是推断大型系统发育树的经典方法。理想情况下，合并后的树应展示每棵输入的部分树，只有当输入树不包含相互矛盾的系统发育信息时才有可能。因此，超树问题的最简单版本是说明一组树是否兼容，如果兼容，则构建一棵展示所有这些树的树。传统上，超树方法已应用于物种树的重建。在这里，我们考虑根据已知的物种树S重建一棵超基因树。我们将超基因树问题定义为在展示一组输入基因树的所有超树中，找到一棵与S的和解距离最小的超树。我们首先展示了如何将超树问题的经典精确方法扩展到超基因树问题。由于所有这些方法都是高度指数级的，我们还基于最小化第一次物种形成事件之前的重复集，展示了一种针对重复成本的自然贪婪启发式方法。然后我们表明，对于任何0 < ϵ < 1，超基因树问题及其对最小化第一次物种形成之前的重复的限制在n1-ϵ因子内难以近似求解。最后，我们表明将这个问题限制在唯一标记的物种形成基因树上，这与许多生物学应用相关，也是NP难的。因此，我们在超树领域引入了新的途径，并为探索该问题的各种算法方面奠定了理论基础。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be1b/4602317/cbedb405a6b2/1471-2105-16-S14-S4-1.jpg

相似文献

Reconstructing a SuperGeneTree minimizing reconciliation.

BMC Bioinformatics. 2015;16 Suppl 14(Suppl 14):S4. doi: 10.1186/1471-2105-16-S14-S4. Epub 2015 Oct 2.

Gene Tree Construction and Correction Using SuperTree and Reconciliation.

IEEE/ACM Trans Comput Biol Bioinform. 2018 Sep-Oct;15(5):1560-1570. doi: 10.1109/TCBB.2017.2720581. Epub 2017 Jun 27.

Cubic time algorithms of amalgamating gene trees and building evolutionary scenarios.

Biol Direct. 2012 Dec 22;7:48. doi: 10.1186/1745-6150-7-48.

Reconstructing protein and gene phylogenies using reconciliation and soft-clustering.

J Bioinform Comput Biol. 2017 Dec;15(6):1740007. doi: 10.1142/S0219720017400078. Epub 2017 Oct 19.

Algorithms: simultaneous error-correction and rooting for gene tree reconciliation and the gene duplication problem.

BMC Bioinformatics. 2012 Jun 25;13 Suppl 10(Suppl 10):S14. doi: 10.1186/1471-2105-13-S10-S14.

Exact Algorithms for Duplication-Transfer-Loss Reconciliation with Non-Binary Gene Trees.

IEEE/ACM Trans Comput Biol Bioinform. 2019 Jul-Aug;16(4):1077-1090. doi: 10.1109/TCBB.2017.2710342. Epub 2017 Jun 1.

Split-based computation of majority-rule supertrees.

BMC Evol Biol. 2011 Jul 13;11:205. doi: 10.1186/1471-2148-11-205.

Inferring duplication episodes from unrooted gene trees.

BMC Genomics. 2018 May 8;19(Suppl 5):288. doi: 10.1186/s12864-018-4623-z.

COSPEDTree: COuplet Supertree by Equivalence Partitioning of Taxa Set and DAG Formation.

IEEE/ACM Trans Comput Biol Bioinform. 2015 May-Jun;12(3):590-603. doi: 10.1109/TCBB.2014.2366778.

Minimum-flip supertrees: complexity and algorithms.

IEEE/ACM Trans Comput Biol Bioinform. 2006 Apr-Jun;3(2):165-73. doi: 10.1109/TCBB.2006.26.

引用本文的文献

Evolution through segmental duplications and losses: a Super-Reconciliation approach.

Algorithms Mol Biol. 2020 May 26;15:12. doi: 10.1186/s13015-020-00171-4. eCollection 2020.

Inferring duplication episodes from unrooted gene trees.

BMC Genomics. 2018 May 8;19(Suppl 5):288. doi: 10.1186/s12864-018-4623-z.

本文引用的文献

Joint amalgamation of most parsimonious reconciled gene trees.

Bioinformatics. 2015 Mar 15;31(6):841-8. doi: 10.1093/bioinformatics/btu728. Epub 2014 Nov 6.

PANTHER in 2013: modeling the evolution of gene function, and other gene attributes, in the context of phylogenetic trees.

Nucleic Acids Res. 2013 Jan;41(Database issue):D377-86. doi: 10.1093/nar/gks1118. Epub 2012 Nov 27.

MRL and SuperFine+MRL: new supertree methods.

Algorithms Mol Biol. 2012 Jan 26;7(1):3. doi: 10.1186/1748-7188-7-3.

SuperFine: fast and accurate supertree estimation.

Syst Biol. 2012 Mar;61(2):214-27. doi: 10.1093/sysbio/syr092. Epub 2011 Sep 20.

Proteinortho: detection of (co-)orthologs in large-scale analysis.

BMC Bioinformatics. 2011 Apr 28;12:124. doi: 10.1186/1471-2105-12-124.

MetaPhOrs: orthology and paralogy predictions from multiple phylogenetic evidence using a consistency-based confidence score.

Nucleic Acids Res. 2011 Mar;39(5):e32. doi: 10.1093/nar/gkq953. Epub 2010 Dec 11.

PhylomeDB v3.0: an expanding repository of genome-wide collections of trees, alignments and phylogeny-based orthology and paralogy predictions.

Nucleic Acids Res. 2011 Jan;39(Database issue):D556-60. doi: 10.1093/nar/gkq1109. Epub 2010 Nov 12.

SuperTriplets: a triplet-based supertree approach to phylogenomics.

Bioinformatics. 2010 Jun 15;26(12):i115-23. doi: 10.1093/bioinformatics/btq196.

Robinson-Foulds supertrees.

Algorithms Mol Biol. 2010 Feb 24;5:18. doi: 10.1186/1748-7188-5-18.

Databases of homologous gene families for comparative genomics.

BMC Bioinformatics. 2009 Jun 16;10 Suppl 6(Suppl 6):S3. doi: 10.1186/1471-2105-10-S6-S3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

重建一棵使和解最小化的超级基因树。

Reconstructing a SuperGeneTree minimizing reconciliation.

作者信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献