最简约一致基因树的联合合并

Joint amalgamation of most parsimonious reconciled gene trees.

作者信息

Scornavacca Celine, Jacox Edwin, Szöllősi Gergely J

机构信息

ISEM, UM2-CNRS-IRD, Place Eugène Bataillon 34095 Montpellier, France, Institut de Biologie Computationnelle (IBC), 95 rue de la Galéra, 34095 Montpellier, France and ELTE-MTA 'Lendület' Biophysics Research Group 1117 Bp., Pázmány P. stny. 1A., Budapest, Hungary ISEM, UM2-CNRS-IRD, Place Eugène Bataillon 34095 Montpellier, France, Institut de Biologie Computationnelle (IBC), 95 rue de la Galéra, 34095 Montpellier, France and ELTE-MTA 'Lendület' Biophysics Research Group 1117 Bp., Pázmány P. stny. 1A., Budapest, Hungary.

出版信息

Bioinformatics. 2015 Mar 15;31(6):841-8. doi: 10.1093/bioinformatics/btu728. Epub 2014 Nov 6.

DOI:10.1093/bioinformatics/btu728

PMID:25380957

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4380024/

Abstract

MOTIVATION

Traditionally, gene phylogenies have been reconstructed solely on the basis of molecular sequences; this, however, often does not provide enough information to distinguish between statistically equivalent relationships. To address this problem, several recent methods have incorporated information on the species phylogeny in gene tree reconstruction, leading to dramatic improvements in accuracy. Although probabilistic methods are able to estimate all model parameters but are computationally expensive, parsimony methods-generally computationally more efficient-require a prior estimate of parameters and of the statistical support.

RESULTS

Here, we present the Tree Estimation using Reconciliation (TERA) algorithm, a parsimony based, species tree aware method for gene tree reconstruction based on a scoring scheme combining duplication, transfer and loss costs with an estimate of the sequence likelihood. TERA explores all reconciled gene trees that can be amalgamated from a sample of gene trees. Using a large scale simulated dataset, we demonstrate that TERA achieves the same accuracy as the corresponding probabilistic method while being faster, and outperforms other parsimony-based methods in both accuracy and speed. Running TERA on a set of 1099 homologous gene families from complete cyanobacterial genomes, we find that incorporating knowledge of the species tree results in a two thirds reduction in the number of apparent transfer events.

摘要

动机

传统上，基因系统发育树仅基于分子序列进行重建；然而，这通常无法提供足够信息来区分统计上等效的关系。为了解决这个问题，最近的几种方法在基因树重建中纳入了物种系统发育的信息，从而显著提高了准确性。尽管概率方法能够估计所有模型参数，但计算成本高昂，而简约方法（通常计算效率更高）则需要对参数和统计支持进行先验估计。

结果

在此，我们提出了使用和解的树估计（TERA）算法，这是一种基于简约、考虑物种树的基因树重建方法，它基于一种评分方案，将复制、转移和丢失成本与序列似然估计相结合。TERA探索了所有可以从基因树样本中合并得到的和解基因树。使用大规模模拟数据集，我们证明TERA在速度更快的同时，能够达到与相应概率方法相同的准确性，并且在准确性和速度方面均优于其他基于简约的方法。在一组来自完整蓝藻基因组的1099个同源基因家族上运行TERA，我们发现纳入物种树的知识可使明显转移事件的数量减少三分之二。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4be2/4380024/98c8f6dbec47/btu728f1p.jpg

相似文献

Joint amalgamation of most parsimonious reconciled gene trees.

Bioinformatics. 2015 Mar 15;31(6):841-8. doi: 10.1093/bioinformatics/btu728. Epub 2014 Nov 6.

Efficient exploration of the space of reconciled gene trees.

Syst Biol. 2013 Nov;62(6):901-12. doi: 10.1093/sysbio/syt054. Epub 2013 Aug 6.

GATC: a genetic algorithm for gene tree construction under the Duplication-Transfer-Loss model of evolution.

BMC Genomics. 2018 May 9;19(Suppl 2):102. doi: 10.1186/s12864-018-4455-x.

ecceTERA: comprehensive gene tree-species tree reconciliation using parsimony.

Bioinformatics. 2016 Jul 1;32(13):2056-8. doi: 10.1093/bioinformatics/btw105. Epub 2016 Feb 26.

AleRax: a tool for gene and species tree co-estimation and reconciliation under a probabilistic model of gene duplication, transfer, and loss.

Bioinformatics. 2024 Mar 29;40(4). doi: 10.1093/bioinformatics/btae162.

Species Tree Inference Using a Mixture Model.

Mol Biol Evol. 2015 Sep;32(9):2469-82. doi: 10.1093/molbev/msv115. Epub 2015 May 11.

On the impact of uncertain gene tree rooting on duplication-transfer-loss reconciliation.

BMC Bioinformatics. 2018 Aug 13;19(Suppl 9):290. doi: 10.1186/s12859-018-2269-0.

A hybrid micro-macroevolutionary approach to gene tree reconstruction.

J Comput Biol. 2006 Mar;13(2):320-35. doi: 10.1089/cmb.2006.13.320.

GeneRax: A Tool for Species-Tree-Aware Maximum Likelihood-Based Gene Family Tree Inference under Gene Duplication, Transfer, and Loss.

Mol Biol Evol. 2020 Sep 1;37(9):2763-2774. doi: 10.1093/molbev/msaa141.

Inferring angiosperm phylogeny from EST data with widespread gene duplication.

BMC Evol Biol. 2007 Feb 8;7 Suppl 1(Suppl 1):S3. doi: 10.1186/1471-2148-7-S1-S3.

引用本文的文献

Phylogenetic reconciliation: making the most of genomes to understand microbial ecology and evolution.

ISME J. 2024 Jan 8;18(1). doi: 10.1093/ismejo/wrae129.

Timing the evolution of phosphorus-cycling enzymes through geological time using phylogenomics.

Nat Commun. 2024 May 2;15(1):3703. doi: 10.1038/s41467-024-47914-0.

Parameter Estimation and Species Tree Rooting Using ALE and GeneRax.

Genome Biol Evol. 2023 Jul 3;15(7). doi: 10.1093/gbe/evad134.

Phylogenetic reconciliation.

PLoS Comput Biol. 2022 Nov 3;18(11):e1010621. doi: 10.1371/journal.pcbi.1010621. eCollection 2022 Nov.

virDTL: Viral Recombination Analysis Through Phylogenetic Reconciliation and Its Application to Sarbecoviruses and SARS-CoV-2.

J Comput Biol. 2023 Jan;30(1):3-20. doi: 10.1089/cmb.2021.0507. Epub 2022 Sep 20.

Deciphering Microbial Gene Family Evolution Using Duplication-Transfer-Loss Reconciliation and RANGER-DTL.

Methods Mol Biol. 2022;2569:233-252. doi: 10.1007/978-1-0716-2691-7_11.

A Comprehensive Evolutionary Scenario of Cell Division and Associated Processes in the Firmicutes.

Mol Biol Evol. 2021 May 19;38(6):2396-2412. doi: 10.1093/molbev/msab034.

Genome size evolution in the Archaea.

Emerg Top Life Sci. 2018 Dec 14;2(4):595-605. doi: 10.1042/ETLS20180021.

Coevolving Plasmids Drive Gene Flow and Genome Plasticity in Host-Associated Intracellular Bacteria.

Curr Biol. 2021 Jan 25;31(2):346-357.e3. doi: 10.1016/j.cub.2020.10.030. Epub 2020 Nov 5.

ASTRAL-Pro: Quartet-Based Species-Tree Inference despite Paralogy.

Mol Biol Evol. 2020 Nov 1;37(11):3292-3307. doi: 10.1093/molbev/msaa139.

本文引用的文献

The inference of gene trees with species trees.

Syst Biol. 2015 Jan;64(1):e42-62. doi: 10.1093/sysbio/syu048. Epub 2014 Jul 28.

A Bayesian method for analyzing lateral gene transfer.

Syst Biol. 2014 May;63(3):409-20. doi: 10.1093/sysbio/syu007. Epub 2014 Feb 20.

Most parsimonious reconciliation in the presence of gene duplication, loss, and deep coalescence using labeled coalescent trees.

Genome Res. 2014 Mar;24(3):475-86. doi: 10.1101/gr.161968.113. Epub 2013 Dec 5.

Efficient exploration of the space of reconciled gene trees.

Syst Biol. 2013 Nov;62(6):901-12. doi: 10.1093/sysbio/syt054. Epub 2013 Aug 6.

Reconciliation and local gene tree rearrangement can be of mutual profit.

Algorithms Mol Biol. 2013 Apr 8;8(1):12. doi: 10.1186/1748-7188-8-12.

The estimation of tree posterior probabilities using conditional clade probability distributions.

Syst Biol. 2013 Jul;62(4):501-11. doi: 10.1093/sysbio/syt014. Epub 2013 Mar 11.

Lateral gene transfer from the dead.

Syst Biol. 2013 May 1;62(3):386-97. doi: 10.1093/sysbio/syt003. Epub 2013 Jan 25.

Genome-scale coestimation of species and gene trees.

Genome Res. 2013 Feb;23(2):323-30. doi: 10.1101/gr.141978.112. Epub 2012 Nov 6.

Phylogenetic modeling of lateral gene transfer reconstructs the pattern and relative timing of speciations.

Proc Natl Acad Sci U S A. 2012 Oct 23;109(43):17513-8. doi: 10.1073/pnas.1202997109. Epub 2012 Oct 4.

TreeFix: statistically informed gene tree error correction using species trees.

Syst Biol. 2013 Jan 1;62(1):110-20. doi: 10.1093/sysbio/sys076. Epub 2012 Sep 4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

最简约一致基因树的联合合并

Joint amalgamation of most parsimonious reconciled gene trees.

作者信息

Scornavacca Celine, Jacox Edwin, Szöllősi Gergely J

机构信息

出版信息

Bioinformatics. 2015 Mar 15;31(6):841-8. doi: 10.1093/bioinformatics/btu728. Epub 2014 Nov 6.

DOI:10.1093/bioinformatics/btu728

PMID:25380957

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4380024/

Abstract

MOTIVATION

RESULTS

摘要

最简约一致基因树的联合合并

Joint amalgamation of most parsimonious reconciled gene trees.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

动机

结果

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

最简约一致基因树的联合合并

Joint amalgamation of most parsimonious reconciled gene trees.

作者信息

机构信息

出版信息

MOTIVATION

RESULTS

动机

结果

相似文献

引用本文的文献

本文引用的文献