• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从基因树上推断物种树的阶。

Terraces in species tree inference from gene trees.

机构信息

Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka, 1205, Bangladesh.

出版信息

BMC Ecol Evol. 2024 Nov 4;24(1):135. doi: 10.1186/s12862-024-02309-z.

DOI:10.1186/s12862-024-02309-z
PMID:39497030
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11533290/
Abstract

A terrace in a phylogenetic tree space is a region where all trees contain the same set of subtrees, due to certain patterns of missing data among the taxa sampled, resulting in an identical optimality score for a given data set. This was first investigated in the context of phylogenetic tree estimation from sequence alignments using maximum likelihood (ML) and maximum parsimony (MP). It was later extended to the species tree inference problem from a collection of gene trees, where a set of equally optimal species trees was referred to as a "pseudo" species tree terrace which does not consider the topological proximity of the trees in terms of the induced subtrees resulting from certain patterns of missing data. In this study, we mathematically characterize species tree terraces and investigate the mathematical properties and conditions that lead multiple species trees to induce/display an identical set of locus-specific subtrees owing to missing data. We report that species tree terraces are agnostic to gene tree heterogeneity. Therefore, we introduce and characterize a special type of gene tree topology-aware terrace which we call "peak terrace". Moreover, we empirically investigated various challenges and opportunities related to species tree terraces through extensive empirical studies using simulated and real biological data. We demonstrate the prevalence of species tree terraces and the resulting ambiguity created for tree search algorithms. Remarkably, our findings indicate that the identification of terraces could potentially lead to advances that enhance the accuracy of summary methods and provide reasonably accurate branch support.

摘要

系统发育树空间中的树阶是指在采样分类单元中存在某些缺失数据模式的情况下,所有树都包含相同的子树集,从而导致给定数据集的相同最优得分的区域。这首先在使用最大似然法(ML)和最大简约法(MP)从序列比对中估计系统发育树的背景下进行了研究。后来,它被扩展到从一组基因树推断物种树的问题,其中一组同样最优的物种树被称为“伪”物种树阶,它不考虑由于某些缺失数据模式而导致的诱导子树的拓扑接近度。在这项研究中,我们从数学上刻画了物种树阶,并研究了导致多个物种树由于缺失数据而诱导/显示相同的局部子树的数学性质和条件。我们报告说,物种树阶与基因树异质性无关。因此,我们引入并刻画了一种特殊类型的基因树拓扑感知的阶,我们称之为“峰阶”。此外,我们通过使用模拟和真实生物数据的广泛实证研究,实证研究了与物种树阶相关的各种挑战和机遇。我们证明了物种树阶的普遍性,以及树搜索算法因此而产生的歧义。值得注意的是,我们的发现表明,识别阶可能会带来提高汇总方法准确性和提供合理准确分支支持的进展。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/864fc2168873/12862_2024_2309_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/e79ce168ecb4/12862_2024_2309_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/8d36cc9985d7/12862_2024_2309_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/e190058de24b/12862_2024_2309_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/70bdcfe5ed10/12862_2024_2309_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/00093fba8ea1/12862_2024_2309_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/db8af9149d6e/12862_2024_2309_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/83aac6a1e891/12862_2024_2309_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/c40beedd4b7c/12862_2024_2309_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/9d9fa1d563cc/12862_2024_2309_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/cae7718708f8/12862_2024_2309_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/864fc2168873/12862_2024_2309_Fig11_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/e79ce168ecb4/12862_2024_2309_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/8d36cc9985d7/12862_2024_2309_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/e190058de24b/12862_2024_2309_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/70bdcfe5ed10/12862_2024_2309_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/00093fba8ea1/12862_2024_2309_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/db8af9149d6e/12862_2024_2309_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/83aac6a1e891/12862_2024_2309_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/c40beedd4b7c/12862_2024_2309_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/9d9fa1d563cc/12862_2024_2309_Fig9_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/cae7718708f8/12862_2024_2309_Fig10_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bcf6/11533290/864fc2168873/12862_2024_2309_Fig11_HTML.jpg

相似文献

1
Terraces in species tree inference from gene trees.从基因树上推断物种树的阶。
BMC Ecol Evol. 2024 Nov 4;24(1):135. doi: 10.1186/s12862-024-02309-z.
2
The prevalence of terraced treescapes in analyses of phylogenetic data sets.分析系统发育数据集时,阶地式树景的出现频率。
BMC Evol Biol. 2018 Apr 4;18(1):46. doi: 10.1186/s12862-018-1162-9.
3
Consequences of Common Topological Rearrangements for Partition Trees in Phylogenomic Inference.系统发育基因组学推断中常见拓扑重排对划分树的影响。
J Comput Biol. 2015 Dec;22(12):1129-42. doi: 10.1089/cmb.2015.0146. Epub 2015 Oct 8.
4
Impacts of Terraces on Phylogenetic Inference.梯田对系统发育推断的影响。
Syst Biol. 2015 Sep;64(5):709-26. doi: 10.1093/sysbio/syv024. Epub 2015 May 20.
5
Gentrius: Generating Trees Compatible With a Set of Unrooted Subtrees and its Application to Phylogenetic Terraces.金特里乌斯:生成与一组无根子树兼容的树及其在系统发育阶地中的应用。
Mol Biol Evol. 2024 Nov 1;41(11). doi: 10.1093/molbev/msae219.
6
Species Tree Estimation from Gene Trees by Minimizing Deep Coalescence and Maximizing Quartet Consistency: A Comparative Study and the Presence of Pseudo Species Tree Terraces.基于最小化深度融合和最大化四重奏一致性的基因树估计种系发生树:比较研究和伪种系发生树阶地的存在。
Syst Biol. 2021 Oct 13;70(6):1213-1231. doi: 10.1093/sysbio/syab026.
7
Terrace Aware Data Structure for Phylogenomic Inference from Supermatrices.用于从超级矩阵进行系统发育基因组推断的分层感知数据结构
Syst Biol. 2016 Nov;65(6):997-1008. doi: 10.1093/sysbio/syw037. Epub 2016 Apr 26.
8
Terraces in phylogenetic tree space.系统发育树空间中的阶。
Science. 2011 Jul 22;333(6041):448-50. doi: 10.1126/science.1206357. Epub 2011 Jun 16.
9
SpeciesRax: A Tool for Maximum Likelihood Species Tree Inference from Gene Family Trees under Duplication, Transfer, and Loss.SpeciesRax:一种用于在基因家族树中进行复制、转移和丢失的最大似然种系发生树推断的工具。
Mol Biol Evol. 2022 Feb 3;39(2). doi: 10.1093/molbev/msab365.
10
Two C++ libraries for counting trees on a phylogenetic terrace.两个用于在系统发生阶地上计算树的 C++ 库。
Bioinformatics. 2018 Oct 1;34(19):3399-3401. doi: 10.1093/bioinformatics/bty384.

引用本文的文献

1
Gentrius: Generating Trees Compatible With a Set of Unrooted Subtrees and its Application to Phylogenetic Terraces.金特里乌斯:生成与一组无根子树兼容的树及其在系统发育阶地中的应用。
Mol Biol Evol. 2024 Nov 1;41(11). doi: 10.1093/molbev/msae219.

本文引用的文献

1
Asteroid: a new algorithm to infer species trees from gene trees under high proportions of missing data.小行星:一种在高比例缺失数据下从基因树推断物种树的新算法。
Bioinformatics. 2023 Jan 1;39(1). doi: 10.1093/bioinformatics/btac832.
2
wQFM: highly accurate genome-scale species tree estimation from weighted quartets.wQFM:基于加权四重奏的高精度基因组规模物种树估计
Bioinformatics. 2021 Nov 5;37(21):3734-3743. doi: 10.1093/bioinformatics/btab428.
3
Species Tree Estimation from Gene Trees by Minimizing Deep Coalescence and Maximizing Quartet Consistency: A Comparative Study and the Presence of Pseudo Species Tree Terraces.
基于最小化深度融合和最大化四重奏一致性的基因树估计种系发生树:比较研究和伪种系发生树阶地的存在。
Syst Biol. 2021 Oct 13;70(6):1213-1231. doi: 10.1093/sysbio/syab026.
4
STELAR: a statistically consistent coalescent-based species tree estimation method by maximizing triplet consistency.STELAR:一种基于最大三重一致性的统计一致的合并物种树估计方法。
BMC Genomics. 2020 Feb 10;21(1):136. doi: 10.1186/s12864-020-6519-y.
5
Phylogenomics provides robust support for a two-domains tree of life.系统发生基因组学为二域生命树提供了强有力的支持。
Nat Ecol Evol. 2020 Jan;4(1):138-147. doi: 10.1038/s41559-019-1040-x. Epub 2019 Dec 9.
6
ParGenes: a tool for massively parallel model selection and phylogenetic tree inference on thousands of genes.ParGenes:一个用于在数千个基因上进行大规模并行模型选择和系统发育树推断的工具。
Bioinformatics. 2019 May 15;35(10):1771-1773. doi: 10.1093/bioinformatics/bty839.
7
Two C++ libraries for counting trees on a phylogenetic terrace.两个用于在系统发生阶地上计算树的 C++ 库。
Bioinformatics. 2018 Oct 1;34(19):3399-3401. doi: 10.1093/bioinformatics/bty384.
8
ASTRAL-III: polynomial time species tree reconstruction from partially resolved gene trees.ASTRAL-III:从部分解析的基因树重建多项式时间种系发生树。
BMC Bioinformatics. 2018 May 8;19(Suppl 6):153. doi: 10.1186/s12859-018-2129-y.
9
The prevalence of terraced treescapes in analyses of phylogenetic data sets.分析系统发育数据集时,阶地式树景的出现频率。
BMC Evol Biol. 2018 Apr 4;18(1):46. doi: 10.1186/s12862-018-1162-9.
10
Gene tree parsimony for incomplete gene trees: addressing true biological loss.针对不完整基因树的基因树简约法:解决真正的生物学损失问题。
Algorithms Mol Biol. 2018 Jan 19;13:1. doi: 10.1186/s13015-017-0120-1. eCollection 2018.