• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于最小化深度融合和最大化四重奏一致性的基因树估计种系发生树:比较研究和伪种系发生树阶地的存在。

Species Tree Estimation from Gene Trees by Minimizing Deep Coalescence and Maximizing Quartet Consistency: A Comparative Study and the Presence of Pseudo Species Tree Terraces.

机构信息

Department of Computer Science and Engineering, Bangladesh University of Engineering and Technology, Dhaka 1205, Bangladesh.

Applied Statistics and Data Science (ASDS), Department of Statistics, Jahangirnagar University, Dhaka 1342, Bangladesh.

出版信息

Syst Biol. 2021 Oct 13;70(6):1213-1231. doi: 10.1093/sysbio/syab026.

DOI:10.1093/sysbio/syab026
PMID:33844023
Abstract

Species tree estimation from multilocus data sets is extremely challenging, especially in the presence of gene tree heterogeneity across the genome due to incomplete lineage sorting (ILS). Summary methods have been developed which estimate gene trees and then combine the gene trees to estimate a species tree by optimizing various optimization scores. In this study, we have extended and adapted the concept of phylogenetic terraces to species tree estimation by "summarizing" a set of gene trees, where multiple species trees with distinct topologies may have exactly the same optimality score (i.e., quartet score, extra lineage score, etc.). We particularly investigated the presence and impacts of equally optimal trees in species tree estimation from multilocus data using summary methods by taking ILS into account. We analyzed two of the most popular ILS-aware optimization criteria: maximize quartet consistency (MQC) and minimize deep coalescence (MDC). Methods based on MQC are provably statistically consistent, whereas MDC is not a consistent criterion for species tree estimation. We present a comprehensive comparative study of these two optimality criteria. Our experiments, on a collection of data sets simulated under ILS, indicate that MDC may result in competitive or identical quartet consistency score as MQC, but could be significantly worse than MQC in terms of tree accuracy-demonstrating the presence and impacts of equally optimal species trees. This is the first known study that provides the conditions for the data sets to have equally optimal trees in the context of phylogenomic inference using summary methods. [Gene tree; incomplete lineage sorting; phylogenomic analysis, species tree; summary method.].

摘要

从多基因数据集估计物种树极具挑战性,特别是在由于不完全谱系分选(ILS)而导致基因组中存在基因树异质性的情况下。已经开发了汇总方法来估计基因树,然后通过优化各种优化分数来合并基因树以估计物种树。在这项研究中,我们通过“汇总”一组基因树,将系统发育阶地的概念扩展并应用于物种树估计,其中具有不同拓扑结构的多个物种树可能具有完全相同的最优性得分(即四分体得分,额外谱系得分等)。我们特别研究了在考虑 ILS 的情况下,使用汇总方法从多基因数据估计物种树中具有相同最优性的树的存在和影响。我们分析了两种最流行的 ILS 感知优化标准:四分体一致性最大化(MQC)和深度合并最小化(MDC)。基于 MQC 的方法在统计学上是可证明一致的,而 MDC 不是物种树估计的一致标准。我们对这两个最优性标准进行了全面的比较研究。我们在 ILS 下模拟的数据集上的实验表明,MDC 可能会导致四分体一致性得分与 MQC 相当或相同,但在树准确性方面可能比 MQC 差很多-证明了具有相同最优性的物种树的存在和影响。这是首次在使用汇总方法进行基因组推断的上下文中提供数据集具有相同最优树的条件的研究。[基因树;不完全谱系分选;系统发育基因组分析;物种树;汇总方法。]

相似文献

1
Species Tree Estimation from Gene Trees by Minimizing Deep Coalescence and Maximizing Quartet Consistency: A Comparative Study and the Presence of Pseudo Species Tree Terraces.基于最小化深度融合和最大化四重奏一致性的基因树估计种系发生树:比较研究和伪种系发生树阶地的存在。
Syst Biol. 2021 Oct 13;70(6):1213-1231. doi: 10.1093/sysbio/syab026.
2
Consistency properties of species tree inference by minimizing deep coalescences.通过最小化深度合并来推断物种树的一致性属性。
J Comput Biol. 2011 Jan;18(1):1-15. doi: 10.1089/cmb.2010.0102.
3
SVDquest: Improving SVDquartets species tree estimation using exact optimization within a constrained search space.SVDquest:在约束搜索空间内使用精确优化提高 SVDquartets 种系树估计。
Mol Phylogenet Evol. 2018 Jul;124:122-136. doi: 10.1016/j.ympev.2018.03.006. Epub 2018 Mar 9.
4
A comparative study of SVDquartets and other coalescent-based species tree estimation methods.SVDquartets与其他基于溯祖理论的物种树估计方法的比较研究。
BMC Genomics. 2015;16 Suppl 10(Suppl 10):S2. doi: 10.1186/1471-2164-16-S10-S2. Epub 2015 Oct 2.
5
To Include or Not to Include: The Impact of Gene Filtering on Species Tree Estimation Methods.包含还是不包含:基因过滤对物种树估计方法的影响。
Syst Biol. 2018 Mar 1;67(2):285-303. doi: 10.1093/sysbio/syx077.
6
Quartet Based Gene Tree Imputation Using Deep Learning Improves Phylogenomic Analyses Despite Missing Data.基于四重奏的深度学习基因树推断在存在缺失数据的情况下仍能改进系统发育基因组分析。
J Comput Biol. 2022 Nov;29(11):1156-1172. doi: 10.1089/cmb.2022.0212. Epub 2022 Sep 1.
7
Phylogenomic species tree estimation in the presence of incomplete lineage sorting and horizontal gene transfer.存在不完全谱系分选和水平基因转移情况下的系统发育基因组物种树估计
BMC Genomics. 2015;16 Suppl 10(Suppl 10):S1. doi: 10.1186/1471-2164-16-S10-S1. Epub 2015 Oct 2.
8
Species tree inference by minimizing deep coalescences.通过最小化深度合并来推断物种树。
PLoS Comput Biol. 2009 Sep;5(9):e1000501. doi: 10.1371/journal.pcbi.1000501. Epub 2009 Sep 11.
9
Estimating optimal species trees from incomplete gene trees under deep coalescence.在深度溯祖情况下从不完整基因树估计最优物种树。
J Comput Biol. 2012 Jun;19(6):591-605. doi: 10.1089/cmb.2012.0037.
10
Algorithms for MDC-based multi-locus phylogeny inference: beyond rooted binary gene trees on single alleles.基于MDC的多位点系统发育推断算法:超越单等位基因上的有根二叉基因树。
J Comput Biol. 2011 Nov;18(11):1543-59. doi: 10.1089/cmb.2011.0174. Epub 2011 Oct 28.

引用本文的文献

1
Leveraging Weighted Quartet Distributions for Enhanced Species Tree Inference from Genome-Wide Data.利用加权四重奏分布从全基因组数据中增强物种树推断
Genome Biol Evol. 2025 Sep 2;17(9). doi: 10.1093/gbe/evaf159.
2
Terraces in species tree inference from gene trees.从基因树上推断物种树的阶。
BMC Ecol Evol. 2024 Nov 4;24(1):135. doi: 10.1186/s12862-024-02309-z.