• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从估计的基因树推断种系树时,对分歧时间低估的稳健性。

Robustness to divergence time underestimation when inferring species trees from estimated gene trees.

机构信息

Department of Integrative Biology, University of California, Berkeley, CA 94720, USA; Department of Biology, Pennsylvania State University, University Park, PA 16802, USA; and Department of Mathematics and Statistics, University of New Mexico, 1 University of New Mexico, Albuquerque, NM 87131, USA.

出版信息

Syst Biol. 2014 Jan 1;63(1):66-82. doi: 10.1093/sysbio/syt059. Epub 2013 Aug 29.

DOI:10.1093/sysbio/syt059
PMID:23988674
Abstract

To infer species trees from gene trees estimated from phylogenomic data sets, tractable methods are needed that can handle dozens to hundreds of loci. We examine several computationally efficient approaches-MP-EST, STAR, STEAC, STELLS, and STEM-for inferring species trees from gene trees estimated using maximum likelihood (ML) and Bayesian approaches. Among the methods examined, we found that topology-based methods often performed better using ML gene trees and methods employing coalescent times typically performed better using Bayesian gene trees, with MP-EST, STAR, STEAC, and STELLS outperforming STEM under most conditions. We examine why the STEM tree (also called GLASS or Maximum Tree) is less accurate on estimated gene trees by comparing estimated and true coalescence times, performing species tree inference using simulations, and analyzing a great ape data set keeping track of false positive and false negative rates for inferred clades. We find that although true coalescence times are more ancient than speciation times under the multispecies coalescent model, estimated coalescence times are often more recent than speciation times. This underestimation can lead to increased bias and lack of resolution with increased sampling (either alleles or loci) when gene trees are estimated with ML. The problem appears to be less severe using Bayesian gene-tree estimates.

摘要

为了从基于系统基因组数据集估计的基因树上推断物种树,需要使用能够处理数十到数百个基因座的可行方法。我们研究了几种计算效率高的方法——MP-EST、STAR、STEAC、STELLS 和 STEM——用于从最大似然(ML)和贝叶斯方法估计的基因树上推断物种树。在所检查的方法中,我们发现基于拓扑的方法通常在使用 ML 基因树时表现更好,而使用合并时间的方法通常在使用贝叶斯基因树时表现更好,在大多数情况下,MP-EST、STAR、STEAC 和 STELLS 的表现优于 STEM。我们通过比较估计的和真实的合并时间、使用模拟进行物种树推断以及分析大型猿类数据集来检查为什么 STEM 树(也称为 GLASS 或最大树)在估计的基因树上的准确性较低,同时跟踪推断的分支的假阳性和假阴性率。我们发现,尽管在多物种合并模型下,真实的合并时间比物种形成时间更古老,但估计的合并时间通常比物种形成时间更新。当使用 ML 估计基因树时,这种低估会导致随着样本量(无论是等位基因还是基因座)的增加,出现更大的偏差和分辨率不足的问题。使用贝叶斯基因树估计时,这个问题似乎不太严重。

相似文献

1
Robustness to divergence time underestimation when inferring species trees from estimated gene trees.从估计的基因树推断种系树时,对分歧时间低估的稳健性。
Syst Biol. 2014 Jan 1;63(1):66-82. doi: 10.1093/sysbio/syt059. Epub 2013 Aug 29.
2
Maximum likelihood estimates of species trees: how accuracy of phylogenetic inference depends upon the divergence history and sampling design.最大似然估计物种树:系统发育推断的准确性如何取决于分歧历史和采样设计。
Syst Biol. 2009 Oct;58(5):501-8. doi: 10.1093/sysbio/syp045. Epub 2009 Aug 20.
3
The influence of gene flow on species tree estimation: a simulation study.基因流对物种树估计的影响:一项模拟研究。
Syst Biol. 2014 Jan 1;63(1):17-30. doi: 10.1093/sysbio/syt049. Epub 2013 Aug 13.
4
The accuracy of species tree estimation under simulation: a comparison of methods.基于模拟的物种树估计精度:方法比较。
Syst Biol. 2011 Mar;60(2):126-37. doi: 10.1093/sysbio/syq073. Epub 2010 Nov 18.
5
Estimating species phylogenies using coalescence times among sequences.使用序列间的合并时间估计物种系统发育。
Syst Biol. 2009 Oct;58(5):468-77. doi: 10.1093/sysbio/syp031. Epub 2009 Jul 16.
6
Improvements to a class of distance matrix methods for inferring species trees from gene trees.从基因树推断物种树的一类距离矩阵方法的改进。
J Comput Biol. 2012 Jun;19(6):632-49. doi: 10.1089/cmb.2012.0042.
7
The gene tree delusion.基因树错觉
Mol Phylogenet Evol. 2016 Jan;94(Pt A):1-33. doi: 10.1016/j.ympev.2015.07.018. Epub 2015 Jul 31.
8
To Include or Not to Include: The Impact of Gene Filtering on Species Tree Estimation Methods.包含还是不包含:基因过滤对物种树估计方法的影响。
Syst Biol. 2018 Mar 1;67(2):285-303. doi: 10.1093/sysbio/syx077.
9
Estimating species trees from unrooted gene trees.从无根基因树估计物种树。
Syst Biol. 2011 Oct;60(5):661-7. doi: 10.1093/sysbio/syr027. Epub 2011 Mar 28.
10
Species trees from gene trees: reconstructing Bayesian posterior distributions of a species phylogeny using estimated gene tree distributions.从基因树构建物种树:利用估计的基因树分布重建物种系统发育的贝叶斯后验分布。
Syst Biol. 2007 Jun;56(3):504-14. doi: 10.1080/10635150701429982.

引用本文的文献

1
Major Revisions in Pancrustacean Phylogeny and Evidence of Sensitivity to Taxon Sampling.泛甲壳动物系统发育的重大修订和对分类群采样敏感性的证据。
Mol Biol Evol. 2023 Aug 3;40(8). doi: 10.1093/molbev/msad175.
2
Weighting by Gene Tree Uncertainty Improves Accuracy of Quartet-based Species Trees.基于基因树不确定性的加权可提高基于四元组的种系发生树的准确性。
Mol Biol Evol. 2022 Dec 5;39(12). doi: 10.1093/molbev/msac215.
3
A stochastic Farris transform for genetic data under the multispecies coalescent with applications to data requirements.
多物种合并下遗传数据的随机法里斯变换及其在数据需求方面的应用。
J Math Biol. 2022 Apr 8;84(5):36. doi: 10.1007/s00285-022-01731-5.
4
Multispecies coalescent and its applications to infer species phylogenies and cross-species gene flow.多物种合并及其在推断物种系统发育和跨物种基因流中的应用。
Natl Sci Rev. 2021 Jul 15;8(12):nwab127. doi: 10.1093/nsr/nwab127. eCollection 2021 Dec.
5
DISCO: Species Tree Inference using Multicopy Gene Family Tree Decomposition.利用多拷贝基因家族树分解进行种系树推断。
Syst Biol. 2022 Apr 19;71(3):610-629. doi: 10.1093/sysbio/syab070.
6
Maximum Likelihood Estimation of Species Trees from Gene Trees in the Presence of Ancestral Population Structure.存在祖先群体结构时,从基因树上估计物种树的最大似然法。
Genome Biol Evol. 2020 Feb 1;12(2):3977-3995. doi: 10.1093/gbe/evaa022.
7
Modeling Hybridization Under the Network Multispecies Coalescent.网络多物种合并下的杂交建模。
Syst Biol. 2018 Sep 1;67(5):786-799. doi: 10.1093/sysbio/syy040.
8
Conflicting Evolutionary Histories of the Mitochondrial and Nuclear Genomes in New World Myotis Bats.新大陆鼠耳蝠线粒体和核基因组的冲突进化史
Syst Biol. 2018 Mar 1;67(2):236-249. doi: 10.1093/sysbio/syx070.
9
IDXL: Species Tree Inference Using Internode Distance and Excess Gene Leaf Count.IDXL:利用节间距离和多余基因叶计数进行物种树推断
J Mol Evol. 2017 Aug;85(1-2):57-78. doi: 10.1007/s00239-017-9807-7. Epub 2017 Aug 23.
10
Simulation-Based Evaluation of Hybridization Network Reconstruction Methods in the Presence of Incomplete Lineage Sorting.存在不完全谱系分选时杂交网络重建方法的基于模拟的评估
Evol Bioinform Online. 2017 Mar 10;13:1176934317691935. doi: 10.1177/1176934317691935. eCollection 2017.