• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

核苷酸替代模型假设对非参数自举支持度估计的影响。

The effects of nucleotide substitution model assumptions on estimates of nonparametric bootstrap support.

作者信息

Buckley Thomas R, Cunningham Clifford W

机构信息

Department of Biology, Duke University, Durham, North Carolina, USA.

出版信息

Mol Biol Evol. 2002 Apr;19(4):394-405. doi: 10.1093/oxfordjournals.molbev.a004094.

DOI:10.1093/oxfordjournals.molbev.a004094
PMID:11919280
Abstract

The use of parameter-rich substitution models in molecular phylogenetics has been criticized on the basis that these models can cause a reduction both in accuracy and in the ability to discriminate among competing topologies. We have explored the relationship between nucleotide substitution model complexity and nonparametric bootstrap support under maximum likelihood (ML) for six data sets for which the true relationships are known with a high degree of certainty. We also performed equally weighted maximum parsimony analyses in order to assess the effects of ignoring branch length information during tree selection. We observed that maximum parsimony gave the lowest mean estimate of bootstrap support for the correct set of nodes relative to the ML models for every data set except one. For several data sets, we established that the exact distribution used to model among-site rate variation was critical for a successful phylogenetic analysis. Site-specific rate models were shown to perform very poorly relative to gamma and invariable sites models for several of the data sets most likely because of the gross underestimation of branch lengths. The invariable sites model also performed poorly for several data sets where this model had a poor fit to the data, suggesting that addition of the gamma distribution can be critical. Estimates of bootstrap support for the correct nodes often increased under gamma and invariable sites models relative to equal rates models. Our observations are contrary to the prediction that such models cause reduced confidence in phylogenetic hypotheses. Our results raise several issues regarding the process of model selection, and we briefly discuss model selection uncertainty and the role of sensitivity analyses in molecular phylogenetics.

摘要

在分子系统发育学中,富含参数的替代模型的使用受到了批评,理由是这些模型可能会导致准确性以及区分竞争拓扑结构的能力下降。我们针对六个已知真实关系且具有高度确定性的数据集,探讨了核苷酸替代模型复杂性与最大似然法(ML)下非参数自展支持率之间的关系。我们还进行了等加权最大简约分析,以评估在树选择过程中忽略分支长度信息的影响。我们观察到,除了一个数据集外,相对于ML模型,最大简约法对正确节点集的自展支持率的平均估计值最低。对于几个数据集,我们确定用于模拟位点间速率变化的精确分布对于成功的系统发育分析至关重要。对于几个数据集,位点特异性速率模型相对于伽马和不变位点模型表现非常差,这很可能是因为分支长度被严重低估。对于几个该模型与数据拟合不佳的数据集,不变位点模型的表现也很差,这表明添加伽马分布可能至关重要。相对于等速率模型,在伽马和不变位点模型下,对正确节点的自展支持率估计值通常会增加。我们的观察结果与此类模型会降低对系统发育假设的信心这一预测相反。我们的结果引发了有关模型选择过程的几个问题,并且我们简要讨论了模型选择的不确定性以及敏感性分析在分子系统发育学中的作用。

相似文献

1
The effects of nucleotide substitution model assumptions on estimates of nonparametric bootstrap support.核苷酸替代模型假设对非参数自举支持度估计的影响。
Mol Biol Evol. 2002 Apr;19(4):394-405. doi: 10.1093/oxfordjournals.molbev.a004094.
2
Exploring among-site rate variation models in a maximum likelihood framework using empirical data: effects of model assumptions on estimates of topology, branch lengths, and bootstrap support.在最大似然框架下使用经验数据探索位点间速率变化模型:模型假设对拓扑结构、分支长度和自展支持度估计的影响。
Syst Biol. 2001 Feb;50(1):67-86.
3
Comparison of models for nucleotide substitution used in maximum-likelihood phylogenetic estimation.用于最大似然系统发育估计的核苷酸替换模型比较。
Mol Biol Evol. 1994 Mar;11(2):316-24. doi: 10.1093/oxfordjournals.molbev.a040112.
4
The devil in the details: interactions between the branch-length prior and likelihood model affect node support and branch lengths in the phylogeny of the Psoraceae.细节中的魔鬼:分支长度先验和似然模型之间的相互作用影响了 Psoraceae 系统发育中的节点支持和分支长度。
Syst Biol. 2011 Jul;60(4):541-61. doi: 10.1093/sysbio/syr022. Epub 2011 Mar 24.
5
Relative efficiencies of the maximum-likelihood, neighbor-joining, and maximum-parsimony methods when substitution rate varies with site.当替换率随位点变化时,最大似然法、邻接法和最大简约法的相对效率。
Mol Biol Evol. 1994 Mar;11(2):261-77. doi: 10.1093/oxfordjournals.molbev.a040108.
6
Bayes or bootstrap? A simulation study comparing the performance of Bayesian Markov chain Monte Carlo sampling and bootstrapping in assessing phylogenetic confidence.贝叶斯法还是自助法?一项比较贝叶斯马尔可夫链蒙特卡罗抽样和自助法在评估系统发育置信度时性能的模拟研究。
Mol Biol Evol. 2003 Feb;20(2):255-66. doi: 10.1093/molbev/msg028.
7
Phylogenetic analysis using parsimony and likelihood methods.使用简约法和似然法进行系统发育分析。
J Mol Evol. 1996 Feb;42(2):294-307. doi: 10.1007/BF02198856.
8
Phylogeny of Bembidion and related ground beetles (Coleoptera: Carabidae: Trechinae: Bembidiini: Bembidiina).Bembidion 及相关步甲(鞘翅目:步甲科:步甲亚科:拟步甲族:拟步甲亚族)的系统发育。
Mol Phylogenet Evol. 2012 Jun;63(3):533-76. doi: 10.1016/j.ympev.2012.01.015. Epub 2012 Mar 13.
9
Does choice in model selection affect maximum likelihood analysis?模型选择中的选择会影响最大似然分析吗?
Syst Biol. 2008 Feb;57(1):76-85. doi: 10.1080/10635150801898920.
10
Tail paradox, partial identifiability, and influential priors in Bayesian branch length inference.贝叶斯分支长度推断中的尾部悖论、部分可识别性和有影响的先验。
Mol Biol Evol. 2012 Jan;29(1):325-35. doi: 10.1093/molbev/msr210. Epub 2011 Sep 2.

引用本文的文献

1
The impact of software and criteria on the selection of best-fit nucleotide substitution models for molecular evolutionary genetic analysis.软件和标准对分子进化遗传分析中最佳拟合核苷酸替换模型选择的影响。
PLoS One. 2025 Mar 26;20(3):e0319774. doi: 10.1371/journal.pone.0319774. eCollection 2025.
2
Predicting Phylogenetic Bootstrap Values via Machine Learning.基于机器学习的系统发育自举值预测。
Mol Biol Evol. 2024 Oct 4;41(10). doi: 10.1093/molbev/msae215.
3
Molecular occurrence and genetic diversity of Ehrlichia canis in naturally infected dogs from Thailand.
泰国自然感染犬埃立克体的分子发生和遗传多样性。
Sci Rep. 2023 Nov 21;13(1):20394. doi: 10.1038/s41598-023-47784-4.
4
Fast-Evolving Alignment Sites Are Highly Informative for Reconstructions of Deep Tree of Life Phylogenies.快速进化的比对位点对重建生命之树的深层系统发育具有高度信息价值。
Microorganisms. 2023 Oct 5;11(10):2499. doi: 10.3390/microorganisms11102499.
5
Viral genome sequence datasets display pervasive evidence of strand-specific substitution biases that are best described using non-reversible nucleotide substitution models.病毒基因组序列数据集显示出普遍存在的链特异性替代偏差证据,使用不可逆核苷酸替代模型能对其进行最佳描述。
Res Sq. 2022 Dec 29:rs.3.rs-2407778. doi: 10.21203/rs.3.rs-2407778/v1.
6
Molecular characterization and genetic diversity of and of cattle in Thailand.泰国牛和牛的分子特征和遗传多样性。
Front Cell Infect Microbiol. 2022 Nov 29;12:1065963. doi: 10.3389/fcimb.2022.1065963. eCollection 2022.
7
Taming the Selection of Optimal Substitution Models in Phylogenomics by Site Subsampling and Upsampling.通过位点抽样和上采样来驯服系统发育基因组学中最优替代模型的选择。
Mol Biol Evol. 2022 Nov 3;39(11). doi: 10.1093/molbev/msac236.
8
Complete chloroplast genomes of two medicinal Swertia species: the comparative evolutionary analysis of Swertia genus in the Gentianaceae family.两种药用獐牙菜属植物的完整叶绿体基因组:龙胆科獐牙菜属的比较进化分析。
Planta. 2022 Sep 9;256(4):73. doi: 10.1007/s00425-022-03987-z.
9
Analysis of lineage-specific protein family variability in prokaryotes combined with evolutionary reconstructions.分析原核生物中谱系特异性蛋白家族的变异性,并结合进化重建。
Biol Direct. 2022 Aug 30;17(1):22. doi: 10.1186/s13062-022-00337-7.
10
The effect of alignment uncertainty, substitution models and priors in building and dating the mammal tree of life.在构建和定时代哺乳动物系统发育树时,配准不确定性、替代模型和先验概率的影响。
BMC Evol Biol. 2019 Nov 6;19(1):203. doi: 10.1186/s12862-019-1534-9.