• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

系统发育推断中处理异速现象模型的评估。

Evaluation of the models handling heterotachy in phylogenetic inference.

作者信息

Zhou Yan, Rodrigue Nicolas, Lartillot Nicolas, Philippe Hervé

机构信息

Canadian Institute for Advanced Research, Département de Biochimie, Université de Montréal, Succursale Centre-Ville, Montréal, Québec H3C3J7, Canada.

出版信息

BMC Evol Biol. 2007 Nov 1;7:206. doi: 10.1186/1471-2148-7-206.

DOI:10.1186/1471-2148-7-206
PMID:17974035
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2248194/
Abstract

BACKGROUND

The evolutionary rate at a given homologous position varies across time. When sufficiently pronounced, this phenomenon - called heterotachy - may produce artefactual phylogenetic reconstructions under the commonly used models of sequence evolution. These observations have motivated the development of models that explicitly recognize heterotachy, with research directions proposed along two main axes: 1) the covarion approach, where sites switch from variable to invariable states; and 2) the mixture of branch lengths (MBL) approach, where alignment patterns are assumed to arise from one of several sets of branch lengths, under a given phylogeny.

RESULTS

Here, we report the first statistical comparisons contrasting the performance of covarion and MBL modeling strategies. Using simulations under heterotachous conditions, we explore the properties of three model comparison methods: the Akaike information criterion, the Bayesian information criterion, and cross validation. Although more time consuming, cross validation appears more reliable than AIC and BIC as it directly measures the predictive power of a model on 'future' data. We also analyze three large datasets (nuclear proteins of animals, mitochondrial proteins of mammals, and plastid proteins of plants), and find the optimal number of components of the MBL model to be two for all datasets, indicating that this model is preferred over the standard homogeneous model. However, the covarion model is always favored over the optimal MBL model.

CONCLUSION

We demonstrated, using three large datasets, that the covarion model is more efficient at handling heterotachy than the MBL model. This is probably due to the fact that the MBL model requires a serious increase in the number of parameters, as compared to two supplementary parameters of the covarion approach. Further improvements of the both the mixture and the covarion approaches might be obtained by modeling heterogeneous behavior both along time and across sites.

摘要

背景

给定同源位置的进化速率会随时间变化。当这种现象足够明显时——称为异速进化——在常用的序列进化模型下可能会产生人为的系统发育重建。这些观察结果推动了明确识别异速进化的模型的发展,研究方向主要沿着两个主轴提出:1)协变位点模型,其中位点从可变状态转变为不变状态;2)分支长度混合(MBL)模型,在给定的系统发育下,比对模式被假定为由几组分支长度中的一组产生。

结果

在此,我们报告了首次对协变位点模型和MBL建模策略的性能进行对比的统计分析。利用异速进化条件下的模拟,我们探索了三种模型比较方法的特性:赤池信息准则、贝叶斯信息准则和交叉验证。虽然交叉验证耗时更多,但它似乎比AIC和BIC更可靠,因为它直接测量模型对“未来”数据的预测能力。我们还分析了三个大型数据集(动物的核蛋白、哺乳动物的线粒体蛋白和植物的质体蛋白),并发现所有数据集的MBL模型的最佳成分数量均为两个,这表明该模型优于标准的齐次模型。然而,协变位点模型总是比最优的MBL模型更受青睐。

结论

我们利用三个大型数据集证明,协变位点模型在处理异速进化方面比MBL模型更有效。这可能是由于与协变位点模型的两个补充参数相比,MBL模型需要大幅增加参数数量。通过对沿时间和跨位点的异质性行为进行建模,可能会进一步改进混合模型和协变位点模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9e9/2248194/024692057e14/1471-2148-7-206-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9e9/2248194/211e156b40f2/1471-2148-7-206-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9e9/2248194/14121a9bd1b2/1471-2148-7-206-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9e9/2248194/0e978c9f52b5/1471-2148-7-206-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9e9/2248194/024692057e14/1471-2148-7-206-4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9e9/2248194/211e156b40f2/1471-2148-7-206-1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9e9/2248194/14121a9bd1b2/1471-2148-7-206-2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9e9/2248194/0e978c9f52b5/1471-2148-7-206-3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c9e9/2248194/024692057e14/1471-2148-7-206-4.jpg

相似文献

1
Evaluation of the models handling heterotachy in phylogenetic inference.系统发育推断中处理异速现象模型的评估。
BMC Evol Biol. 2007 Nov 1;7:206. doi: 10.1186/1471-2148-7-206.
2
Modelling heterotachy in phylogenetic inference by reversible-jump Markov chain Monte Carlo.通过可逆跳跃马尔可夫链蒙特卡罗方法在系统发育推断中对异速进行建模。
Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):3955-64. doi: 10.1098/rstb.2008.0178.
3
A dirichlet process covarion mixture model and its assessments using posterior predictive discrepancy tests.Dirichlet 过程协变量混合模型及其使用后验预测差异检验的评估。
Mol Biol Evol. 2010 Feb;27(2):371-84. doi: 10.1093/molbev/msp248. Epub 2009 Oct 12.
4
A mixed branch length model of heterotachy improves phylogenetic accuracy.一种异速进化的混合分支长度模型提高了系统发育准确性。
Mol Biol Evol. 2008 Jun;25(6):1054-66. doi: 10.1093/molbev/msn042. Epub 2008 Mar 3.
5
Heterotachy and long-branch attraction in phylogenetics.系统发育学中的异速进化和长枝吸引
BMC Evol Biol. 2005 Oct 6;5:50. doi: 10.1186/1471-2148-5-50.
6
A test for heterotachy using multiple pairs of sequences.使用多对序列进行异序测试。
Mol Biol Evol. 2011 May;28(5):1661-73. doi: 10.1093/molbev/msq346. Epub 2010 Dec 24.
7
PROCOV: maximum likelihood estimation of protein phylogeny under covarion models and site-specific covarion pattern analysis.PROCOV:共变模型下蛋白质系统发育的最大似然估计及位点特异性共变模式分析
BMC Evol Biol. 2009 Sep 8;9:225. doi: 10.1186/1471-2148-9-225.
8
Markov-Modulated Continuous-Time Markov Chains to Identify Site- and Branch-Specific Evolutionary Variation in BEAST.马科夫调制连续时间马科夫链在 BEAST 中识别位点和分支特异性进化变化。
Syst Biol. 2021 Jan 1;70(1):181-189. doi: 10.1093/sysbio/syaa037.
9
Phylogenetic substitution models for detecting heterotachy during plastid evolution.检测质体进化中异时发生的系统发育替代模型。
Mol Biol Evol. 2011 Jan;28(1):449-58. doi: 10.1093/molbev/msq215. Epub 2010 Aug 19.
10
Testing for covarion-like evolution in protein sequences.检测蛋白质序列中的类共变进化。
Mol Biol Evol. 2007 Jan;24(1):294-305. doi: 10.1093/molbev/msl155. Epub 2006 Oct 20.

引用本文的文献

1
Performance of tree-building methods using a morphological dataset and a well-supported Hexapoda phylogeny.基于形态数据集和支持良好的六足动物系统发育构建树的方法的性能。
PeerJ. 2024 Jan 8;12:e16706. doi: 10.7717/peerj.16706. eCollection 2024.
2
Is Over-parameterization a Problem for Profile Mixture Models?过参数化对轮廓混合模型是一个问题吗?
Syst Biol. 2024 May 27;73(1):53-75. doi: 10.1093/sysbio/syad063.
3
Compositionally Constrained Sites Drive Long-Branch Attraction.组成受限的位点驱动长枝吸引。

本文引用的文献

1
Among-site rate variation and its impact on phylogenetic analyses.种间变异率及其对系统发育分析的影响。
Trends Ecol Evol. 1996 Sep;11(9):367-72. doi: 10.1016/0169-5347(96)10041-0.
2
Optimization by simulated annealing.模拟退火优化。
Science. 1983 May 13;220(4598):671-80. doi: 10.1126/science.220.4598.671.
3
Identifying dramatic selection shifts in phylogenetic trees.识别系统发育树中显著的选择转移。
Syst Biol. 2023 Aug 7;72(4):767-780. doi: 10.1093/sysbio/syad013.
4
Performance of Akaike Information Criterion and Bayesian Information Criterion in Selecting Partition Models and Mixture Models.Akaike 信息准则和贝叶斯信息准则在选择划分模型和混合模型中的性能。
Syst Biol. 2023 May 19;72(1):92-105. doi: 10.1093/sysbio/syac081.
5
Phylogenetic tree building in the genomic age.基因组时代的系统发育树构建。
Nat Rev Genet. 2020 Jul;21(7):428-444. doi: 10.1038/s41576-020-0233-0. Epub 2020 May 18.
6
Trichostomatid Ciliates (Alveolata, Ciliophora, Trichostomatia) Systematics and Diversity: Past, Present, and Future.毛口目纤毛虫(囊泡虫类、纤毛门、毛口目)的系统学与多样性:过去、现在与未来
Front Microbiol. 2020 Jan 15;10:2967. doi: 10.3389/fmicb.2019.02967. eCollection 2019.
7
Evaluating Statistical Multiple Sequence Alignment in Comparison to Other Alignment Methods on Protein Data Sets.评估统计多重序列比对与蛋白质数据集上其他比对方法的比较。
Syst Biol. 2019 May 1;68(3):396-411. doi: 10.1093/sysbio/syy068.
8
The prevalence of terraced treescapes in analyses of phylogenetic data sets.分析系统发育数据集时,阶地式树景的出现频率。
BMC Evol Biol. 2018 Apr 4;18(1):46. doi: 10.1186/s12862-018-1162-9.
9
Selecting optimal partitioning schemes for phylogenomic datasets.选择基因组数据集的最佳分区方案。
BMC Evol Biol. 2014 Apr 17;14:82. doi: 10.1186/1471-2148-14-82.
10
Evolutionary Modeling of Genotype-Phenotype Associations, and Application to Primate Coding and Non-coding mtDNA Rate Variation.基因型-表型关联的进化建模,及其在灵长类编码和非编码 mtDNA 速率变异中的应用。
Evol Bioinform Online. 2013 Jul 28;9:301-16. doi: 10.4137/EBO.S11600. Print 2013.
BMC Evol Biol. 2007 Feb 8;7 Suppl 1(Suppl 1):S10. doi: 10.1186/1471-2148-7-S1-S10.
4
Phylogenetic analyses of nuclear, mitochondrial, and plastid multigene data sets support the placement of Mesostigma in the Streptophyta.对核基因、线粒体基因和质体多基因数据集的系统发育分析支持将中带藻置于链形植物门中。
Mol Biol Evol. 2007 Mar;24(3):723-31. doi: 10.1093/molbev/msl200. Epub 2006 Dec 16.
5
Testing for covarion-like evolution in protein sequences.检测蛋白质序列中的类共变进化。
Mol Biol Evol. 2007 Jan;24(1):294-305. doi: 10.1093/molbev/msl155. Epub 2006 Oct 20.
6
Lack of resolution in the animal phylogeny: closely spaced cladogeneses or undetected systematic errors?动物系统发育中分辨率的缺失:是紧密间隔的分支发生还是未被检测到的系统误差?
Mol Biol Evol. 2007 Jan;24(1):6-9. doi: 10.1093/molbev/msl137. Epub 2006 Sep 29.
7
A Bayesian compound stochastic process for modeling nonstationary and nonhomogeneous sequence evolution.一种用于对非平稳和非齐次序列进化进行建模的贝叶斯复合随机过程。
Mol Biol Evol. 2006 Nov;23(11):2058-71. doi: 10.1093/molbev/msl091. Epub 2006 Aug 24.
8
Heterotachy processes in rhodophyte-derived secondhand plastid genes: Implications for addressing the origin and evolution of dinoflagellate plastids.红藻衍生的二手质体基因中的异速过程:对解决甲藻质体起源与进化问题的启示
Mol Biol Evol. 2006 Aug;23(8):1504-15. doi: 10.1093/molbev/msl011. Epub 2006 May 12.
9
Heterotachy in mammalian promoter evolution.哺乳动物启动子进化中的异速变化。
PLoS Genet. 2006 Apr;2(4):e30. doi: 10.1371/journal.pgen.0020030. Epub 2006 Apr 28.
10
An improved statistical method for detecting heterotachy in nucleotide sequences.一种用于检测核苷酸序列中异速现象的改进统计方法。
Mol Biol Evol. 2006 Jul;23(7):1397-405. doi: 10.1093/molbev/msl006. Epub 2006 May 3.