• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于氨基酸替换过程中跨位点异质性的贝叶斯混合模型。

A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process.

作者信息

Lartillot Nicolas, Philippe Hervé

机构信息

Canadian Institute for Advanced Research, Département de Biochimie, Université de Montréal, Montréal, Québec Canada.

出版信息

Mol Biol Evol. 2004 Jun;21(6):1095-109. doi: 10.1093/molbev/msh112. Epub 2004 Mar 10.

DOI:10.1093/molbev/msh112
PMID:15014145
Abstract

Most current models of sequence evolution assume that all sites of a protein evolve under the same substitution process, characterized by a 20 x 20 substitution matrix. Here, we propose to relax this assumption by developing a Bayesian mixture model that allows the amino-acid replacement pattern at different sites of a protein alignment to be described by distinct substitution processes. Our model, named CAT, assumes the existence of distinct processes (or classes) differing by their equilibrium frequencies over the 20 residues. Through the use of a Dirichlet process prior, the total number of classes and their respective amino-acid profiles, as well as the affiliations of each site to a given class, are all free variables of the model. In this way, the CAT model is able to adapt to the complexity actually present in the data, and it yields an estimate of the substitutional heterogeneity through the posterior mean number of classes. We show that a significant level of heterogeneity is present in the substitution patterns of proteins, and that the standard one-matrix model fails to account for this heterogeneity. By evaluating the Bayes factor, we demonstrate that the standard model is outperformed by CAT on all of the data sets which we analyzed. Altogether, these results suggest that the complexity of the pattern of substitution of real sequences is better captured by the CAT model, offering the possibility of studying its impact on phylogenetic reconstruction and its connections with structure-function determinants.

摘要

当前大多数序列进化模型都假定蛋白质的所有位点都在相同的替换过程下进化,该过程由一个20×20的替换矩阵来表征。在此,我们提议通过开发一种贝叶斯混合模型来放宽这一假定,该模型允许用不同的替换过程来描述蛋白质比对中不同位点的氨基酸替换模式。我们的模型名为CAT,假定存在不同的过程(或类别),这些过程在20种氨基酸残基上的平衡频率有所不同。通过使用狄利克雷过程先验,类别总数及其各自的氨基酸分布,以及每个位点隶属于给定类别的情况,都是该模型的自由变量。通过这种方式,CAT模型能够适应数据中实际存在的复杂性,并通过类别后验平均数对替换异质性进行估计。我们表明,蛋白质的替换模式中存在显著水平的异质性,并且标准的单矩阵模型无法解释这种异质性。通过评估贝叶斯因子,我们证明在我们分析的所有数据集中,CAT模型都优于标准模型。总之,这些结果表明,CAT模型能更好地捕捉真实序列替换模式的复杂性,这为研究其对系统发育重建的影响以及与结构 - 功能决定因素的联系提供了可能性。

相似文献

1
A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process.一种用于氨基酸替换过程中跨位点异质性的贝叶斯混合模型。
Mol Biol Evol. 2004 Jun;21(6):1095-109. doi: 10.1093/molbev/msh112. Epub 2004 Mar 10.
2
A site- and time-heterogeneous model of amino acid replacement.氨基酸替换的位点和时间异质性模型。
Mol Biol Evol. 2008 May;25(5):842-58. doi: 10.1093/molbev/msn018. Epub 2008 Jan 29.
3
A class frequency mixture model that adjusts for site-specific amino acid frequencies and improves inference of protein phylogeny.一种根据特定位点氨基酸频率进行调整并改进蛋白质系统发育推断的类频率混合模型。
BMC Evol Biol. 2008 Dec 16;8:331. doi: 10.1186/1471-2148-8-331.
4
The impact of single substitutions on multiple sequence alignments.单个替换对多序列比对的影响。
Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):4041-7. doi: 10.1098/rstb.2008.0140.
5
Phylogenetic mixture models for proteins.蛋白质的系统发育混合模型
Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):3965-76. doi: 10.1098/rstb.2008.0180.
6
Bayesian analysis of amino acid substitution models.氨基酸替换模型的贝叶斯分析。
Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):3941-53. doi: 10.1098/rstb.2008.0175.
7
An amino acid substitution-selection model adjusts residue fitness to improve phylogenetic estimation.氨基酸替换选择模型调整残基适合度以改进系统发育估计。
Mol Biol Evol. 2014 Apr;31(4):779-92. doi: 10.1093/molbev/msu044. Epub 2014 Jan 16.
8
Site-specific time heterogeneity of the substitution process and its impact on phylogenetic inference.取代过程的位点特异性时间异质性及其对系统发育推断的影响。
BMC Evol Biol. 2011 Jan 14;11:17. doi: 10.1186/1471-2148-11-17.
9
A dirichlet process covarion mixture model and its assessments using posterior predictive discrepancy tests.Dirichlet 过程协变量混合模型及其使用后验预测差异检验的评估。
Mol Biol Evol. 2010 Feb;27(2):371-84. doi: 10.1093/molbev/msp248. Epub 2009 Oct 12.
10
Evaluating the robustness of phylogenetic methods to among-site variability in substitution processes.评估系统发育方法对替换过程中位点间变异性的稳健性。
Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):4013-21. doi: 10.1098/rstb.2008.0162.

引用本文的文献

1
Chromosome-scale genome assembly and gene annotation of the hydrothermal vent annelid Alvinella pompejana yield insight into animal evolution in extreme environments.热液喷口环节动物庞贝蠕虫的染色体水平基因组组装和基因注释为极端环境中的动物进化提供了见解。
BMC Biol. 2025 Sep 2;23(1):274. doi: 10.1186/s12915-025-02369-7.
2
Protein Structural Phylogenetics.蛋白质结构系统发育学
Genome Biol Evol. 2025 Jul 30;17(8). doi: 10.1093/gbe/evaf139.
3
Insect Phylogenomics: From Experiment Planning to Post-phylogenetic Analyses.昆虫系统发育基因组学:从实验规划到系统发育后分析
Methods Mol Biol. 2025;2935:211-235. doi: 10.1007/978-1-0716-4583-3_9.
4
Phylogenomic Analyses Reveal that Panguiarchaeum Is a Clade of Genome-Reduced Asgard Archaea Within the Njordarchaeia.系统基因组学分析表明,泛古古菌是约顿古菌门内基因组简化的阿斯加德古菌的一个进化枝。
Mol Biol Evol. 2025 Sep 1;42(9). doi: 10.1093/molbev/msaf201.
5
Infinite Mixture Models for Improved Modeling of Across-Site Evolutionary Variation.用于改进跨位点进化变异建模的无限混合模型。
Mol Biol Evol. 2025 Jul 30;42(8). doi: 10.1093/molbev/msaf199.
6
Stochastic Character Mapping: An Under-Exploited Approach to the Study of Molecular Evolution.随机特征映射:一种尚未充分利用的分子进化研究方法。
J Mol Evol. 2025 Aug;93(4):465-473. doi: 10.1007/s00239-025-10257-5. Epub 2025 Jul 8.
7
Comparative and Phylogenetic Analyses of Mitochondrial Genomes in Carabidae (Coleoptera: Adephaga).步甲科(鞘翅目:肉食亚目)线粒体基因组的比较与系统发育分析
Ecol Evol. 2025 Jul 2;15(7):e71707. doi: 10.1002/ece3.71707. eCollection 2025 Jul.
8
Polyphasic and phylogenomic reevaluation of Zhongshania and Marortus with the description of Zhongshania aquatica sp. nov.中山藻属和马罗特藻属的多相及系统基因组重新评估并描述水生中山藻新物种(Zhongshania aquatica sp. nov.)
Sci Rep. 2025 Jul 1;15(1):20417. doi: 10.1038/s41598-025-08302-w.
9
n. sp.: a novel predatory flagellate illuminates the character evolution within the eukaryotic clade CRuMs.新物种:一种新型捕食性鞭毛虫揭示了真核生物进化枝CRuMs内的性状进化。
Open Biol. 2025 Jun;15(6):250057. doi: 10.1098/rsob.250057. Epub 2025 Jun 4.
10
Robustness of Ancestral Sequence Reconstruction to Among-site and Among-lineage Evolutionary Heterogeneity.祖先序列重建对位点间和谱系间进化异质性的稳健性。
Mol Biol Evol. 2025 Apr 1;42(4). doi: 10.1093/molbev/msaf084.