Suppr超能文献

用于评估密码子间存在依赖性的编码序列进化系统发育模型的计算方法。

Computational methods for evaluating phylogenetic models of coding sequence evolution with dependence between codons.

作者信息

Rodrigue Nicolas, Kleinman Claudia L, Philippe Hervé, Lartillot Nicolas

机构信息

Department of Biology, Center for Advanced Research in Environmental Genomics, University of Ottawa, Ottawa, Ontario, Canada.

出版信息

Mol Biol Evol. 2009 Jul;26(7):1663-76. doi: 10.1093/molbev/msp078. Epub 2009 Apr 21.

Abstract

In recent years, molecular evolutionary models formulated as site-interdependent Markovian codon substitution processes have been proposed as means of mechanistically accounting for selective features over long-range evolutionary scales. Under such models, site interdependencies are reflected in the use of a simplified protein tertiary structure representation and predefined statistical potential, which, along with mutational parameters, mediate nonsynonymous rates of substitution; rates of synonymous events are solely mediated by mutational parameters. Although theoretically attractive, the models are computationally challenging, and the methods used to manipulate them still do not allow for quantitative model evaluations in a multiple-sequence context. Here, we describe Markov chain Monte Carlo computational methodologies for sampling parameters from their posterior distribution under site-interdependent codon substitution models within a phylogenetic context and allowing for Bayesian model assessment and ranking. Specifically, the techniques we expound here can form the basis of posterior predictive checking under these models and can be embedded within thermodynamic integration algorithms for computing Bayes factors. We illustrate the methods using two data sets and find that although current forms of site-interdependent models of codon substitution provide an improved fit, they are outperformed by the extended site-independent versions. Altogether, the methodologies described here should enable a quantified contrasting of alternative ways of modeling structural constraints, or other site-interdependent criteria, and establish if such formulations can match (or supplant) site-independent model extensions.

摘要

近年来,已提出将作为位点依赖的马尔可夫密码子替换过程构建的分子进化模型作为在长期进化尺度上从机制上解释选择特征的手段。在这类模型中,位点依赖性体现在使用简化的蛋白质三级结构表示和预定义的统计势上,它们与突变参数一起介导非同义替换率;同义事件的发生率仅由突变参数介导。尽管这些模型在理论上具有吸引力,但计算上具有挑战性,并且用于处理它们的方法仍然不允许在多序列背景下进行定量模型评估。在这里,我们描述了马尔可夫链蒙特卡罗计算方法,用于在系统发育背景下从位点依赖的密码子替换模型下的后验分布中采样参数,并进行贝叶斯模型评估和排名。具体来说,我们在此阐述的技术可以构成这些模型下后验预测检验的基础,并且可以嵌入到用于计算贝叶斯因子的热力学积分算法中。我们使用两个数据集说明了这些方法,发现尽管当前形式的位点依赖密码子替换模型提供了更好的拟合,但它们的表现不如扩展的位点独立版本。总之,这里描述的方法应该能够对建模结构约束或其他位点依赖标准的替代方法进行定量对比,并确定这些公式是否能够匹配(或取代)位点独立模型扩展。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验