基准多速率密码子模型。

Benchmarking multi-rate codon models.

机构信息

Department of Pathology, University of California San Diego, San Diego, California, United States of America.

出版信息

PLoS One. 2010 Jul 21;5(7):e11587. doi: 10.1371/journal.pone.0011587.

DOI:10.1371/journal.pone.0011587

PMID:20657773

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2908124/

Abstract

The single rate codon model of non-synonymous substitution is ubiquitous in phylogenetic modeling. Indeed, the use of a non-synonymous to synonymous substitution rate ratio parameter has facilitated the interpretation of selection pressure on genomes. Although the single rate model has achieved wide acceptance, we argue that the assumption of a single rate of non-synonymous substitution is biologically unreasonable, given observed differences in substitution rates evident from empirical amino acid models. Some have attempted to incorporate amino acid substitution biases into models of codon evolution and have shown improved model performance versus the single rate model. Here, we show that the single rate model of non-synonymous substitution is easily outperformed by a model with multiple non-synonymous rate classes, yet in which amino acid substitution pairs are assigned randomly to these classes. We argue that, since the single rate model is so easy to improve upon, new codon models should not be validated entirely on the basis of improved model fit over this model. Rather, we should strive to both improve on the single rate model and to approximate the general time-reversible model of codon substitution, with as few parameters as possible, so as to reduce model over-fitting. We hint at how this can be achieved with a Genetic Algorithm approach in which rate classes are assigned on the basis of sequence information content.

摘要

单速率密码子模型在系统发育建模中无处不在。事实上，使用非同义替换到同义替换的速率比参数有助于解释对基因组的选择压力。尽管单速率模型已被广泛接受，但我们认为，鉴于从经验氨基酸模型中观察到的替换率差异，单个非同义替换率的假设在生物学上是不合理的。一些人试图将氨基酸替换偏向纳入密码子进化模型中，并显示出与单速率模型相比，模型性能有所提高。在这里，我们表明，具有多个非同义速率类的模型很容易超过单速率模型的非同义替换模型，而在该模型中，氨基酸替换对被随机分配到这些类中。我们认为，由于单速率模型很容易改进，因此新的密码子模型不应完全基于该模型对改进的模型拟合度进行验证。相反，我们应该努力改进单速率模型，并尽可能接近通用的时间可逆密码子替换模型，同时使用尽可能少的参数，以减少模型过度拟合。我们暗示了如何通过遗传算法方法实现这一点，其中根据序列信息量分配速率类。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a150/2908124/89785a1ba818/pone.0011587.g001.jpg

相似文献

Benchmarking multi-rate codon models.

PLoS One. 2010 Jul 21;5(7):e11587. doi: 10.1371/journal.pone.0011587.

CodonTest: modeling amino acid substitution preferences in coding sequences.

PLoS Comput Biol. 2010 Aug 19;6(8):e1000885. doi: 10.1371/journal.pcbi.1000885.

Bayesian codon substitution modelling to identify sources of pathogen evolutionary rate variation.

Microb Genom. 2016 Jun 24;2(6):e000057. doi: 10.1099/mgen.0.000057. eCollection 2016 Jun.

Detecting Adaptation in Protein-Coding Genes Using a Bayesian Site-Heterogeneous Mutation-Selection Codon Substitution Model.

Mol Biol Evol. 2017 Jan;34(1):204-214. doi: 10.1093/molbev/msw220. Epub 2016 Oct 15.

Towards realistic codon models: among site variability and dependency of synonymous and non-synonymous rates.

Bioinformatics. 2007 Jul 1;23(13):i319-27. doi: 10.1093/bioinformatics/btm176.

Large-scale analyses of synonymous substitution rates can be sensitive to assumptions about the process of mutation.

Gene. 2006 Aug 15;378:58-64. doi: 10.1016/j.gene.2006.04.024. Epub 2006 May 22.

Standard Codon Substitution Models Overestimate Purifying Selection for Nonstationary Data.

Genome Biol Evol. 2017 Jan 1;9(1):134-149. doi: 10.1093/gbe/evw308.

Synonymous substitutions substantially improve evolutionary inference from highly diverged proteins.

Syst Biol. 2008 Jun;57(3):367-77. doi: 10.1080/10635150802158670.

Nucleotide substitution pattern in rice paralogues: implication for negative correlation between the synonymous substitution rate and codon usage bias.

Gene. 2006 Jul 19;376(2):199-206. doi: 10.1016/j.gene.2006.03.003. Epub 2006 Mar 18.

A codon model of nucleotide substitution with selection on synonymous codon usage.

Mol Phylogenet Evol. 2016 Jan;94(Pt A):290-7. doi: 10.1016/j.ympev.2015.08.026. Epub 2015 Sep 8.

引用本文的文献

A New Comparative Framework for Estimating Selection on Synonymous Substitutions.

Mol Biol Evol. 2025 Apr 1;42(4). doi: 10.1093/molbev/msaf068.

A new comparative framework for estimating selection on synonymous substitutions.

bioRxiv. 2025 Feb 6:2024.09.17.613331. doi: 10.1101/2024.09.17.613331.

Evolutionary Shortcuts via Multinucleotide Substitutions and Their Impact on Natural Selection Analyses.

Mol Biol Evol. 2023 Jul 5;40(7). doi: 10.1093/molbev/msad150.

Next-generation development and application of codon model in evolution.

Front Genet. 2023 Jan 27;14:1091575. doi: 10.3389/fgene.2023.1091575. eCollection 2023.

An Improved Codon Modeling Approach for Accurate Estimation of the Mutation Bias.

Mol Biol Evol. 2022 Feb 3;39(2). doi: 10.1093/molbev/msac005.

Causes of evolutionary rate variation among protein sites.

Nat Rev Genet. 2016 Feb;17(2):109-21. doi: 10.1038/nrg.2015.18. Epub 2016 Jan 19.

Superiority of a mechanistic codon substitution model even for protein sequences in phylogenetic analysis.

BMC Evol Biol. 2013 Nov 21;13:257. doi: 10.1186/1471-2148-13-257.

Modeling coding-sequence evolution within the context of residue solvent accessibility.

BMC Evol Biol. 2012 Sep 12;12:179. doi: 10.1186/1471-2148-12-179.

Advantages of a mechanistic codon substitution model for evolutionary analysis of protein-coding sequences.

PLoS One. 2011;6(12):e28892. doi: 10.1371/journal.pone.0028892. Epub 2011 Dec 29.

Selective constraints on amino acids estimated by a mechanistic codon substitution model with multiple nucleotide changes.

PLoS One. 2011 Mar 18;6(3):e17244. doi: 10.1371/journal.pone.0017244.

本文引用的文献

CodonTest: modeling amino acid substitution preferences in coding sequences.

PLoS Comput Biol. 2010 Aug 19;6(8):e1000885. doi: 10.1371/journal.pcbi.1000885.

Correcting the bias of empirical frequency parameter estimators in codon models.

PLoS One. 2010 Jul 30;5(7):e11230. doi: 10.1371/journal.pone.0011230.

Solvent exposure imparts similar selective pressures across a range of yeast proteins.

Mol Biol Evol. 2009 May;26(5):1155-61. doi: 10.1093/molbev/msp031. Epub 2009 Feb 20.

Models of coding sequence evolution.

Brief Bioinform. 2009 Jan;10(1):97-109. doi: 10.1093/bib/bbn049. Epub 2008 Oct 29.

Investigating protein-coding sequence evolution with probabilistic codon substitution models.

Mol Biol Evol. 2009 Feb;26(2):255-71. doi: 10.1093/molbev/msn232. Epub 2008 Oct 14.

Bayesian analysis of amino acid substitution models.

Philos Trans R Soc Lond B Biol Sci. 2008 Dec 27;363(1512):3941-53. doi: 10.1098/rstb.2008.0175.

Elucidation of phenotypic adaptations: Molecular analyses of dim-light vision proteins in vertebrates.

Proc Natl Acad Sci U S A. 2008 Sep 9;105(36):13480-5. doi: 10.1073/pnas.0802426105. Epub 2008 Sep 3.

An empirical codon model for protein sequence evolution.

Mol Biol Evol. 2007 Jul;24(7):1464-79. doi: 10.1093/molbev/msm064. Epub 2007 Mar 30.

A combined empirical and mechanistic codon model.

Mol Biol Evol. 2007 Feb;24(2):388-97. doi: 10.1093/molbev/msl175. Epub 2006 Nov 16.

Evolutionary model selection with a genetic algorithm: a case study using stem RNA.

Mol Biol Evol. 2007 Jan;24(1):159-70. doi: 10.1093/molbev/msl144. Epub 2006 Oct 12.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基准多速率密码子模型。

Benchmarking multi-rate codon models.

机构信息

Department of Pathology, University of California San Diego, San Diego, California, United States of America.

出版信息

PLoS One. 2010 Jul 21;5(7):e11587. doi: 10.1371/journal.pone.0011587.

DOI:10.1371/journal.pone.0011587

PMID:20657773

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC2908124/

Abstract

摘要

基准多速率密码子模型。

Benchmarking multi-rate codon models.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

基准多速率密码子模型。

Benchmarking multi-rate codon models.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献