Yang Ziheng, Nielsen Rasmus
Department of Biology, Galton Laboratory, University College London, London, United Kingdom.
Mol Biol Evol. 2008 Mar;25(3):568-79. doi: 10.1093/molbev/msm284. Epub 2008 Jan 3.
Current models of codon substitution are formulated at the levels of nucleotide substitution and do not explicitly consider the separate effects of mutation and selection. They are thus incapable of inferring whether mutation or selection is responsible for evolution at silent sites. Here we implement a few population genetics models of codon substitution that explicitly consider mutation bias and natural selection at the DNA level. Selection on codon usage is modeled by introducing codon-fitness parameters, which together with mutation-bias parameters, predict optimal codon frequencies for the gene. The selective pressure may be for translational efficiency and accuracy or for fine-tuning translational kinetics to produce correct protein folding. We apply the models to compare mitochondrial and nuclear genes from several mammalian species. Model assumptions concerning codon usage are found to affect the estimation of sequence distances (such as the synonymous rate d(S), the nonsynonymous rate d(N), and the rate at the 4-fold degenerate sites d(4)), as found in previous studies, but the new models produced very similar estimates to some old ones. We also develop a likelihood ratio test to examine the null hypothesis that codon usage is due to mutation bias alone, not influenced by natural selection. Application of the test to the mammalian data led to rejection of the null hypothesis in most genes, suggesting that natural selection may be a driving force in the evolution of synonymous codon usage in mammals. Estimates of selection coefficients nevertheless suggest that selection on codon usage is weak and most mutations are nearly neutral. The sensitivity of the analysis on the assumed mutation model is discussed.
当前的密码子替换模型是在核苷酸替换水平上构建的,并未明确考虑突变和选择的单独作用。因此,它们无法推断出是突变还是选择导致了沉默位点的进化。在这里,我们实现了一些密码子替换的群体遗传学模型,这些模型明确考虑了DNA水平上的突变偏向和自然选择。通过引入密码子适应度参数来模拟对密码子使用的选择,这些参数与突变偏向参数一起,预测基因的最优密码子频率。选择压力可能是为了提高翻译效率和准确性,或者是为了微调翻译动力学以产生正确的蛋白质折叠。我们应用这些模型来比较几种哺乳动物物种的线粒体基因和核基因。正如先前研究所发现的,关于密码子使用的模型假设会影响序列距离的估计(如同义替换率d(S)、非同义替换率d(N)以及四倍简并位点的替换率d(4)),但新模型产生的估计值与一些旧模型非常相似。我们还开发了一种似然比检验,以检验密码子使用仅由突变偏向导致、不受自然选择影响这一零假设。将该检验应用于哺乳动物数据时,大多数基因的零假设被拒绝,这表明自然选择可能是哺乳动物同义密码子使用进化的驱动力。然而,选择系数的估计表明,对密码子使用的选择较弱,大多数突变近乎中性。文中还讨论了分析对假定突变模型的敏感性。