Department of Bioinformatics and Genomics, Faculty of Biotechnology, University of Wroclaw, ul. Joliot-Curie 14a, 50-383 Wroclaw, Poland.
Int J Mol Sci. 2023 Jan 7;24(2):1185. doi: 10.3390/ijms24021185.
Synonymous codon usage can be influenced by mutations and/or selection, e.g., for speed of protein translation and correct folding. However, this codon bias can also be affected by a general selection at the amino acid level due to differences in the acceptance of the loss and generation of these codons. To assess the importance of this effect, we constructed a mutation-selection model model, in which we generated almost 90,000 stationary nucleotide distributions produced by mutational processes and applied a selection based on differences in physicochemical properties of amino acids. Under these conditions, we calculated the usage of fourfold degenerated (4FD) codons and compared it with the usage characteristic of the pure mutations. We considered both the standard genetic code (SGC) and alternative genetic codes (AGCs). The analyses showed that a majority of AGCs produced a greater 4FD codon bias than the SGC. The mutations producing more thymine or adenine than guanine and cytosine increased the differences in usage. On the other hand, the mutational pressures generating a lot of cytosine or guanine with a low content of adenine and thymine decreased this bias because the nucleotide content of most 4FD codons stayed in the compositional equilibrium with these pressures. The comparison of the theoretical results with those for real protein coding sequences showed that the influence of selection at the amino acid level on the synonymous codon usage cannot be neglected. The analyses indicate that the effect of amino acid selection cannot be disregarded and that it can interfere with other selection factors influencing codon usage, especially in AT-rich genomes, in which AGCs are usually used.
同义密码子的使用可能受到突变和/或选择的影响,例如,为了蛋白质翻译和正确折叠的速度。然而,由于这些密码子的丢失和产生的接受程度的差异,这种密码子偏向也可能受到氨基酸水平的一般选择的影响。为了评估这种效应的重要性,我们构建了一个突变-选择模型,在该模型中,我们生成了近 90000 个由突变过程产生的静止核苷酸分布,并基于氨基酸理化性质的差异应用了选择。在这些条件下,我们计算了四联体退化(4FD)密码子的使用情况,并将其与纯突变的使用特征进行了比较。我们考虑了标准遗传密码(SGC)和替代遗传密码(AGC)。分析表明,大多数 AGC 产生的 4FD 密码子偏向大于 SGC。产生比鸟嘌呤和胞嘧啶更多的胸腺嘧啶或腺嘌呤的突变增加了使用差异。另一方面,产生大量胞嘧啶或鸟嘌呤、腺嘌呤和胸腺嘧啶含量低的突变压力降低了这种偏向,因为大多数 4FD 密码子的核苷酸含量与这些压力保持组成平衡。将理论结果与真实蛋白质编码序列的结果进行比较表明,氨基酸水平选择对同义密码子使用的影响不容忽视。分析表明,氨基酸选择的影响不容忽视,它可以干扰影响密码子使用的其他选择因素,特别是在富含 AT 的基因组中,通常使用 AGC。