Department of General Chemistry, Belarussian State Medical University, Communisticheskaya 7-24, Dzerzinskogo 83, Minsk 220029, Belarus.
J Theor Biol. 2010 Sep 7;266(1):88-98. doi: 10.1016/j.jtbi.2010.06.023. Epub 2010 Jun 19.
We studied usage of cytosine and guanine in 914 genes from completely sequenced genomes of five Simplex- and seven Varicelloviruses. In genes with total GC-content higher than 50% usage of cytosine is usually higher than usage of guanine (an average difference for genes with G+C higher than 70% reaches 4.0%). This difference is caused mostly by the elevated usage of cytosine in two-fold degenerated sites situated in third codon positions relatively to the usage of guanine in two-fold degenerated sites situated in third codon positions (an average difference for genes with G+C higher than 70% is equal to 28.2%). The usage of amino acids that are encoded by codons containing cytosine in two-fold degenerated sites situated in third codon positions (AA2TC) is much higher than the usage of amino acids encoded by codons containing guanine in two-fold degenerated sites situated in third codon positions (AA2AG). The usage of AA2AG declines much more steeply with the growth of GC-content than the usage of AA2TC. This effect is the consequence of the nature of genetic code and of the negative selection. In GC-rich genes the usage of cytosine in four-fold degenerated sites is only a little (but significantly) higher than the usage of guanine (in genes with G+C higher than 70% an average difference is equal to 4.3%). This difference may be caused by transcription-associated mutational pressure.
我们研究了来自五个单纯疱疹病毒和七个水痘带状疱疹病毒的完全测序基因组中 914 个基因中的胞嘧啶和鸟嘌呤的使用情况。在总 GC 含量高于 50%的基因中,胞嘧啶的使用通常高于鸟嘌呤的使用(G+C 含量高于 70%的基因的平均差异达到 4.0%)。这种差异主要是由于在三分之一密码子位置的两倍退化位点中胞嘧啶的使用增加,而在三分之一密码子位置的两倍退化位点中鸟嘌呤的使用减少(G+C 含量高于 70%的基因的平均差异等于 28.2%)。在三分之一密码子位置的两倍退化位点中,由包含胞嘧啶的密码子编码的氨基酸(AA2TC)的使用比由包含鸟嘌呤的密码子编码的氨基酸(AA2AG)的使用高得多。在三分之一密码子位置的两倍退化位点中,AA2AG 的使用随着 GC 含量的增加而急剧下降,而 AA2TC 的使用则不然。这种效应是遗传密码的性质和负选择的结果。在 GC 丰富的基因中,四倍退化位点中胞嘧啶的使用仅略高于(但显著高于)鸟嘌呤的使用(在 G+C 含量高于 70%的基因中,平均差异等于 4.3%)。这种差异可能是由转录相关的突变压力引起的。