Wan Xiu-Feng, Xu Dong, Kleinhofs Andris, Zhou Jizhong
Environmental Sciences Division, Oak Ridge National Laboratory, Oak Ridge, TN 37831, USA.
BMC Evol Biol. 2004 Jun 28;4:19. doi: 10.1186/1471-2148-4-19.
Codon usage bias has been widely reported to correlate with GC composition. However, the quantitative relationship between codon usage bias and GC composition across species has not been reported.
Based on an informatics method (SCUO) we developed previously using Shannon informational theory and maximum entropy theory, we investigated the quantitative relationship between codon usage bias and GC composition. The regression based on 70 bacterial and 16 archaeal genomes showed that in bacteria, SCUO = -2.06 * GC3 + 2.05*(GC3)2 + 0.65, r = 0.91, and that in archaea, SCUO = -1.79 * GC3 + 1.85*(GC3)2 + 0.56, r = 0.89. We developed an analytical model to quantify synonymous codon usage bias by GC compositions based on SCUO. The parameters within this model were inferred by inspecting the relationship between codon usage bias and GC composition across 70 bacterial and 16 archaeal genomes. We further simplified this relationship using only GC3. This simple model was supported by computational simulation.
The synonymous codon usage bias could be simply expressed as 1+ (p/2)log2(p/2) + ((1-p)/2)log2((l-p)/2), where p = GC3. The software we developed for measuring SCUO (codonO) is available at http://digbio.missouri.edu/~wanx/cu/codonO.
密码子使用偏好与GC含量之间的相关性已被广泛报道。然而,尚未有关于跨物种密码子使用偏好与GC含量之间定量关系的报道。
基于我们先前利用香农信息论和最大熵理论开发的一种信息学方法(SCUO),我们研究了密码子使用偏好与GC含量之间的定量关系。基于70个细菌基因组和16个古细菌基因组的回归分析表明,在细菌中,SCUO = -2.06 * GC3 + 2.05 * (GC3)2 + 0.65,r = 0.91;在古细菌中,SCUO = -1.79 * GC3 + 1.85 * (GC3)2 + 0.56,r = 0.89。我们基于SCUO开发了一个分析模型,通过GC含量来量化同义密码子使用偏好。该模型中的参数通过考察70个细菌基因组和16个古细菌基因组中密码子使用偏好与GC含量之间的关系来推断。我们仅使用GC3进一步简化了这种关系。这个简单模型得到了计算模拟的支持。
同义密码子使用偏好可以简单地表示为1 + (p/2)log2(p/2) + ((1 - p)/2)log2((1 - p)/2),其中p = GC3。我们开发的用于测量SCUO的软件(codonO)可在http://digbio.missouri.edu/~wanx/cu/codonO获取。