Blake R D, Hinds P W
Department of Biochemistry, University of Maine, Orono.
J Biomol Struct Dyn. 1984 Dec;2(3):593-606. doi: 10.1080/07391102.1984.10507593.
Fifty-three gene sequences from E. coli containing 18,288 reading frame triplets have been characterized according to the nature and level of average codon preference. The distribution of average preferences is bimodal, with approximately half the genes using an average of only 36 codons, and the remainder just 42 codons. There is a high correlation between the level of codon bias, the tRNA population and the abundance of protein product, indicating biased patterns are exploited by the cell for the production of widely different levels of gene product. This relationship is especially striking in genes involved in the production of components for transcription and translation. Overall, the genes for these processes generate some five-fold more protein than the average in the genome, and use about five fewer codons. The very high codon bias found in the RNA polymerase gene thus provides a simple, autogenous mechanism for the coordinate synthesis of these components and RNA polymerase. A surprisingly high level of codon probability is also found in triplets of the complement of coding sequences. This is apparently due to the evolutionary dispersion of coding sequences and/or the requirement for increased levels of secondary structure in messenger RNAs.
根据平均密码子偏好的性质和程度,对来自大肠杆菌的53个基因序列(包含18288个阅读框三联体)进行了特征分析。平均偏好的分布是双峰的,大约一半的基因平均仅使用36个密码子,其余的则仅使用42个密码子。密码子偏倚程度、tRNA群体与蛋白质产物丰度之间存在高度相关性,这表明细胞利用偏倚模式来产生差异很大的基因产物水平。这种关系在参与转录和翻译组件生产的基因中尤为显著。总体而言,这些过程的基因产生的蛋白质比基因组中的平均水平多约五倍,并且使用的密码子少约五个。因此,在RNA聚合酶基因中发现的非常高的密码子偏倚为这些组件和RNA聚合酶的协同合成提供了一种简单的自身调节机制。在编码序列互补链的三联体中也发现了惊人高的密码子概率水平。这显然是由于编码序列的进化分散和/或信使RNA中二级结构水平增加的要求。