Wright F, Bibb M J
Scottish Agricultural Statistics Service, University of Edinburgh, U.K.
Gene. 1992 Apr 1;113(1):55-65. doi: 10.1016/0378-1119(92)90669-g.
The codon usage (CU) patterns of 64 genes from the Gram+ prokaryotic genus Streptomyces were analysed. Despite the extremely high overall G+C content of the Streptomyces genome (estimated at 0.74), individual genes varied in G+C content from 0.610 to 0.797, and had third codon position G+C contents (GC3s) that varied from 0.764 to 0.983. The variation in GC3s explains a significant proportion of the variation in CU patterns. This is consistent with an evolutionary model of the Streptomyces genome where biased mutation pressure has led to a high average G+C content with random variation about the mean, although the variation observed is greater than that expected from a simple binomial model. The only gene in the sample that can be confidently predicted to be highly expressed, EF-Tu of Streptomyces coelicolor A3(2) (GC3s = 0.927), shows a preference for a third position C in several of the four codon families, and for CGY and GGY for Arg and Gly codons, respectively (Y = pyrimidine); similar CU patterns are found in highly expressed genes of the G+C-rich Micrococcus luteus genome. It thus appears that codon usage in Streptomyces is determined predominantly by mutation bias, with weak translational selection operating only in highly expressed genes. We discuss the possible consequences of the extreme codon bias of Streptomyces and consider how it may have evolved. A set of CU tables is provided for use with computer programs that locate protein-coding regions.
对革兰氏阳性原核生物链霉菌属的64个基因的密码子使用(CU)模式进行了分析。尽管链霉菌基因组的总体G+C含量极高(估计为0.74),但各个基因的G+C含量在0.610至0.797之间变化,其第三密码子位置的G+C含量(GC3s)在0.764至0.983之间变化。GC3s的变化解释了CU模式变化的很大一部分。这与链霉菌基因组的进化模型一致,在该模型中,偏向性突变压力导致了较高的平均G+C含量,且围绕均值存在随机变化,尽管观察到的变化大于简单二项式模型预期的变化。样本中唯一可被可靠预测为高表达的基因,即天蓝色链霉菌A3(2)的EF-Tu(GC3s = 0.927),在四个密码子家族中的几个家族中,对第三位置的C有偏好,对于Arg和Gly密码子,分别偏好CGY和GGY(Y = 嘧啶);在富含G+C的藤黄微球菌基因组的高表达基因中也发现了类似的CU模式。因此,链霉菌中的密码子使用似乎主要由突变偏向性决定,只有在高表达基因中才存在较弱的翻译选择作用。我们讨论了链霉菌极端密码子偏向性的可能后果,并思考了其可能的进化方式。提供了一组CU表,供用于定位蛋白质编码区域的计算机程序使用。