Gouy M, Gautier C
Nucleic Acids Res. 1982 Nov 25;10(22):7055-74. doi: 10.1093/nar/10.22.7055.
The nucleic acid sequence bank now contains over 600 protein coding genes of which 107 are from prokaryotic organisms. Codon frequencies in each new prokaryotic gene are given. Analysis of genetic code usage in the 83 sequenced genes of the Escherichia coli genome (chromosome, transposons and plasmids) is presented, taking into account new data on gene expressivity and regulation as well as iso-tRNA specificity and cellular concentration. The codon composition of each gene is summarized using two indexes: one is based on the differential usage of iso-tRNA species during gene translation, the other on choice between Cytosine and Uracil for third base. A strong relationship between codon composition and mRNA expressivity is confirmed, even for genes transcribed in the same operon. The influence of codon use of peptide elongation rate and protein yield is discussed. Finally, the evolutionary aspect of codon selection in mRNA sequences is studied.
核酸序列库目前包含600多个蛋白质编码基因,其中107个来自原核生物。给出了每个新原核基因的密码子频率。本文对大肠杆菌基因组(染色体、转座子和质粒)的83个已测序基因的遗传密码使用情况进行了分析,同时考虑了基因表达和调控的新数据以及同工tRNA特异性和细胞浓度。每个基因的密码子组成用两个指标进行总结:一个基于基因翻译过程中同工tRNA种类的差异使用,另一个基于密码子第三位碱基在胞嘧啶和尿嘧啶之间的选择。即使对于在同一操纵子中转录的基因,也证实了密码子组成与mRNA表达之间存在密切关系。讨论了密码子使用对肽链延伸速率和蛋白质产量的影响。最后,研究了mRNA序列中密码子选择的进化方面。