Lobry J R, Gautier C
Laboratoire de Biométrie, CNRS URA 243, Université Claude Bernard, Villeurbanne, France.
Nucleic Acids Res. 1994 Aug 11;22(15):3174-80. doi: 10.1093/nar/22.15.3174.
Multivariate analysis of the amino-acid compositions of 999 chromosome-encoded proteins from Escherichia coli showed that three main factors influence the variability of amino-acid composition. The first factor was correlated with the global hydrophobicity of proteins, and it discriminated integral membrane proteins from the others. The second factor was correlated with gene expressivity, showing a bias in highly expressed genes towards amino-acids having abundant major tRNAs. Just as highly expressed genes have reduced codon diversity in protein coding sequences, so do they have a reduced diversity of amino-acid choice. This showed that translational constraints are important enough to affect the global amino-acid composition of proteins. The third factor was correlated with the aromaticity of proteins, showing that aromatic amino-acid content is highly variable.
对来自大肠杆菌的999种染色体编码蛋白质的氨基酸组成进行多变量分析表明,有三个主要因素影响氨基酸组成的变异性。第一个因素与蛋白质的整体疏水性相关,它将整合膜蛋白与其他蛋白区分开来。第二个因素与基因表达能力相关,表明高表达基因偏向于使用具有丰富主要tRNA的氨基酸。正如高表达基因在蛋白质编码序列中的密码子多样性降低一样,它们在氨基酸选择上的多样性也降低。这表明翻译限制对影响蛋白质的整体氨基酸组成具有足够的重要性。第三个因素与蛋白质的芳香性相关,表明芳香族氨基酸含量变化很大。