Banerjee T, Gupta S K, Ghosh T C
Bioinformatics Centre, Bose Institute, P 1/12, C.I.T. Scheme VII M, Kolkata 700 054, India.
Biosystems. 2005 Jul;81(1):11-8. doi: 10.1016/j.biosystems.2005.01.002.
Correlations between genomic GC contents and amino acid frequencies were studied in the homologous sequences of 12 eubacterial genomes. Results show that amino acids encoded by GC-rich codons increases significantly with genomic GC contents, whereas opposite trend was observed in case of amino acids encoded by GC-poor codons. Further studies show all the amino acids do not change in the predicted direction according to their genomic GC pressure, suggesting that protein evolution is not entirely dictated by their nucleotide frequencies. Amino acid substitution matrix calculated among hydrophobic, amphipathic and hydrophilic amino acid groups' shows that amphipathic and hydrophilic amino acids are more frequently substituted by hydrophobic amino acids than from hydrophobic to hydrophilic or amphipathic amino acids. This indicates that nucleotide bias induces a directional changes in proteome composition in such a way that underwent strong changes in hydropathy values. In fact, significant increases in hydrophobicity values have also been observed with the increase of genomic GC contents. Correlations between GC contents and amino acid compositions in three different predicted protein secondary structures show that hydropathy values increases significantly with GC contents in aperiodic and helix structures whereas strand structure remains insensitive with the genomic GC levels. The relative importance of mutation and selection on the evolution of proteins have been discussed on the basis of these results.
在12种真细菌基因组的同源序列中研究了基因组GC含量与氨基酸频率之间的相关性。结果表明,由富含GC的密码子编码的氨基酸随着基因组GC含量的增加而显著增加,而由富含AT的密码子编码的氨基酸则呈现相反的趋势。进一步的研究表明,并非所有氨基酸都按照其基因组GC压力的预测方向变化,这表明蛋白质进化并不完全由其核苷酸频率决定。在疏水、两亲性和亲水性氨基酸组之间计算的氨基酸替换矩阵表明,两亲性和亲水性氨基酸被疏水性氨基酸取代的频率高于从疏水性氨基酸转变为亲水性或两亲性氨基酸的频率。这表明核苷酸偏倚以蛋白质组组成发生显著亲水性值变化的方式诱导了方向性变化。事实上,随着基因组GC含量的增加,疏水性值也显著增加。在三种不同预测的蛋白质二级结构中GC含量与氨基酸组成之间的相关性表明,在无规卷曲和螺旋结构中,亲水性值随GC含量显著增加,而β-折叠结构对基因组GC水平不敏感。基于这些结果,讨论了突变和选择在蛋白质进化中的相对重要性。