Diaz-Lazcoz Y, Hénaut A, Vigier P, Risler J L
Centre de Génétique Moléculaire du CNRS, Laboratoire associé à l'Université P. et M. Curie, Gif sur Yvette, France.
J Mol Biol. 1995 Jul 7;250(2):123-7. doi: 10.1006/jmbi.1995.0363.
The availability of specialized sequence databanks for Escherichia coli, Saccharomyces cerevisiae and Bacillus subtilis made it possible to build a set of 105 protein-coding genes that are homologous in these three species. An analysis of the triplets at both the nucleotide and amino acid level revealed that the codon bias of some amino acids are significantly higher at conserved rather than at non-conserved positions. Comparisons of homologous genes in E. coli and Salmonella typhimurium, and in S. cerevisiae and Drosophila melanogaster, led to the same conclusion. A special case was made for serine in E. coli, whose major codon is AGC for non-conserved and TCC for conserved residues. We interpret this observation as evidence that the primordial codons for serine were TCN, while codons AGY appeared later. This conclusion is substantiated by an analysis of the codon usage of catalytic serine residues in ancient, ubiquitous and essential proteins (ATP synthases and topoisomerases). It is shown that in these proteins the proportion of the catalytic serine residues coded by TCN is significantly higher than the one expected from the overall codon usage of serine residues.
大肠杆菌、酿酒酵母和枯草芽孢杆菌的专门序列数据库的可用性,使得构建一组在这三个物种中同源的105个蛋白质编码基因成为可能。对核苷酸和氨基酸水平上的三联体进行分析后发现,某些氨基酸在保守位置而非非保守位置的密码子偏好性显著更高。对大肠杆菌和鼠伤寒沙门氏菌以及酿酒酵母和黑腹果蝇中的同源基因进行比较,也得出了相同的结论。大肠杆菌中的丝氨酸是一个特殊情况,其非保守残基的主要密码子是AGC,而保守残基的主要密码子是TCC。我们将这一观察结果解释为丝氨酸的原始密码子是TCN,而密码子AGY是后来出现的证据。对古老、普遍存在且必不可少的蛋白质(ATP合酶和拓扑异构酶)中催化丝氨酸残基的密码子使用情况进行分析,证实了这一结论。结果表明,在这些蛋白质中,由TCN编码的催化丝氨酸残基的比例明显高于从丝氨酸残基的总体密码子使用情况预期的比例。