Niimura Yoshihito, Terabe Mahito, Gojobori Takashi, Miura Kin-ichiro
Center for Information Biology and DNA Data Bank of Japan, National Institute of Genetics, 1111, Yata, Mishima, Shizuoka 411-8540, Japan.
Nucleic Acids Res. 2003 Sep 1;31(17):5195-201. doi: 10.1093/nar/gkg701.
Adenine nucleotides have been found to appear preferentially in the regions after the initiation codons or before the termination codons of bacterial genes. Our previous experiments showed that AAA and AAT, the two most frequent second codons in Escherichia coli, significantly enhance translation efficiency. To determine whether such a characteristic feature of base frequencies exists in eukaryote genes, we performed a comparative analysis of the base biases at the gene terminal portions using the proteomes of seven eukaryotes. Here we show that the base appearance at the codon third positions of gene terminal regions is highly biased in eukaryote genomes, although the codon third positions are almost free from amino acid preference. The bias changes depending on its position in a gene, and is characteristic of each species. We also found that bias is most outstanding at the second codon, the codon after the initiation codon. NCN is preferred in every genome; in particular, GCG is strongly favored in human and plant genes. The presence of the bias implies that the base sequences at the second codon affect translation efficiency in eukaryotes as well as bacteria.
已发现腺嘌呤核苷酸优先出现在细菌基因起始密码子之后或终止密码子之前的区域。我们之前的实验表明,AAA和AAT这两个大肠杆菌中最常见的第二个密码子,能显著提高翻译效率。为了确定真核生物基因中是否存在这种碱基频率的特征,我们利用七种真核生物的蛋白质组对基因末端部分的碱基偏好进行了比较分析。我们在此表明,尽管密码子的第三位几乎不受氨基酸偏好的影响,但在真核生物基因组中,基因末端区域密码子第三位的碱基出现情况存在高度偏好。这种偏好会因其在基因中的位置而变化,且具有每个物种的特征。我们还发现,这种偏好在起始密码子后的第二个密码子处最为显著。每个基因组中NCN都是首选;特别是,GCG在人类和植物基因中受到强烈青睐。这种偏好的存在意味着第二个密码子处的碱基序列在真核生物和细菌中都会影响翻译效率。