Itzkovitz Shalev, Alon Uri
Department of Molecular Cell Biology, Weizmann Institute of Science, Rehovot, Israel.
Genome Res. 2007 Apr;17(4):405-12. doi: 10.1101/gr.5987307. Epub 2007 Feb 9.
DNA sequences that code for proteins need to convey, in addition to the protein-coding information, several different signals at the same time. These "parallel codes" include binding sequences for regulatory and structural proteins, signals for splicing, and RNA secondary structure. Here, we show that the universal genetic code can efficiently carry arbitrary parallel codes much better than the vast majority of other possible genetic codes. This property is related to the identity of the stop codons. We find that the ability to support parallel codes is strongly tied to another useful property of the genetic code--minimization of the effects of frame-shift translation errors. Whereas many of the known regulatory codes reside in nontranslated regions of the genome, the present findings suggest that protein-coding regions can readily carry abundant additional information.
编码蛋白质的DNA序列除了要传达蛋白质编码信息外,还需要同时传递几种不同的信号。这些“并行密码”包括调控蛋白和结构蛋白的结合序列、剪接信号以及RNA二级结构。在此,我们表明通用遗传密码能够比绝大多数其他可能的遗传密码更有效地携带任意并行密码。这一特性与终止密码子的特性有关。我们发现,支持并行密码的能力与遗传密码的另一个有用特性紧密相关——使移码翻译错误的影响最小化。虽然许多已知的调控密码存在于基因组的非翻译区域,但目前的研究结果表明,蛋白质编码区域可以轻松携带大量额外信息。