Kryukov Gregory V, Gladyshev Vadim N
Department of Biochemistry, University of Nebraska, Lincoln, Nebraska 68588-0664, USA.
EMBO Rep. 2004 May;5(5):538-43. doi: 10.1038/sj.embor.7400126. Epub 2004 Apr 23.
In the genetic code, the UGA codon has a dual function as it encodes selenocysteine (Sec) and serves as a stop signal. However, only the translation terminator function is used in gene annotation programs, resulting in misannotation of selenoprotein genes. Here, we applied two independent bioinformatics approaches to characterize a selenoprotein set in prokaryotic genomes. One method searched for selenoprotein genes by identifying RNA stem-loop structures, selenocysteine insertion sequence elements; the second approach identified Sec/Cys pairs in homologous sequences. These analyses identified all or almost all selenoproteins in completely sequenced bacterial and archaeal genomes and provided a view on the distribution and composition of prokaryotic selenoproteomes. In addition, lineage-specific and core selenoproteins were detected, which provided insights into the mechanisms of selenoprotein evolution. Characterization of selenoproteomes allows interpretation of other UGA codons in completed genomes of prokaryotes as terminators, addressing the UGA dual-function problem.
在遗传密码中,UGA密码子具有双重功能,它既编码硒代半胱氨酸(Sec),又作为终止信号。然而,在基因注释程序中仅使用了翻译终止子功能,导致硒蛋白基因注释错误。在此,我们应用两种独立的生物信息学方法来鉴定原核生物基因组中的一组硒蛋白。一种方法是通过识别RNA茎环结构、硒代半胱氨酸插入序列元件来搜索硒蛋白基因;第二种方法是在同源序列中鉴定Sec/ Cys对。这些分析鉴定出了完全测序的细菌和古细菌基因组中的所有或几乎所有硒蛋白,并对原核生物硒蛋白组的分布和组成有了一定认识。此外,还检测到了特定谱系和核心硒蛋白,这为硒蛋白进化机制提供了见解。对硒蛋白组的鉴定有助于将原核生物完整基因组中的其他UGA密码子解释为终止子,从而解决UGA的双重功能问题。