State Key Laboratory for Turbulence and Complex Systems and Department of Biomedical Engineering, College of Engineering, Peking University, Beijing 100871, China.
BMC Genomics. 2011 Jul 12;12:361. doi: 10.1186/1471-2164-12-361.
Shine-Dalgarno (SD) signal has long been viewed as the dominant translation initiation signal in prokaryotes. Recently, leaderless genes, which lack 5'-untranslated regions (5'-UTR) on their mRNAs, have been shown abundant in archaea. However, current large-scale in silico analyses on initiation mechanisms in bacteria are mainly based on the SD-led initiation way, other than the leaderless one. The study of leaderless genes in bacteria remains open, which causes uncertain understanding of translation initiation mechanisms for prokaryotes.
Here, we study signals in translation initiation regions of all genes over 953 bacterial and 72 archaeal genomes, then make an effort to construct an evolutionary scenario in view of leaderless genes in bacteria. With an algorithm designed to identify multi-signal in upstream regions of genes for a genome, we classify all genes into SD-led, TA-led and atypical genes according to the category of the most probable signal in their upstream sequences. Particularly, occurrence of TA-like signals about 10 bp upstream to translation initiation site (TIS) in bacteria most probably means leaderless genes.
Our analysis reveals that leaderless genes are totally widespread, although not dominant, in a variety of bacteria. Especially for Actinobacteria and Deinococcus-Thermus, more than twenty percent of genes are leaderless. Analyzed in closely related bacterial genomes, our results imply that the change of translation initiation mechanisms, which happens between the genes deriving from a common ancestor, is linearly dependent on the phylogenetic relationship. Analysis on the macroevolution of leaderless genes further shows that the proportion of leaderless genes in bacteria has a decreasing trend in evolution.
Shine-Dalgarno (SD) 信号长期以来一直被视为原核生物中主要的翻译起始信号。最近,缺乏 mRNA 5'-非翻译区(5'-UTR)的无领导者基因在古菌中大量存在。然而,目前基于 SD 引导起始方式的细菌中启动机制的大规模计算分析主要是基于无领导者基因。细菌中无领导者基因的研究仍然是开放的,这导致对原核生物翻译起始机制的理解不确定。
在这里,我们研究了超过 953 个细菌和 72 个古菌基因组中所有基因的翻译起始区信号,然后努力构建一个基于细菌中无领导者基因的进化情景。我们设计了一种算法来识别基因组中基因上游区域的多信号,根据其上游序列中最可能信号的类别,将所有基因分为 SD 引导、TA 引导和非典型基因。特别是,在细菌翻译起始位点(TIS)上游约 10 个碱基处出现 TA 样信号很可能意味着无领导者基因。
我们的分析表明,无领导者基因在各种细菌中虽然不是主要的,但却是广泛存在的。特别是对于放线菌和 Deinococcus-Thermus,超过 20%的基因是无领导者基因。在密切相关的细菌基因组中进行分析,我们的结果表明,发生在共同祖先衍生的基因之间的翻译起始机制的变化与系统发育关系呈线性相关。对无领导者基因的宏观进化分析进一步表明,在进化过程中,细菌中无领导者基因的比例呈下降趋势。