Chang Bill, Halgamuge Saman, Tang Sen-Lin
Bioengineering Group, DoMME, Faculty of Engineering, University of Melbourne, Parkville, Victoria, 3010, Australia.
Gene. 2006 May 24;373:90-9. doi: 10.1016/j.gene.2006.01.033. Epub 2006 Mar 30.
Shine-Dalgarno (SD) sequence has been considered as one of the common features of 5' end untranslated region (5'UTR) of prokaryotic transcripts. However, more leaderless bacteria and archaea mRNAs are being increasingly reported in recent years. To understand the distribution of SD-led genes and non-SD-led genes, we have analyzed 162 completed prokaryotic genomes leading to various new conclusions and validations of previous smaller scale studies. The fact that the number of the SD-led genes among those genomes varies from 11.6% to 90.8% implies that the populations of non-SD-led genes as well as leaderless genes are significant. We found that there is a strong SD conserved region in genomes with high proportion of SD-led genes. Following a t-test we showed that SD sequence content (SDSC) has no correlation with GC content. We observed that the closely related phylogenetic microbes mostly possess a similar SDSC value, and archaeal nonleading genes possess higher SDSC. This study shows that the 5'UTR of prokaryotic genes are highly diverse, particularly when genomes of distantly related organisms are compared, suggesting that more flexible mechanisms are used for translation initiation process in various prokaryotes.
夏因-达尔加诺(SD)序列一直被认为是原核生物转录本5'端非翻译区(5'UTR)的共同特征之一。然而,近年来越来越多无起始密码子的细菌和古菌mRNA被报道。为了了解含SD序列基因和无SD序列基因的分布情况,我们分析了162个已完成测序的原核生物基因组,得出了各种新结论,并验证了之前较小规模研究的结果。这些基因组中含SD序列基因的数量在11.6%至90.8%之间变化,这一事实表明无SD序列基因以及无起始密码子基因的数量相当可观。我们发现,在含SD序列基因比例高的基因组中存在一个很强的SD保守区域。经过t检验,我们发现SD序列含量(SDSC)与GC含量无关。我们观察到,亲缘关系密切的系统发育微生物大多具有相似的SDSC值,古菌的无起始密码子基因具有更高的SDSC。这项研究表明,原核生物基因的5'UTR高度多样,特别是在比较亲缘关系较远的生物体的基因组时,这表明在各种原核生物中,翻译起始过程使用了更灵活的机制。