Rasmussen Simon, Nielsen Henrik Bjørn, Jarmer Hanne
Center for Biological Sequence Analysis, Department of Systems Biology, Technical University of Denmark, Lyngby, Denmark.
Mol Microbiol. 2009 Sep;73(6):1043-57. doi: 10.1111/j.1365-2958.2009.06830.x. Epub 2009 Aug 4.
The majority of all genes have so far been identified and annotated systematically through in silico gene finding. Here we report the finding of 3662 strand-specific transcriptionally active regions (TARs) in the genome of Bacillus subtilis by the use of tiling arrays. We have measured the genome-wide expression during mid-exponential growth on rich (LB) and minimal (M9) medium. The identified TARs account for 77.3% of the genes as they are currently annotated and additionally we find 84 putative non-coding RNAs (ncRNAs) and 127 antisense transcripts. One ncRNA, ncr22, is predicted to act as a translational control on cstA and an antisense transcript was observed opposite the housekeeping sigma factor sigA. Through this work we have discovered a long conserved 3' untranslated region (UTR) in a group of membrane-associated genes that is predicted to fold into a large and highly stable secondary structure. One of the genes having this tail is efeN, which encodes a target of the twin-arginine translocase (Tat) protein translocation system.
到目前为止,大多数基因已通过电子基因预测被系统地识别和注释。在此,我们报告利用平铺阵列在枯草芽孢杆菌基因组中发现了3662个链特异性转录活性区域(TAR)。我们测定了在丰富(LB)和基本(M9)培养基中指数中期生长期间的全基因组表达。已识别的TAR占当前注释基因的77.3%,此外,我们还发现了84个假定的非编码RNA(ncRNA)和127个反义转录本。预测一个ncRNA,即ncr22,对cstA起翻译调控作用,并且在管家σ因子sigA的对面观察到一个反义转录本。通过这项工作,我们在一组与膜相关的基因中发现了一个长的保守3'非翻译区(UTR),预测其可折叠成一个大的且高度稳定的二级结构。具有这种尾部的基因之一是efeN,它编码双精氨酸转运酶(Tat)蛋白转运系统的一个靶标。