Fliss E R, Setlow P
J Bacteriol. 1984 Jun;158(3):809-13. doi: 10.1128/jb.158.3.809-813.1984.
The nucleotide sequence of the Bacillus megaterium protein C gene, encompassing the coding region and 341 base pairs of flanking regions, has been determined. The gene codes for a 72-residue protein whose predicted amino acid sequence is identical to that previously determined for protein C with the exception of an amino-terminal methionine predicted from the gene sequence, but not found in the mature protein. The translational initiation codon is preceded by an 11-base pair sequence highly complementary to the 3' terminus of B. megaterium 16S rRNA. Protection against S1 nuclease digestion by hybridization of a protein C gene fragment to RNA containing high levels of protein C mRNA localized the transcription initiation site 108 base pairs upstream from the translation start site. Upstream from the transcription initiation site there are no obvious homologies with conserved regions of promoters for previously described B. subtilis vegetative or sporulation genes.
已确定巨大芽孢杆菌蛋白C基因的核苷酸序列,该序列涵盖编码区及侧翼区域的341个碱基对。该基因编码一种72个氨基酸残基的蛋白质,其预测的氨基酸序列与先前确定的蛋白C相同,只是从基因序列预测的氨基端甲硫氨酸在成熟蛋白中未发现。翻译起始密码子之前有一段11个碱基对的序列,与巨大芽孢杆菌16S rRNA的3'末端高度互补。通过将蛋白C基因片段与含有高水平蛋白C mRNA的RNA杂交来防止S1核酸酶消化,从而确定转录起始位点位于翻译起始位点上游108个碱基对处。在转录起始位点上游,与先前描述的枯草芽孢杆菌营养或芽孢形成基因的启动子保守区域没有明显的同源性。