Grépinet O, Béguin P
Nucleic Acids Res. 1986 Feb 25;14(4):1791-9. doi: 10.1093/nar/14.4.1791.
The nucleotide sequence of the CelB gene, encoding the extracellular endoglucanase B of Clostridium thermocellum, is reported. The putative start of the 1689 bp coding sequence was assigned to an ATG codon which is preceded by an AGGAGG sequence typical of ribosomal binding sites in Gram-positive bacteria. The amino-terminal end of the deduced protein sequence is similar to signal peptides described for other bacterial secretory proteins. The carboxy-terminal ends of endoglucanases A and B appear to be remarkably homologous. A striking feature of the conserved region is that both proteins contain two reiterated stretches of 23 aminoacids each, separated by 9 residues.
报道了编码嗜热栖热放线菌胞外内切葡聚糖酶B的CelB基因的核苷酸序列。1689bp编码序列的推定起始位点被指定为一个ATG密码子,其前面是革兰氏阳性细菌核糖体结合位点典型的AGGAGG序列。推导的蛋白质序列的氨基末端类似于其他细菌分泌蛋白所描述的信号肽。内切葡聚糖酶A和B的羧基末端似乎具有显著的同源性。保守区域的一个显著特征是两种蛋白质都各自含有两个23个氨基酸的重复片段,中间间隔9个残基。