Béguin P, Cornet P, Aubert J P
J Bacteriol. 1985 Apr;162(1):102-5. doi: 10.1128/jb.162.1.102-105.1985.
The nucleotide sequence of the celA gene, encoding the extracellular endoglucanase A of Clostridium thermocellum, was determined and compared with the NH2-terminal amino acid sequence of the purified enzyme. The mature protein appeared to be extended by a signal sequence of 32 amino acids. A segment of 23 amino acids was duplicated at the COOH-terminal end of the protein. The putative GUG initiation codon was preceded by an AGGAGG sequence, typical of procaryotic ribosomal binding sites. The segment of DNA presumably specifying transcriptional initiation contained a high percentage of adenine and thymine residues, including an adenine-thymine tract extending over 54 base pairs.
测定了编码嗜热栖热放线菌胞外内切葡聚糖酶A的celA基因的核苷酸序列,并将其与纯化酶的氨基末端氨基酸序列进行了比较。成熟蛋白似乎由一个32个氨基酸的信号序列延伸。在该蛋白的羧基末端重复了一段23个氨基酸的片段。推测的GUG起始密码子之前有一个AGGAGG序列,这是原核核糖体结合位点的典型序列。推测指定转录起始的DNA片段含有高比例的腺嘌呤和胸腺嘧啶残基,包括一个延伸超过54个碱基对的腺嘌呤-胸腺嘧啶序列。