Jung E D, Lao G, Irwin D, Barr B K, Benjamin A, Wilson D B
Section of Biochemistry, Molecular and Cell Biology, Cornell University, Ithaca, New York 14853.
Appl Environ Microbiol. 1993 Sep;59(9):3032-43. doi: 10.1128/aem.59.9.3032-3043.1993.
Two genes encoding cellulases E1 and E4 from Thermomonospora fusca have been cloned in Escherichia coli, and their DNA sequences have been determined. Both genes were introduced into Streptomyces lividans, and the enzymes were purified from the culture supernatants of transformants. E1 and E4 were expressed 18- and 4-fold higher, respectively, in S. lividans than in E. coli. Thin-layer chromatography of digestion products showed that E1 digests cellotriose, cellotetraose, and cellopentaose to cellobiose and a trace of glucose. E4 is poor at degrading cellotriose and cleaves cellopentaose to cellotetraose and glucose or cellotriose and cellobiose. It readily cleaves cellotetraose to cellobiose. E1 shows 59% identity to Cellulomonas fumi CenC in a 689-amino-acid overlap, and E4 shows 80% identity to the N terminus of C. fimi CenB in a 441-amino-acid overlap; all of these proteins are members of cellulase family E. Alignment of the amino acid sequences of Clostridium thermocellum celD, E1, E4, and four other members of family E demonstrates a clear relationship between their catalytic domains, although there is as little as 25% identity between some of them. Residues in celD that have been identified by site-directed mutagenesis and chemical modification to be important for catalytic activity are conserved in all seven proteins. The catalytic domains of E1 and E4 are not similar to those of T. fusca E2 or E5, but all four enzymes share similar cellulose-binding domains and have the same 14-bp inverted repeat upstream of their initiation codons. This sequence has been identified previously as the binding site for a protein that regulates induction.
来自嗜热栖热放线菌的两个编码纤维素酶E1和E4的基因已在大肠杆菌中克隆,并测定了它们的DNA序列。这两个基因都被导入了变铅青链霉菌,酶从转化体的培养上清液中纯化出来。E1和E4在变铅青链霉菌中的表达分别比在大肠杆菌中高18倍和4倍。消化产物的薄层色谱显示,E1将纤维三糖、纤维四糖和纤维五糖消化为纤维二糖和微量葡萄糖。E4降解纤维三糖的能力较差,将纤维五糖裂解为纤维四糖和葡萄糖或纤维三糖和纤维二糖。它很容易将纤维四糖裂解为纤维二糖。E1在689个氨基酸的重叠区域与烟纤维单胞菌CenC有59%的同一性,E4在441个氨基酸的重叠区域与嗜热栖热放线菌CenB的N端有80%的同一性;所有这些蛋白质都是纤维素酶E家族的成员。嗜热栖热放线菌celD、E1、E4和E家族的其他四个成员的氨基酸序列比对表明,尽管它们之间的同一性低至25%,但它们的催化结构域之间存在明显的关系。通过定点诱变和化学修饰确定对催化活性重要的celD中的残基在所有七种蛋白质中都是保守的。E1和E4的催化结构域与嗜热栖热放线菌E2或E5的催化结构域不相似,但所有四种酶都具有相似的纤维素结合结构域,并且在其起始密码子上游具有相同的14bp反向重复序列。该序列先前已被确定为一种调节诱导的蛋白质的结合位点。