Machovic Martin, Svensson Birte, MacGregor E Ann, Janecek Stefan
Institute of Molecular Biology, Slovak Academy of Sciences, Bratislava, Slovakia.
FEBS J. 2005 Nov;272(21):5497-513. doi: 10.1111/j.1742-4658.2005.04942.x.
Approximately 10% of amylolytic enzymes are able to bind and degrade raw starch. Usually a distinct domain, the starch-binding domain (SBD), is responsible for this property. These domains have been classified into families of carbohydrate-binding modules (CBM). At present, there are six SBD families: CBM20, CBM21, CBM25, CBM26, CBM34, and CBM41. This work is concentrated on CBM20 and CBM21. The CBM20 module was believed to be located almost exclusively at the C-terminal end of various amylases. The CBM21 module was known as the N-terminally positioned SBD of Rhizopus glucoamylase. Nowadays many nonamylolytic proteins have been recognized as possessing sequence segments that exhibit similarities with the experimentally observed CBM20 and CBM21. These facts have stimulated interest in carrying out a rigorous bioinformatics analysis of the two CBM families. The present analysis showed that the original idea of the CBM20 module being at the C-terminus and the CBM21 module at the N-terminus of a protein should be modified. Although the CBM20 functionally important tryptophans were found to be substituted in several cases, these aromatics and the regions around them belong to the best conserved parts of the CBM20 module. They were therefore used as templates for revealing the corresponding regions in the CBM21 family. Secondary structure prediction together with fold recognition indicated that the CBM21 module structure should be similar to that of CBM20. The evolutionary tree based on a common alignment of sequences of both modules showed that the CBM21 SBDs from alpha-amylases and glucoamylases are the closest relatives to the CBM20 counterparts, with the CBM20 modules from the glycoside hydrolase family GH13 amylopullulanases being possible candidates for the intermediate between the two CBM families.
约10%的淀粉分解酶能够结合并降解生淀粉。通常,一个独特的结构域,即淀粉结合结构域(SBD),负责这一特性。这些结构域已被归类为碳水化合物结合模块(CBM)家族。目前,有六个SBD家族:CBM20、CBM21、CBM25、CBM26、CBM34和CBM41。本研究集中于CBM20和CBM21。人们认为CBM20模块几乎只位于各种淀粉酶的C末端。CBM21模块是根霉葡糖淀粉酶N末端定位的SBD。如今,许多非淀粉分解蛋白已被认为具有与实验观察到的CBM20和CBM21表现出相似性的序列片段。这些事实激发了对这两个CBM家族进行严格生物信息学分析的兴趣。目前的分析表明,关于CBM20模块位于蛋白质C末端而CBM21模块位于N末端的最初观点应予以修正。尽管在一些情况下发现CBM20功能重要的色氨酸被取代,但这些芳香族氨基酸及其周围区域属于CBM20模块中保守性最好的部分。因此,它们被用作揭示CBM21家族相应区域的模板。二级结构预测与折叠识别表明,CBM21模块结构应与CBM20相似。基于两个模块序列共同比对的进化树表明,来自α-淀粉酶和葡糖淀粉酶的CBM21 SBD与CBM20对应物关系最为密切,来自糖苷水解酶家族GH13支链淀粉酶的CBM20模块可能是这两个CBM家族之间的中间候选者。