Department of Biotechnology and Biosciences, University of Milano-Bicocca, Milan, Italy.
Institute of Biochemistry and Technical Biochemistry, University of Stuttgart, Stuttgart, Germany.
PLoS One. 2021 Oct 26;16(10):e0256817. doi: 10.1371/journal.pone.0256817. eCollection 2021.
The glycoside hydrolase 19 (GH19) is a bifunctional family of chitinases and endolysins, which have been studied for the control of plant fungal pests, the recycle of chitin biomass, and the treatment of multi-drug resistant bacteria. The GH19 domain-containing sequences (22,461) were divided into a chitinase and an endolysin subfamily by analyzing sequence networks, guided by taxonomy and the substrate specificity of characterized enzymes. The chitinase subfamily was split into seventeen groups, thus extending the previous classification. The endolysin subfamily is more diverse and consists of thirty-four groups. Despite their sequence diversity, twenty-six residues are conserved in chitinases and endolysins, which can be distinguished by two specific sequence patterns at six and four positions, respectively. Their location outside the catalytic cleft suggests a possible mechanism for substrate specificity that goes beyond the direct interaction with the substrate. The evolution of the GH19 catalytic domain was investigated by large-scale phylogeny. The inferred evolutionary history and putative horizontal gene transfer events differ from previous works. While no clear patterns were detected in endolysins, chitinases varied in sequence length by up to four loop insertions, causing at least eight distinct presence/absence loop combinations. The annotated GH19 sequences and structures are accessible via the GH19 Engineering Database (GH19ED, https://gh19ed.biocatnet.de). The GH19ED has been developed to support the prediction of substrate specificity and the search for novel GH19 enzymes from neglected taxonomic groups or in regions of the sequence space where few sequences have been described yet.
糖苷水解酶 19(GH19)是一个具有几丁质酶和内溶素功能的双功能家族,已被用于控制植物真菌害虫、几丁质生物质的回收以及治疗多药耐药细菌。通过分析序列网络,根据分类学和特征酶的底物特异性,将含有 GH19 结构域的序列(22461)分为几丁质酶和内溶素亚家族。几丁质酶亚家族分为十七组,从而扩展了以前的分类。内溶素亚家族更加多样化,由三十四个组组成。尽管它们的序列多样性很大,但在几丁质酶和内溶素中仍有二十六个残基保守,这可以通过在六个和四个位置的两个特定序列模式来区分。它们位于催化裂缝之外,这表明存在一种可能超越与底物直接相互作用的底物特异性的机制。通过大规模系统发育研究了 GH19 催化结构域的进化。推断的进化历史和可能的水平基因转移事件与以前的工作不同。虽然在内溶素中没有检测到明显的模式,但几丁质酶的序列长度变化最多可达四个环插入,导致至少有八个不同的存在/缺失环组合。GH19 序列和结构的注释可通过 GH19 工程数据库(GH19ED,https://gh19ed.biocatnet.de)获得。GH19ED 的开发旨在支持底物特异性的预测,并从被忽视的分类群或在描述序列较少的序列空间区域中寻找新的 GH19 酶。