Zhu Qingsong, Arakane Yasuyuki, Banerjee Debarshi, Beeman Richard W, Kramer Karl J, Muthukrishnan Subbaratnam
Department of Biochemistry, Kansas State University, Manhattan, KS 66506, USA.
Insect Biochem Mol Biol. 2008 Apr;38(4):452-66. doi: 10.1016/j.ibmb.2007.06.010. Epub 2007 Jul 5.
A bioinformatics-based investigation of three insect species with completed genome sequences has revealed that insect chitinase-like proteins (glycosylhydrolase family 18) are encoded by a rather large and diverse group of genes. We identified 16, 16 and 13 putative chitinase-like genes in the genomic databases of the red flour beetle, Tribolium castaneum, the fruit fly, Drosophila melanogaster, and the malaria mosquito, Anopheles gambiae, respectively. Chitinase-like proteins encoded by this gene family were classified into five groups based on phylogenetic analyses. Group I chitinases are secreted proteins that are the most abundant such enzymes in molting fluid and/or integument, and represent the prototype enzyme of the family, with a single copy each of the catalytic domain and chitin-binding domain (ChBD) connected by an S/T-rich linker polypeptide. Group II chitinases are unusually larger-sized secreted proteins that contain multiple catalytic domains and ChBDs. Group III chitinases contain two catalytic domains and are predicted to be membrane-anchored proteins. Group IV chitinases are the most divergent. They usually lack a ChBD and/or an S/T-rich linker domain, and are known or predicted to be secreted proteins found in gut or fat body. Group V proteins include the putative chitinase-like imaginal disc growth factors (IDGFs). In each of the three insect genomes, multiple genes encode group IV and group V chitinase-like proteins. In contrast, groups I-III are each represented by only a singe gene in each species.
一项基于生物信息学对三种已完成基因组测序的昆虫物种的研究表明,昆虫几丁质酶样蛋白(糖基水解酶家族18)由相当大且多样的一组基因编码。我们分别在赤拟谷盗、黑腹果蝇和冈比亚按蚊的基因组数据库中鉴定出16个、16个和13个假定的几丁质酶样基因。基于系统发育分析,该基因家族编码的几丁质酶样蛋白被分为五组。第一组几丁质酶是分泌蛋白,是蜕皮液和/或体表中最丰富的此类酶,代表了该家族的原型酶,其催化结构域和几丁质结合结构域(ChBD)各有一个拷贝,由富含S/T的连接多肽连接。第二组几丁质酶是异常大的分泌蛋白,含有多个催化结构域和ChBD。第三组几丁质酶含有两个催化结构域,预计为膜锚定蛋白。第四组几丁质酶差异最大。它们通常缺乏ChBD和/或富含S/T的连接结构域,已知或预计为在肠道或脂肪体中发现的分泌蛋白。第五组蛋白包括假定类似几丁质酶的成虫盘生长因子(IDGF)。在这三种昆虫的每个基因组中,多个基因编码第四组和第五组几丁质酶样蛋白。相比之下,第一至三组在每个物种中各仅由一个基因代表。