Cantarel Brandi L, Coutinho Pedro M, Rancurel Corinne, Bernard Thomas, Lombard Vincent, Henrissat Bernard
Architecture et Fonction des Macromolécules Biologiques, UMR6098, CNRS, Universités Aix-Marseille I & II, 163 Avenue de Luminy, 13288 Marseille, France.
Nucleic Acids Res. 2009 Jan;37(Database issue):D233-8. doi: 10.1093/nar/gkn663. Epub 2008 Oct 5.
The Carbohydrate-Active Enzyme (CAZy) database is a knowledge-based resource specialized in the enzymes that build and breakdown complex carbohydrates and glycoconjugates. As of September 2008, the database describes the present knowledge on 113 glycoside hydrolase, 91 glycosyltransferase, 19 polysaccharide lyase, 15 carbohydrate esterase and 52 carbohydrate-binding module families. These families are created based on experimentally characterized proteins and are populated by sequences from public databases with significant similarity. Protein biochemical information is continuously curated based on the available literature and structural information. Over 6400 proteins have assigned EC numbers and 700 proteins have a PDB structure. The classification (i) reflects the structural features of these enzymes better than their sole substrate specificity, (ii) helps to reveal the evolutionary relationships between these enzymes and (iii) provides a convenient framework to understand mechanistic properties. This resource has been available for over 10 years to the scientific community, contributing to information dissemination and providing a transversal nomenclature to glycobiologists. More recently, this resource has been used to improve the quality of functional predictions of a number genome projects by providing expert annotation. The CAZy resource resides at URL: http://www.cazy.org/.
碳水化合物活性酶(CAZy)数据库是一个基于知识的资源库,专门收录参与构建和分解复杂碳水化合物及糖缀合物的酶。截至2008年9月,该数据库描述了目前关于113个糖苷水解酶家族、91个糖基转移酶家族、19个多糖裂解酶家族、15个碳水化合物酯酶家族和52个碳水化合物结合模块家族的知识。这些家族是基于经过实验表征的蛋白质创建的,并由来自公共数据库的具有显著相似性的序列填充。蛋白质生化信息会根据现有文献和结构信息不断进行整理。超过6400种蛋白质已被指定酶委员会(EC)编号,700种蛋白质具有蛋白质数据银行(PDB)结构。这种分类方法(i)比单纯的底物特异性更能反映这些酶的结构特征,(ii)有助于揭示这些酶之间的进化关系,(iii)提供了一个便于理解其机制特性的框架。该资源已向科学界开放超过10年,有助于信息传播,并为糖生物学研究人员提供了一个通用的命名法。最近,通过提供专业注释,该资源已被用于提高多个基因组计划功能预测的质量。CAZy资源位于网址:http://www.cazy.org/ 。