CAZymes 分析工具包 (CAT)：一种网络服务，可使用 CAZy 数据库搜索和分析新测序生物中的碳水化合物活性酶。

CAZymes Analysis Toolkit (CAT): web service for searching and analyzing carbohydrate-active enzymes in a newly sequenced organism using CAZy database.

机构信息

Computer Science and Mathematics Division.

出版信息

Glycobiology. 2010 Dec;20(12):1574-84. doi: 10.1093/glycob/cwq106. Epub 2010 Aug 9.

PMID:20696711

Abstract

The Carbohydrate-Active Enzyme (CAZy) database provides a rich set of manually annotated enzymes that degrade, modify, or create glycosidic bonds. Despite rich and invaluable information stored in the database, software tools utilizing this information for annotation of newly sequenced genomes by CAZy families are limited. We have employed two annotation approaches to fill the gap between manually curated high-quality protein sequences collected in the CAZy database and the growing number of other protein sequences produced by genome or metagenome sequencing projects. The first approach is based on a similarity search against the entire nonredundant sequences of the CAZy database. The second approach performs annotation using links or correspondences between the CAZy families and protein family domains. The links were discovered using the association rule learning algorithm applied to sequences from the CAZy database. The approaches complement each other and in combination achieved high specificity and sensitivity when cross-evaluated with the manually curated genomes of Clostridium thermocellum ATCC 27405 and Saccharophagus degradans 2-40. The capability of the proposed framework to predict the function of unknown protein domains and of hypothetical proteins in the genome of Neurospora crassa is demonstrated. The framework is implemented as a Web service, the CAZymes Analysis Toolkit, and is available at http://cricket.ornl.gov/cgi-bin/cat.cgi.

摘要

碳水化合物活性酶（CAZy）数据库提供了丰富的手动注释酶，这些酶可降解、修饰或生成糖苷键。尽管数据库中存储了丰富而有价值的信息，但利用这些信息通过 CAZy 家族对新测序基因组进行注释的软件工具仍然有限。我们采用了两种注释方法来填补 CAZy 数据库中精心收集的高质量蛋白质序列与基因组或宏基因组测序项目产生的其他蛋白质序列数量不断增加之间的空白。第一种方法是基于与 CAZy 数据库中所有非冗余序列的相似性搜索。第二种方法使用 CAZy 家族和蛋白质家族结构域之间的链接或对应关系进行注释。这些链接是使用关联规则学习算法在 CAZy 数据库中的序列上发现的。这两种方法相互补充，当与经过精心编辑的梭菌属热纤梭菌 ATCC 27405 和降解球腔菌 2-40 的基因组进行交叉评估时，它们实现了高特异性和灵敏度。该框架预测未知蛋白质结构域和粗糙脉孢菌基因组中假设蛋白质功能的能力得到了证明。该框架实现为 Web 服务，即 CAZymes 分析工具包，可在 http://cricket.ornl.gov/cgi-bin/cat.cgi 上获得。

相似文献

CAZymes Analysis Toolkit (CAT): web service for searching and analyzing carbohydrate-active enzymes in a newly sequenced organism using CAZy database.

Glycobiology. 2010 Dec;20(12):1574-84. doi: 10.1093/glycob/cwq106. Epub 2010 Aug 9.

The carbohydrate-active enzymes database (CAZy) in 2013.

Nucleic Acids Res. 2014 Jan;42(Database issue):D490-5. doi: 10.1093/nar/gkt1178. Epub 2013 Nov 21.

MIPS: analysis and annotation of genome information in 2007.

Nucleic Acids Res. 2008 Jan;36(Database issue):D196-201. doi: 10.1093/nar/gkm980. Epub 2007 Dec 23.

Association algorithm to mine the rules that govern enzyme definition and to classify protein sequences.

BMC Bioinformatics. 2006 Jun 15;7:304. doi: 10.1186/1471-2105-7-304.

Homology to peptide pattern for annotation of carbohydrate-active enzymes and prediction of function.

BMC Bioinformatics. 2017 Apr 12;18(1):214. doi: 10.1186/s12859-017-1625-9.

Predicting functional family of novel enzymes irrespective of sequence similarity: a statistical learning approach.

Nucleic Acids Res. 2004 Dec 7;32(21):6437-44. doi: 10.1093/nar/gkh984. Print 2004.

A database of phylogenetically atypical genes in archaeal and bacterial genomes, identified using the DarkHorse algorithm.

BMC Bioinformatics. 2008 Oct 7;9:419. doi: 10.1186/1471-2105-9-419.

dbCAN2: a meta server for automated carbohydrate-active enzyme annotation.

Nucleic Acids Res. 2018 Jul 2;46(W1):W95-W101. doi: 10.1093/nar/gky418.

MIPS bacterial genomes functional annotation benchmark dataset.

Bioinformatics. 2005 May 15;21(10):2520-1. doi: 10.1093/bioinformatics/bti380. Epub 2005 Mar 15.

The relationship between protein structure and function: a comprehensive survey with application to the yeast genome.

J Mol Biol. 1999 Apr 23;288(1):147-64. doi: 10.1006/jmbi.1999.2661.

引用本文的文献

Galactofuranosidases: From the Initial Activity Detection to the First Crystalline Structure.

ACS Omega. 2025 Jul 11;10(28):29969-29979. doi: 10.1021/acsomega.5c04674. eCollection 2025 Jul 22.

Arginine-Enhanced Mycelia: Improvement in Growth and Lignocellulose Degradation Capabilities.

Foods. 2025 Jan 23;14(3):361. doi: 10.3390/foods14030361.

A comprehensive review on probiotics and their use in aquaculture: Biological control, efficacy, and safety through the genomics and wet methods.

Heliyon. 2024 Dec 4;10(24):e40892. doi: 10.1016/j.heliyon.2024.e40892. eCollection 2024 Dec 30.

Isolation of Sphingopyxis kveilinensis sp. nov., a Potential Antibiotic-Degrading Bacterium, from a Karst Wetland.

Curr Microbiol. 2024 Oct 17;81(12):414. doi: 10.1007/s00284-024-03941-0.

Dual-RNA-sequencing to elucidate the interactions between sorghum and .

Front Fungal Biol. 2024 Aug 16;5:1437344. doi: 10.3389/ffunb.2024.1437344. eCollection 2024.

Characteristics of Corynespora cassiicola, the causal agent of tobacco Corynespora leaf spot, revealed by genomic and metabolic phenomic analysis.

Sci Rep. 2024 Aug 7;14(1):18326. doi: 10.1038/s41598-024-67510-y.

The Yin and Yang of pathogens and probiotics: interplay between sv. Typhimurium and during co-infection.

Front Microbiol. 2024 May 15;15:1387498. doi: 10.3389/fmicb.2024.1387498. eCollection 2024.

Exploring potential polysaccharide utilization loci involved in the degradation of typical marine seaweed polysaccharides by .

Front Microbiol. 2024 May 9;15:1332105. doi: 10.3389/fmicb.2024.1332105. eCollection 2024.

Transcriptomic analysis and carbohydrate metabolism-related enzyme expression across different pH values in .

Front Microbiol. 2024 Mar 6;15:1359830. doi: 10.3389/fmicb.2024.1359830. eCollection 2024.

Hydrated lime promoted the polysaccharide content and affected the transcriptomes of during brown film formation.

Front Microbiol. 2023 Dec 4;14:1290180. doi: 10.3389/fmicb.2023.1290180. eCollection 2023.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

CAZymes 分析工具包 (CAT)：一种网络服务，可使用 CAZy 数据库搜索和分析新测序生物中的碳水化合物活性酶。

CAZymes Analysis Toolkit (CAT): web service for searching and analyzing carbohydrate-active enzymes in a newly sequenced organism using CAZy database.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献