Suppr超能文献

mycoCLAP,真菌来源的木质纤维素活性蛋白特征数据库:资源与文本挖掘管理支持

mycoCLAP, the database for characterized lignocellulose-active proteins of fungal origin: resource and text mining curation support.

作者信息

Strasser Kimchi, McDonnell Erin, Nyaga Carol, Wu Min, Wu Sherry, Almeida Hayda, Meurs Marie-Jean, Kosseim Leila, Powlowski Justin, Butler Greg, Tsang Adrian

机构信息

Centre for Structural and Functional Genomics, Department of Computer Science and Software Engineering, Department of Chemistry and Biochemistry, and Department of Biology Concordia University, Montréal, QC, USA.

Centre for Structural and Functional Genomics, Department of Computer Science and Software Engineering, Department of Chemistry and Biochemistry, and Department of Biology Concordia University, Montréal, QC, USA Centre for Structural and Functional Genomics, Department of Computer Science and Software Engineering, Department of Chemistry and Biochemistry, and Department of Biology Concordia University, Montréal, QC, USA.

出版信息

Database (Oxford). 2015 Mar 8;2015. doi: 10.1093/database/bav008. Print 2015.

Abstract

Enzymes active on components of lignocellulosic biomass are used for industrial applications ranging from food processing to biofuels production. These include a diverse array of glycoside hydrolases, carbohydrate esterases, polysaccharide lyases and oxidoreductases. Fungi are prolific producers of these enzymes, spurring fungal genome sequencing efforts to identify and catalogue the genes that encode them. To facilitate the functional annotation of these genes, biochemical data on over 800 fungal lignocellulose-degrading enzymes have been collected from the literature and organized into the searchable database, mycoCLAP (http://mycoclap.fungalgenomics.ca). First implemented in 2011, and updated as described here, mycoCLAP is capable of ranking search results according to closest biochemically characterized homologues: this improves the quality of the annotation, and significantly decreases the time required to annotate novel sequences. The database is freely available to the scientific community, as are the open source applications based on natural language processing developed to support the manual curation of mycoCLAP. Database URL: http://mycoclap.fungalgenomics.ca.

摘要

作用于木质纤维素生物质成分的酶被用于从食品加工到生物燃料生产等一系列工业应用中。这些酶包括各种各样的糖苷水解酶、碳水化合物酯酶、多糖裂解酶和氧化还原酶。真菌是这些酶的丰富生产者,这促使人们对真菌基因组进行测序,以识别和编目编码这些酶的基因。为了便于对这些基因进行功能注释,已从文献中收集了800多种真菌木质纤维素降解酶的生化数据,并将其整理到可搜索的数据库mycoCLAP(http://mycoclap.fungalgenomics.ca)中。mycoCLAP于2011年首次推出,并按此处所述进行了更新,它能够根据最接近的具有生化特征的同源物对搜索结果进行排名:这提高了注释的质量,并显著减少了注释新序列所需的时间。该数据库可供科学界免费使用,基于自然语言处理开发的支持mycoCLAP人工管理的开源应用程序也是如此。数据库网址:http://mycoclap.fungalgenomics.ca。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f916/4352688/803a57c37907/bav008f1p.jpg

相似文献

2
Curation of characterized glycoside hydrolases of fungal origin.
Database (Oxford). 2011 May 26;2011:bar020. doi: 10.1093/database/bar020. Print 2011.
4
Semantic text mining support for lignocellulose research.
BMC Med Inform Decis Mak. 2012 Apr 30;12 Suppl 1(Suppl 1):S5. doi: 10.1186/1472-6947-12-S1-S5.
5
Facts from text: can text mining help to scale-up high-quality manual curation of gene products with ontologies?
Brief Bioinform. 2008 Nov;9(6):466-78. doi: 10.1093/bib/bbn043. Epub 2008 Dec 6.
6
OntoMate: a text-mining tool aiding curation at the Rat Genome Database.
Database (Oxford). 2015 Jan 25;2015. doi: 10.1093/database/bau129. Print 2015.
7
Genome Sequencing and Carbohydrate-Active Enzyme (CAZyme) Repertoire of the White Rot Fungus .
Int J Mol Sci. 2018 Aug 13;19(8):2379. doi: 10.3390/ijms19082379.
8
Moving the mountain: analysis of the effort required to transform comparative anatomy into computable anatomy.
Database (Oxford). 2015 May 13;2015:bav040. doi: 10.1093/database/bav040. Print 2015.
10
Fungal pretreatment of lignocellulosic biomass.
Biotechnol Adv. 2012 Nov-Dec;30(6):1447-57. doi: 10.1016/j.biotechadv.2012.03.003. Epub 2012 Mar 10.

引用本文的文献

2
Substrate specificity mapping of fungal CAZy AA3_2 oxidoreductases.
Biotechnol Biofuels Bioprod. 2024 Mar 27;17(1):47. doi: 10.1186/s13068-024-02491-8.
3
4
Information Retrieval Using Machine Learning for Biomarker Curation in the Exposome-Explorer.
Front Res Metr Anal. 2021 Aug 19;6:689264. doi: 10.3389/frma.2021.689264. eCollection 2021.
5
Comparative Transcriptome and Endophytic Bacterial Community Analysis of SH.
Front Microbiol. 2021 Jul 20;12:682356. doi: 10.3389/fmicb.2021.682356. eCollection 2021.
6
Comparative Genomics Used to Predict Virulence Factors and Metabolic Genes among Species.
J Fungi (Basel). 2021 Jun 8;7(6):464. doi: 10.3390/jof7060464.
8
Insights into the mechanism of cyanobacteria removal by the algicidal fungi Bjerkandera adusta and Trametes versicolor.
Microbiologyopen. 2020 Aug;9(8):e1042. doi: 10.1002/mbo3.1042. Epub 2020 Jun 11.
9
Redesigning N-glycosylation sites in a GH3 β-xylosidase improves the enzymatic efficiency.
Biotechnol Biofuels. 2019 Nov 14;12:269. doi: 10.1186/s13068-019-1609-2. eCollection 2019.

本文引用的文献

1
Machine learning for biomedical literature triage.
PLoS One. 2014 Dec 31;9(12):e115892. doi: 10.1371/journal.pone.0115892. eCollection 2014.
2
3
GenBank.
Nucleic Acids Res. 2013 Jan;41(Database issue):D36-42. doi: 10.1093/nar/gks1195. Epub 2012 Nov 27.
4
Update on activities at the Universal Protein Resource (UniProt) in 2013.
Nucleic Acids Res. 2013 Jan;41(Database issue):D43-7. doi: 10.1093/nar/gks1068. Epub 2012 Nov 17.
5
The Paleozoic origin of enzymatic lignin decomposition reconstructed from 31 fungal genomes.
Science. 2012 Jun 29;336(6089):1715-9. doi: 10.1126/science.1221748.
6
Semantic text mining support for lignocellulose research.
BMC Med Inform Decis Mak. 2012 Apr 30;12 Suppl 1(Suppl 1):S5. doi: 10.1186/1472-6947-12-S1-S5.
9
SignalP 4.0: discriminating signal peptides from transmembrane regions.
Nat Methods. 2011 Sep 29;8(10):785-6. doi: 10.1038/nmeth.1701.
10
Curation of characterized glycoside hydrolases of fungal origin.
Database (Oxford). 2011 May 26;2011:bar020. doi: 10.1093/database/bar020. Print 2011.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验