Suppr超能文献

mycoCLAP,真菌来源的木质纤维素活性蛋白特征数据库:资源与文本挖掘管理支持

mycoCLAP, the database for characterized lignocellulose-active proteins of fungal origin: resource and text mining curation support.

作者信息

Strasser Kimchi, McDonnell Erin, Nyaga Carol, Wu Min, Wu Sherry, Almeida Hayda, Meurs Marie-Jean, Kosseim Leila, Powlowski Justin, Butler Greg, Tsang Adrian

机构信息

Centre for Structural and Functional Genomics, Department of Computer Science and Software Engineering, Department of Chemistry and Biochemistry, and Department of Biology Concordia University, Montréal, QC, USA.

Centre for Structural and Functional Genomics, Department of Computer Science and Software Engineering, Department of Chemistry and Biochemistry, and Department of Biology Concordia University, Montréal, QC, USA Centre for Structural and Functional Genomics, Department of Computer Science and Software Engineering, Department of Chemistry and Biochemistry, and Department of Biology Concordia University, Montréal, QC, USA.

出版信息

Database (Oxford). 2015 Mar 8;2015. doi: 10.1093/database/bav008. Print 2015.

Abstract

Enzymes active on components of lignocellulosic biomass are used for industrial applications ranging from food processing to biofuels production. These include a diverse array of glycoside hydrolases, carbohydrate esterases, polysaccharide lyases and oxidoreductases. Fungi are prolific producers of these enzymes, spurring fungal genome sequencing efforts to identify and catalogue the genes that encode them. To facilitate the functional annotation of these genes, biochemical data on over 800 fungal lignocellulose-degrading enzymes have been collected from the literature and organized into the searchable database, mycoCLAP (http://mycoclap.fungalgenomics.ca). First implemented in 2011, and updated as described here, mycoCLAP is capable of ranking search results according to closest biochemically characterized homologues: this improves the quality of the annotation, and significantly decreases the time required to annotate novel sequences. The database is freely available to the scientific community, as are the open source applications based on natural language processing developed to support the manual curation of mycoCLAP. Database URL: http://mycoclap.fungalgenomics.ca.

摘要

作用于木质纤维素生物质成分的酶被用于从食品加工到生物燃料生产等一系列工业应用中。这些酶包括各种各样的糖苷水解酶、碳水化合物酯酶、多糖裂解酶和氧化还原酶。真菌是这些酶的丰富生产者,这促使人们对真菌基因组进行测序,以识别和编目编码这些酶的基因。为了便于对这些基因进行功能注释,已从文献中收集了800多种真菌木质纤维素降解酶的生化数据,并将其整理到可搜索的数据库mycoCLAP(http://mycoclap.fungalgenomics.ca)中。mycoCLAP于2011年首次推出,并按此处所述进行了更新,它能够根据最接近的具有生化特征的同源物对搜索结果进行排名:这提高了注释的质量,并显著减少了注释新序列所需的时间。该数据库可供科学界免费使用,基于自然语言处理开发的支持mycoCLAP人工管理的开源应用程序也是如此。数据库网址:http://mycoclap.fungalgenomics.ca。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f916/4352688/803a57c37907/bav008f1p.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验