Strasser Kimchi, McDonnell Erin, Nyaga Carol, Wu Min, Wu Sherry, Almeida Hayda, Meurs Marie-Jean, Kosseim Leila, Powlowski Justin, Butler Greg, Tsang Adrian
Centre for Structural and Functional Genomics, Department of Computer Science and Software Engineering, Department of Chemistry and Biochemistry, and Department of Biology Concordia University, Montréal, QC, USA.
Centre for Structural and Functional Genomics, Department of Computer Science and Software Engineering, Department of Chemistry and Biochemistry, and Department of Biology Concordia University, Montréal, QC, USA Centre for Structural and Functional Genomics, Department of Computer Science and Software Engineering, Department of Chemistry and Biochemistry, and Department of Biology Concordia University, Montréal, QC, USA.
Database (Oxford). 2015 Mar 8;2015. doi: 10.1093/database/bav008. Print 2015.
Enzymes active on components of lignocellulosic biomass are used for industrial applications ranging from food processing to biofuels production. These include a diverse array of glycoside hydrolases, carbohydrate esterases, polysaccharide lyases and oxidoreductases. Fungi are prolific producers of these enzymes, spurring fungal genome sequencing efforts to identify and catalogue the genes that encode them. To facilitate the functional annotation of these genes, biochemical data on over 800 fungal lignocellulose-degrading enzymes have been collected from the literature and organized into the searchable database, mycoCLAP (http://mycoclap.fungalgenomics.ca). First implemented in 2011, and updated as described here, mycoCLAP is capable of ranking search results according to closest biochemically characterized homologues: this improves the quality of the annotation, and significantly decreases the time required to annotate novel sequences. The database is freely available to the scientific community, as are the open source applications based on natural language processing developed to support the manual curation of mycoCLAP. Database URL: http://mycoclap.fungalgenomics.ca.
作用于木质纤维素生物质成分的酶被用于从食品加工到生物燃料生产等一系列工业应用中。这些酶包括各种各样的糖苷水解酶、碳水化合物酯酶、多糖裂解酶和氧化还原酶。真菌是这些酶的丰富生产者,这促使人们对真菌基因组进行测序,以识别和编目编码这些酶的基因。为了便于对这些基因进行功能注释,已从文献中收集了800多种真菌木质纤维素降解酶的生化数据,并将其整理到可搜索的数据库mycoCLAP(http://mycoclap.fungalgenomics.ca)中。mycoCLAP于2011年首次推出,并按此处所述进行了更新,它能够根据最接近的具有生化特征的同源物对搜索结果进行排名:这提高了注释的质量,并显著减少了注释新序列所需的时间。该数据库可供科学界免费使用,基于自然语言处理开发的支持mycoCLAP人工管理的开源应用程序也是如此。数据库网址:http://mycoclap.fungalgenomics.ca。