Instituto Federal de Educação, Ciência e Tecnologia do Sudeste de Minas Gerais, Muriaé, Brazil.
Department of Bioprocess and Biotechnology, School of Agriculture, São Paulo State University (Unesp), Botucatu, Brazil.
Plant Genome. 2020 Nov;13(3):e20043. doi: 10.1002/tpg2.20043. Epub 2020 Aug 28.
Most of the bioinformatics tools for enzyme annotation focus on enzymatic function assignments. Sequence similarity to well-characterized enzymes is often used for functional annotation and to assign metabolic pathways. However, these approaches are not feasible for all sequences leading to inaccurate annotations or lack of metabolic pathway information. Here we present the mApLe (metabolic pathway predictor of plant enzymes), a high-performance machine learning-based tool with models to label the metabolic pathway of enzymes rather than specifying enzymes' reactions. The mApLe uses molecular descriptors of the enzyme sequences to perform predictions without considering sequence similarities with reference sequences. Hence, mApLe can classify a diversity of enzymes, even the ones without any homolog or with incomplete EC numbers. This tool can be used to improve the quality of genomic annotation of plants or to narrow down the number of candidate genes for metabolic engineering researches. The mApLe tool is available online, and the GUI can be locally installed.
大多数用于酶注释的生物信息学工具都侧重于酶的功能分配。序列与经过充分研究的酶的相似性通常用于功能注释和分配代谢途径。然而,这些方法对于所有序列都不可行,导致注释不准确或缺乏代谢途径信息。在这里,我们介绍了 mApLe(植物酶的代谢途径预测器),这是一种基于高性能机器学习的工具,具有对酶的代谢途径进行标记的模型,而不是指定酶的反应。mApLe 使用酶序列的分子描述符进行预测,而不考虑与参考序列的序列相似性。因此,mApLe 可以对各种酶进行分类,即使是没有任何同源物或 EC 编号不完整的酶。该工具可用于提高植物基因组注释的质量,或缩小代谢工程研究的候选基因数量。mApLe 工具可在线使用,并且可以本地安装 GUI。