State Key Laboratory of NBC Protection for Civilian, No. 37 South Central Street, Yangfang Town, Changping District, Beijing 102205, China.
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae387.
Enzymatic reaction kinetics are central in analyzing enzymatic reaction mechanisms and target-enzyme optimization, and thus in biomanufacturing and other industries. The enzyme turnover number (kcat) and Michaelis constant (Km), key kinetic parameters for measuring enzyme catalytic efficiency, are crucial for analyzing enzymatic reaction mechanisms and the directed evolution of target enzymes. Experimental determination of kcat and Km is costly in terms of time, labor, and cost. To consider the intrinsic connection between kcat and Km and further improve the prediction performance, we propose a universal pretrained multitask deep learning model, MPEK, to predict these parameters simultaneously while considering pH, temperature, and organismal information. Through testing on the same kcat and Km test datasets, MPEK demonstrated superior prediction performance over the previous models. Specifically, MPEK achieved the Pearson coefficient of 0.808 for predicting kcat, improving ca. 14.6% and 7.6% compared to the DLKcat and UniKP models, and it achieved the Pearson coefficient of 0.777 for predicting Km, improving ca. 34.9% and 53.3% compared to the Kroll_model and UniKP models. More importantly, MPEK was able to reveal enzyme promiscuity and was sensitive to slight changes in the mutant enzyme sequence. In addition, in three case studies, it was shown that MPEK has the potential for assisted enzyme mining and directed evolution. To facilitate in silico evaluation of enzyme catalytic efficiency, we have established a web server implementing this model, which can be accessed at http://mathtc.nscc-tj.cn/mpek.
酶反应动力学在分析酶反应机制和靶酶优化方面至关重要,因此在生物制造和其他行业中也具有重要意义。酶转换数(kcat)和米氏常数(Km)是衡量酶催化效率的关键动力学参数,对于分析酶反应机制和定向进化靶酶至关重要。kcat 和 Km 的实验测定在时间、劳动力和成本方面都非常昂贵。为了考虑 kcat 和 Km 之间的内在联系,并进一步提高预测性能,我们提出了一种通用的预训练多任务深度学习模型 MPEK,该模型可以同时预测这些参数,同时考虑 pH 值、温度和生物体信息。通过在相同的 kcat 和 Km 测试数据集上进行测试,MPEK 展示了优于先前模型的预测性能。具体来说,MPEK 在预测 kcat 方面的 Pearson 系数为 0.808,与 DLKcat 和 UniKP 模型相比,提高了约 14.6%和 7.6%;在预测 Km 方面的 Pearson 系数为 0.777,与 Kroll_model 和 UniKP 模型相比,提高了约 34.9%和 53.3%。更重要的是,MPEK 能够揭示酶的多功能性,并且对突变酶序列的微小变化敏感。此外,在三个案例研究中,表明 MPEK 具有辅助酶挖掘和定向进化的潜力。为了方便酶催化效率的计算机评估,我们建立了一个实现该模型的网络服务器,可在 http://mathtc.nscc-tj.cn/mpek 访问。