Wang Zechen, Xie Dongqi, Wu Dong, Luo Xiaozhou, Wang Sheng, Li Yangyang, Yang Yanmei, Li Weifeng, Zheng Liangzhen
School of Physics, Shandong University, Jinan, 250100, Shandong, China.
Shanghai Zelixir Biotech Co. Ltd, Shanghai, 201210, Shanghai, China.
Nat Commun. 2025 Mar 20;16(1):2736. doi: 10.1038/s41467-025-58038-4.
Accurate prediction of enzyme kinetic parameters is crucial for enzyme exploration and modification. Existing models face the problem of either low accuracy or poor generalization ability due to overfitting. In this work, we first developed unbiased datasets to evaluate the actual performance of these methods and proposed a deep learning model, CataPro, based on pre-trained models and molecular fingerprints to predict turnover number (k), Michaelis constant (K), and catalytic efficiency (k/K). Compared with previous baseline models, CataPro demonstrates clearly enhanced accuracy and generalization ability on the unbiased datasets. In a representational enzyme mining project, by combining CataPro with traditional methods, we identified an enzyme (SsCSO) with 19.53 times increased activity compared to the initial enzyme (CSO2) and then successfully engineered it to improve its activity by 3.34 times. This reveals the high potential of CataPro as an effective tool for future enzyme discovery and modification.
准确预测酶动力学参数对于酶的探索和改造至关重要。由于过拟合,现有模型面临准确率低或泛化能力差的问题。在这项工作中,我们首先开发了无偏数据集来评估这些方法的实际性能,并基于预训练模型和分子指纹提出了一种深度学习模型CataPro,用于预测周转数(k)、米氏常数(K)和催化效率(k/K)。与之前的基线模型相比,CataPro在无偏数据集上表现出明显提高的准确率和泛化能力。在一个具有代表性的酶挖掘项目中,通过将CataPro与传统方法相结合,我们鉴定出一种酶(SsCSO),其活性比初始酶(CSO2)提高了19.53倍,然后成功对其进行工程改造,使其活性提高了3.34倍。这揭示了CataPro作为未来酶发现和改造的有效工具的巨大潜力。