• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

MPEK:基于预训练语言模型的酶反应动力学参数预测的多任务深度学习框架。

MPEK: a multitask deep learning framework based on pretrained language models for enzymatic reaction kinetic parameters prediction.

机构信息

State Key Laboratory of NBC Protection for Civilian, No. 37 South Central Street, Yangfang Town, Changping District, Beijing 102205, China.

出版信息

Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae387.

DOI:10.1093/bib/bbae387
PMID:39129365
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11317537/
Abstract

Enzymatic reaction kinetics are central in analyzing enzymatic reaction mechanisms and target-enzyme optimization, and thus in biomanufacturing and other industries. The enzyme turnover number (kcat) and Michaelis constant (Km), key kinetic parameters for measuring enzyme catalytic efficiency, are crucial for analyzing enzymatic reaction mechanisms and the directed evolution of target enzymes. Experimental determination of kcat and Km is costly in terms of time, labor, and cost. To consider the intrinsic connection between kcat and Km and further improve the prediction performance, we propose a universal pretrained multitask deep learning model, MPEK, to predict these parameters simultaneously while considering pH, temperature, and organismal information. Through testing on the same kcat and Km test datasets, MPEK demonstrated superior prediction performance over the previous models. Specifically, MPEK achieved the Pearson coefficient of 0.808 for predicting kcat, improving ca. 14.6% and 7.6% compared to the DLKcat and UniKP models, and it achieved the Pearson coefficient of 0.777 for predicting Km, improving ca. 34.9% and 53.3% compared to the Kroll_model and UniKP models. More importantly, MPEK was able to reveal enzyme promiscuity and was sensitive to slight changes in the mutant enzyme sequence. In addition, in three case studies, it was shown that MPEK has the potential for assisted enzyme mining and directed evolution. To facilitate in silico evaluation of enzyme catalytic efficiency, we have established a web server implementing this model, which can be accessed at http://mathtc.nscc-tj.cn/mpek.

摘要

酶反应动力学在分析酶反应机制和靶酶优化方面至关重要,因此在生物制造和其他行业中也具有重要意义。酶转换数(kcat)和米氏常数(Km)是衡量酶催化效率的关键动力学参数,对于分析酶反应机制和定向进化靶酶至关重要。kcat 和 Km 的实验测定在时间、劳动力和成本方面都非常昂贵。为了考虑 kcat 和 Km 之间的内在联系,并进一步提高预测性能,我们提出了一种通用的预训练多任务深度学习模型 MPEK,该模型可以同时预测这些参数,同时考虑 pH 值、温度和生物体信息。通过在相同的 kcat 和 Km 测试数据集上进行测试,MPEK 展示了优于先前模型的预测性能。具体来说,MPEK 在预测 kcat 方面的 Pearson 系数为 0.808,与 DLKcat 和 UniKP 模型相比,提高了约 14.6%和 7.6%;在预测 Km 方面的 Pearson 系数为 0.777,与 Kroll_model 和 UniKP 模型相比,提高了约 34.9%和 53.3%。更重要的是,MPEK 能够揭示酶的多功能性,并且对突变酶序列的微小变化敏感。此外,在三个案例研究中,表明 MPEK 具有辅助酶挖掘和定向进化的潜力。为了方便酶催化效率的计算机评估,我们建立了一个实现该模型的网络服务器,可在 http://mathtc.nscc-tj.cn/mpek 访问。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b5b/11317537/ca304236da84/bbae387f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b5b/11317537/d83dbce07109/bbae387f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b5b/11317537/1201eb9f68d8/bbae387f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b5b/11317537/ef79cecc165a/bbae387f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b5b/11317537/f02f22411588/bbae387f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b5b/11317537/0addbb9fabb7/bbae387f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b5b/11317537/ca304236da84/bbae387f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b5b/11317537/d83dbce07109/bbae387f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b5b/11317537/1201eb9f68d8/bbae387f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b5b/11317537/ef79cecc165a/bbae387f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b5b/11317537/f02f22411588/bbae387f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b5b/11317537/0addbb9fabb7/bbae387f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b5b/11317537/ca304236da84/bbae387f6.jpg

相似文献

1
MPEK: a multitask deep learning framework based on pretrained language models for enzymatic reaction kinetic parameters prediction.MPEK:基于预训练语言模型的酶反应动力学参数预测的多任务深度学习框架。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae387.
2
UniKP: a unified framework for the prediction of enzyme kinetic parameters.UniKP:一种用于预测酶动力学参数的统一框架。
Nat Commun. 2023 Dec 11;14(1):8211. doi: 10.1038/s41467-023-44113-1.
3
DeepEnzyme: a robust deep learning model for improved enzyme turnover number prediction by utilizing features of protein 3D-structures.DeepEnzyme:一种强大的深度学习模型,通过利用蛋白质 3D 结构的特征,提高酶转化数预测的准确性。
Brief Bioinform. 2024 Jul 25;25(5). doi: 10.1093/bib/bbae409.
4
GraphKM: machine and deep learning for K prediction of wildtype and mutant enzymes.GraphKM:用于野生型和突变酶 K 预测的机器学习和深度学习。
BMC Bioinformatics. 2024 Mar 28;25(1):135. doi: 10.1186/s12859-024-05746-1.
5
Turnover number predictions for kinetically uncharacterized enzymes using machine and deep learning.使用机器学习和深度学习预测动力学特征未知的酶的周转率。
Nat Commun. 2023 Jul 12;14(1):4139. doi: 10.1038/s41467-023-39840-4.
6
DLKcat cannot predict meaningful values for mutants and unfamiliar enzymes.DLKcat无法预测突变体和不熟悉的酶的有意义的值。
Biol Methods Protoc. 2024 Aug 24;9(1):bpae061. doi: 10.1093/biomethods/bpae061. eCollection 2024.
7
Deep Large-Scale Multitask Learning Network for Gene Expression Inference.深度大规模多任务学习网络用于基因表达推断。
J Comput Biol. 2021 May;28(5):485-500. doi: 10.1089/cmb.2020.0438.
8
deepNEC: a novel alignment-free tool for the identification and classification of nitrogen biochemical network-related enzymes using deep learning.深度 NEC:一种新颖的无对齐工具,用于使用深度学习识别和分类与氮生化网络相关的酶。
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac071.
9
A hierarchical deep learning based approach for multi-functional enzyme classification.基于深度学习的多层次方法用于多功能酶分类。
Protein Sci. 2021 Sep;30(9):1935-1945. doi: 10.1002/pro.4146. Epub 2021 Jun 28.
10
Construction of an enzyme-constrained metabolic network model for Myceliophthora thermophila using machine learning-based k data.基于机器学习的 k 数据构建嗜热毁丝霉酶约束代谢网络模型
Microb Cell Fact. 2024 May 15;23(1):138. doi: 10.1186/s12934-024-02415-z.

引用本文的文献

1
Mapping multi-substrate specificity of Arabidopsis aminotransferases.解析拟南芥转氨酶的多底物特异性
Nat Plants. 2025 Sep 12. doi: 10.1038/s41477-025-02095-6.
2
A structure-oriented kinetics dataset of enzyme-substrate interactions.一个面向结构的酶-底物相互作用动力学数据集。
Sci Data. 2025 Aug 26;12(1):1489. doi: 10.1038/s41597-025-05829-5.
3
IECata: interpretable bilinear attention network and evidential deep learning improve the catalytic efficiency prediction of enzymes.IECata:可解释的双线性注意力网络和证据深度学习改进了酶的催化效率预测

本文引用的文献

1
UniKP: a unified framework for the prediction of enzyme kinetic parameters.UniKP:一种用于预测酶动力学参数的统一框架。
Nat Commun. 2023 Dec 11;14(1):8211. doi: 10.1038/s41467-023-44113-1.
2
ADMET property prediction via multi-task graph learning under adaptive auxiliary task selection.基于自适应辅助任务选择的多任务图学习进行ADMET性质预测
iScience. 2023 Oct 24;26(11):108285. doi: 10.1016/j.isci.2023.108285. eCollection 2023 Nov 17.
3
Turnover number predictions for kinetically uncharacterized enzymes using machine and deep learning.
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf283.
4
NNKcat: deep neural network to predict catalytic constants (Kcat) by integrating protein sequence and substrate structure with enhanced data imbalance handling.NNKcat:通过整合蛋白质序列和底物结构并增强数据不平衡处理来预测催化常数(Kcat)的深度神经网络。
Brief Bioinform. 2025 May 1;26(3). doi: 10.1093/bib/bbaf212.
5
Fermi calculations enable quick downselection of target genes and process optimization in photosynthetic systems.费米计算能够在光合系统中快速筛选目标基因并优化过程。
Plant Physiol. 2025 Jul 3;198(3). doi: 10.1093/plphys/kiaf103.
6
Robust enzyme discovery and engineering with deep learning using CataPro.使用CataPro通过深度学习进行强大的酶发现与工程设计。
Nat Commun. 2025 Mar 20;16(1):2736. doi: 10.1038/s41467-025-58038-4.
7
SELFprot: Effective and Efficient Multitask Finetuning Methods for Protein Parameter Prediction.SELFprot:用于蛋白质参数预测的高效多任务微调方法
J Chem Inf Model. 2025 Apr 14;65(7):3226-3238. doi: 10.1021/acs.jcim.4c02230. Epub 2025 Mar 17.
8
Seq2Topt: a sequence-based deep learning predictor of enzyme optimal temperature.Seq2Topt:一种基于序列的酶最佳温度深度学习预测器。
Brief Bioinform. 2025 Mar 4;26(2). doi: 10.1093/bib/bbaf114.
9
RBC-GEM: A genome-scale metabolic model for systems biology of the human red blood cell.红细胞基因组规模代谢模型(RBC-GEM):用于人类红细胞系统生物学的基因组规模代谢模型。
PLoS Comput Biol. 2025 Mar 12;21(3):e1012109. doi: 10.1371/journal.pcbi.1012109. eCollection 2025 Mar.
10
CatPred: a comprehensive framework for deep learning in vitro enzyme kinetic parameters.CatPred:用于深度学习体外酶动力学参数的综合框架。
Nat Commun. 2025 Feb 28;16(1):2072. doi: 10.1038/s41467-025-57215-9.
使用机器学习和深度学习预测动力学特征未知的酶的周转率。
Nat Commun. 2023 Jul 12;14(1):4139. doi: 10.1038/s41467-023-39840-4.
4
Analytical Bounds on the Local Lipschitz Constants of ReLU Networks.ReLU网络局部Lipschitz常数的解析界
IEEE Trans Neural Netw Learn Syst. 2024 Oct;35(10):13902-13913. doi: 10.1109/TNNLS.2023.3273228. Epub 2024 Oct 7.
5
A general model to predict small molecule substrates of enzymes based on machine and deep learning.基于机器学习和深度学习的酶小分子底物通用预测模型。
Nat Commun. 2023 May 15;14(1):2787. doi: 10.1038/s41467-023-38347-2.
6
Fast Multiview Clustering by Optimal Graph Mining.通过最优图挖掘实现快速多视图聚类
IEEE Trans Neural Netw Learn Syst. 2024 Sep;35(9):13071-13077. doi: 10.1109/TNNLS.2023.3256066. Epub 2024 Sep 3.
7
Evolutionary-scale prediction of atomic-level protein structure with a language model.用语言模型进行原子级蛋白质结构的进化尺度预测。
Science. 2023 Mar 17;379(6637):1123-1130. doi: 10.1126/science.ade2574. Epub 2023 Mar 16.
8
A novel, genetically encoded whole-cell biosensor for directed evolution of myrcene synthase in Escherichia coli.一种用于在大肠杆菌中对月桂烯合酶进行定向进化的新型基因编码全细胞生物传感器。
Biosens Bioelectron. 2023 May 15;228:115176. doi: 10.1016/j.bios.2023.115176. Epub 2023 Mar 8.
9
DEML: Drug Synergy and Interaction Prediction Using Ensemble-Based Multi-Task Learning.DEML:基于集成的多任务学习的药物协同和相互作用预测。
Molecules. 2023 Jan 14;28(2):844. doi: 10.3390/molecules28020844.
10
Learning the protein language of proteome-wide protein-protein binding sites via explainable ensemble deep learning.通过可解释的集成深度学习学习蛋白质组范围内蛋白质-蛋白质结合位点的蛋白质语言。
Commun Biol. 2023 Jan 19;6(1):73. doi: 10.1038/s42003-023-04462-5.