• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于酶促反应预测酶功能的通用模型。

A general model for predicting enzyme functions based on enzymatic reactions.

作者信息

Qian Wenjia, Wang Xiaorui, Kang Yu, Pan Peichen, Hou Tingjun, Hsieh Chang-Yu

机构信息

College of Pharmaceutical Sciences, Zhejiang University, Hangzhou, 310058, Zhejiang, China.

Dr. Neher's Biophysics Laboratory for Innovative Drug Discovery, State Key Laboratory of Quality Research in Chinese Medicine, Macau Institute for Applied Research in Medicine and Health, Macau University of Science and Technology, Macao, 999078, China.

出版信息

J Cheminform. 2024 Mar 31;16(1):38. doi: 10.1186/s13321-024-00827-y.

DOI:10.1186/s13321-024-00827-y
PMID:38556873
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10983695/
Abstract

Accurate prediction of the enzyme comission (EC) numbers for chemical reactions is essential for the understanding and manipulation of enzyme functions, biocatalytic processes and biosynthetic planning. A number of machine leanring (ML)-based models have been developed to classify enzymatic reactions, showing great advantages over costly and long-winded experimental verifications. However, the prediction accuracy for most available models trained on the records of chemical reactions without specifying the enzymatic catalysts is rather limited. In this study, we introduced BEC-Pred, a BERT-based multiclassification model, for predicting EC numbers associated with reactions. Leveraging transfer learning, our approach achieves precise forecasting across a wide variety of Enzyme Commission (EC) numbers solely through analysis of the SMILES sequences of substrates and products. BEC-Pred model outperformed other sequence and graph-based ML methods, attaining a higher accuracy of 91.6%, surpassing them by 5.5%, and exhibiting superior F1 scores with improvements of 6.6% and 6.0%, respectively. The enhanced performance highlights the potential of BEC-Pred to serve as a reliable foundational tool to accelerate the cutting-edge research in synthetic biology and drug metabolism. Moreover, we discussed a few examples on how BEC-Pred could accurately predict the enzymatic classification for the Novozym 435-induced hydrolysis and lipase efficient catalytic synthesis. We anticipate that BEC-Pred will have a positive impact on the progression of enzymatic research.

摘要

准确预测化学反应的酶委员会(EC)编号对于理解和操控酶的功能、生物催化过程及生物合成规划至关重要。已经开发了许多基于机器学习(ML)的模型来对酶促反应进行分类,相较于成本高昂且冗长的实验验证展现出了巨大优势。然而,大多数基于未指定酶催化剂的化学反应记录训练的现有模型的预测准确性相当有限。在本研究中,我们引入了BEC-Pred,一种基于BERT的多分类模型,用于预测与反应相关的EC编号。利用迁移学习,我们的方法仅通过分析底物和产物的SMILES序列就能在各种酶委员会(EC)编号上实现精确预测。BEC-Pred模型优于其他基于序列和图的ML方法,达到了91.6%的更高准确率,比它们高出5.5%,并且分别在F1分数上有6.6%和6.0%的提升,展现出卓越性能。性能的提升凸显了BEC-Pred作为可靠基础工具加速合成生物学和药物代谢前沿研究的潜力。此外,我们讨论了一些关于BEC-Pred如何能够准确预测诺维信435诱导的水解和脂肪酶高效催化合成的酶分类的例子。我们预计BEC-Pred将对酶学研究的进展产生积极影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2af/10983695/8177da39a2a5/13321_2024_827_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2af/10983695/918826f27d3a/13321_2024_827_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2af/10983695/7ecfc7a14afd/13321_2024_827_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2af/10983695/9eb49e3d90f5/13321_2024_827_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2af/10983695/c02076060b08/13321_2024_827_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2af/10983695/be23c72bb615/13321_2024_827_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2af/10983695/8177da39a2a5/13321_2024_827_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2af/10983695/918826f27d3a/13321_2024_827_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2af/10983695/7ecfc7a14afd/13321_2024_827_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2af/10983695/9eb49e3d90f5/13321_2024_827_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2af/10983695/c02076060b08/13321_2024_827_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2af/10983695/be23c72bb615/13321_2024_827_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c2af/10983695/8177da39a2a5/13321_2024_827_Fig6_HTML.jpg

相似文献

1
A general model for predicting enzyme functions based on enzymatic reactions.一种基于酶促反应预测酶功能的通用模型。
J Cheminform. 2024 Mar 31;16(1):38. doi: 10.1186/s13321-024-00827-y.
2
Exploration and Evaluation of Machine Learning-Based Models for Predicting Enzymatic Reactions.基于机器学习的酶反应预测模型的探索与评估。
J Chem Inf Model. 2020 Mar 23;60(3):1833-1843. doi: 10.1021/acs.jcim.9b00877. Epub 2020 Feb 27.
3
Accurately predicting enzyme functions through geometric graph learning on ESMFold-predicted structures.通过在 ESMFold 预测结构上进行几何图形学习,准确预测酶功能。
Nat Commun. 2024 Sep 18;15(1):8180. doi: 10.1038/s41467-024-52533-w.
4
5
Reaction graph kernels predict EC numbers of unknown enzymatic reactions in plant secondary metabolism.反应图核函数可预测植物次生代谢中未知酶反应的 EC 编号。
BMC Bioinformatics. 2010 Jan 18;11 Suppl 1(Suppl 1):S31. doi: 10.1186/1471-2105-11-S1-S31.
6
ECPred: a tool for the prediction of the enzymatic functions of protein sequences based on the EC nomenclature.ECPred:一种基于 EC 命名法预测蛋白质序列酶功能的工具。
BMC Bioinformatics. 2018 Sep 21;19(1):334. doi: 10.1186/s12859-018-2368-y.
7
Comprehensive Machine Learning Prediction of Extensive Enzymatic Reactions.全面的机器学习预测广泛的酶反应。
J Phys Chem B. 2022 Sep 15;126(36):6762-6770. doi: 10.1021/acs.jpcb.2c03287. Epub 2022 Sep 2.
8
Enzyme Commission Number Prediction and Benchmarking with Hierarchical Dual-core Multitask Learning Framework.基于分层双核多任务学习框架的酶委员会编号预测与基准测试
Research (Wash D C). 2023 May 31;6:0153. doi: 10.34133/research.0153. eCollection 2023.
9
BERT-5mC: an interpretable model for predicting 5-methylcytosine sites of DNA based on BERT.BERT-5mC:一种基于 BERT 的可解释模型,用于预测 DNA 的 5-甲基胞嘧啶位点。
PeerJ. 2023 Dec 8;11:e16600. doi: 10.7717/peerj.16600. eCollection 2023.
10
Deep learning enables high-quality and high-throughput prediction of enzyme commission numbers.深度学习可实现酶委员会编号的高质量和高通量预测。
Proc Natl Acad Sci U S A. 2019 Jul 9;116(28):13996-14001. doi: 10.1073/pnas.1821905116. Epub 2019 Jun 20.

引用本文的文献

1
RC-GNN: A predictive model of enzyme-reaction pairs.RC-GNN:酶-反应对的预测模型。
bioRxiv. 2025 Jun 27:2025.06.22.660952. doi: 10.1101/2025.06.22.660952.

本文引用的文献

1
Enzyme function prediction using contrastive learning.使用对比学习进行酶功能预测。
Science. 2023 Mar 31;379(6639):1358-1363. doi: 10.1126/science.adf2465. Epub 2023 Mar 30.
2
Graph neural networks for materials science and chemistry.用于材料科学与化学的图神经网络
Commun Mater. 2022;3(1):93. doi: 10.1038/s43246-022-00315-6. Epub 2022 Nov 26.
3
Comprehensive Machine Learning Prediction of Extensive Enzymatic Reactions.全面的机器学习预测广泛的酶反应。
J Phys Chem B. 2022 Sep 15;126(36):6762-6770. doi: 10.1021/acs.jpcb.2c03287. Epub 2022 Sep 2.
4
Deep learning driven biosynthetic pathways navigation for natural products with BioNavi-NP.深度学习驱动的生物合成途径导航用于天然产物的 BioNavi-NP。
Nat Commun. 2022 Jun 10;13(1):3342. doi: 10.1038/s41467-022-30970-9.
5
Reaction classification and yield prediction using the differential reaction fingerprint DRFP.使用微分反应指纹DRFP进行反应分类和产率预测。
Digit Discov. 2022 Jan 21;1(2):91-97. doi: 10.1039/d1dd00006c. eCollection 2022 Apr 11.
6
Biocatalysed synthesis planning using data-driven learning.基于数据驱动学习的生物催化合成规划。
Nat Commun. 2022 Feb 18;13(1):964. doi: 10.1038/s41467-022-28536-w.
7
Quantum Mechanics and Machine Learning Synergies: Graph Attention Neural Networks to Predict Chemical Reactivity.量子力学与机器学习协同作用:图注意神经网络预测化学反应性。
J Chem Inf Model. 2022 May 9;62(9):2121-2132. doi: 10.1021/acs.jcim.1c01400. Epub 2022 Jan 12.
8
Machine Learning of Reaction Properties via Learned Representations of the Condensed Graph of Reaction.通过反应凝聚图的学习表示来进行反应性质的机器学习。
J Chem Inf Model. 2022 May 9;62(9):2101-2110. doi: 10.1021/acs.jcim.1c00975. Epub 2021 Nov 4.
9
Novel oleate hydratases and potential biotechnological applications.新型油酸盐水解酶及其潜在的生物技术应用。
Appl Microbiol Biotechnol. 2021 Aug;105(16-17):6159-6172. doi: 10.1007/s00253-021-11465-x. Epub 2021 Aug 5.
10
One-Step Synthesis of 4-Octyl Itaconate through the Structure Control of Lipase.通过脂肪酶的结构控制一步合成 4-辛烯酸正丁酯。
J Org Chem. 2021 Jun 18;86(12):7895-7903. doi: 10.1021/acs.joc.0c02995. Epub 2021 Jun 4.