• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

生物结构网络:基于结构的迁移学习网络用于预测生物催化剂功能。

BioStructNet: Structure-Based Network with Transfer Learning for Predicting Biocatalyst Functions.

作者信息

Wang Xiangwen, Zhou Jiahui, Mueller Jane, Quinn Derek, Carvalho Alexandra, Moody Thomas S, Huang Meilan

机构信息

School of Chemistry and Chemical Engineering, Queen's University Belfast, BT9 5AG Belfast, Northern Ireland, U.K.

Department of Biocatalysis and Isotope Chemistry, Almac Sciences, BT63 5QD Craigavon, Northern Ireland, U.K.

出版信息

J Chem Theory Comput. 2025 Jan 14;21(1):474-490. doi: 10.1021/acs.jctc.4c01391. Epub 2024 Dec 20.

DOI:10.1021/acs.jctc.4c01391
PMID:39705058
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11736791/
Abstract

Enzyme-substrate interactions are essential to both biological processes and industrial applications. Advanced machine learning techniques have significantly accelerated biocatalysis research, revolutionizing the prediction of biocatalytic activities and facilitating the discovery of novel biocatalysts. However, the limited availability of data for specific enzyme functions, such as conversion efficiency and stereoselectivity, presents challenges for prediction accuracy. In this study, we developed BioStructNet, a structure-based deep learning network that integrates both protein and ligand structural data to capture the complexity of enzyme-substrate interactions. Benchmarking studies with different algorithms showed the enhanced predictive accuracy of BioStructNet. To further optimize the prediction accuracy for the small data set, we implemented transfer learning in the framework, training a source model on a large data set and fine-tuning it on a small, function-specific data set, using the CalB data set as a case study. The model performance was validated by comparing the attention heat maps generated by the BioStructNet interaction module with the enzyme-substrate interactions revealed from molecular dynamics simulations of enzyme-substrate complexes. BioStructNet would accelerate the discovery of functional enzymes for industrial use, particularly in cases where the training data sets for machine learning are small.

摘要

酶 - 底物相互作用对生物过程和工业应用都至关重要。先进的机器学习技术显著加速了生物催化研究,彻底改变了生物催化活性的预测,并促进了新型生物催化剂的发现。然而,特定酶功能的数据(如转化效率和立体选择性)可用性有限,这给预测准确性带来了挑战。在本研究中,我们开发了BioStructNet,这是一种基于结构的深度学习网络,它整合了蛋白质和配体结构数据,以捕捉酶 - 底物相互作用的复杂性。使用不同算法进行的基准研究表明BioStructNet具有更高的预测准确性。为了进一步优化小数据集的预测准确性,我们在该框架中实施了迁移学习,以一个大数据集训练源模型,并以功能特定的小数据集对其进行微调,以卡尔酵母脂肪酶(CalB)数据集为例进行研究。通过将BioStructNet相互作用模块生成的注意力热图与酶 - 底物复合物分子动力学模拟揭示的酶 - 底物相互作用进行比较,验证了模型性能。BioStructNet将加速工业用功能酶的发现,特别是在机器学习训练数据集较小的情况下。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da1d/11736791/570e40437b0c/ct4c01391_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da1d/11736791/4750882d580b/ct4c01391_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da1d/11736791/2cf5c7e9c372/ct4c01391_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da1d/11736791/2631d2726cd6/ct4c01391_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da1d/11736791/8eb79c953308/ct4c01391_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da1d/11736791/e78e981d98a6/ct4c01391_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da1d/11736791/570e40437b0c/ct4c01391_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da1d/11736791/4750882d580b/ct4c01391_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da1d/11736791/2cf5c7e9c372/ct4c01391_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da1d/11736791/2631d2726cd6/ct4c01391_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da1d/11736791/8eb79c953308/ct4c01391_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da1d/11736791/e78e981d98a6/ct4c01391_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/da1d/11736791/570e40437b0c/ct4c01391_0006.jpg

相似文献

1
BioStructNet: Structure-Based Network with Transfer Learning for Predicting Biocatalyst Functions.生物结构网络:基于结构的迁移学习网络用于预测生物催化剂功能。
J Chem Theory Comput. 2025 Jan 14;21(1):474-490. doi: 10.1021/acs.jctc.4c01391. Epub 2024 Dec 20.
2
ALDELE: All-Purpose Deep Learning Toolkits for Predicting the Biocatalytic Activities of Enzymes.ALDELE:用于预测酶生物催化活性的通用深度学习工具包。
J Chem Inf Model. 2024 Apr 22;64(8):3123-3139. doi: 10.1021/acs.jcim.4c00058. Epub 2024 Apr 4.
3
Navigating the landscape of enzyme design: from molecular simulations to machine learning.探索酶设计的领域:从分子模拟到机器学习。
Chem Soc Rev. 2024 Aug 12;53(16):8202-8239. doi: 10.1039/d4cs00196f.
4
deepNEC: a novel alignment-free tool for the identification and classification of nitrogen biochemical network-related enzymes using deep learning.深度 NEC:一种新颖的无对齐工具,用于使用深度学习识别和分类与氮生化网络相关的酶。
Brief Bioinform. 2022 May 13;23(3). doi: 10.1093/bib/bbac071.
5
Deep Learning-Driven Insights into Enzyme-Substrate Interaction Discovery.深度学习驱动的酶-底物相互作用发现洞察
J Chem Inf Model. 2025 Jan 13;65(1):187-200. doi: 10.1021/acs.jcim.4c01801. Epub 2024 Dec 25.
6
Protein representations: Encoding biological information for machine learning in biocatalysis.蛋白质表示:生物催化机器学习中的生物信息编码。
Biotechnol Adv. 2024 Dec;77:108459. doi: 10.1016/j.biotechadv.2024.108459. Epub 2024 Oct 2.
7
Parallel Convolutional Contrastive Learning Method for Enzyme Function Prediction.用于酶功能预测的并行卷积对比学习方法
IEEE/ACM Trans Comput Biol Bioinform. 2024 Nov-Dec;21(6):2604-2609. doi: 10.1109/TCBB.2024.3447037. Epub 2024 Dec 10.
8
[Application of bioinformatics in researches of industrial biocatalysis].生物信息学在工业生物催化研究中的应用
Sheng Wu Gong Cheng Xue Bao. 2004 May;20(3):325-31.
9
Artificial intelligence to deep learning: machine intelligence approach for drug discovery.人工智能到深度学习:药物发现的机器智能方法。
Mol Divers. 2021 Aug;25(3):1315-1360. doi: 10.1007/s11030-021-10217-3. Epub 2021 Apr 12.
10
Leveraging conformal prediction to annotate enzyme function space with limited false positives.利用共形预测来注释酶功能空间,以减少假阳性。
PLoS Comput Biol. 2024 May 29;20(5):e1012135. doi: 10.1371/journal.pcbi.1012135. eCollection 2024 May.

本文引用的文献

1
Navigating the landscape of enzyme design: from molecular simulations to machine learning.探索酶设计的领域:从分子模拟到机器学习。
Chem Soc Rev. 2024 Aug 12;53(16):8202-8239. doi: 10.1039/d4cs00196f.
2
Docking Score ML: Target-Specific Machine Learning Models Improving Docking-Based Virtual Screening in 155 Targets.对接评分 ML:针对 155 个靶标,基于对接的虚拟筛选的目标特异性机器学习模型的改进。
J Chem Inf Model. 2024 Jul 22;64(14):5413-5426. doi: 10.1021/acs.jcim.4c00072. Epub 2024 Jul 3.
3
Exploring the Conformational Ensembles of Protein-Protein Complex with Transformer-Based Generative Model.
基于生成式模型的蛋白质-蛋白质复合物构象集合研究。
J Chem Theory Comput. 2024 Jun 11;20(11):4469-4480. doi: 10.1021/acs.jctc.4c00255. Epub 2024 May 30.
4
ALDELE: All-Purpose Deep Learning Toolkits for Predicting the Biocatalytic Activities of Enzymes.ALDELE:用于预测酶生物催化活性的通用深度学习工具包。
J Chem Inf Model. 2024 Apr 22;64(8):3123-3139. doi: 10.1021/acs.jcim.4c00058. Epub 2024 Apr 4.
5
An overview of recent advances and challenges in predicting compound-protein interaction (CPI).预测化合物-蛋白质相互作用(CPI)的最新进展与挑战概述。
Med Rev (2021). 2023 Oct 6;3(6):465-486. doi: 10.1515/mr-2023-0030. eCollection 2023 Dec.
6
Sequence-based drug design as a concept in computational drug design.基于序列的药物设计作为计算药物设计中的一个概念。
Nat Commun. 2023 Jul 14;14(1):4217. doi: 10.1038/s41467-023-39856-w.
7
A Thermostable Lipase Isolated from Strain 7 Degrades Ɛ-Polycaprolactone.从菌株7中分离出的一种耐热脂肪酶可降解ε-聚己内酯。
BioTech (Basel). 2023 Mar 14;12(1):23. doi: 10.3390/biotech12010023.
8
Drug-target affinity prediction using graph neural network and contact maps.使用图神经网络和接触图进行药物-靶点亲和力预测。
RSC Adv. 2020 Jun 1;10(35):20701-20712. doi: 10.1039/d0ra02297g. eCollection 2020 May 27.
9
Compound-protein interaction prediction by deep learning: Databases, descriptors and models.基于深度学习的复合蛋白相互作用预测:数据库、描述符与模型。
Drug Discov Today. 2022 May;27(5):1350-1366. doi: 10.1016/j.drudis.2022.02.023. Epub 2022 Mar 3.
10
MGraphDTA: deep multiscale graph neural network for explainable drug-target binding affinity prediction.MGraphDTA:用于可解释药物-靶点结合亲和力预测的深度多尺度图神经网络
Chem Sci. 2022 Jan 5;13(3):816-833. doi: 10.1039/d1sc05180f. eCollection 2022 Jan 19.