• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于分子动力学模拟的时空学习用于蛋白质-配体结合亲和力预测。

Spatio-temporal learning from molecular dynamics simulations for protein-ligand binding affinity prediction.

作者信息

Libouban Pierre-Yves, Parisel Camille, Song Maxime, Aci-Sèche Samia, Gómez-Tamayo Jose C, Tresadern Gary, Bonnet Pascal

机构信息

Institute of Organic and Analytical Chemistry (ICOA), UMR7311, Université d'Orléans, CNRS, Pôle de chimie rue de Chartres, 45067 Orléans Cedex 2, France.

Institute for Development and Resources in Intensive Scientific Computing (IDRIS), CNRS, Rue John Von Neumann, 91403 Orsay Cedex, France.

出版信息

Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf429.

DOI:10.1093/bioinformatics/btaf429
PMID:40828893
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12371333/
Abstract

MOTIVATION

The field of protein-ligand binding affinity prediction continues to face significant challenges. While deep learning (DL) models can leverage 3D structural information of protein-ligand complexes, they perform well only on heavily biased test sets containing information leaked from training sets. This lack of generalization arises from the limited availability of training data and the models' inability to effectively learn from protein-ligand interactions. Since these interactions are inherently time-dependent, molecular dynamics (MD) simulations offer a potential solution by incorporating conformational sampling and providing interaction rich information.

RESULTS

We have developed MDbind, a dataset comprising 63 000 simulations of protein-ligand interactions, along with novel neural networks capable of learning from these simulations to predict binding affinity. By utilizing MD as data augmentation, our models achieved state-of-the-art performance on the PDBbind v.2016 core set and an external test set, the free energy perturbation (FEP) dataset. Additionally, when trained on the full MD simulations, the models demonstrated less biased predictions.

AVAILABILITY AND IMPLEMENTATION

The code for neural networks is available at https://github.com/ICOA-SBC/MD_DL_BA. The models, the results and the training/validation/test sets are available for download at https://zenodo.org/records/10390550. The MDbind trajectories are being transferred to the MDDB: https://mmb-dev.mddbr.eu/#/browse? option=mdbind.

摘要

动机

蛋白质-配体结合亲和力预测领域仍然面临重大挑战。虽然深度学习(DL)模型可以利用蛋白质-配体复合物的3D结构信息,但它们仅在包含从训练集中泄露信息的高度有偏测试集上表现良好。这种缺乏泛化性的情况源于训练数据的有限可用性以及模型无法有效地从蛋白质-配体相互作用中学习。由于这些相互作用本质上是时间依赖性的,分子动力学(MD)模拟通过纳入构象采样并提供丰富的相互作用信息提供了一种潜在的解决方案。

结果

我们开发了MDbind,这是一个包含63000个蛋白质-配体相互作用模拟的数据集,以及能够从这些模拟中学习以预测结合亲和力的新型神经网络。通过将MD用作数据增强,我们的模型在PDBbind v.2016核心集和外部测试集自由能扰动(FEP)数据集上取得了领先的性能。此外,当在完整的MD模拟上进行训练时,模型表现出偏差较小的预测。

可用性和实现

神经网络的代码可在https://github.com/ICOA-SBC/MD_DL_BA获取。模型、结果以及训练/验证/测试集可在https://zenodo.org/records/10390550下载。MDbind轨迹正在转移到MDDB:https://mmb-dev.mddbr.eu/#/browse?option=mdbind。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8489/12371333/ace13fae8754/btaf429f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8489/12371333/a427a9c80796/btaf429f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8489/12371333/92e48eeffd65/btaf429f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8489/12371333/ef1bca8f4a32/btaf429f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8489/12371333/3eaac162c7ac/btaf429f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8489/12371333/e76cc2956e04/btaf429f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8489/12371333/acf0efcd455e/btaf429f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8489/12371333/d51dfc335cf7/btaf429f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8489/12371333/ace13fae8754/btaf429f7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8489/12371333/a427a9c80796/btaf429f8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8489/12371333/92e48eeffd65/btaf429f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8489/12371333/ef1bca8f4a32/btaf429f2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8489/12371333/3eaac162c7ac/btaf429f3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8489/12371333/e76cc2956e04/btaf429f4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8489/12371333/acf0efcd455e/btaf429f5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8489/12371333/d51dfc335cf7/btaf429f6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8489/12371333/ace13fae8754/btaf429f7.jpg

相似文献

1
Spatio-temporal learning from molecular dynamics simulations for protein-ligand binding affinity prediction.基于分子动力学模拟的时空学习用于蛋白质-配体结合亲和力预测。
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf429.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
A 4D tensor-enhanced multi-dimensional convolutional neural network for accurate prediction of protein-ligand binding affinity.一种用于准确预测蛋白质-配体结合亲和力的4D张量增强多维卷积神经网络。
Mol Divers. 2024 Dec 23. doi: 10.1007/s11030-024-11044-y.
4
Hybrid protein-ligand binding residue prediction with protein language models: does the structure matter?利用蛋白质语言模型进行混合蛋白质-配体结合残基预测:结构重要吗?
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf431.
5
EM-PLA: environment-aware heterogeneous graph-based multimodal protein-ligand binding affinity prediction.EM-PLA:基于环境感知异构图的多模态蛋白质-配体结合亲和力预测
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf298.
6
Neural Network Models for Prediction of Biological Activity using Molecular Dynamics Data: A Case of Photoswitchable Peptides.使用分子动力学数据预测生物活性的神经网络模型:以光开关肽为例。
Mol Inform. 2025 Jul;44(7):e70001. doi: 10.1002/minf.70001.
7
Accurate PROTAC-targeted degradation prediction with DegradeMaster.使用DegradeMaster进行准确的PROTAC靶向降解预测。
Bioinformatics. 2025 Jul 1;41(Supplement_1):i342-i351. doi: 10.1093/bioinformatics/btaf191.
8
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
9
Development and Validation of a Convolutional Neural Network Model to Predict a Pathologic Fracture in the Proximal Femur Using Abdomen and Pelvis CT Images of Patients With Advanced Cancer.利用晚期癌症患者腹部和骨盆 CT 图像建立卷积神经网络模型预测股骨近端病理性骨折的研究
Clin Orthop Relat Res. 2023 Nov 1;481(11):2247-2256. doi: 10.1097/CORR.0000000000002771. Epub 2023 Aug 23.
10
Healthcare workers' informal uses of mobile phones and other mobile devices to support their work: a qualitative evidence synthesis.医护人员非正规使用手机和其他移动设备来支持工作:定性证据综合评价。
Cochrane Database Syst Rev. 2024 Aug 27;8(8):CD015705. doi: 10.1002/14651858.CD015705.pub2.

本文引用的文献

1
From Static to Dynamic Structures: Improving Binding Affinity Prediction with Graph-Based Deep Learning.从静态结构到动态结构:基于图的深度学习提高结合亲和力预测。
Adv Sci (Weinh). 2024 Oct;11(40):e2405404. doi: 10.1002/advs.202405404. Epub 2024 Aug 29.
2
MISATO: machine learning dataset of protein-ligand complexes for structure-based drug discovery.MISATO:基于结构的药物发现的蛋白质-配体复合物的机器学习数据集。
Nat Comput Sci. 2024 May;4(5):367-378. doi: 10.1038/s43588-024-00627-2. Epub 2024 May 10.
3
PoseBusters: AI-based docking methods fail to generate physically valid poses or generalise to novel sequences.
PoseBusters:基于人工智能的对接方法无法生成符合物理原理的构象,也无法推广到新序列。
Chem Sci. 2023 Dec 13;15(9):3130-3139. doi: 10.1039/d3sc04185a. eCollection 2024 Feb 28.
4
The Impact of Data on Structure-Based Binding Affinity Predictions Using Deep Neural Networks.基于深度神经网络的数据对结构结合亲和力预测的影响。
Int J Mol Sci. 2023 Nov 9;24(22):16120. doi: 10.3390/ijms242216120.
5
Enhancing Hit Discovery in Virtual Screening through Absolute Protein-Ligand Binding Free-Energy Calculations.通过绝对蛋白-配体结合自由能计算提高虚拟筛选中的命中发现。
J Chem Inf Model. 2023 May 22;63(10):3171-3185. doi: 10.1021/acs.jcim.3c00013. Epub 2023 May 11.
6
Can molecular dynamics simulations improve predictions of protein-ligand binding affinity with machine learning?分子动力学模拟能否通过机器学习改进对蛋白质-配体结合亲和力的预测?
Brief Bioinform. 2023 Mar 19;24(2). doi: 10.1093/bib/bbad008.
7
Pre-Training of Equivariant Graph Matching Networks with Conformation Flexibility for Drug Binding.具有构象灵活性的同伦图匹配网络的预训练用于药物结合。
Adv Sci (Weinh). 2022 Nov;9(33):e2203796. doi: 10.1002/advs.202203796. Epub 2022 Oct 6.
8
Scoring Functions for Protein-Ligand Binding Affinity Prediction using Structure-Based Deep Learning: A Review.基于结构的深度学习预测蛋白质-配体结合亲和力的评分函数综述
Front Bioinform. 2022 Jun 17;2. doi: 10.3389/fbinf.2022.885983.
9
PLAS-5k: Dataset of Protein-Ligand Affinities from Molecular Dynamics for Machine Learning Applications.PLAS-5k:用于机器学习应用的分子动力学中蛋白质-配体亲和力的数据集。
Sci Data. 2022 Sep 7;9(1):548. doi: 10.1038/s41597-022-01631-9.
10
On the Frustration to Predict Binding Affinities from Protein-Ligand Structures with Deep Neural Networks.从蛋白质-配体结构用深度神经网络预测结合亲和力的挫折。
J Med Chem. 2022 Jun 9;65(11):7946-7958. doi: 10.1021/acs.jmedchem.2c00487. Epub 2022 May 24.