利用对接结构的多实例学习进行蛋白质-配体结合亲和力预测。

Protein-ligand binding affinity prediction using multi-instance learning with docking structures.

作者信息

Kim Hyojin, Shim Heesung, Ranganath Aditya, He Stewart, Stevenson Garrett, Allen Jonathan E

机构信息

Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, Livermore, CA, United States.

Biosciences and Biotechnology Division, Lawrence Livermore National Laboratory, Livermore, CA, United States.

出版信息

Front Pharmacol. 2025 Jan 3;15:1518875. doi: 10.3389/fphar.2024.1518875. eCollection 2024.

DOI:10.3389/fphar.2024.1518875

PMID:39830331

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11738626/

Abstract

INTRODUCTION

Recent advances in 3D structure-based deep learning approaches demonstrate improved accuracy in predicting protein-ligand binding affinity in drug discovery. These methods complement physics-based computational modeling such as molecular docking for virtual high-throughput screening. Despite recent advances and improved predictive performance, most methods in this category primarily rely on utilizing co-crystal complex structures and experimentally measured binding affinities as both input and output data for model training. Nevertheless, co-crystal complex structures are not readily available and the inaccurate predicted structures from molecular docking can degrade the accuracy of the machine learning methods.

METHODS

We introduce a novel structure-based inference method utilizing multiple molecular docking poses for each complex entity. Our proposed method employs multi-instance learning with an attention network to predict binding affinity from a collection of docking poses.

RESULTS

We validate our method using multiple datasets, including PDBbind and compounds targeting the main protease of SARS-CoV-2. The results demonstrate that our method leveraging docking poses is competitive with other state-of-the-art inference models that depend on co-crystal structures.

DISCUSSION

This method offers binding affinity prediction without requiring co-crystal structures, thereby increasing its applicability to protein targets lacking such data.

摘要

引言

基于3D结构的深度学习方法的最新进展表明，在药物发现中预测蛋白质-配体结合亲和力的准确性有所提高。这些方法补充了基于物理的计算建模，如用于虚拟高通量筛选的分子对接。尽管有最新进展且预测性能有所改善，但此类方法中的大多数主要依赖于利用共晶复合物结构和实验测量的结合亲和力作为模型训练的输入和输出数据。然而，共晶复合物结构并不容易获得，并且分子对接产生的预测结构不准确会降低机器学习方法的准确性。

方法

我们引入了一种新颖的基于结构的推理方法，为每个复合物实体利用多个分子对接姿态。我们提出的方法采用多实例学习和注意力网络，从对接姿态集合中预测结合亲和力。

结果

我们使用多个数据集验证了我们的方法，包括PDBbind和针对SARS-CoV-2主要蛋白酶的化合物。结果表明，我们利用对接姿态的方法与其他依赖共晶结构的先进推理模型具有竞争力。

讨论

该方法无需共晶结构即可进行结合亲和力预测，从而提高了其对缺乏此类数据的蛋白质靶点的适用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4039/11738626/aa6081c8f114/fphar-15-1518875-g001.jpg

相似文献

Protein-ligand binding affinity prediction using multi-instance learning with docking structures.利用对接结构的多实例学习进行蛋白质-配体结合亲和力预测。

Front Pharmacol. 2025 Jan 3;15:1518875. doi: 10.3389/fphar.2024.1518875. eCollection 2024.

Machine learning in computational docking.计算对接中的机器学习。

Artif Intell Med. 2015 Mar;63(3):135-52. doi: 10.1016/j.artmed.2015.02.002. Epub 2015 Feb 16.

DENVIS: Scalable and High-Throughput Virtual Screening Using Graph Neural Networks with Atomic and Surface Protein Pocket Features.DENVIS：使用具有原子和表面蛋白口袋特征的图神经网络进行可扩展的高通量虚拟筛选。

J Chem Inf Model. 2022 Oct 10;62(19):4642-4659. doi: 10.1021/acs.jcim.2c01057. Epub 2022 Sep 26.

SG-ML-PLAP: A structure-guided machine learning-based scoring function for protein-ligand binding affinity prediction.SG-ML-PLAP：一种基于结构引导的机器学习蛋白质-配体结合亲和力预测评分函数。

Protein Sci. 2025 Jan;34(1):e5257. doi: 10.1002/pro.5257.

Accurate prediction of protein-ligand interactions by combining physical energy functions and graph-neural networks.通过结合物理能量函数和图神经网络准确预测蛋白质-配体相互作用。

J Cheminform. 2024 Nov 4;16(1):121. doi: 10.1186/s13321-024-00912-2.

PLANET: A Multi-objective Graph Neural Network Model for Protein-Ligand Binding Affinity Prediction.PLANET：一种用于蛋白质-配体结合亲和力预测的多目标图神经网络模型。

J Chem Inf Model. 2024 Apr 8;64(7):2205-2220. doi: 10.1021/acs.jcim.3c00253. Epub 2023 Jun 15.

The Impact of Crystallographic Data for the Development of Machine Learning Models to Predict Protein-Ligand Binding Affinity.晶体学数据对开发用于预测蛋白质-配体结合亲和力的机器学习模型的影响。

Curr Med Chem. 2021 Oct 27;28(34):7006-7022. doi: 10.2174/0929867328666210210121320.

Improved Protein-Ligand Binding Affinity Prediction with Structure-Based Deep Fusion Inference.基于结构的深度融合推理提高蛋白-配体结合亲和力预测。

J Chem Inf Model. 2021 Apr 26;61(4):1583-1592. doi: 10.1021/acs.jcim.0c01306. Epub 2021 Mar 23.

A New Hybrid Neural Network Deep Learning Method for Protein-Ligand Binding Affinity Prediction and De Novo Drug Design.一种用于蛋白质-配体结合亲和力预测和从头药物设计的新型混合神经网络深度学习方法。

Int J Mol Sci. 2022 Nov 11;23(22):13912. doi: 10.3390/ijms232213912.

SCORCH: Improving structure-based virtual screening with machine learning classifiers, data augmentation, and uncertainty estimation.SCORCH：利用机器学习分类器、数据增强和不确定性估计改进基于结构的虚拟筛选。

J Adv Res. 2023 Apr;46:135-147. doi: 10.1016/j.jare.2022.07.001. Epub 2022 Jul 25.

引用本文的文献

In search of a photoswitchable drug for serotonin receptors: a molecular dynamics simulation study.寻找用于血清素受体的光开关药物：分子动力学模拟研究

RSC Adv. 2025 Jun 23;15(26):21077-21088. doi: 10.1039/d5ra03789a. eCollection 2025 Jun 16.

本文引用的文献

Guided Docking as a Data Generation Approach Facilitates Structure-Based Machine Learning on Kinases.引导对接作为一种数据生成方法，可促进激酶的基于结构的机器学习。

J Chem Inf Model. 2024 May 27;64(10):4009-4020. doi: 10.1021/acs.jcim.4c00055. Epub 2024 May 15.

graphLambda: Fusion Graph Neural Networks for Binding Affinity Prediction.图拉宾：用于结合亲和力预测的融合图神经网络。

J Chem Inf Model. 2024 Apr 8;64(7):2323-2330. doi: 10.1021/acs.jcim.3c00771. Epub 2024 Feb 17.

Pretrained transformer models for predicting the withdrawal of drugs from the market.用于预测药物退出市场的预训练转换器模型。

Bioinformatics. 2023 Aug 1;39(8). doi: 10.1093/bioinformatics/btad519.

SS-GNN: A Simple-Structured Graph Neural Network for Affinity Prediction.SS-GNN：一种用于亲和力预测的结构简单的图神经网络。

ACS Omega. 2023 Jun 15;8(25):22496-22507. doi: 10.1021/acsomega.3c00085. eCollection 2023 Jun 27.

A Small Step Toward Generalizability: Training a Machine Learning Scoring Function for Structure-Based Virtual Screening.迈向可泛化性的一小步：基于结构的虚拟筛选的机器学习打分函数的训练。

J Chem Inf Model. 2023 May 22;63(10):2960-2974. doi: 10.1021/acs.jcim.3c00322. Epub 2023 May 11.

HAC-Net: A Hybrid Attention-Based Convolutional Neural Network for Highly Accurate Protein-Ligand Binding Affinity Prediction.HAC-Net：一种基于混合注意力的卷积神经网络，用于高精度蛋白质-配体结合亲和力预测。

J Chem Inf Model. 2023 Apr 10;63(7):1947-1960. doi: 10.1021/acs.jcim.3c00251. Epub 2023 Mar 29.

Pose Classification Using Three-Dimensional Atomic Structure-Based Neural Networks Applied to Ion Channel-Ligand Docking.基于三维原子结构的神经网络在离子通道配体对接中的姿势分类应用。

J Chem Inf Model. 2022 May 23;62(10):2301-2315. doi: 10.1021/acs.jcim.1c01510. Epub 2022 Apr 21.

InteractionGraphNet: A Novel and Efficient Deep Graph Representation Learning Framework for Accurate Protein-Ligand Interaction Predictions.InteractionGraphNet：一种新颖高效的深度图表示学习框架，用于准确预测蛋白质-配体相互作用。

J Med Chem. 2021 Dec 23;64(24):18209-18232. doi: 10.1021/acs.jmedchem.1c01830. Epub 2021 Dec 8.

AutoDock Vina 1.2.0: New Docking Methods, Expanded Force Field, and Python Bindings.AutoDock Vina 1.2.0：新的对接方法、扩展的力场及Python绑定

J Chem Inf Model. 2021 Aug 23;61(8):3891-3898. doi: 10.1021/acs.jcim.1c00203. Epub 2021 Jul 19.

GNINA 1.0: molecular docking with deep learning.GNINA 1.0：基于深度学习的分子对接

J Cheminform. 2021 Jun 9;13(1):43. doi: 10.1186/s13321-021-00522-2.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用对接结构的多实例学习进行蛋白质-配体结合亲和力预测。

Protein-ligand binding affinity prediction using multi-instance learning with docking structures.

作者信息

机构信息

出版信息

INTRODUCTION

METHODS

RESULTS

DISCUSSION

引言

方法

结果

讨论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献