SMPLIP评分：通过简单且可解释的实时相互作用指纹模式描述符预测配体结合亲和力。

SMPLIP-Score: predicting ligand binding affinity from simple and interpretable on-the-fly interaction fingerprint pattern descriptors.

作者信息

Kumar Surendra, Kim Mi-Hyun

机构信息

Gachon Institute of Pharmaceutical Science & Department of Pharmacy, College of Pharmacy, Gachon University, 191 Hambakmoeiro, Yeonsu-gu, Incheon, Republic of Korea.

出版信息

J Cheminform. 2021 Mar 25;13(1):28. doi: 10.1186/s13321-021-00507-1.

DOI:10.1186/s13321-021-00507-1

PMID:33766140

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7993508/

Abstract

In drug discovery, rapid and accurate prediction of protein-ligand binding affinities is a pivotal task for lead optimization with acceptable on-target potency as well as pharmacological efficacy. Furthermore, researchers hope for a high correlation between docking score and pose with key interactive residues, although scoring functions as free energy surrogates of protein-ligand complexes have failed to provide collinearity. Recently, various machine learning or deep learning methods have been proposed to overcome the drawbacks of scoring functions. Despite being highly accurate, their featurization process is complex and the meaning of the embedded features cannot directly be interpreted by human recognition without an additional feature analysis. Here, we propose SMPLIP-Score (Substructural Molecular and Protein-Ligand Interaction Pattern Score), a direct interpretable predictor of absolute binding affinity. Our simple featurization embeds the interaction fingerprint pattern on the ligand-binding site environment and molecular fragments of ligands into an input vectorized matrix for learning layers (random forest or deep neural network). Despite their less complex features than other state-of-the-art models, SMPLIP-Score achieved comparable performance, a Pearson's correlation coefficient up to 0.80, and a root mean square error up to 1.18 in pK units with several benchmark datasets (PDBbind v.2015, Astex Diverse Set, CSAR NRC HiQ, FEP, PDBbind NMR, and CASF-2016). For this model, generality, predictive power, ranking power, and robustness were examined using direct interpretation of feature matrices for specific targets.

摘要

在药物研发中，快速准确地预测蛋白质-配体结合亲和力是先导化合物优化的关键任务，以确保具有可接受的靶标活性和药理疗效。此外，研究人员希望对接分数与关键相互作用残基的构象之间具有高度相关性，尽管作为蛋白质-配体复合物自由能替代物的评分函数未能提供共线性关系。最近，人们提出了各种机器学习或深度学习方法来克服评分函数的缺点。尽管这些方法非常准确，但其特征化过程复杂，而且在没有额外特征分析的情况下，嵌入特征的含义无法直接通过人类识别来解释。在此，我们提出了SMPLIP-Score（亚结构分子与蛋白质-配体相互作用模式评分），一种绝对结合亲和力的直接可解释预测器。我们简单的特征化方法将配体结合位点环境上的相互作用指纹模式和配体的分子片段嵌入到一个输入向量化矩阵中，用于学习层（随机森林或深度神经网络）。尽管SMPLIP-Score的特征比其他现有模型简单，但在几个基准数据集（PDBbind v.2015、Astex多样集、CSAR NRC HiQ、FEP、PDBbind NMR和CASF-2016）上，它取得了相当的性能，皮尔逊相关系数高达0.80，以pK单位计的均方根误差高达1.18。对于该模型，通过对特定靶标的特征矩阵进行直接解释，检验了其通用性、预测能力、排序能力和稳健性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c5a0/7993508/e36bf33ca11e/13321_2021_507_Fig1_HTML.jpg

相似文献

SMPLIP-Score: predicting ligand binding affinity from simple and interpretable on-the-fly interaction fingerprint pattern descriptors.SMPLIP评分：通过简单且可解释的实时相互作用指纹模式描述符预测配体结合亲和力。

J Cheminform. 2021 Mar 25;13(1):28. doi: 10.1186/s13321-021-00507-1.

Sfcnn: a novel scoring function based on 3D convolutional neural network for accurate and stable protein-ligand affinity prediction.Sfcnn：一种基于 3D 卷积神经网络的新型评分函数，用于准确稳定的蛋白质-配体亲和力预测。

BMC Bioinformatics. 2022 Jun 8;23(1):222. doi: 10.1186/s12859-022-04762-3.

Boosted neural networks scoring functions for accurate ligand docking and ranking.用于精确配体对接和排序的增强神经网络评分函数。

J Bioinform Comput Biol. 2018 Apr;16(2):1850004. doi: 10.1142/S021972001850004X. Epub 2018 Feb 4.

Machine learning in computational docking.计算对接中的机器学习。

Artif Intell Med. 2015 Mar;63(3):135-52. doi: 10.1016/j.artmed.2015.02.002. Epub 2015 Feb 16.

BgN-Score and BsN-Score: bagging and boosting based ensemble neural networks scoring functions for accurate binding affinity prediction of protein-ligand complexes.BgN分数和BsN分数：基于装袋法和提升法的集成神经网络评分函数，用于准确预测蛋白质-配体复合物的结合亲和力。

BMC Bioinformatics. 2015;16 Suppl 4(Suppl 4):S8. doi: 10.1186/1471-2105-16-S4-S8. Epub 2015 Feb 23.

A New Hybrid Neural Network Deep Learning Method for Protein-Ligand Binding Affinity Prediction and De Novo Drug Design.一种用于蛋白质-配体结合亲和力预测和从头药物设计的新型混合神经网络深度学习方法。

Int J Mol Sci. 2022 Nov 11;23(22):13912. doi: 10.3390/ijms232213912.

GB-score: Minimally designed machine learning scoring function based on distance-weighted interatomic contact features.GB评分：基于距离加权原子间接触特征的最小化设计机器学习评分函数。

Mol Inform. 2023 Mar;42(3):e2200135. doi: 10.1002/minf.202200135. Epub 2023 Feb 1.

Forging the Basis for Developing Protein-Ligand Interaction Scoring Functions.为开发蛋白质-配体相互作用评分函数奠定基础。

Acc Chem Res. 2017 Feb 21;50(2):302-309. doi: 10.1021/acs.accounts.6b00491. Epub 2017 Feb 9.

Comparative assessment of scoring functions on an updated benchmark: 2. Evaluation methods and general results.更新后的基准上评分函数的比较评估：2. 评估方法与总体结果。

J Chem Inf Model. 2014 Jun 23;54(6):1717-36. doi: 10.1021/ci500081m. Epub 2014 Jun 2.

Deep Learning in Drug Design: Protein-Ligand Binding Affinity Prediction.药物设计中的深度学习：蛋白质-配体结合亲和力预测

IEEE/ACM Trans Comput Biol Bioinform. 2022 Jan-Feb;19(1):407-417. doi: 10.1109/TCBB.2020.3046945. Epub 2022 Feb 3.

引用本文的文献

A beginner's approach to deep learning applied to VS and MD techniques.深度学习应用于VS和MD技术的初学者方法。

J Cheminform. 2025 Apr 8;17(1):47. doi: 10.1186/s13321-025-00985-7.

Exploring the potential of compound-protein complex structure-free models in virtual screening using BlendNet.利用BlendNet探索无复合蛋白复合物结构模型在虚拟筛选中的潜力。

Brief Bioinform. 2024 Nov 22;26(1). doi: 10.1093/bib/bbae712.

The role of artificial intelligence in drug screening, drug design, and clinical trials.人工智能在药物筛选、药物设计和临床试验中的作用。

Front Pharmacol. 2024 Nov 29;15:1459954. doi: 10.3389/fphar.2024.1459954. eCollection 2024.

GPCR-IPL score: multilevel featurization of GPCR-ligand interaction patterns and prediction of ligand functions from selectivity to biased activation.GPCR-IPL 评分：从选择性到偏激活的配体功能预测，对 GPCR-配体相互作用模式进行多层次特征化。

Brief Bioinform. 2024 Jan 22;25(2). doi: 10.1093/bib/bbae105.

Systematic analysis, aggregation and visualisation of interaction fingerprints for molecular dynamics simulation data.分子动力学模拟数据相互作用指纹的系统分析、汇总与可视化

J Cheminform. 2024 Mar 12;16(1):28. doi: 10.1186/s13321-024-00822-3.

Integrating Artificial Intelligence for Drug Discovery in the Context of Revolutionizing Drug Delivery.在药物递送变革的背景下整合人工智能用于药物发现。

Life (Basel). 2024 Feb 7;14(2):233. doi: 10.3390/life14020233.

Prediction of chemical warfare agents based on cholinergic array type meta-predictors.基于胆碱能阵列型元预测因子预测化学战剂。

Sci Rep. 2022 Oct 6;12(1):16709. doi: 10.1038/s41598-022-21150-2.

PLAS-5k: Dataset of Protein-Ligand Affinities from Molecular Dynamics for Machine Learning Applications.PLAS-5k：用于机器学习应用的分子动力学中蛋白质-配体亲和力的数据集。

Sci Data. 2022 Sep 7;9(1):548. doi: 10.1038/s41597-022-01631-9.

Explainable deep drug-target representations for binding affinity prediction.可解释的深度药物靶标表示用于结合亲和力预测。

BMC Bioinformatics. 2022 Jun 17;23(1):237. doi: 10.1186/s12859-022-04767-y.

fingeRNAt-A novel tool for high-throughput analysis of nucleic acid-ligand interactions.fingERNAt—一种用于高通量分析核酸-配体相互作用的新工具。

PLoS Comput Biol. 2022 Jun 2;18(6):e1009783. doi: 10.1371/journal.pcbi.1009783. eCollection 2022 Jun.

本文引用的文献

RosENet: Improving Binding Affinity Prediction by Leveraging Molecular Mechanics Energies with an Ensemble of 3D Convolutional Neural Networks.RosENet：利用 3D 卷积神经网络集成提高结合亲和力预测的分子力学能量。

J Chem Inf Model. 2020 Jun 22;60(6):2791-2802. doi: 10.1021/acs.jcim.0c00075. Epub 2020 May 26.

Machine learning and ligand binding predictions: A review of data, methods, and obstacles.机器学习和配体结合预测：数据、方法和障碍的综述。

Biochim Biophys Acta Gen Subj. 2020 Jun;1864(6):129545. doi: 10.1016/j.bbagen.2020.129545. Epub 2020 Feb 10.

Learning from the ligand: using ligand-based features to improve binding affinity prediction.从配体中学习：利用基于配体的特征来提高结合亲和力预测。

Bioinformatics. 2020 Feb 1;36(3):758-764. doi: 10.1093/bioinformatics/btz665.

OnionNet: a Multiple-Layer Intermolecular-Contact-Based Convolutional Neural Network for Protein-Ligand Binding Affinity Prediction.洋葱网络：一种基于多层分子间接触的卷积神经网络，用于蛋白质-配体结合亲和力预测。

ACS Omega. 2019 Sep 16;4(14):15956-15965. doi: 10.1021/acsomega.9b01997. eCollection 2019 Oct 1.

AGL-Score: Algebraic Graph Learning Score for Protein-Ligand Binding Scoring, Ranking, Docking, and Screening.AGL-Score：用于蛋白质-配体结合评分、排序、对接和筛选的代数图学习评分。

J Chem Inf Model. 2019 Jul 22;59(7):3291-3304. doi: 10.1021/acs.jcim.9b00334. Epub 2019 Jul 1.

Comparative Assessment of Scoring Functions: The CASF-2016 Update.评分函数的比较评估：CASF-2016 更新。

J Chem Inf Model. 2019 Feb 25;59(2):895-913. doi: 10.1021/acs.jcim.8b00545. Epub 2018 Dec 11.

Development of a protein-ligand extended connectivity (PLEC) fingerprint and its application for binding affinity predictions.开发一种蛋白质配体扩展连接性（PLEC）指纹及其在结合亲和力预测中的应用。

Bioinformatics. 2019 Apr 15;35(8):1334-1341. doi: 10.1093/bioinformatics/bty757.

Evaluation of AutoDock and AutoDock Vina on the CASF-2013 Benchmark.评价 AutoDock 和 AutoDock Vina 在 CASF-2013 基准测试中的表现。

J Chem Inf Model. 2018 Aug 27;58(8):1697-1706. doi: 10.1021/acs.jcim.8b00312. Epub 2018 Jul 25.

Development and evaluation of a deep learning model for protein-ligand binding affinity prediction.开发和评估用于预测蛋白质-配体结合亲和力的深度学习模型。

Bioinformatics. 2018 Nov 1;34(21):3666-3674. doi: 10.1093/bioinformatics/bty374.

Deep Learning for Drug Design: an Artificial Intelligence Paradigm for Drug Discovery in the Big Data Era.深度学习在药物设计中的应用：大数据时代药物发现的人工智能范例。

AAPS J. 2018 Mar 30;20(3):58. doi: 10.1208/s12248-018-0210-0.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

SMPLIP评分：通过简单且可解释的实时相互作用指纹模式描述符预测配体结合亲和力。

SMPLIP-Score: predicting ligand binding affinity from simple and interpretable on-the-fly interaction fingerprint pattern descriptors.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献