评估用于蛋白质-配体结合预测的神经网络中的点预测不确定性。

Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction.

作者信息

Fan Ya Ju, Allen Jonathan E, McLoughlin Kevin S, Shi Da, Bennion Brian J, Zhang Xiaohua, Lightstone Felice C

机构信息

Center for Applied Scientific Computing, Lawrence Livermore National Laboratory, 7000 East Ave., Livermore, CA, USA.

Biological Science and Security Center, Lawrence Livermore National Laboratory, Livermore, CA, USA.

出版信息

Artif Intell Chem. 2023 Jun;1(1). doi: 10.1016/j.aichem.2023.100004. Epub 2023 Jun 3.

DOI:10.1016/j.aichem.2023.100004

PMID:37583465

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10426331/

Abstract

Neural Network (NN) models provide potential to speed up the drug discovery process and reduce its failure rates. The success of NN models requires uncertainty quantification (UQ) as drug discovery explores chemical space beyond the training data distribution. Standard NN models do not provide uncertainty information. Some methods require changing the NN architecture or training procedure, limiting the selection of NN models. Moreover, predictive uncertainty can come from different sources. It is important to have the ability to separately model different types of predictive uncertainty, as the model can take assorted actions depending on the source of uncertainty. In this paper, we examine UQ methods that estimate different sources of predictive uncertainty for NN models aiming at protein-ligand binding prediction. We use our prior knowledge on chemical compounds to design the experiments. By utilizing a visualization method we create non-overlapping and chemically diverse partitions from a collection of chemical compounds. These partitions are used as training and test set splits to explore NN model uncertainty. We demonstrate how the uncertainties estimated by the selected methods describe different sources of uncertainty under different partitions and featurization schemes and the relationship to prediction error.

摘要

神经网络（NN）模型为加速药物发现过程和降低失败率提供了潜力。由于药物发现探索的化学空间超出了训练数据分布范围，NN模型的成功需要不确定性量化（UQ）。标准的NN模型不提供不确定性信息。一些方法需要改变NN架构或训练过程，限制了NN模型的选择。此外，预测不确定性可能来自不同来源。能够分别对不同类型的预测不确定性进行建模很重要，因为模型可以根据不确定性的来源采取不同的行动。在本文中，我们研究了针对蛋白质-配体结合预测的NN模型估计不同预测不确定性来源的UQ方法。我们利用对化合物的先验知识来设计实验。通过使用一种可视化方法，我们从一组化合物中创建了不重叠且化学性质不同的分区。这些分区用作训练集和测试集划分，以探索NN模型的不确定性。我们展示了所选方法估计的不确定性如何描述不同分区和特征化方案下的不同不确定性来源以及与预测误差的关系。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f204/10426331/af03a7a4f410/nihms-1912151-f0001.jpg

相似文献

Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction.评估用于蛋白质-配体结合预测的神经网络中的点预测不确定性。

Artif Intell Chem. 2023 Jun;1(1). doi: 10.1016/j.aichem.2023.100004. Epub 2023 Jun 3.

Scalable Bayesian Uncertainty Quantification for Neural Network Potentials: Promise and Pitfalls.神经网络势的可扩展贝叶斯不确定性量化：前景与陷阱

J Chem Theory Comput. 2023 Jul 25;19(14):4520-4532. doi: 10.1021/acs.jctc.2c01267. Epub 2023 Apr 4.

An Optimized Uncertainty-Aware Training Framework for Neural Networks.一种针对神经网络的优化不确定性感知训练框架。

IEEE Trans Neural Netw Learn Syst. 2024 May;35(5):6928-6935. doi: 10.1109/TNNLS.2022.3213315. Epub 2024 May 2.

Aleatory-aware deep uncertainty quantification for transfer learning.用于迁移学习的随机感知深度不确定性量化

Comput Biol Med. 2022 Apr;143:105246. doi: 10.1016/j.compbiomed.2022.105246. Epub 2022 Jan 24.

Bayesian uncertainty calculation in neural network inference of ion and electron temperature profiles at W7-X.W7-X装置中离子和电子温度剖面神经网络推断的贝叶斯不确定性计算

Rev Sci Instrum. 2018 Oct;89(10):10K102. doi: 10.1063/1.5039286.

Neural network uncertainty assessment using Bayesian statistics: a remote sensing application.使用贝叶斯统计的神经网络不确定性评估：遥感应用

Neural Comput. 2004 Nov;16(11):2415-58. doi: 10.1162/0899766041941925.

Reducing uncertainties in neural network Jacobians and improving accuracy of neural network emulations with NN ensemble approaches.使用神经网络集成方法减少神经网络雅可比矩阵中的不确定性并提高神经网络仿真的准确性。

Neural Netw. 2007 May;20(4):454-61. doi: 10.1016/j.neunet.2007.04.008. Epub 2007 May 1.

Uncertainty quantification for predictions of atomistic neural networks.原子神经网络预测的不确定性量化。

Chem Sci. 2022 Oct 17;13(44):13068-13084. doi: 10.1039/d2sc04056e. eCollection 2022 Nov 16.

Relationship between prediction accuracy and uncertainty in compound potency prediction using deep neural networks and control models.使用深度神经网络和控制模型进行化合物效力预测时预测准确性与不确定性之间的关系

Sci Rep. 2024 Mar 19;14(1):6536. doi: 10.1038/s41598-024-57135-6.

Folic acid supplementation and malaria susceptibility and severity among people taking antifolate antimalarial drugs in endemic areas.在流行地区，服用抗叶酸抗疟药物的人群中，叶酸补充剂与疟疾易感性和严重程度的关系。

Cochrane Database Syst Rev. 2022 Feb 1;2(2022):CD014217. doi: 10.1002/14651858.CD014217.

引用本文的文献

Achieving well-informed decision-making in drug discovery: a comprehensive calibration study using neural network-based structure-activity models.在药物发现中实现明智的决策：一项使用基于神经网络的构效模型的全面校准研究。

J Cheminform. 2025 Mar 5;17(1):29. doi: 10.1186/s13321-025-00964-y.

Reducing overconfident errors in molecular property classification using Posterior Network.使用后验网络减少分子性质分类中的过度自信错误。

Patterns (N Y). 2024 May 8;5(6):100991. doi: 10.1016/j.patter.2024.100991. eCollection 2024 Jun 14.

本文引用的文献

A hybrid framework for improving uncertainty quantification in deep learning-based QSAR regression modeling.一种用于改进基于深度学习的定量构效关系回归建模中不确定性量化的混合框架。

J Cheminform. 2021 Sep 20;13(1):69. doi: 10.1186/s13321-021-00551-x.

Machine Learning Models to Predict Inhibition of the Bile Salt Export Pump.机器学习模型预测胆汁盐输出泵抑制剂。

J Chem Inf Model. 2021 Feb 22;61(2):587-602. doi: 10.1021/acs.jcim.0c00950. Epub 2021 Jan 27.

Towards reproducible computational drug discovery.迈向可重复的计算药物发现。

J Cheminform. 2020 Jan 28;12(1):9. doi: 10.1186/s13321-020-0408-x.

Uncertainty quantification in drug design.药物设计中的不确定性量化。

Drug Discov Today. 2021 Feb;26(2):474-489. doi: 10.1016/j.drudis.2020.11.027. Epub 2020 Nov 27.

Leveraging Uncertainty in Machine Learning Accelerates Biological Discovery and Design.利用机器学习中的不确定性加速生物学发现和设计。

Cell Syst. 2020 Nov 18;11(5):461-477.e9. doi: 10.1016/j.cels.2020.09.007. Epub 2020 Oct 15.

Uncertainty Quantification Using Neural Networks for Molecular Property Prediction.使用神经网络进行分子性质预测的不确定性量化。

J Chem Inf Model. 2020 Aug 24;60(8):3770-3780. doi: 10.1021/acs.jcim.0c00502. Epub 2020 Aug 4.

Evaluating Scalable Uncertainty Estimation Methods for Deep Learning-Based Molecular Property Prediction.评估基于深度学习的分子性质预测的可扩展不确定性估计方法。

J Chem Inf Model. 2020 Jun 22;60(6):2697-2717. doi: 10.1021/acs.jcim.9b00975. Epub 2020 Apr 24.

AMPL: A Data-Driven Modeling Pipeline for Drug Discovery.AMPL：一个用于药物发现的数据驱动建模管道。

J Chem Inf Model. 2020 Apr 27;60(4):1955-1968. doi: 10.1021/acs.jcim.9b01053. Epub 2020 Apr 16.

Analyzing Learned Molecular Representations for Property Prediction.分析用于性质预测的学习分子表示。

J Chem Inf Model. 2019 Aug 26;59(8):3370-3388. doi: 10.1021/acs.jcim.9b00237. Epub 2019 Aug 13.

Applications of machine learning in drug discovery and development.机器学习在药物发现和开发中的应用。

Nat Rev Drug Discov. 2019 Jun;18(6):463-477. doi: 10.1038/s41573-019-0024-5.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

评估用于蛋白质-配体结合预测的神经网络中的点预测不确定性。

Evaluating point-prediction uncertainties in neural networks for protein-ligand binding prediction.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献