使用测试时随机失活技术对深度神经网络进行可靠的预测误差估计。

Reliable Prediction Errors for Deep Neural Networks Using Test-Time Dropout.

机构信息

Centre for Molecular Informatics, Department of Chemistry , University of Cambridge , Lensfield Road , Cambridge CB2 1EW , United Kingdom.

出版信息

J Chem Inf Model. 2019 Jul 22;59(7):3330-3339. doi: 10.1021/acs.jcim.9b00297. Epub 2019 Jun 26.

DOI:10.1021/acs.jcim.9b00297

PMID:31241929

Abstract

While the use of deep learning in drug discovery is gaining increasing attention, the lack of methods to compute reliable errors in prediction for Neural Networks prevents their application to guide decision making in domains where identifying unreliable predictions is essential, e.g., precision medicine. Here, we present a framework to compute reliable errors in prediction for Neural Networks using Test-Time Dropout and Conformal Prediction. Specifically, the algorithm consists of training a Neural Network using dropout, and then to both the validation and test sets, also employing dropout in this step. Therefore, for each instance in the validation and test sets an ensemble of predictions are generated. The residuals and absolute errors in prediction for the validation set are then used to compute prediction errors for the test set instances using Conformal Prediction. We show using 24 bioactivity data sets from ChEMBL 23 that Dropout Conformal Predictors are valid (i.e., the fraction of instances whose true value lies within the predicted interval strongly correlates with the confidence level) and efficient, as the predicted confidence intervals span a narrower set of values than those computed with Conformal Predictors generated using Random Forest (RF) models. Lastly, we show in retrospective virtual screening experiments that dropout and RF-based Conformal Predictors lead to comparable retrieval rates of active compounds. Overall, we propose a computationally efficient framework (as only extra forward passes are required in addition to training a single network) to harness Test-Time Dropout and the Conformal Prediction framework, which is generally applicable to generate reliable prediction errors for Deep Neural Networks in drug discovery and beyond.

摘要

虽然深度学习在药物发现中的应用越来越受到关注，但缺乏计算神经网络预测可靠误差的方法，这阻碍了它们在需要识别不可靠预测的领域（如精准医学）中应用于指导决策。在这里，我们提出了一种使用测试时随机失活和一致性预测来计算神经网络预测可靠误差的框架。具体来说，该算法包括使用随机失活训练神经网络，然后对验证集和测试集都使用随机失活。因此，对于验证集和测试集中的每个实例，都会生成一组预测结果。然后，使用验证集的残差和预测绝对误差来使用一致性预测计算测试集实例的预测误差。我们使用来自 ChEMBL 23 的 24 个生物活性数据集表明，随机失活一致性预测器是有效的（即，真实值位于预测区间内的实例的分数与置信水平强烈相关）和高效的，因为预测置信区间比使用随机森林 (RF) 模型生成的一致性预测器计算的置信区间范围更窄。最后，我们在回顾性虚拟筛选实验中表明，随机失活和基于 RF 的一致性预测器可以导致活性化合物的检索率相当。总的来说，我们提出了一种计算效率高的框架（除了训练单个网络外，仅需要额外进行次前向传递）来利用测试时随机失活和一致性预测框架，该框架通常适用于在药物发现及其他领域生成可靠的深度神经网络预测误差。

相似文献

Reliable Prediction Errors for Deep Neural Networks Using Test-Time Dropout.使用测试时随机失活技术对深度神经网络进行可靠的预测误差估计。

J Chem Inf Model. 2019 Jul 22;59(7):3330-3339. doi: 10.1021/acs.jcim.9b00297. Epub 2019 Jun 26.

Deep Confidence: A Computationally Efficient Framework for Calculating Reliable Prediction Errors for Deep Neural Networks.深度置信度：一种用于计算深度神经网络可靠预测误差的计算效率高的框架。

J Chem Inf Model. 2019 Mar 25;59(3):1269-1281. doi: 10.1021/acs.jcim.8b00542. Epub 2018 Oct 30.

Deep Learning-Based Conformal Prediction of Toxicity.基于深度学习的毒性保形预测。

J Chem Inf Model. 2021 Jun 28;61(6):2648-2657. doi: 10.1021/acs.jcim.1c00208. Epub 2021 May 27.

KekuleScope: prediction of cancer cell line sensitivity and compound potency using convolutional neural networks trained on compound images.凯库勒镜：利用在化合物图像上训练的卷积神经网络预测癌细胞系敏感性和化合物效力。

J Cheminform. 2019 Jun 19;11(1):41. doi: 10.1186/s13321-019-0364-5.

J Chem Inf Model. 2019 Jan 28;59(1):181-189. doi: 10.1021/acs.jcim.8b00597. Epub 2018 Nov 19.

How Sure Can We Be about ML Methods-Based Evaluation of Compound Activity: Incorporation of Information about Prediction Uncertainty Using Deep Learning Techniques.基于机器学习方法的化合物活性评估有多大把握：利用深度学习技术纳入预测不确定性信息。

Molecules. 2020 Mar 23;25(6):1452. doi: 10.3390/molecules25061452.

Predicting With Confidence: Using Conformal Prediction in Drug Discovery.有信心的预测：在药物发现中使用一致性预测。

J Pharm Sci. 2021 Jan;110(1):42-49. doi: 10.1016/j.xphs.2020.09.055. Epub 2020 Oct 17.

Conformal Regression for Quantitative Structure-Activity Relationship Modeling-Quantifying Prediction Uncertainty.定量构效关系建模的保形回归——量化预测不确定性。

J Chem Inf Model. 2018 May 29;58(5):1132-1140. doi: 10.1021/acs.jcim.8b00054. Epub 2018 May 10.

Optimizing neural networks for medical data sets: A case study on neonatal apnea prediction.优化神经网络在医学数据集上的应用：以新生儿呼吸暂停预测为例的研究

Artif Intell Med. 2019 Jul;98:59-76. doi: 10.1016/j.artmed.2019.07.008. Epub 2019 Jul 25.

Comparison of Deep Learning With Multiple Machine Learning Methods and Metrics Using Diverse Drug Discovery Data Sets.使用多种药物发现数据集比较深度学习与多种机器学习方法和指标。

Mol Pharm. 2017 Dec 4;14(12):4462-4475. doi: 10.1021/acs.molpharmaceut.7b00578. Epub 2017 Nov 13.

引用本文的文献

State-of-the-art learning COVID-19 vaccine effectiveness using LSTM.利用长短期记忆网络（LSTM）了解新冠病毒疫苗有效性的最新技术。

Inform Med Unlocked. 2024;49. doi: 10.1016/j.imu.2024.101561. Epub 2024 Jul 30.

Implicit versus explicit Bayesian priors for epistemic uncertainty estimation in clinical decision support.临床决策支持中用于认知不确定性估计的隐式与显式贝叶斯先验

PLOS Digit Health. 2025 Jul 29;4(7):e0000801. doi: 10.1371/journal.pdig.0000801. eCollection 2025 Jul.

Spatially Resolved Uncertainties for Machine Learning Potentials.机器学习势的空间分辨不确定性。

J Chem Inf Model. 2024 Aug 26;64(16):6377-6387. doi: 10.1021/acs.jcim.4c00904. Epub 2024 Aug 7.

Predicting the Hallucinogenic Potential of Molecules Using Artificial Intelligence.利用人工智能预测分子的致幻潜力。

ACS Chem Neurosci. 2024 Aug 21;15(16):3078-3089. doi: 10.1021/acschemneuro.4c00405. Epub 2024 Aug 2.

Development and Evaluation of Conformal Prediction Methods for Quantitative Structure-Activity Relationship.定量构效关系的共形预测方法的开发与评估

ACS Omega. 2024 Jun 27;9(27):29478-29490. doi: 10.1021/acsomega.4c02017. eCollection 2024 Jul 9.

Relationship between prediction accuracy and uncertainty in compound potency prediction using deep neural networks and control models.使用深度神经网络和控制模型进行化合物效力预测时预测准确性与不确定性之间的关系

Sci Rep. 2024 Mar 19;14(1):6536. doi: 10.1038/s41598-024-57135-6.

Characterizing Uncertainty in Machine Learning for Chemistry.机器学习在化学中的不确定性描述。

J Chem Inf Model. 2023 Jul 10;63(13):4012-4029. doi: 10.1021/acs.jcim.3c00373. Epub 2023 Jun 20.

Large-scale evaluation of k-fold cross-validation ensembles for uncertainty estimation.用于不确定性估计的k折交叉验证集成的大规模评估。

J Cheminform. 2023 Apr 28;15(1):49. doi: 10.1186/s13321-023-00709-9.

Uncertainty-aware deep learning in healthcare: A scoping review.医疗保健领域中具有不确定性意识的深度学习：一项范围综述。

PLOS Digit Health. 2022;1(8). doi: 10.1371/journal.pdig.0000085. Epub 2022 Aug 10.

Evaluating High-Variance Leaves as Uncertainty Measure for Random Forest Regression.评估高方差叶子作为随机森林回归的不确定性度量。

Molecules. 2021 Oct 28;26(21):6514. doi: 10.3390/molecules26216514.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用测试时随机失活技术对深度神经网络进行可靠的预测误差估计。

Reliable Prediction Errors for Deep Neural Networks Using Test-Time Dropout.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献