• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

独特性:不确定性量化基准测试框架。

UNIQUE: A Framework for Uncertainty Quantification Benchmarking.

机构信息

Novartis Biomedical Research, Novartis Campus, 4002 Basel, Switzerland.

出版信息

J Chem Inf Model. 2024 Nov 25;64(22):8379-8386. doi: 10.1021/acs.jcim.4c01578. Epub 2024 Nov 14.

DOI:10.1021/acs.jcim.4c01578
PMID:39542432
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11600502/
Abstract

Machine learning (ML) models have become key in decision-making for many disciplines, including drug discovery and medicinal chemistry. ML models are generally evaluated prior to their usage in high-stakes decisions, such as compound synthesis or experimental testing. However, no ML model is robust or predictive in all real-world scenarios. Therefore, uncertainty quantification (UQ) in ML predictions has gained importance in recent years. Many investigations have focused on developing methodologies that provide accurate uncertainty estimates for ML-based predictions. Unfortunately, there is no UQ strategy that consistently provides robust estimates about model's applicability on new samples. Depending on the dataset, prediction task, and algorithm, accurate uncertainty estimations might be unfeasible to obtain. Moreover, the optimum UQ metric also varies across applications, and previous investigations have shown a lack of consistency across benchmarks. Herein, the UNIQUE (UNcertaInty QUantification bEnchmarking) framework is introduced to facilitate a comparison of UQ strategies in ML-based predictions. This Python library unifies the benchmarking of multiple UQ metrics, including the calculation of nonstandard UQ metrics (combining information from the dataset and model), and provides a comprehensive evaluation. In this framework, UQ metrics are evaluated for different application scenarios, e.g., eliminating the predictions with the lowest confidence or obtaining a reliable uncertainty estimate for an acquisition function. Taken together, this library will help to standardize UQ investigations and evaluate new methodologies.

摘要

机器学习 (ML) 模型已成为许多学科(包括药物发现和药物化学)决策的关键。在做出高风险决策(如化合物合成或实验测试)之前,通常会对 ML 模型进行评估。然而,没有一个 ML 模型在所有实际场景中都是稳健或可预测的。因此,近年来,机器学习预测中的不确定性量化 (UQ) 变得越来越重要。许多研究都集中在开发能够为基于 ML 的预测提供准确不确定性估计的方法上。不幸的是,没有一种 UQ 策略能够始终为模型在新样本上的适用性提供稳健的估计。根据数据集、预测任务和算法的不同,准确的不确定性估计可能难以获得。此外,最佳 UQ 指标在不同应用中也有所不同,之前的研究表明,基准测试之间缺乏一致性。在此,引入 UNIQUE(UNcertaInty QUantification bEnchmarking)框架,以促进基于 ML 的预测中 UQ 策略的比较。这个 Python 库统一了多个 UQ 指标的基准测试,包括计算非标准 UQ 指标(结合数据集和模型的信息),并提供全面的评估。在这个框架中,UQ 指标针对不同的应用场景进行评估,例如,消除置信度最低的预测,或者为获取函数获得可靠的不确定性估计。总之,这个库将有助于标准化 UQ 研究并评估新的方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8236/11600502/0ab0d224741b/ci4c01578_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8236/11600502/14f914abda80/ci4c01578_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8236/11600502/eda5dfafd22d/ci4c01578_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8236/11600502/69a94d050463/ci4c01578_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8236/11600502/0ab0d224741b/ci4c01578_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8236/11600502/14f914abda80/ci4c01578_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8236/11600502/eda5dfafd22d/ci4c01578_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8236/11600502/69a94d050463/ci4c01578_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8236/11600502/0ab0d224741b/ci4c01578_0004.jpg

相似文献

1
UNIQUE: A Framework for Uncertainty Quantification Benchmarking.独特性:不确定性量化基准测试框架。
J Chem Inf Model. 2024 Nov 25;64(22):8379-8386. doi: 10.1021/acs.jcim.4c01578. Epub 2024 Nov 14.
2
Benchmarking uncertainty quantification for protein engineering.蛋白质工程中基准不确定性量化
PLoS Comput Biol. 2025 Jan 7;21(1):e1012639. doi: 10.1371/journal.pcbi.1012639. eCollection 2025 Jan.
3
Uncertainty quantification via localized gradients for deep learning-based medical image assessments.基于深度学习的医学图像评估的局部梯度不确定性量化。
Phys Med Biol. 2024 Jul 19;69(15). doi: 10.1088/1361-6560/ad611d.
4
Quantifying Uncertainty in Deep Learning of Radiologic Images.深度学习在放射影像中的不确定性量化。
Radiology. 2023 Aug;308(2):e222217. doi: 10.1148/radiol.222217.
5
Comprehensive Uncertainty Quantification and Sensitivity Analysis for Cardiac Action Potential Models.心脏动作电位模型的综合不确定性量化与敏感性分析
Front Physiol. 2019 Jun 26;10:721. doi: 10.3389/fphys.2019.00721. eCollection 2019.
6
An Optimized Uncertainty-Aware Training Framework for Neural Networks.一种针对神经网络的优化不确定性感知训练框架。
IEEE Trans Neural Netw Learn Syst. 2024 May;35(5):6928-6935. doi: 10.1109/TNNLS.2022.3213315. Epub 2024 May 2.
7
A universal similarity based approach for predictive uncertainty quantification in materials science.基于通用相似性的材料科学预测不确定性量化方法。
Sci Rep. 2022 Sep 2;12(1):14931. doi: 10.1038/s41598-022-19205-5.
8
Uncertainty quantification in multivariable regression for material property prediction with Bayesian neural networks.基于贝叶斯神经网络的材料性能预测多变量回归中的不确定性量化
Sci Rep. 2024 May 8;14(1):10543. doi: 10.1038/s41598-024-61189-x.
9
Uncertainty Quantification Reveals the Importance of Data Variability and Experimental Design Considerations for Proarrhythmia Risk Assessment.不确定性量化揭示了数据变异性和实验设计考量在致心律失常风险评估中的重要性。
Front Physiol. 2017 Nov 21;8:917. doi: 10.3389/fphys.2017.00917. eCollection 2017.
10
Data-efficient Bayesian learning for radial dynamic MR reconstruction.用于径向动态磁共振成像重建的数据高效贝叶斯学习
Med Phys. 2023 Nov;50(11):6955-6977. doi: 10.1002/mp.16543. Epub 2023 Jun 27.

本文引用的文献

1
Uncertain of uncertainties? A comparison of uncertainty quantification metrics for chemical data sets.对不确定性不确定?化学数据集不确定性量化指标的比较。
J Cheminform. 2023 Dec 18;15(1):121. doi: 10.1186/s13321-023-00790-0.
2
Prediction Accuracy of Production ADMET Models as a Function of Version: Activity Cliffs Rule.生产 ADMET 模型版本预测准确性:活性悬崖规则。
J Chem Inf Model. 2022 Jul 25;62(14):3275-3280. doi: 10.1021/acs.jcim.2c00699. Epub 2022 Jul 7.
3
Stop Explaining Black Box Machine Learning Models for High Stakes Decisions and Use Interpretable Models Instead.
停止为高风险决策解释黑箱机器学习模型,转而使用可解释模型。
Nat Mach Intell. 2019 May;1(5):206-215. doi: 10.1038/s42256-019-0048-x. Epub 2019 May 13.
4
Uncertainty Prediction for Machine Learning Models of Material Properties.材料性能机器学习模型的不确定性预测
ACS Omega. 2021 Nov 23;6(48):32431-32440. doi: 10.1021/acsomega.1c03752. eCollection 2021 Dec 7.
5
Machine Learning in Drug Discovery: A Review.药物发现中的机器学习:综述
Artif Intell Rev. 2022;55(3):1947-1999. doi: 10.1007/s10462-021-10058-4. Epub 2021 Aug 11.
6
Identifying domains of applicability of machine learning models for materials science.识别机器学习模型在材料科学中的适用领域。
Nat Commun. 2020 Sep 4;11(1):4428. doi: 10.1038/s41467-020-17112-9.
7
Uncertainty Quantification Using Neural Networks for Molecular Property Prediction.使用神经网络进行分子性质预测的不确定性量化。
J Chem Inf Model. 2020 Aug 24;60(8):3770-3780. doi: 10.1021/acs.jcim.0c00502. Epub 2020 Aug 4.
8
Chemoinformatic Classification Methods and their Applicability Domain.化学信息学分类方法及其适用领域。
Mol Inform. 2016 May;35(5):160-80. doi: 10.1002/minf.201501019. Epub 2016 Feb 19.
9
Chemoinformatics as a Theoretical Chemistry Discipline.化学信息学作为理论化学的一个分支。
Mol Inform. 2011 Jan 17;30(1):20-32. doi: 10.1002/minf.201000100. Epub 2011 Jan 24.
10
ChEMBL: a large-scale bioactivity database for drug discovery.ChEMBL:用于药物发现的大型生物活性数据库。
Nucleic Acids Res. 2012 Jan;40(Database issue):D1100-7. doi: 10.1093/nar/gkr777. Epub 2011 Sep 23.