• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于通用相似性的材料科学预测不确定性量化方法。

A universal similarity based approach for predictive uncertainty quantification in materials science.

机构信息

Department of Chemistry, Lomonosov Moscow State University, Moscow, 119991, Russia.

Frumkin Institute of Physical Chemistry and Electrochemistry, Russian Academy of Sciences, Moscow, 119071, Russia.

出版信息

Sci Rep. 2022 Sep 2;12(1):14931. doi: 10.1038/s41598-022-19205-5.

DOI:10.1038/s41598-022-19205-5
PMID:36056050
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9440040/
Abstract

Immense effort has been exerted in the materials informatics community towards enhancing the accuracy of machine learning (ML) models; however, the uncertainty quantification (UQ) of state-of-the-art algorithms also demands further development. Most prominent UQ methods are model-specific or are related to the ensembles of models; therefore, there is a need to develop a universal technique that can be readily applied to a single model from a diverse set of ML algorithms. In this study, we suggest a new UQ measure known as the Δ-metric to address this issue. The presented quantitative criterion was inspired by the k-nearest neighbor approach adopted for applicability domain estimation in chemoinformatics. It surpasses several UQ methods in accurately ranking the predictive errors and could be considered a low-cost option for a more advanced deep ensemble strategy. We also evaluated the performance of the presented UQ measure on various classes of materials, ML algorithms, and types of input features, thus demonstrating its universality.

摘要

在材料信息学领域,人们付出了巨大的努力来提高机器学习 (ML) 模型的准确性;然而,最先进算法的不确定性量化 (UQ) 也需要进一步发展。大多数突出的 UQ 方法是特定于模型的,或者与模型的集合有关;因此,需要开发一种通用技术,可以很容易地应用于来自不同 ML 算法的单个模型。在这项研究中,我们提出了一种新的 UQ 度量,称为 Δ-度量,以解决这个问题。所提出的定量标准受到了化学信息学中采用的 k-最近邻方法的启发,用于适用性域估计。它在准确地对预测误差进行排序方面优于几种 UQ 方法,并且可以被认为是一种更先进的深度集成策略的低成本选择。我们还评估了所提出的 UQ 度量在各种材料、ML 算法和输入特征类型上的性能,从而证明了它的通用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c806/9440040/83ac9f19c369/41598_2022_19205_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c806/9440040/055a94978423/41598_2022_19205_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c806/9440040/9c903ff9decf/41598_2022_19205_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c806/9440040/bd2ffd9c95ba/41598_2022_19205_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c806/9440040/838df521c12e/41598_2022_19205_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c806/9440040/6f49600f5203/41598_2022_19205_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c806/9440040/a93e744373f3/41598_2022_19205_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c806/9440040/83ac9f19c369/41598_2022_19205_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c806/9440040/055a94978423/41598_2022_19205_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c806/9440040/9c903ff9decf/41598_2022_19205_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c806/9440040/bd2ffd9c95ba/41598_2022_19205_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c806/9440040/838df521c12e/41598_2022_19205_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c806/9440040/6f49600f5203/41598_2022_19205_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c806/9440040/a93e744373f3/41598_2022_19205_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/c806/9440040/83ac9f19c369/41598_2022_19205_Fig7_HTML.jpg

相似文献

1
A universal similarity based approach for predictive uncertainty quantification in materials science.基于通用相似性的材料科学预测不确定性量化方法。
Sci Rep. 2022 Sep 2;12(1):14931. doi: 10.1038/s41598-022-19205-5.
2
Leveraging Bayesian deep learning and ensemble methods for uncertainty quantification in image classification: A ranking-based approach.利用贝叶斯深度学习和集成方法进行图像分类中的不确定性量化:一种基于排序的方法。
Heliyon. 2024 Jan 8;10(2):e24188. doi: 10.1016/j.heliyon.2024.e24188. eCollection 2024 Jan 30.
3
Uncertainty quantification in skin cancer classification using three-way decision-based Bayesian deep learning.基于三向决策贝叶斯深度学习的皮肤癌分类中的不确定性量化。
Comput Biol Med. 2021 Aug;135:104418. doi: 10.1016/j.compbiomed.2021.104418. Epub 2021 Apr 28.
4
Large-scale evaluation of k-fold cross-validation ensembles for uncertainty estimation.用于不确定性估计的k折交叉验证集成的大规模评估。
J Cheminform. 2023 Apr 28;15(1):49. doi: 10.1186/s13321-023-00709-9.
5
Evolutionary Strategies Enable Systematic and Reliable Uncertainty Quantification: A Proof-of-Concept Pilot Study on Resting-State Functional MRI Language Lateralization.进化策略实现系统且可靠的不确定性量化:一项关于静息态功能磁共振成像语言偏侧化的概念验证性初步研究
J Imaging Inform Med. 2025 Feb;38(1):576-586. doi: 10.1007/s10278-024-01188-6. Epub 2024 Jul 9.
6
General Approach to Estimate Error Bars for Quantitative Structure-Activity Relationship Predictions of Molecular Activity.定量构效关系预测分子活性的误差估计的一般方法。
J Chem Inf Model. 2018 Aug 27;58(8):1561-1575. doi: 10.1021/acs.jcim.8b00114. Epub 2018 Jul 17.
7
Comprehensive Uncertainty Quantification and Sensitivity Analysis for Cardiac Action Potential Models.心脏动作电位模型的综合不确定性量化与敏感性分析
Front Physiol. 2019 Jun 26;10:721. doi: 10.3389/fphys.2019.00721. eCollection 2019.
8
Enabling forward uncertainty quantification and sensitivity analysis in cardiac electrophysiology by reduced order modeling and machine learning.通过降阶建模和机器学习实现心脏电生理学中的正向不确定性量化和敏感性分析。
Int J Numer Method Biomed Eng. 2021 Jun;37(6):e3450. doi: 10.1002/cnm.3450. Epub 2021 May 7.
9
Novel learning framework (knockoff technique) to evaluate metric ranking algorithms to describe human response to injury.用于评估度量排序算法以描述人类对损伤反应的新型学习框架(仿冒技术)。
Traffic Inj Prev. 2018;19(sup2):S121-S126. doi: 10.1080/15389588.2018.1519805. Epub 2018 Dec 20.
10
Aleatory-aware deep uncertainty quantification for transfer learning.用于迁移学习的随机感知深度不确定性量化
Comput Biol Med. 2022 Apr;143:105246. doi: 10.1016/j.compbiomed.2022.105246. Epub 2022 Jan 24.

引用本文的文献

1
Interpret Gaussian Process Models by Using Integrated Gradients.使用积分梯度解释高斯过程模型。
Mol Inform. 2025 Jan;44(1):e202400051. doi: 10.1002/minf.202400051. Epub 2024 Nov 26.
2
Evaluation and Optimization Methods for Applicability Domain Methods and Their Hyperparameters, Considering the Prediction Performance of Machine Learning Models.考虑机器学习模型预测性能的适用域方法及其超参数的评估与优化方法
ACS Omega. 2024 Feb 26;9(10):11453-11458. doi: 10.1021/acsomega.3c08036. eCollection 2024 Mar 12.

本文引用的文献

1
Feature Blending: An Approach toward Generalized Machine Learning Models for Property Prediction.特征融合:一种用于属性预测的广义机器学习模型的方法。
ACS Phys Chem Au. 2021 Sep 17;2(1):16-22. doi: 10.1021/acsphyschemau.1c00017. eCollection 2022 Jan 26.
2
Uncertainty Prediction for Machine Learning Models of Material Properties.材料性能机器学习模型的不确定性预测
ACS Omega. 2021 Nov 23;6(48):32431-32440. doi: 10.1021/acsomega.1c03752. eCollection 2021 Dec 7.
3
High-Throughput Discovery of Novel Cubic Crystal Materials Using Deep Generative Neural Networks.
高通量利用深度生成神经网络发现新型立方晶体材料
Adv Sci (Weinh). 2021 Oct;8(20):e2100566. doi: 10.1002/advs.202100566. Epub 2021 Aug 5.
4
Preview of machine learning the quantum-chemical properties of metal-organic frameworks for accelerated materials discovery.用于加速材料发现的机器学习预测金属有机框架的量子化学性质
Patterns (N Y). 2021 Apr 9;2(4):100239. doi: 10.1016/j.patter.2021.100239.
5
Predicting materials properties without crystal structure: deep representation learning from stoichiometry.无需晶体结构预测材料属性:基于化学计量学的深度表征学习
Nat Commun. 2020 Dec 8;11(1):6280. doi: 10.1038/s41467-020-19964-7.
6
Uncertainty quantification in drug design.药物设计中的不确定性量化。
Drug Discov Today. 2021 Feb;26(2):474-489. doi: 10.1016/j.drudis.2020.11.027. Epub 2020 Nov 27.
7
A band-gap database for semiconducting inorganic materials calculated with hybrid functional.使用杂化泛函计算的半导体无机材料带隙数据库。
Sci Data. 2020 Nov 11;7(1):387. doi: 10.1038/s41597-020-00723-8.
8
Identifying domains of applicability of machine learning models for materials science.识别机器学习模型在材料科学中的适用领域。
Nat Commun. 2020 Sep 4;11(1):4428. doi: 10.1038/s41467-020-17112-9.
9
Comparison of Scaling Methods to Obtain Calibrated Probabilities of Activity for Protein-Ligand Predictions.比较用于获得蛋白质-配体预测的校准活性概率的定标方法。
J Chem Inf Model. 2020 Oct 26;60(10):4546-4559. doi: 10.1021/acs.jcim.0c00476. Epub 2020 Sep 21.
10
Big-Data Science in Porous Materials: Materials Genomics and Machine Learning.多孔材料中的大数据科学:材料基因组学与机器学习。
Chem Rev. 2020 Aug 26;120(16):8066-8129. doi: 10.1021/acs.chemrev.0c00004. Epub 2020 Jun 10.