• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

计算化学方法的概率性能估计器:系统改进概率和排名概率矩阵。I. 理论。

Probabilistic performance estimators for computational chemistry methods: Systematic improvement probability and ranking probability matrix. I. Theory.

作者信息

Pernot Pascal, Savin Andreas

机构信息

Institut de Chimie Physique, UMR8000, CNRS, Université Paris-Saclay, 91405 Orsay, France.

Laboratoire de Chimie Théorique, CNRS and UPMC Université Paris 06, Sorbonne Universités, 75252 Paris, France.

出版信息

J Chem Phys. 2020 Apr 30;152(16):164108. doi: 10.1063/5.0006202.

DOI:10.1063/5.0006202
PMID:32357773
Abstract

The comparison of benchmark error sets is an essential tool for the evaluation of theories in computational chemistry. The standard ranking of methods by their mean unsigned error is unsatisfactory for several reasons linked to the non-normality of the error distributions and the presence of underlying trends. Complementary statistics have recently been proposed to palliate such deficiencies, such as quantiles of the absolute error distribution or the mean prediction uncertainty. We introduce here a new score, the systematic improvement probability, based on the direct system-wise comparison of absolute errors. Independent of the chosen scoring rule, the uncertainty of the statistics due to the incompleteness of the benchmark datasets is also generally overlooked. However, this uncertainty is essential to appreciate the robustness of rankings. In the present article, we develop two indicators based on robust statistics to address this problem: P, the inversion probability between two values of a statistic, and P, the ranking probability matrix. We demonstrate also the essential contribution of the correlations between error sets in these scores comparisons.

摘要

基准误差集的比较是评估计算化学理论的重要工具。按平均绝对误差对方法进行标准排名存在若干不足之处,原因与误差分布的非正态性以及潜在趋势的存在有关。最近有人提出了补充统计量来缓解这些不足,例如绝对误差分布的分位数或平均预测不确定性。我们在此引入一种新的分数,即系统改进概率,它基于绝对误差的直接系统间比较。无论选择何种评分规则,由于基准数据集不完整导致的统计量不确定性通常也被忽视。然而,这种不确定性对于评估排名的稳健性至关重要。在本文中,我们基于稳健统计量开发了两个指标来解决这个问题:P,即统计量两个值之间的反转概率;以及P,即排名概率矩阵。我们还证明了误差集之间的相关性在这些分数比较中的重要作用。

相似文献

1
Probabilistic performance estimators for computational chemistry methods: Systematic improvement probability and ranking probability matrix. I. Theory.计算化学方法的概率性能估计器:系统改进概率和排名概率矩阵。I. 理论。
J Chem Phys. 2020 Apr 30;152(16):164108. doi: 10.1063/5.0006202.
2
Probabilistic performance estimators for computational chemistry methods: Systematic improvement probability and ranking probability matrix. II. Applications.计算化学方法的概率性能估计器:系统改进概率和排名概率矩阵。II. 应用。
J Chem Phys. 2020 Apr 30;152(16):164109. doi: 10.1063/5.0006204.
3
Probabilistic performance estimators for computational chemistry methods: The empirical cumulative distribution function of absolute errors.计算化学方法的概率性能估计器:绝对误差的经验累积分布函数。
J Chem Phys. 2018 Jun 28;148(24):241707. doi: 10.1063/1.5016248.
4
Erratum: Probabilistic performance estimators for computational chemistry methods: Systematic improvement probability and ranking probability matrix. I. Theory [J. Chem. Phys. 152, 164108 (2020)].勘误:计算化学方法的概率性能估计器:系统改进概率和排名概率矩阵。I. 理论 [《化学物理杂志》152, 164108 (2020)]。
J Chem Phys. 2020 Oct 28;153(16):169901. doi: 10.1063/5.0031156.
5
Error Assessment of Computational Models in Chemistry.化学计算模型的误差评估
Chimia (Aarau). 2017 Apr 26;71(4):202-208. doi: 10.2533/chimia.2017.202.
6
Isosurface Visualization of Data with Nonparametric Models for Uncertainty.基于非参数模型的不确定数据的等表面可视化。
IEEE Trans Vis Comput Graph. 2016 Jan;22(1):777-86. doi: 10.1109/TVCG.2015.2467958.
7
Efficiency of analytical and sampling-based uncertainty propagation in intensity-modulated proton therapy.调强质子治疗中基于分析和采样的不确定性传播的效率
Phys Med Biol. 2017 Jun 26;62(14):5790-5807. doi: 10.1088/1361-6560/aa6ec5.
8
Epistemic uncertainty in the ranking and categorization of probabilistic safety assessment model elements: issues and findings.概率安全评估模型要素排名和分类中的认知不确定性:问题与发现。
Risk Anal. 2008 Aug;28(4):983-1001. doi: 10.1111/j.1539-6924.2008.01064.x. Epub 2008 Jun 28.
9
Evaluation of the ranking probabilities for partial orders based on random linear extensions.基于随机线性扩展对偏序的排序概率进行评估。
Chemosphere. 2003 Dec;53(8):981-92. doi: 10.1016/S0045-6535(03)00558-7.
10
Kernel density estimation-based real-time prediction for respiratory motion.基于核密度估计的呼吸运动实时预测。
Phys Med Biol. 2010 Mar 7;55(5):1311-26. doi: 10.1088/0031-9155/55/5/004. Epub 2010 Feb 4.

引用本文的文献

1
Use of Factorial Design for Calculation of Second Hyperpolarizabilities.用于计算二阶超极化率的因子设计法。
Nanomaterials (Basel). 2025 Aug 23;15(17):1302. doi: 10.3390/nano15171302.
2
DFT exchange: sharing perspectives on the workhorse of quantum chemistry and materials science.DFT 交换:对量子化学和材料科学的主力的观点分享。
Phys Chem Chem Phys. 2022 Dec 7;24(47):28700-28781. doi: 10.1039/d2cp02827a.
3
A Generalized Regression Neural Network Model for Predicting the Curing Characteristics of Carbon Black-Filled Rubber Blends.
一种用于预测炭黑填充橡胶共混物固化特性的广义回归神经网络模型。
Polymers (Basel). 2022 Feb 9;14(4):653. doi: 10.3390/polym14040653.
4
Uncertainty quantification in classical molecular dynamics.经典分子动力学中的不确定性量化。
Philos Trans A Math Phys Eng Sci. 2021 May 17;379(2197):20200082. doi: 10.1098/rsta.2020.0082. Epub 2021 Mar 29.