• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

在一个通过概率修剪得到的生成焓基准数据集上,对流行的复合热化学模型和密度泛函近似进行关键基准测试。

Critical benchmarking of popular composite thermochemistry models and density functional approximations on a probabilistically pruned benchmark dataset of formation enthalpies.

作者信息

Das Sambit Kumar, Chakraborty Sabyasachi, Ramakrishnan Raghunathan

机构信息

Tata Institute of Fundamental Research, Centre for Interdisciplinary Sciences, Hyderabad 500107, India.

出版信息

J Chem Phys. 2021 Jan 28;154(4):044113. doi: 10.1063/5.0032713.

DOI:10.1063/5.0032713
PMID:33514111
Abstract

First-principles calculation of the standard formation enthalpy, ΔH° (298 K), in such a large scale as required by chemical space explorations, is amenable only with density functional approximations (DFAs) and certain composite wave function theories (cWFTs). Unfortunately, the accuracies of popular range-separated hybrid, "rung-4" DFAs, and cWFTs that offer the best accuracy-vs-cost trade-off have until now been established only for datasets predominantly comprising small molecules; their transferability to larger systems remains vague. In this study, we present an extended benchmark dataset of ΔH° for structurally and electronically diverse molecules. We apply quartile-ranking based on boundary-corrected kernel density estimation to filter outliers and arrive at probabilistically pruned enthalpies of 1694 compounds (PPE1694). For this dataset, we rank the prediction accuracies of G4, G4(MP2), ccCA, CBS-QB3, and 23 popular DFAs using conventional and probabilistic error metrics. We discuss systematic prediction errors and highlight the role an empirical higher-level correction plays in the G4(MP2) model. Furthermore, we comment on uncertainties associated with the reference empirical data for atoms and the systematic errors stemming from these that grow with the molecular size. We believe that these findings will aid in identifying meaningful application domains for quantum thermochemical methods.

摘要

在化学空间探索所需的如此大规模下,标准生成焓ΔH°(298 K)的第一性原理计算仅适用于密度泛函近似(DFA)和某些复合波函数理论(cWFT)。不幸的是,迄今为止,流行的范围分离杂化、“第4级”DFA以及提供最佳精度与成本权衡的cWFT的准确性仅针对主要由小分子组成的数据集得以确立;它们对更大体系的可转移性仍不明确。在本研究中,我们展示了一个针对结构和电子性质多样的分子的ΔH°扩展基准数据集。我们应用基于边界校正核密度估计的四分位数排序来过滤异常值,并得出1694种化合物的概率修剪焓(PPE1694)。对于该数据集,我们使用传统和概率误差度量对G4、G4(MP2)、ccCA、CBS - QB3以及23种流行的DFA的预测准确性进行排名。我们讨论了系统预测误差,并强调了经验性高级校正在G4(MP2)模型中所起的作用。此外,我们评论了与原子参考经验数据相关的不确定性以及由这些不确定性随分子大小增长而产生的系统误差。我们相信这些发现将有助于确定量子热化学方法有意义的应用领域。

相似文献

1
Critical benchmarking of popular composite thermochemistry models and density functional approximations on a probabilistically pruned benchmark dataset of formation enthalpies.在一个通过概率修剪得到的生成焓基准数据集上,对流行的复合热化学模型和密度泛函近似进行关键基准测试。
J Chem Phys. 2021 Jan 28;154(4):044113. doi: 10.1063/5.0032713.
2
Benchmarking Compound Methods (CBS-QB3, CBS-APNO, G3, G4, W1BD) against the Active Thermochemical Tables: Formation Enthalpies of Radicals.将复合方法(CBS-QB3、CBS-APNO、G3、G4、W1BD)与活性热化学表进行基准测试:自由基的生成焓。
J Phys Chem A. 2015 Aug 20;119(33):8922-33. doi: 10.1021/acs.jpca.5b05448. Epub 2015 Aug 6.
3
Benchmarking Compound Methods (CBS-QB3, CBS-APNO, G3, G4, W1BD) against the Active Thermochemical Tables: A Litmus Test for Cost-Effective Molecular Formation Enthalpies.将复合方法(CBS-QB3、CBS-APNO、G3、G4、W1BD)与活性热化学表进行基准测试:对具有成本效益的分子生成焓的石蕊试验。
J Phys Chem A. 2015 Jul 16;119(28):7235-46. doi: 10.1021/jp511403a. Epub 2015 Jan 27.
4
Canonical and DLPNO-Based G4(MP2)XK-Inspired Composite Wave Function Methods Parametrized against Large and Chemically Diverse Training Sets: Are They More Accurate and/or Robust than Double-Hybrid DFT?基于正则化和 DLPNO 的 G4(MP2)XK 启发式复合波函数方法,针对大型和化学多样化的训练集进行参数化:它们比双杂化 DFT 更准确和/或稳健吗?
J Chem Theory Comput. 2020 Jul 14;16(7):4238-4255. doi: 10.1021/acs.jctc.0c00189. Epub 2020 Jun 5.
5
Comparative Study of Nonhybrid Density Functional Approximations for the Prediction of 3d Transition Metal Thermochemistry.用于预测3d过渡金属热化学的非杂化密度泛函近似的比较研究
J Chem Theory Comput. 2017 Oct 10;13(10):4907-4913. doi: 10.1021/acs.jctc.7b00809. Epub 2017 Sep 20.
6
W4-17: A diverse and high-confidence dataset of atomization energies for benchmarking high-level electronic structure methods.W4 - 17:用于基准测试高级电子结构方法的多样化且高置信度的雾化能数据集。
J Comput Chem. 2017 Sep 15;38(24):2063-2075. doi: 10.1002/jcc.24854. Epub 2017 Jul 4.
7
Estimating Systematic Error and Uncertainty in Thermochemistry: II. ATOMIC(hc) Enthalpies of Formation for a Large Set of Hydrocarbons.估算热化学中的系统误差和不确定性:II. 大量烃类的原子(hc)生成焓。
J Chem Theory Comput. 2020 Jan 14;16(1):399-426. doi: 10.1021/acs.jctc.9b00974. Epub 2019 Dec 20.
8
Accurate reaction barrier heights of pericyclic reactions: Surprisingly large deviations for the CBS-QB3 composite method and their consequences in DFT benchmark studies.周环反应的精确反应势垒高度:CBS-QB3复合方法存在惊人的大偏差及其在密度泛函理论基准研究中的后果。
J Comput Chem. 2015 Apr 5;36(9):622-32. doi: 10.1002/jcc.23837. Epub 2015 Feb 3.
9
The correlation consistent composite approach (ccCA): an alternative to the Gaussian-n methods.相关一致性复合方法(ccCA):高斯-n 方法的替代方法。
J Chem Phys. 2006 Mar 21;124(11):114104. doi: 10.1063/1.2173988.
10
Investigation of Gaussian4 theory for transition metal thermochemistry.高斯4理论在过渡金属热化学中的研究。
J Phys Chem A. 2009 Apr 30;113(17):5170-5. doi: 10.1021/jp809179q.

引用本文的文献

1
COMPAS-2: a dataset of cata-condensed hetero-polycyclic aromatic systems.COMPAS-2:一个包含缩合杂多环芳烃体系的数据集。
Sci Data. 2024 Jan 19;11(1):97. doi: 10.1038/s41597-024-02927-8.
2
Prognostic Value of Early Fluorodeoxyglucose-Positron Emission Tomography Response Imaging and Peripheral Immunologic Biomarkers: Substudy of a Phase II Trial of Risk-Adaptive Chemoradiation for Unresectable Non-Small Cell Lung Cancer.早期氟脱氧葡萄糖正电子发射断层扫描反应成像和外周免疫生物标志物的预后价值:不可切除非小细胞肺癌风险适应性放化疗II期试验的子研究
Adv Radiat Oncol. 2021 Nov 21;7(2):100857. doi: 10.1016/j.adro.2021.100857. eCollection 2022 Mar-Apr.