大数据基准测试：在包含122k个CCSD(T)总原子化能的数据集上，雅各布天梯各层级的密度泛函理论（DFT）方法表现如何？

Big data benchmarking: how do DFT methods across the rungs of Jacob's ladder perform for a dataset of 122k CCSD(T) total atomization energies?

作者信息

Karton Amir

机构信息

School of Science and Technology, University of New England, Armidale, NSW 2351, Australia.

出版信息

Phys Chem Chem Phys. 2024 May 22;26(20):14594-14606. doi: 10.1039/d4cp00387j.

DOI:10.1039/d4cp00387j

PMID:38738470

Abstract

Total atomization energies (TAEs) are a central quantity in density functional theory (DFT) benchmark studies. However, so far TAE databases obtained from experiment or high-level wavefunction theory included up to hundreds of TAEs. Here, we use the GDB-9 database of 133k CCSD(T) TAEs generated by Curtiss and co-workers [B. Narayanan, P. C. Redfern, R. S. Assary and L. A. Curtiss, , 2019, , 7449] to evaluate the performance of 14 representative DFT methods across the rungs of Jacob's ladder (namely, PBE, BLYP, B97-D, M06-L, τ-HCTH, PBE0, B3LYP, B3PW91, ωB97X-D, τ-HCTHh, PW6B95, M06, M06-2X, and MN15). We first use the [PBE] diagnostic for nondynamical correlation to eliminate systems that potentially include significant multireference effects, for which the CCSD(T) TAEs might not be sufficiently reliable. The resulting database (denoted by GDB9-nonMR) includes 122k species. Of the considered functionals, B3LYP attains the best performance relative to the G4(MP2) reference TAEs, with a mean absolute deviation (MAD) of 4.09 kcal mol. This first-generation hybrid functional, in which the three mixing coefficients were fitted against a small set of TAEs, is one of the few functionals that are not systematically biased towards overestimating the G4(MP2) TAEs, as demonstrated by a mean-signed deviation (MSD) of 0.45 kcal mol. The relatively good performance of B3LYP is followed by the heavily parameterized M06-L -GGA functional, which attains a MAD of 6.24 kcal mol. The PW6B95, M06, M06-2X, and MN15 functionals tend to systematically overestimate the G4(MP2) TAEs and attain MADs ranging between 18.69 (M06) and 28.54 (MN15) kcal mol. However, PW6B95 and M06-2X exhibit particularly narrow error distributions. Thus, scaling their TAEs by an empirical scaling factor reduces their MADs to merely 3.38 (PW6B95) and 2.85 (M06-2X) kcal mol. Empirical dispersion corrections (, D3 and D4) are attractive, and therefore, their inclusion worsens the performance of methods that systematically overestimate the TAEs.

摘要

总原子化能（TAEs）是密度泛函理论（DFT）基准研究中的核心量。然而，到目前为止，从实验或高水平波函数理论获得的TAE数据库包含多达数百个TAEs。在这里，我们使用由柯蒂斯及其同事生成的133k个CCSD(T) TAE的GDB - 9数据库[B. 纳拉亚南、P. C. 雷德费恩、R. S. 阿萨里和L. A. 柯蒂斯，，2019，，7449]来评估雅各布天梯各层级上14种代表性DFT方法的性能（即PBE、BLYP、B97 - D、M06 - L、τ - HCTH、PBE0、B3LYP、B3PW91、ωB97X - D、τ - HCTHh、PW6B95、M06、M06 - 2X和MN15）。我们首先使用[PBE]非动态相关诊断来排除可能包含显著多参考效应的体系，对于这些体系，CCSD(T) TAE可能不够可靠。所得数据库（记为GDB9 - nonMR）包含122k个物种。在所考虑的泛函中，相对于G4(MP2)参考TAE，B3LYP表现最佳，平均绝对偏差（MAD）为4.09 kcal/mol。这种第一代杂化泛函，其三个混合系数是针对一小部分TAE拟合的，是少数几种不会系统性地偏向高估G4(MP2) TAE的泛函之一，平均符号偏差（MSD）为0.45 kcal/mol就证明了这一点。B3LYP相对较好的性能之后是参数化程度很高的M06 - L - GGA泛函，其MAD为6.24 kcal/mol。PW6B95、M06、M06 - 2X和MN15泛函往往会系统性地高估G4(MP2) TAE，MAD在介于18.69（M06）和28.54（MN15）kcal/mol之间。然而，PW6B95和M06 - 2X表现出特别窄的误差分布。因此，通过经验缩放因子对它们的TAE进行缩放，可将其MAD分别降至仅3.38（PW6B95）和2.85（M06 - 2X）kcal/mol。经验色散校正（如D3和D4）很有吸引力，因此，将它们包含在内会使系统性高估TAE的方法的性能变差。

相似文献

Big data benchmarking: how do DFT methods across the rungs of Jacob's ladder perform for a dataset of 122k CCSD(T) total atomization energies?大数据基准测试：在包含122k个CCSD(T)总原子化能的数据集上，雅各布天梯各层级的密度泛函理论（DFT）方法表现如何？

Phys Chem Chem Phys. 2024 May 22;26(20):14594-14606. doi: 10.1039/d4cp00387j.

Performance of DFT for C Isomerization Energies: A Noticeable Exception to Jacob's Ladder.用于C异构化能量的密度泛函理论（DFT）性能：雅各布天梯的一个显著例外。

J Phys Chem A. 2019 Jan 10;123(1):257-266. doi: 10.1021/acs.jpca.8b10240. Epub 2018 Dec 21.

"Mindless" DFT Benchmarking.“盲目”离散傅里叶变换基准测试

J Chem Theory Comput. 2009 Apr 14;5(4):993-1003. doi: 10.1021/ct800511q. Epub 2009 Mar 4.

How reliable is DFT in predicting relative energies of polycyclic aromatic hydrocarbon isomers? comparison of functionals from different rungs of jacob's ladder.DFT 在预测多环芳烃异构体的相对能量时有多可靠？雅各布天梯不同梯级的泛函比较。

J Comput Chem. 2017 Mar 5;38(6):370-382. doi: 10.1002/jcc.24669. Epub 2016 Nov 17.

Heats of Formation of Medium-Sized Organic Compounds from Contemporary Electronic Structure Methods.基于当代电子结构方法的中等尺寸有机化合物的生成热

J Chem Theory Comput. 2017 Aug 8;13(8):3537-3560. doi: 10.1021/acs.jctc.7b00335. Epub 2017 Jul 10.

A thorough benchmark of density functional methods for general main group thermochemistry, kinetics, and noncovalent interactions.全面基准测试密度泛函方法在一般主族热化学、动力学和非共价相互作用中的应用。

Phys Chem Chem Phys. 2011 Apr 14;13(14):6670-88. doi: 10.1039/c0cp02984j. Epub 2011 Mar 7.

Appropriate description of intermolecular interactions in the methane hydrates: an assessment of DFT methods.甲烷水合物中分子间相互作用的恰当描述：DFT 方法的评估。

J Comput Chem. 2013 Jan 15;34(2):121-31. doi: 10.1002/jcc.23112. Epub 2012 Sep 5.

A critical comparison of CH⋯π π⋯π interactions in the benzene dimer: obtaining benchmarks at the CCSD(T) level and assessing the accuracy of lower scaling methods.苯二聚体中CH⋯π和π⋯π相互作用的关键比较：在CCSD(T)水平获得基准并评估低标度方法的准确性。

Phys Chem Chem Phys. 2023 Feb 8;25(6):4824-4838. doi: 10.1039/d2cp04335a.

Structural and energetic properties of cluster models of anatase-supported single late transition metal atoms: a density functional theory benchmark study.锐钛矿负载的单个晚期过渡金属原子团簇模型的结构和能量性质：一项密度泛函理论基准研究。

J Mol Model. 2024 Oct 22;30(11):380. doi: 10.1007/s00894-024-06173-y.

A comprehensive benchmark investigation of quantum chemical methods for carbocations.碳正离子量子化学方法的全面基准研究。

Phys Chem Chem Phys. 2023 Jan 18;25(3):1903-1922. doi: 10.1039/d2cp04603b.

引用本文的文献

Big-Data Analysis of Geometric Descriptors as Efficient Predictors of Energetic Stability in Nonplanar Polycyclic Aromatic Hydrocarbons.作为非平面多环芳烃能量稳定性有效预测指标的几何描述符的大数据分析

J Comput Chem. 2025 Aug 5;46(21):e70198. doi: 10.1002/jcc.70198.

Tests of the DFT Ladder for the Fulminic Acid Challenge.针对雷酸挑战的密度泛函理论阶梯测试

J Am Chem Soc. 2025 Apr 30;147(17):14088-14104. doi: 10.1021/jacs.4c13823. Epub 2025 Apr 16.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

大数据基准测试：在包含122k个CCSD(T)总原子化能的数据集上，雅各布天梯各层级的密度泛函理论（DFT）方法表现如何？

Big data benchmarking: how do DFT methods across the rungs of Jacob's ladder perform for a dataset of 122k CCSD(T) total atomization energies?

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献