• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于高斯过程的弥散校正改进。

Gaussian Process-Based Refinement of Dispersion Corrections.

机构信息

Department of Chemistry , and Department of Computer Science , University of Toronto , Toronto , Ontario M5S , Canada.

Laboratory of Physical Chemistry , ETH Zurich , Vladimir-Prelog-Weg 2 , 8093 Zurich , Switzerland.

出版信息

J Chem Theory Comput. 2019 Nov 12;15(11):6046-6060. doi: 10.1021/acs.jctc.9b00627. Epub 2019 Oct 11.

DOI:10.1021/acs.jctc.9b00627
PMID:31603673
Abstract

We employ Gaussian process (GP) regression to adjust for systematic errors in D3-type dispersion corrections. We refer to the associated, statistically improved model as D3-GP. It is trained on differences between interaction energies obtained from PBE-D3(BJ)/ma-def2-QZVPP and DLPNO-CCSD(T)/CBS calculations. We generated a data set containing interaction energies for 1248 molecular dimers, which resemble the dispersion-dominated systems contained in the S66 data set. Our systems represent not only equilibrium structures but also dimers with various relative orientations and conformations at both shorter and longer distances. A reparametrization of the D3(BJ) model based on 66 of these dimers suggests that two of its three empirical parameters, and , are zero, whereas = 5.6841 bohr. For the remaining 1182 dimers, we find that this new set of parameters is superior to all previously published D3(BJ) parameter sets. To train our D3-GP model, we engineered two different vectorial representations of (supra-)molecular systems, both derived from the matrix of atom-pairwise D3(BJ) interaction terms: (a) a distance-resolved interaction energy histogram, histD3(BJ), and (b) eigenvalues of the interaction matrix ordered according to their decreasing absolute value, eigD3(BJ). Hence, the GP learns a mapping from D3(BJ) information only, which renders D3-GP-type dispersion corrections comparable to those obtained with the original D3 approach. They improve systematically if the underlying training set is selected carefully. Here, we harness the prediction variance obtained from GP regression to select optimal training sets in an automated fashion. The larger the variance, the more information the corresponding data point may add to the training set. For a given set of molecular systems, variance-based sampling can approximately determine the smallest subset being subjected to reference calculations such that all dispersion corrections for the remaining systems fall below a predefined accuracy threshold. To render the entire D3-GP workflow as efficient as possible, we present an improvement over our variance-based, sequential active-learning scheme [ 2018 , 14 , 5238 ]. Our refined learning algorithm selects multiple (instead of single) systems that can be subjected to reference calculations simultaneously. We refer to the underlying selection strategy as batchwise variance-based sampling (BVS). BVS-guided active learning is an essential component of our D3-GP workflow, which is implemented in a black-box fashion. Once provided with reference data for new molecular systems, the underlying GP model automatically learns to adapt to these and similar systems. This approach leads overall to a self-improving model (D3-GP) that predicts system-focused and GP-refined D3-type dispersion corrections for any given system of reference data.

摘要

我们采用高斯过程(GP)回归来调整 D3 型色散校正中的系统误差。我们将相关的、经过统计学改进的模型称为 D3-GP。它是基于 PBE-D3(BJ)/ma-def2-QZVPP 和 DLPNO-CCSD(T)/CBS 计算得到的相互作用能之间的差异进行训练的。我们生成了一个包含 1248 个分子二聚体相互作用能的数据集,这些二聚体类似于 S66 数据集中包含的色散主导系统。我们的系统不仅代表了平衡结构,还代表了在较短和较长距离处具有各种相对取向和构象的二聚体。基于其中 66 个二聚体对 D3(BJ)模型进行的重新参数化表明,其三个经验参数中的两个, 和 ,为零,而 = 5.6841 bohr。对于其余的 1182 个二聚体,我们发现这个新的参数集优于所有以前发表的 D3(BJ)参数集。为了训练我们的 D3-GP 模型,我们设计了两种不同的超分子系统的向量表示形式,都源自原子间 D3(BJ)相互作用项的矩阵:(a)距离分辨的相互作用能直方图 histD3(BJ),以及(b)根据其绝对值递减的顺序排列的相互作用矩阵的特征值 eigD3(BJ)。因此,GP 仅从 D3(BJ)信息中学习映射,这使得 D3-GP 类型的色散校正与原始 D3 方法获得的校正相当。如果仔细选择基础训练集,它们会系统地改进。在这里,我们利用 GP 回归获得的预测方差以自动化的方式选择最佳训练集。方差越大,相应的数据点可能为训练集添加的信息就越多。对于给定的分子系统集,基于方差的采样可以大致确定要进行参考计算的最小子集,以便所有剩余系统的色散校正都低于预定义的精度阈值。为了使整个 D3-GP 工作流程尽可能高效,我们对基于方差的顺序主动学习方案[2018,14,5238]进行了改进。我们的改进学习算法选择多个(而不是单个)可以同时进行参考计算的系统。我们将这种基本的选择策略称为基于方差的批处理采样(BVS)。基于方差的引导主动学习是我们 D3-GP 工作流程的一个基本组成部分,它以黑盒方式实现。一旦为新的分子系统提供了参考数据,底层 GP 模型就会自动学习适应这些系统和类似系统。这种方法导致整体上的自我改进模型(D3-GP),可以为任何给定的参考数据系统预测聚焦于系统和经过 GP 改进的 D3 型色散校正。

相似文献

1
Gaussian Process-Based Refinement of Dispersion Corrections.基于高斯过程的弥散校正改进。
J Chem Theory Comput. 2019 Nov 12;15(11):6046-6060. doi: 10.1021/acs.jctc.9b00627. Epub 2019 Oct 11.
2
Comprehensive Thermochemical Benchmark Set of Realistic Closed-Shell Metal Organic Reactions.真实闭壳金属有机反应的综合热化学基准集。
J Chem Theory Comput. 2018 May 8;14(5):2596-2608. doi: 10.1021/acs.jctc.7b01183. Epub 2018 Apr 4.
3
Benchmarking density functional methods against the S66 and S66x8 datasets for non-covalent interactions.对非共价相互作用的 S66 和 S66x8 数据集进行密度泛函方法的基准测试。
Chemphyschem. 2011 Dec 9;12(17):3421-33. doi: 10.1002/cphc.201100826. Epub 2011 Nov 23.
4
A geometrical correction for the inter- and intra-molecular basis set superposition error in Hartree-Fock and density functional theory calculations for large systems.一种用于大分子体系 Hartree-Fock 和密度泛函理论计算中分子内和分子间基组叠加误差的几何修正方法。
J Chem Phys. 2012 Apr 21;136(15):154101. doi: 10.1063/1.3700154.
5
A look at the density functional theory zoo with the advanced GMTKN55 database for general main group thermochemistry, kinetics and noncovalent interactions.借助用于一般主族热化学、动力学和非共价相互作用的先进GMTKN55数据库审视密度泛函理论体系。
Phys Chem Chem Phys. 2017 Dec 13;19(48):32184-32215. doi: 10.1039/c7cp04913g.
6
Implementation of empirical dispersion corrections to density functional theory for periodic systems.周期性体系中密度泛函理论的经验弥散修正的实现。
J Comput Chem. 2012 Sep 30;33(25):2023-31. doi: 10.1002/jcc.23037. Epub 2012 Jun 8.
7
Calculations on noncovalent interactions and databases of benchmark interaction energies.非共价相互作用的计算和基准相互作用能数据库。
Acc Chem Res. 2012 Apr 17;45(4):663-72. doi: 10.1021/ar200255p. Epub 2012 Jan 6.
8
DFT Functionals for Modeling of Polyethylene Chains Cross-Linked by Metal Atoms. DLPNO-CCSD(T) Benchmark Calculations.用于金属原子交联聚乙烯链建模的密度泛函理论(DFT)泛函。DLPNO-CCSD(T)基准计算。
J Phys Chem A. 2021 Sep 2;125(34):7382-7395. doi: 10.1021/acs.jpca.1c04793. Epub 2021 Aug 24.
9
Accurate Diels-Alder reaction energies from efficient density functional calculations.通过高效密度泛函计算得到的精确狄尔斯-阿尔德反应能量。
J Chem Theory Comput. 2015 Jun 9;11(6):2879-88. doi: 10.1021/acs.jctc.5b00223. Epub 2015 May 26.
10
Assessing the Performance of CASPT2 and DFT Methods for the Description of Long, Multicenter Bonding in Dimers between Radical Ions.评估耦合簇多参考二级微扰理论(CASPT2)和密度泛函理论(DFT)方法对自由基离子二聚体中长程多中心键合的描述性能。
J Chem Theory Comput. 2014 Feb 11;10(2):650-8. doi: 10.1021/ct4010257.

引用本文的文献

1
Ultrafast solvent migration in an iron complex revealed by nonadiabatic dynamics simulations.非绝热动力学模拟揭示铁配合物中的超快溶剂迁移
Chem Sci. 2025 May 13. doi: 10.1039/d5sc01174d.
2
Possible NLO response and electrical/charge transfer capabilities of natural anthraquinones as p-type organic semiconductors: a DFT approach.天然蒽醌作为p型有机半导体的潜在非线性光学响应及电学/电荷转移能力:一种密度泛函理论方法
J Mol Model. 2024 Feb 1;30(2):57. doi: 10.1007/s00894-024-05848-w.
3
Quantitative Structure-Reactivity Relationships for Synthesis Planning: The Benzhydrylium Case.
用于合成规划的定量结构-反应性关系:二苯甲基正离子的情况。
J Phys Chem A. 2024 Jan 11;128(1):343-354. doi: 10.1021/acs.jpca.3c07289. Epub 2023 Dec 19.
4
Open-Source Machine Learning in Computational Chemistry.开源机器学习在计算化学中的应用。
J Chem Inf Model. 2023 Aug 14;63(15):4505-4532. doi: 10.1021/acs.jcim.3c00643. Epub 2023 Jul 19.
5
The transferability limits of static benchmarks.静态基准测试的可转移性限制。
Phys Chem Chem Phys. 2022 Jun 22;24(24):14692-14698. doi: 10.1039/d2cp01725c.
6
Uncertainty Quantification of Reactivity Scales.反应性尺度的不确定性量化。
Chemphyschem. 2022 Apr 20;23(8):e202200061. doi: 10.1002/cphc.202200061. Epub 2022 Mar 18.
7
Gabor Dictionary of Sparse Image Patches Selected in Prior Boundaries for 3D Liver Segmentation in CT Images.基于先验边界的 3D CT 肝脏图像分割中稀疏图像块的 Gabor 字典选择。
J Healthc Eng. 2021 Dec 9;2021:5552864. doi: 10.1155/2021/5552864. eCollection 2021.
8
Gaussian Process Regression for Materials and Molecules.用于材料和分子的高斯过程回归
Chem Rev. 2021 Aug 25;121(16):10073-10141. doi: 10.1021/acs.chemrev.1c00022. Epub 2021 Aug 16.
9
Ab Initio Machine Learning in Chemical Compound Space.从头开始的化合物空间中的机器学习。
Chem Rev. 2021 Aug 25;121(16):10001-10036. doi: 10.1021/acs.chemrev.0c01303. Epub 2021 Aug 13.
10
Uncertainty quantification in classical molecular dynamics.经典分子动力学中的不确定性量化。
Philos Trans A Math Phys Eng Sci. 2021 May 17;379(2197):20200082. doi: 10.1098/rsta.2020.0082. Epub 2021 Mar 29.