• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

金标准二聚体相互作用能的量子化学基准数据库。

Quantum chemical benchmark databases of gold-standard dimer interaction energies.

机构信息

D. E. Shaw Research, New York, NY, 10036, USA.

Department of Biochemistry and Molecular Biophysics, Columbia University, New York, NY, 10032, USA.

出版信息

Sci Data. 2021 Feb 10;8(1):55. doi: 10.1038/s41597-021-00833-x.

DOI:10.1038/s41597-021-00833-x
PMID:33568655
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7876112/
Abstract

Advances in computational chemistry create an ongoing need for larger and higher-quality datasets that characterize noncovalent molecular interactions. We present three benchmark collections of quantum mechanical data, covering approximately 3,700 distinct types of interacting molecule pairs. The first collection, which we refer to as DES370K, contains interaction energies for more than 370,000 dimer geometries. These were computed using the coupled-cluster method with single, double, and perturbative triple excitations [CCSD(T)], which is widely regarded as the gold-standard method in electronic structure theory. Our second benchmark collection, a core representative subset of DES370K called DES15K, is intended for more computationally demanding applications of the data. Finally, DES5M, our third collection, comprises interaction energies for nearly 5,000,000 dimer geometries; these were calculated using SNS-MP2, a machine learning approach that provides results with accuracy comparable to that of our coupled-cluster training data. These datasets may prove useful in the development of density functionals, empirically corrected wavefunction-based approaches, semi-empirical methods, force fields, and models trained using machine learning methods.

摘要

计算化学的进展不断需要更大和更高质量的数据集来描述非共价分子相互作用。我们提出了三个量子力学数据集的基准集,涵盖了大约 3700 种不同类型的相互作用分子对。第一个数据集,我们称之为 DES370K,包含了超过 370000 个二聚体结构的相互作用能。这些是使用包含单、双和微扰三重激发的耦合簇方法[CCSD(T)]计算得到的,该方法被广泛认为是电子结构理论中的黄金标准方法。我们的第二个基准数据集,DES370K 的一个核心代表子集,称为 DES15K,是为了更具挑战性的数据集应用而设计的。最后,我们的第三个数据集 DES5M,包含了近 5000000 个二聚体结构的相互作用能,这些是使用 SNS-MP2 计算得到的,SNS-MP2 是一种机器学习方法,其结果的准确性与我们的耦合簇训练数据相当。这些数据集可能在密度泛函、经验修正波函数方法、半经验方法、力场和使用机器学习方法训练的模型的开发中证明是有用的。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b246/7876112/c40127a31ce2/41597_2021_833_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b246/7876112/38959914ff81/41597_2021_833_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b246/7876112/e52643f15fc5/41597_2021_833_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b246/7876112/c40127a31ce2/41597_2021_833_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b246/7876112/38959914ff81/41597_2021_833_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b246/7876112/e52643f15fc5/41597_2021_833_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/b246/7876112/c40127a31ce2/41597_2021_833_Fig3_HTML.jpg

相似文献

1
Quantum chemical benchmark databases of gold-standard dimer interaction energies.金标准二聚体相互作用能的量子化学基准数据库。
Sci Data. 2021 Feb 10;8(1):55. doi: 10.1038/s41597-021-00833-x.
2
Calculations on noncovalent interactions and databases of benchmark interaction energies.非共价相互作用的计算和基准相互作用能数据库。
Acc Chem Res. 2012 Apr 17;45(4):663-72. doi: 10.1021/ar200255p. Epub 2012 Jan 6.
3
Improving the accuracy of Møller-Plesset perturbation theory with neural networks.利用神经网络提高 Møller-Plesset 微扰理论的准确性。
J Chem Phys. 2017 Oct 28;147(16):161725. doi: 10.1063/1.4986081.
4
MP2.5 and MP2.X: approaching CCSD(T) quality description of noncovalent interaction at the cost of a single CCSD iteration.MP2.5 和 MP2.X:以单次 CCSD 迭代为代价,逼近 CC 衔接密度泛函理论(CCSD(T))质量对非共价相互作用的描述。
Chemphyschem. 2013 Mar 18;14(4):698-707. doi: 10.1002/cphc.201200850. Epub 2013 Jan 11.
5
On geometries of stacked and H-bonded nucleic acid base pairs determined at various DFT, MP2, and CCSD(T) levels up to the CCSD(T)/complete basis set limit level.关于在各种密度泛函理论(DFT)、二阶微扰理论(MP2)和耦合簇单双激发理论(CCSD(T))水平下直至CCSD(T)/完全基组极限水平所确定的堆积和氢键结合的核酸碱基对的几何结构。
J Chem Phys. 2005 May 22;122(20):204322. doi: 10.1063/1.1906205.
6
Is It Possible To Obtain Coupled Cluster Quality Energies at near Density Functional Theory Cost? Domain-Based Local Pair Natural Orbital Coupled Cluster vs Modern Density Functional Theory.能否以接近密度泛函理论的成本获得耦合簇质量能量?基于域的定域对自然轨道耦合簇与现代密度泛函理论的比较
J Chem Theory Comput. 2015 Sep 8;11(9):4054-63. doi: 10.1021/acs.jctc.5b00359. Epub 2015 Aug 4.
7
Density functional theory based study of molecular interactions, recognition, engineering, and quantum transport in π molecular systems.基于密度泛函理论的π分子体系中分子相互作用、识别、工程和量子输运的研究。
Acc Chem Res. 2014 Nov 18;47(11):3321-30. doi: 10.1021/ar400326q. Epub 2014 Oct 22.
8
Platinum, gold, and silver standards of intermolecular interaction energy calculations.分子间相互作用能计算的铂、金和银标准。
J Chem Phys. 2019 Aug 21;151(7):070901. doi: 10.1063/1.5116151.
9
Interactions between large molecules pose a puzzle for reference quantum mechanical methods.大分子之间的相互作用给参考量子力学方法带来了难题。
Nat Commun. 2021 Jun 24;12(1):3927. doi: 10.1038/s41467-021-24119-3.
10
Coupled cluster benchmarks of large noncovalent complexes: The L7 dataset as well as DNA-ellipticine and buckycatcher-fullerene.大非共价复合物的耦合簇基准测试:L7 数据集以及 DNA-椭圆碱和 buckycatcher-富勒烯。
J Chem Phys. 2021 Apr 21;154(15):154104. doi: 10.1063/5.0042906.

引用本文的文献

1
Studying Noncovalent Interactions in Molecular Systems with Machine Learning.利用机器学习研究分子系统中的非共价相互作用。
Chem Rev. 2025 Jun 25;125(12):5776-5829. doi: 10.1021/acs.chemrev.4c00893. Epub 2025 Jun 9.
2
MACE-OFF: Short-Range Transferable Machine Learning Force Fields for Organic Molecules.MACE-OFF:用于有机分子的短程可转移机器学习力场
J Am Chem Soc. 2025 May 28;147(21):17598-17611. doi: 10.1021/jacs.4c07099. Epub 2025 May 19.
3
Non-Covalent Molecular Interaction Rules to Define Internal Dimer Coordinates for Quantum Mechanical Potential Energy Scans.

本文引用的文献

1
Improving the accuracy of Møller-Plesset perturbation theory with neural networks.利用神经网络提高 Møller-Plesset 微扰理论的准确性。
J Chem Phys. 2017 Oct 28;147(16):161725. doi: 10.1063/1.4986081.
2
The BioFragment Database (BFDb): An open-data platform for computational chemistry analysis of noncovalent interactions.生物片段数据库(BFDb):一个用于非共价相互作用计算化学分析的开放数据平台。
J Chem Phys. 2017 Oct 28;147(16):161727. doi: 10.1063/1.5001028.
3
Prediction Errors of Molecular Machine Learning Models Lower than Hybrid DFT Error.
用于定义量子力学势能扫描内部二聚体坐标的非共价分子相互作用规则。
J Comput Chem. 2025 May 30;46(14):e70136. doi: 10.1002/jcc.70136.
4
The QDπ dataset, training data for drug-like molecules and biopolymer fragments and their interactions.QDπ数据集,用于类药物分子、生物聚合物片段及其相互作用的训练数据。
Sci Data. 2025 Apr 25;12(1):693. doi: 10.1038/s41597-025-04972-3.
5
MORE-Q, a dataset for molecular olfactorial receptor engineering by quantum mechanics.MORE-Q,一个用于通过量子力学进行分子嗅觉受体工程的数据集。
Sci Data. 2025 Feb 22;12(1):324. doi: 10.1038/s41597-025-04616-6.
6
Refinement of Atomic Polarizabilities for a Polarizable Gaussian Multipole Force Field with Simultaneous Considerations of Both Molecular Polarizability Tensors and In-Solution Electrostatic Potentials.考虑分子极化率张量和溶液中静电势的同时,对可极化高斯多极子力场的原子极化率进行优化。
J Chem Inf Model. 2025 Feb 10;65(3):1428-1440. doi: 10.1021/acs.jcim.4c02175. Epub 2025 Jan 26.
7
Beyond chemical structures: lessons and guiding principles for the next generation of molecular databases.超越化学结构:面向下一代分子数据库的经验教训与指导原则
Chem Sci. 2024 Nov 28;16(3):1002-1016. doi: 10.1039/d4sc04064c. eCollection 2025 Jan 15.
8
Data Generation for Machine Learning Interatomic Potentials and Beyond.用于机器学习原子间势及其他方面的数据生成。
Chem Rev. 2024 Dec 25;124(24):13681-13714. doi: 10.1021/acs.chemrev.4c00572. Epub 2024 Nov 21.
9
Origin of the intermolecular forces that produce donor-acceptor stacks in π-conjugated charge-transfer complexes.在π共轭电荷转移复合物中产生供体-受体堆叠的分子间力的起源。
Commun Chem. 2024 Nov 6;7(1):253. doi: 10.1038/s42004-024-01329-6.
10
On the Potential Energy Surface of the Pyrene Dimer.芘二聚体的势能面。
Int J Mol Sci. 2024 Oct 6;25(19):10762. doi: 10.3390/ijms251910762.
分子机器学习模型的预测误差低于混合密度泛函理论误差。
J Chem Theory Comput. 2017 Nov 14;13(11):5255-5264. doi: 10.1021/acs.jctc.7b00577. Epub 2017 Oct 10.
4
ANI-1: an extensible neural network potential with DFT accuracy at force field computational cost.ANI-1:一种具有密度泛函理论(DFT)精度且计算成本仅为力场级别的可扩展神经网络势。
Chem Sci. 2017 Apr 1;8(4):3192-3203. doi: 10.1039/c6sc05720a. Epub 2017 Feb 8.
5
Building a More Predictive Protein Force Field: A Systematic and Reproducible Route to AMBER-FB15.构建更具预测性的蛋白质力场:通往AMBER-FB15的系统且可重复的途径。
J Phys Chem B. 2017 Apr 27;121(16):4023-4039. doi: 10.1021/acs.jpcb.7b02320. Epub 2017 Apr 6.
6
Machine-learned approximations to Density Functional Theory Hamiltonians.机器学习近似密度泛函理论哈密顿量。
Sci Rep. 2017 Feb 15;7:42669. doi: 10.1038/srep42669.
7
ωB97M-V: A combinatorially optimized, range-separated hybrid, meta-GGA density functional with VV10 nonlocal correlation.ωB97M-V:一种经过组合优化的、具有范围分离的杂化元广义梯度近似密度泛函,带有VV10非局域相关。
J Chem Phys. 2016 Jun 7;144(21):214110. doi: 10.1063/1.4952647.
8
Parallel and Low-Order Scaling Implementation of Hartree-Fock Exchange Using Local Density Fitting.使用局域密度拟合的哈特里-福克交换的并行和低阶缩放实现
J Chem Theory Comput. 2016 Jul 12;12(7):3122-34. doi: 10.1021/acs.jctc.6b00251. Epub 2016 Jun 27.
9
Revised Damping Parameters for the D3 Dispersion Correction to Density Functional Theory.用于密度泛函理论中D3色散校正的修正阻尼参数
J Phys Chem Lett. 2016 Jun 16;7(12):2197-203. doi: 10.1021/acs.jpclett.6b00780. Epub 2016 May 27.
10
A machine learning correction for DFT non-covalent interactions based on the S22, S66 and X40 benchmark databases.基于S22、S66和X40基准数据库的用于密度泛函理论非共价相互作用的机器学习校正。
J Cheminform. 2016 May 3;8:24. doi: 10.1186/s13321-016-0133-7. eCollection 2016.