• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从气相“小分子”势能的数据集角度来看 MD17 数据集。

The MD17 datasets from the perspective of datasets for gas-phase "small" molecule potentials.

机构信息

Department of Chemistry and Cherry L. Emerson Center for Scientific Computation, Emory University, Atlanta, Georgia 30322, USA.

Independent Researcher, Toronto, Canada.

出版信息

J Chem Phys. 2022 Jun 28;156(24):240901. doi: 10.1063/5.0089200.

DOI:10.1063/5.0089200
PMID:35778068
Abstract

There has been great progress in developing methods for machine-learned potential energy surfaces. There have also been important assessments of these methods by comparing so-called learning curves on datasets of electronic energies and forces, notably the MD17 database. The dataset for each molecule in this database generally consists of tens of thousands of energies and forces obtained from DFT direct dynamics at 500 K. We contrast the datasets from this database for three "small" molecules, ethanol, malonaldehyde, and glycine, with datasets we have generated with specific targets for the potential energy surfaces (PESs) in mind: a rigorous calculation of the zero-point energy and wavefunction, the tunneling splitting in malonaldehyde, and, in the case of glycine, a description of all eight low-lying conformers. We found that the MD17 datasets are too limited for these targets. We also examine recent datasets for several PESs that describe small-molecule but complex chemical reactions. Finally, we introduce a new database, "QM-22," which contains datasets of molecules ranging from 4 to 15 atoms that extend to high energies and a large span of configurations.

摘要

在开发机器学习势能面的方法方面已经取得了很大进展。通过比较电子能和力数据集(尤其是 MD17 数据库)上的所谓学习曲线,这些方法也得到了重要的评估。该数据库中每个分子的数据集通常由从 500 K 的 DFT 直接动力学获得的数万能量和力组成。我们对比了来自该数据库的三个“小分子”(乙醇、丙二醛和甘氨酸)的数据集,以及我们为特定的势能面(PES)数据集生成的数据集:零能和波函数的严格计算、丙二醛的隧道分裂,以及甘氨酸的情况下,对所有八个低能构象的描述。我们发现 MD17 数据集对于这些目标来说太有限了。我们还研究了描述小分子但复杂化学反应的几个 PES 的最新数据集。最后,我们引入了一个新的数据库“QM-22”,其中包含从 4 到 15 个原子的分子数据集,这些数据集延伸到高能和大的构象范围。

相似文献

1
The MD17 datasets from the perspective of datasets for gas-phase "small" molecule potentials.从气相“小分子”势能的数据集角度来看 MD17 数据集。
J Chem Phys. 2022 Jun 28;156(24):240901. doi: 10.1063/5.0089200.
2
Δ-Machine Learned Potential Energy Surfaces and Force Fields.Δ-机器学习势能面与力场
J Chem Theory Comput. 2023 Jan 10;19(1):1-17. doi: 10.1021/acs.jctc.2c01034. Epub 2022 Dec 17.
3
Beyond MD17: the reactive xxMD dataset.超越 MD17:反应性 xxMD 数据集。
Sci Data. 2024 Feb 20;11(1):222. doi: 10.1038/s41597-024-03019-3.
4
Permutationally invariant polynomial regression for energies and gradients, using reverse differentiation, achieves orders of magnitude speed-up with high precision compared to other machine learning methods.通过使用反向微分,能量和梯度的置换不变多项式回归,与其他机器学习方法相比,实现了数量级的速度提升和高精度。
J Chem Phys. 2022 Jan 28;156(4):044120. doi: 10.1063/5.0080506.
5
Ab initio molecular dynamics approach to tunneling splitting in polyatomic molecules.从头算分子动力学方法研究多原子分子中的隧穿劈裂。
J Comput Chem. 2012 Jan 5;33(1):60-5. doi: 10.1002/jcc.21943. Epub 2011 Sep 29.
6
Development and Evaluation of Geometry Optimization Algorithms in Conjunction with ANI Potentials.发展和评估与ANI 势能相结合的几何优化算法。
J Chem Theory Comput. 2022 Feb 8;18(2):978-991. doi: 10.1021/acs.jctc.1c01043. Epub 2022 Jan 12.
7
FCHL revisited: Faster and more accurate quantum machine learning.重新审视FCHL:更快、更准确的量子机器学习。
J Chem Phys. 2020 Jan 31;152(4):044107. doi: 10.1063/1.5126701.
8
The ground state tunneling splitting and the zero point energy of malonaldehyde: a quantum Monte Carlo determination.丙二醛的基态隧穿分裂和零点能:量子蒙特卡罗测定
J Chem Phys. 2007 Jan 14;126(2):024308. doi: 10.1063/1.2406074.
9
Excited-State Potential Energy Surfaces, Conical Intersections, and Analytical Gradients from Ground-State Density Functional Theory.基于基态密度泛函理论的激发态势能面、锥形交叉点及解析梯度
J Phys Chem Lett. 2019 May 16;10(10):2538-2545. doi: 10.1021/acs.jpclett.9b00712. Epub 2019 May 6.
10
WS22 database, Wigner Sampling and geometry interpolation for configurationally diverse molecular datasets.WS22 数据库,用于构象多样的分子数据集的维格纳采样和几何插补。
Sci Data. 2023 Feb 15;10(1):95. doi: 10.1038/s41597-023-01998-3.

引用本文的文献

1
Automated and Efficient Sampling of Chemical Reaction Space.化学反应空间的自动化高效采样
Adv Sci (Weinh). 2025 Mar;12(9):e2409009. doi: 10.1002/advs.202409009. Epub 2025 Jan 13.
2
Data Generation for Machine Learning Interatomic Potentials and Beyond.用于机器学习原子间势及其他方面的数据生成。
Chem Rev. 2024 Dec 25;124(24):13681-13714. doi: 10.1021/acs.chemrev.4c00572. Epub 2024 Nov 21.
3
DFT-Based Permutationally Invariant Polynomial Potentials Capture the Twists and Turns of CH.基于密度泛函理论的排列不变多项式势捕捉了碳氢化合物的曲折结构。
J Chem Theory Comput. 2024 Nov 12;20(21):9339-9353. doi: 10.1021/acs.jctc.4c00932. Epub 2024 Oct 21.
4
Uncertainty-driven dynamics for active learning of interatomic potentials.基于不确定性的原子间相互作用势主动学习动力学。
Nat Comput Sci. 2023 Mar;3(3):230-239. doi: 10.1038/s43588-023-00406-5. Epub 2023 Mar 6.
5
A Status Report on "Gold Standard" Machine-Learned Potentials for Water.关于水的“金标准”机器学习势的现状报告。
J Phys Chem Lett. 2023 Sep 14;14(36):8077-8087. doi: 10.1021/acs.jpclett.3c01791. Epub 2023 Sep 1.
6
Neural network potentials for chemistry: concepts, applications and prospects.化学中的神经网络势:概念、应用与展望。
Digit Discov. 2022 Dec 21;2(1):28-58. doi: 10.1039/d2dd00102k. eCollection 2023 Feb 13.
7
WS22 database, Wigner Sampling and geometry interpolation for configurationally diverse molecular datasets.WS22 数据库,用于构象多样的分子数据集的维格纳采样和几何插补。
Sci Data. 2023 Feb 15;10(1):95. doi: 10.1038/s41597-023-01998-3.