• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于准确预测原子化能的键型受限物性加权径向分布函数的机器学习方法。

Bond Type Restricted Property Weighted Radial Distribution Functions for Accurate Machine Learning Prediction of Atomization Energies.

机构信息

Department of Chemistry and Biomolecular Science , University of Ottawa , Ottawa K1N 6N5 , Canada.

出版信息

J Chem Theory Comput. 2018 Oct 9;14(10):5229-5237. doi: 10.1021/acs.jctc.8b00788. Epub 2018 Sep 10.

DOI:10.1021/acs.jctc.8b00788
PMID:30148628
Abstract

Understanding the performance of machine learning algorithms is essential for designing more accurate and efficient statistical models. It is not always possible to unravel the reasoning of neural networks. Here, we propose a method for calculating machine learning kernels in closed and analytic form by combining atomic property weighted radial distribution function (AP-RDF) descriptor with a Gaussian kernel. This allowed us to analyze and improve the performance of the Bag-of-Bonds descriptor when the bond type restriction is included in AP-RDF. The improvement is achieved for the prediction of molecular atomization energies (MAE = 1.7 kcal/mol for QM7 data set) and is due to the incorporation of a tensor product into the kernel, which captures the multidimensional representation of the AP-RDF. On the other hand, the numerical version of the AP-RDF is a constant size descriptor, making it more computationally efficient than Bag-of-Bonds. We have also discussed a connection between molecular quantum similarity and machine learning kernels with first-principles kinds of descriptors.

摘要

理解机器学习算法的性能对于设计更准确和高效的统计模型至关重要。神经网络的推理并不总是能够被揭示。在这里,我们提出了一种通过将原子特性加权径向分布函数(AP-RDF)描述符与高斯核相结合来计算机器学习核的封闭和解析形式的方法。这使得我们能够分析和改进当在 AP-RDF 中包含键类型限制时的键合描述符的性能。这种改进是通过在核中引入张量积来实现的,该张量积捕获了 AP-RDF 的多维表示,从而实现了对分子原子化能的预测(对于 QM7 数据集,MAE=1.7 kcal/mol)。另一方面,AP-RDF 的数值版本是一个固定大小的描述符,使其比键合描述符更具计算效率。我们还讨论了分子量子相似性与基于第一性原理的描述符的机器学习核之间的联系。

相似文献

1
Bond Type Restricted Property Weighted Radial Distribution Functions for Accurate Machine Learning Prediction of Atomization Energies.用于准确预测原子化能的键型受限物性加权径向分布函数的机器学习方法。
J Chem Theory Comput. 2018 Oct 9;14(10):5229-5237. doi: 10.1021/acs.jctc.8b00788. Epub 2018 Sep 10.
2
The limitations of Slater's element-dependent exchange functional from analytic density-functional theory.解析密度泛函理论中斯莱特元素相关交换泛函的局限性。
J Chem Phys. 2006 Jan 28;124(4):044107. doi: 10.1063/1.2161176.
3
Constant size descriptors for accurate machine learning models of molecular properties.用于分子性质的精确机器学习模型的常量大小描述符。
J Chem Phys. 2018 Jun 28;148(24):241718. doi: 10.1063/1.5020441.
4
Assessment and Validation of Machine Learning Methods for Predicting Molecular Atomization Energies.用于预测分子原子化能量的机器学习方法的评估与验证
J Chem Theory Comput. 2013 Aug 13;9(8):3404-19. doi: 10.1021/ct400195d. Epub 2013 Jul 30.
5
Prediction Errors of Molecular Machine Learning Models Lower than Hybrid DFT Error.分子机器学习模型的预测误差低于混合密度泛函理论误差。
J Chem Theory Comput. 2017 Nov 14;13(11):5255-5264. doi: 10.1021/acs.jctc.7b00577. Epub 2017 Oct 10.
6
Machine Learning Predictions of Molecular Properties: Accurate Many-Body Potentials and Nonlocality in Chemical Space.机器学习对分子性质的预测:化学空间中的精确多体势与非局域性
J Phys Chem Lett. 2015 Jun 18;6(12):2326-31. doi: 10.1021/acs.jpclett.5b00831.
7
MultiDK: A Multiple Descriptor Multiple Kernel Approach for Molecular Discovery and Its Application to Organic Flow Battery Electrolytes.多描述符多核方法(MultiDK)在分子发现中的应用及其在有机流电池电解质中的应用
J Chem Inf Model. 2017 Apr 24;57(4):657-668. doi: 10.1021/acs.jcim.6b00332. Epub 2017 Apr 10.
8
Resolving Transition Metal Chemical Space: Feature Selection for Machine Learning and Structure-Property Relationships.解析过渡金属化学空间:机器学习的特征选择与结构-性质关系
J Phys Chem A. 2017 Nov 22;121(46):8939-8954. doi: 10.1021/acs.jpca.7b08750. Epub 2017 Nov 15.
9
Machine Learning to Predict Homolytic Dissociation Energies of C-H Bonds: Calibration of DFT-based Models with Experimental Data.机器学习预测 C-H 键均裂解离能:基于实验数据的 DFT 模型校准。
Mol Inform. 2023 Jan;42(1):e2200193. doi: 10.1002/minf.202200193. Epub 2022 Oct 19.
10
Gaussian Process Regression Models for the Prediction of Hydrogen Bond Acceptor Strengths.高斯过程回归模型在预测氢键受体强度中的应用。
Mol Inform. 2019 Apr;38(4):e1800115. doi: 10.1002/minf.201800115. Epub 2018 Nov 25.

引用本文的文献

1
The Role of Structural Representation in the Performance of a Deep Neural Network for X-Ray Spectroscopy.结构表示在用于 X 射线光谱学的深度神经网络性能中的作用。
Molecules. 2020 Jun 11;25(11):2715. doi: 10.3390/molecules25112715.