• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过机器学习相对于实验的剩余误差来提高基于物理的水合自由能预测的准确性。

Improving the Accuracy of Physics-Based Hydration-Free Energy Predictions by Machine Learning the Remaining Error Relative to the Experiment.

作者信息

Bass Lewis, Elder Luke H, Folescu Dan E, Forouzesh Negin, Tolokh Igor S, Karpatne Anuj, Onufriev Alexey V

机构信息

Department of Computer Engineering, Virginia Tech, Blacksburg, Virginia 24061, United States.

Department of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States.

出版信息

J Chem Theory Comput. 2024 Jan 9;20(1):396-410. doi: 10.1021/acs.jctc.3c00981. Epub 2023 Dec 27.

DOI:10.1021/acs.jctc.3c00981
PMID:38149593
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10950260/
Abstract

The accuracy of computational models of water is key to atomistic simulations of biomolecules. We propose a computationally efficient way to improve the accuracy of the prediction of hydration-free energies (HFEs) of small molecules: the remaining errors of the physics-based models relative to the experiment are predicted and mitigated by machine learning (ML) as a postprocessing step. Specifically, the trained graph convolutional neural network attempts to identify the "blind spots" in the physics-based model predictions, where the complex physics of aqueous solvation is poorly accounted for, and partially corrects for them. The strategy is explored for five classical solvent models representing various accuracy/speed trade-offs, from the fast analytical generalized Born (GB) to the popular TIP3P explicit solvent model; experimental HFEs of small neutral molecules from the FreeSolv set are used for the training and testing. For all of the models, the ML correction reduces the resulting root-mean-square error relative to the experiment for HFEs of small molecules, without significant overfitting and with negligible computational overhead. For example, on the test set, the relative accuracy improvement is 47% for the fast analytical GB, making it, after the ML correction, almost as accurate as uncorrected TIP3P. For the TIP3P model, the accuracy improvement is about 39%, bringing the ML-corrected model's accuracy below the 1 kcal/mol threshold. In general, the relative benefit of the ML corrections is smaller for more accurate physics-based models, reaching the lower limit of about 20% relative accuracy gain compared with that of the physics-based treatment alone. The proposed strategy of using ML to learn the remaining error of physics-based models offers a distinct advantage over training ML alone directly on reference HFEs: it preserves the correct overall trend, even well outside of the training set.

摘要

水的计算模型的准确性是生物分子原子模拟的关键。我们提出了一种计算效率高的方法来提高小分子水化自由能(HFE)预测的准确性:通过机器学习(ML)作为后处理步骤来预测和减轻基于物理的模型相对于实验的剩余误差。具体来说,经过训练的图卷积神经网络试图识别基于物理的模型预测中的“盲点”,即水溶剂化的复杂物理过程未得到充分考虑的地方,并对其进行部分校正。我们针对代表各种准确性/速度权衡的五种经典溶剂模型探索了该策略,从快速解析广义玻恩(GB)模型到流行的TIP3P显式溶剂模型;使用来自FreeSolv集的小中性分子的实验HFE进行训练和测试。对于所有模型,ML校正降低了小分子HFE相对于实验的均方根误差,没有明显的过拟合,并且计算开销可以忽略不计。例如,在测试集上,快速解析GB模型的相对准确性提高了47%,经过ML校正后,其准确性几乎与未校正的TIP3P模型一样。对于TIP3P模型,准确性提高了约39%,使ML校正后的模型准确性低于1 kcal/mol阈值。一般来说,对于更准确的基于物理的模型,ML校正的相对益处较小,与仅基于物理的处理相比,相对准确性增益的下限约为20%。所提出的使用ML来学习基于物理的模型的剩余误差的策略相对于直接在参考HFE上单独训练ML具有明显优势:即使在训练集之外,它也能保持正确的总体趋势。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0580/10950260/4df5915f30b1/nihms-1971370-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0580/10950260/5adbffdda365/nihms-1971370-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0580/10950260/ac18b8afb86a/nihms-1971370-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0580/10950260/2f2f4fe57b0b/nihms-1971370-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0580/10950260/2471052ad478/nihms-1971370-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0580/10950260/26b429883a23/nihms-1971370-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0580/10950260/cce7e0131471/nihms-1971370-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0580/10950260/535a14a6e06b/nihms-1971370-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0580/10950260/4df5915f30b1/nihms-1971370-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0580/10950260/5adbffdda365/nihms-1971370-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0580/10950260/ac18b8afb86a/nihms-1971370-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0580/10950260/2f2f4fe57b0b/nihms-1971370-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0580/10950260/2471052ad478/nihms-1971370-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0580/10950260/26b429883a23/nihms-1971370-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0580/10950260/cce7e0131471/nihms-1971370-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0580/10950260/535a14a6e06b/nihms-1971370-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0580/10950260/4df5915f30b1/nihms-1971370-f0009.jpg

相似文献

1
Improving the Accuracy of Physics-Based Hydration-Free Energy Predictions by Machine Learning the Remaining Error Relative to the Experiment.通过机器学习相对于实验的剩余误差来提高基于物理的水合自由能预测的准确性。
J Chem Theory Comput. 2024 Jan 9;20(1):396-410. doi: 10.1021/acs.jctc.3c00981. Epub 2023 Dec 27.
2
Introducing Charge Hydration Asymmetry into the Generalized Born Model.将电荷水化不对称性引入广义玻恩模型。
J Chem Theory Comput. 2014 Apr 8;10(4):1788-1794. doi: 10.1021/ct4010917. Epub 2014 Feb 18.
3
Comparative Assessment of Computational Methods for Free Energy Calculations of Ionic Hydration.离子水合自由能计算方法的比较评估
J Chem Inf Model. 2017 Nov 27;57(11):2763-2775. doi: 10.1021/acs.jcim.7b00485. Epub 2017 Nov 1.
4
Inclusion of Water Multipoles into the Implicit Solvation Framework Leads to Accuracy Gains.将水多极矩纳入隐式溶剂化框架可提高精度。
J Phys Chem B. 2024 Jun 20;128(24):5855-5873. doi: 10.1021/acs.jpcb.4c00254. Epub 2024 Jun 11.
5
Force Field Benchmark of Amino Acids: I. Hydration and Diffusion in Different Water Models.氨基酸力场基准测试:I. 不同水模型中的水合和扩散。
J Chem Inf Model. 2018 May 29;58(5):1037-1052. doi: 10.1021/acs.jcim.8b00026. Epub 2018 Apr 18.
6
Efficient Computation of the Total Solvation Energy of Small Molecules via the R6 Generalized Born Model.通过R6广义玻恩模型高效计算小分子的总溶剂化能
J Chem Theory Comput. 2012 Jul 10;8(7):2404-11. doi: 10.1021/ct200786m. Epub 2012 Jun 8.
7
Generalized Born implicit solvent models for small molecule hydration free energies.用于小分子水合自由能的广义玻恩隐式溶剂模型。
Phys Chem Chem Phys. 2017 Jan 4;19(2):1677-1685. doi: 10.1039/c6cp07347f.
8
Accuracy comparison of several common implicit solvent models and their implementations in the context of protein-ligand binding.几种常见隐式溶剂模型在蛋白质-配体结合背景下的准确性比较及其实现方式
J Mol Graph Model. 2017 Mar;72:70-80. doi: 10.1016/j.jmgm.2016.12.011. Epub 2016 Dec 21.
9
Tuning Potential Functions to Host-Guest Binding Data.调整势能函数以适应主客体结合数据。
J Chem Theory Comput. 2024 Jan 9;20(1):239-252. doi: 10.1021/acs.jctc.3c01050. Epub 2023 Dec 26.
10
Hybrid Modelling by Machine Learning Corrections of Analytical Model Predictions towards High-Fidelity Simulation Solutions.通过机器学习对分析模型预测进行校正以实现高保真模拟解决方案的混合建模
Materials (Basel). 2021 Apr 10;14(8):1883. doi: 10.3390/ma14081883.

引用本文的文献

1
Using Deep Graph Neural Networks Improves Physics-Based Hydration Free Energy Predictions Even for Molecules Outside of the Training Set Distribution.使用深度图神经网络可改善基于物理的水合自由能预测,即使对于训练集分布之外的分子也是如此。
J Phys Chem B. 2025 Jul 24;129(29):7483-7498. doi: 10.1021/acs.jpcb.5c02263. Epub 2025 Jul 11.
2
Inclusion of Water Multipoles into the Implicit Solvation Framework Leads to Accuracy Gains.将水多极矩纳入隐式溶剂化框架可提高精度。
J Phys Chem B. 2024 Jun 20;128(24):5855-5873. doi: 10.1021/acs.jpcb.4c00254. Epub 2024 Jun 11.

本文引用的文献

1
Machine Learning Methods for Small Data Challenges in Molecular Science.机器学习方法在分子科学中小数据挑战中的应用。
Chem Rev. 2023 Jul 12;123(13):8736-8780. doi: 10.1021/acs.chemrev.3c00189. Epub 2023 Jun 29.
2
Extending machine learning beyond interatomic potentials for predicting molecular properties.将机器学习应用于超越原子间势的领域,以预测分子性质。
Nat Rev Chem. 2022 Sep;6(9):653-672. doi: 10.1038/s41570-022-00416-3. Epub 2022 Aug 25.
3
A "short blanket" dilemma for a state-of-the-art neural network potential for water: Reproducing experimental properties or the physics of the underlying many-body interactions?
为最先进的神经网络水势模型制造“短毛毯”困境:是重现实验性质还是潜在多体相互作用的物理性质?
J Chem Phys. 2023 Feb 28;158(8):084111. doi: 10.1063/5.0142843.
4
Fast Polarizable Water Model for Atomistic Simulations.快速极化水分子模型用于原子模拟。
J Chem Theory Comput. 2022 Oct 11;18(10):6324-6333. doi: 10.1021/acs.jctc.2c00378. Epub 2022 Oct 3.
5
A Closed-Form, Analytical Approximation for Apparent Surface Charge and Electric Field of Molecules.分子表观表面电荷与电场的闭式解析近似
ACS Omega. 2022 Jul 19;7(30):26123-26136. doi: 10.1021/acsomega.2c01484. eCollection 2022 Aug 2.
6
A Physics-Guided Neural Network for Predicting Protein-Ligand Binding Free Energy: From Host-Guest Systems to the PDBbind Database.基于物理的神经网络预测蛋白质-配体结合自由能:从主客体体系到 PDBbind 数据库。
Biomolecules. 2022 Jun 29;12(7):919. doi: 10.3390/biom12070919.
7
Generating 3D molecules conditional on receptor binding sites with deep generative models.利用深度生成模型根据受体结合位点生成3D分子。
Chem Sci. 2022 Feb 7;13(9):2701-2713. doi: 10.1039/d1sc05976a. eCollection 2022 Mar 2.
8
How good are polarizable and flexible models for water: Insights from a many-body perspective.极化可变形水模型的性能如何:多体视角的见解
J Chem Phys. 2020 Aug 14;153(6):060901. doi: 10.1063/5.0017590.
9
Accurate determination of solvation free energies of neutral organic compounds from first principles.从第一性原理准确确定中性有机化合物的溶剂化自由能。
Nat Commun. 2022 Jan 20;13(1):414. doi: 10.1038/s41467-022-28041-0.
10
Choosing the right molecular machine learning potential.选择合适的分子机器学习势函数。
Chem Sci. 2021 Sep 15;12(43):14396-14413. doi: 10.1039/d1sc03564a. eCollection 2021 Nov 10.