• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用深度图神经网络可改善基于物理的水合自由能预测,即使对于训练集分布之外的分子也是如此。

Using Deep Graph Neural Networks Improves Physics-Based Hydration Free Energy Predictions Even for Molecules Outside of the Training Set Distribution.

作者信息

Elder Luke H, Onufriev Alexey V

机构信息

Department of Computer Science, Virginia Tech, Blacksburg, Virginia 24061, United States.

Department of Physics, Virginia Tech, Blacksburg, Virginia 24061, United States.

出版信息

J Phys Chem B. 2025 Jul 24;129(29):7483-7498. doi: 10.1021/acs.jpcb.5c02263. Epub 2025 Jul 11.

DOI:10.1021/acs.jpcb.5c02263
PMID:40641374
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12302205/
Abstract

The accuracy of computational water models is crucial to atomistic simulations of biomolecules. Here we explore a decoupled framework that combines classical physics-based models with deep neural networks (DNNs) to correct residual error in hydration free energy (HFE) prediction. Our main goal is to evaluate this framework on out-of-distribution data (molecules that differ significantly from those used in training), where DNNs are known to struggle. Several common physics-based solvation models are used in the evaluation. Graph neural network architectures are tested for their ability to generalize using multiple data set splits, including out-of-distribution HFEs and unseen molecular scaffolds. Our most important finding is that for out-of-distribution data, where DNNs alone often struggle, the physics + DNN models consistently improve physics model predictions. For in-distribution data, the DNN corrections significantly improve the accuracy of physics-based models, with a final RMSE below 1 kcal/mol and a relative improvement between 40% and 65% in most cases. The accuracy of physics + DNN models tends to improve when the 6% of molecules with the highest experimental uncertainty are removed. This study provides insights into the potential and limitations of combining physics and machine learning for molecular modeling, offering a practical and generalizable strategy of using DNN as independent postprocessing correction.

摘要

计算水模型的准确性对于生物分子的原子模拟至关重要。在这里,我们探索了一种解耦框架,该框架将基于经典物理的模型与深度神经网络(DNN)相结合,以纠正水合自由能(HFE)预测中的残余误差。我们的主要目标是在分布外数据(与训练中使用的分子有显著差异的分子)上评估这个框架,众所周知,DNN在处理这类数据时会遇到困难。评估中使用了几种常见的基于物理的溶剂化模型。通过多种数据集划分,包括分布外HFE和未见分子支架,测试了图神经网络架构的泛化能力。我们最重要的发现是,对于分布外数据,单独使用DNN往往效果不佳,而物理+DNN模型能持续改进物理模型的预测。对于分布内数据,DNN校正显著提高了基于物理模型的准确性,最终均方根误差(RMSE)低于1千卡/摩尔,在大多数情况下相对提高了40%至65%。去除实验不确定性最高的6%的分子后,物理+DNN模型的准确性往往会提高。这项研究深入探讨了将物理和机器学习结合用于分子建模的潜力和局限性,提供了一种将DNN用作独立后处理校正的实用且可推广的策略。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfe5/12302205/a9035db8d056/jp5c02263_0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfe5/12302205/72ebcd60100a/jp5c02263_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfe5/12302205/da2c322594c5/jp5c02263_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfe5/12302205/2a25ba566563/jp5c02263_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfe5/12302205/500de0455467/jp5c02263_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfe5/12302205/4dd094f77803/jp5c02263_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfe5/12302205/76733e882fb9/jp5c02263_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfe5/12302205/9e1a5e185c19/jp5c02263_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfe5/12302205/a9035db8d056/jp5c02263_0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfe5/12302205/72ebcd60100a/jp5c02263_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfe5/12302205/da2c322594c5/jp5c02263_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfe5/12302205/2a25ba566563/jp5c02263_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfe5/12302205/500de0455467/jp5c02263_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfe5/12302205/4dd094f77803/jp5c02263_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfe5/12302205/76733e882fb9/jp5c02263_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfe5/12302205/9e1a5e185c19/jp5c02263_0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dfe5/12302205/a9035db8d056/jp5c02263_0008.jpg

相似文献

1
Using Deep Graph Neural Networks Improves Physics-Based Hydration Free Energy Predictions Even for Molecules Outside of the Training Set Distribution.使用深度图神经网络可改善基于物理的水合自由能预测,即使对于训练集分布之外的分子也是如此。
J Phys Chem B. 2025 Jul 24;129(29):7483-7498. doi: 10.1021/acs.jpcb.5c02263. Epub 2025 Jul 11.
2
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
3
Approximating Human-Level 3D Visual Inferences With Deep Neural Networks.利用深度神经网络逼近人类水平的3D视觉推理
Open Mind (Camb). 2025 Feb 16;9:305-324. doi: 10.1162/opmi_a_00189. eCollection 2025.
4
Sexual Harassment and Prevention Training性骚扰与预防培训
5
Systemic Inflammatory Response Syndrome全身炎症反应综合征
6
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
7
Does the Presence of Missing Data Affect the Performance of the SORG Machine-learning Algorithm for Patients With Spinal Metastasis? Development of an Internet Application Algorithm.缺失数据的存在是否会影响 SORG 机器学习算法在脊柱转移瘤患者中的性能?开发一种互联网应用算法。
Clin Orthop Relat Res. 2024 Jan 1;482(1):143-157. doi: 10.1097/CORR.0000000000002706. Epub 2023 Jun 12.
8
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
9
Short-Term Memory Impairment短期记忆障碍
10
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.

本文引用的文献

1
Physics-Based Machine Learning to Predict Hydration Free Energies for Small Molecules with a Minimal Number of Descriptors: Interpretable and Accurate.基于物理的机器学习,以最少的描述符预测小分子的水合自由能:可解释且准确。
J Phys Chem B. 2025 Feb 6;129(5):1640-1647. doi: 10.1021/acs.jpcb.4c07090. Epub 2025 Jan 22.
2
Improving the Accuracy of Physics-Based Hydration-Free Energy Predictions by Machine Learning the Remaining Error Relative to the Experiment.通过机器学习相对于实验的剩余误差来提高基于物理的水合自由能预测的准确性。
J Chem Theory Comput. 2024 Jan 9;20(1):396-410. doi: 10.1021/acs.jctc.3c00981. Epub 2023 Dec 27.
3
A systematic study of key elements underlying molecular property prediction.
对分子性质预测背后关键要素的系统研究。
Nat Commun. 2023 Oct 13;14(1):6395. doi: 10.1038/s41467-023-41948-6.
4
Explainable Supervised Machine Learning Model To Predict Solvation Gibbs Energy.可解释的监督机器学习模型,用于预测溶剂化吉布斯自由能。
J Chem Inf Model. 2024 Apr 8;64(7):2250-2262. doi: 10.1021/acs.jcim.3c00544. Epub 2023 Aug 21.
5
Machine Learning Methods for Small Data Challenges in Molecular Science.机器学习方法在分子科学中小数据挑战中的应用。
Chem Rev. 2023 Jul 12;123(13):8736-8780. doi: 10.1021/acs.chemrev.3c00189. Epub 2023 Jun 29.
6
Extending machine learning beyond interatomic potentials for predicting molecular properties.将机器学习应用于超越原子间势的领域,以预测分子性质。
Nat Rev Chem. 2022 Sep;6(9):653-672. doi: 10.1038/s41570-022-00416-3. Epub 2022 Aug 25.
7
A "short blanket" dilemma for a state-of-the-art neural network potential for water: Reproducing experimental properties or the physics of the underlying many-body interactions?为最先进的神经网络水势模型制造“短毛毯”困境:是重现实验性质还是潜在多体相互作用的物理性质?
J Chem Phys. 2023 Feb 28;158(8):084111. doi: 10.1063/5.0142843.
8
Fast Polarizable Water Model for Atomistic Simulations.快速极化水分子模型用于原子模拟。
J Chem Theory Comput. 2022 Oct 11;18(10):6324-6333. doi: 10.1021/acs.jctc.2c00378. Epub 2022 Oct 3.
9
A Physics-Guided Neural Network for Predicting Protein-Ligand Binding Free Energy: From Host-Guest Systems to the PDBbind Database.基于物理的神经网络预测蛋白质-配体结合自由能:从主客体体系到 PDBbind 数据库。
Biomolecules. 2022 Jun 29;12(7):919. doi: 10.3390/biom12070919.
10
Delta Machine Learning to Improve Scoring-Ranking-Screening Performances of Protein-Ligand Scoring Functions.利用 Delta 机器学习改进蛋白质配体打分函数的评分-排名-筛选性能。
J Chem Inf Model. 2022 Jun 13;62(11):2696-2712. doi: 10.1021/acs.jcim.2c00485. Epub 2022 May 17.