• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

分子动力学指纹 (MDFP):从 MD 数据中进行机器学习以预测自由能差异。

Molecular Dynamics Fingerprints (MDFP): Machine Learning from MD Data To Predict Free-Energy Differences.

机构信息

Laboratory of Physical Chemistry, ETH Zürich , Vladimir-Prelog-Weg 2, 8093 Zürich, Switzerland.

出版信息

J Chem Inf Model. 2017 Apr 24;57(4):726-741. doi: 10.1021/acs.jcim.6b00778. Epub 2017 Apr 12.

DOI:10.1021/acs.jcim.6b00778
PMID:28368113
Abstract

While the use of machine-learning (ML) techniques is well established in cheminformatics for the prediction of physicochemical properties and binding affinities, the training of ML models based on data from molecular dynamics (MD) simulations remains largely unexplored. Here, we present a fingerprint termed MDFP which is constructed from the distributions of properties such as potential-energy components, radius of gyration, and solvent-accessible surface area extracted from MD simulations. The corresponding fingerprint elements are the first two statistical moments of the distributions and the median. By considering not only the average but also the spread of the distribution in the fingerprint, some degree of entropic information is encoded. Short MD simulations of the molecules in water (and in vacuum) are used to generate MDFP. These are further combined with simple counts based on the 2D structure of the molecules into MDFP+. The resulting information-rich MDFP+ is used to train ML models for the prediction of solvation free energies in five different solvents (water, octanol, chloroform, hexadecane, and cyclohexane) as well as partition coefficients in octanol/water, hexadecane/water, and cyclohexane/water. The approach is easy to implement and computationally relatively inexpensive. Yet, it performs similarly well compared to more rigorous MD-based free-energy methods such as free-energy perturbation (FEP) as well as end-state methods such as linear interaction energy (LIE), the conductor-like screening model for realistic solvation (COSMO-RS), and the SMx family of solvation models.

摘要

虽然机器学习 (ML) 技术在化学信息学中被广泛应用于预测物理化学性质和结合亲和力,但基于分子动力学 (MD) 模拟数据训练 ML 模型的方法仍在很大程度上未被探索。在这里,我们提出了一种指纹,称为 MDFP,它是由从 MD 模拟中提取的性质分布构建的,例如势能分量、回转半径和溶剂可及表面积。相应的指纹元素是分布的前两个统计矩和中位数。通过不仅考虑指纹中分布的平均值,而且还考虑分布的分散程度,可以编码一定程度的熵信息。在水中(和真空中)对分子进行短 MD 模拟,以生成 MDFP。然后将这些模拟与基于分子二维结构的简单计数相结合,形成 MDFP+。由此产生的信息丰富的 MDFP+ 被用于训练 ML 模型,以预测五种不同溶剂(水、辛醇、氯仿、十六烷和环己烷)中的溶剂化自由能以及辛醇/水、十六烷/水和环己烷/水中的分配系数。该方法易于实现,计算成本相对较低。然而,与更严格的基于 MD 的自由能方法(如自由能微扰 (FEP))以及终态方法(如线性相互作用能 (LIE))、用于实际溶剂化的导体相似性屏蔽模型 (COSMO-RS) 和 SMx 家族的溶剂化模型相比,它的性能相当好。

相似文献

1
Molecular Dynamics Fingerprints (MDFP): Machine Learning from MD Data To Predict Free-Energy Differences.分子动力学指纹 (MDFP):从 MD 数据中进行机器学习以预测自由能差异。
J Chem Inf Model. 2017 Apr 24;57(4):726-741. doi: 10.1021/acs.jcim.6b00778. Epub 2017 Apr 12.
2
Use of molecular dynamics fingerprints (MDFPs) in SAMPL6 octanol-water log P blind challenge.利用分子动力学指纹(MDFPs)在 SAMPL6 辛醇-水分配系数盲测挑战中的应用。
J Comput Aided Mol Des. 2020 Apr;34(4):393-403. doi: 10.1007/s10822-019-00252-6. Epub 2019 Nov 19.
3
Prediction of micelle/water and liposome/water partition coefficients based on molecular dynamics simulations, COSMO-RS, and COSMOmic.基于分子动力学模拟、COSMO-RS 和 COSMOmic 预测胶束/水和脂质体/水分配系数。
Langmuir. 2013 Mar 19;29(11):3527-37. doi: 10.1021/la305035b. Epub 2013 Mar 8.
4
Assessing the accuracy and performance of implicit solvent models for drug molecules: conformational ensemble approaches.评估药物分子隐溶剂模型的准确性和性能:构象系综方法。
J Phys Chem B. 2013 May 16;117(19):5950-62. doi: 10.1021/jp402117c. Epub 2013 May 6.
5
Combining Molecular Dynamics and Machine Learning to Predict Self-Solvation Free Energies and Limiting Activity Coefficients.结合分子动力学和机器学习预测自溶剂化自由能和极限活度系数。
J Chem Inf Model. 2020 Nov 23;60(11):5319-5330. doi: 10.1021/acs.jcim.0c00479. Epub 2020 Sep 1.
6
Extended solvent-contact model approach to blind SAMPL5 prediction challenge for the distribution coefficients of drug-like molecules.扩展溶剂接触模型方法用于药物类分子分配系数的盲SAMPL5预测挑战
J Comput Aided Mol Des. 2016 Nov;30(11):1019-1033. doi: 10.1007/s10822-016-9928-x. Epub 2016 Jul 23.
7
Direct calculation of 1-octanol-water partition coefficients from adaptive biasing force molecular dynamics simulations.自适应偏置力分子动力学模拟直接计算 1-辛醇-水分配系数。
J Chem Phys. 2012 Jul 7;137(1):014502. doi: 10.1063/1.4730040.
8
Prediction of cyclohexane-water distribution coefficients for the SAMPL5 data set using molecular dynamics simulations with the OPLS-AA force field.使用OPLS-AA力场通过分子动力学模拟预测SAMPL5数据集的环己烷-水分配系数。
J Comput Aided Mol Des. 2016 Nov;30(11):1045-1058. doi: 10.1007/s10822-016-9949-5. Epub 2016 Aug 31.
9
Prediction of cyclohexane-water distribution coefficients with COSMO-RS on the SAMPL5 data set.使用COSMO-RS对SAMPL5数据集进行环己烷-水分配系数的预测。
J Comput Aided Mol Des. 2016 Nov;30(11):959-967. doi: 10.1007/s10822-016-9927-y. Epub 2016 Jul 26.
10
Comparison of two simulation methods to compute solvation free energies and partition coefficients.比较两种模拟方法以计算溶剂化自由能和分配系数。
J Comput Chem. 2013 Feb 5;34(4):284-93. doi: 10.1002/jcc.23127. Epub 2012 Oct 29.

引用本文的文献

1
Spatio-temporal learning from molecular dynamics simulations for protein-ligand binding affinity prediction.基于分子动力学模拟的时空学习用于蛋白质-配体结合亲和力预测。
Bioinformatics. 2025 Aug 2;41(8). doi: 10.1093/bioinformatics/btaf429.
2
Enhancing Accuracy and Feature Insights in Hydration Free Energy Predictions for Small Molecules with Machine Learning.利用机器学习提高小分子水合自由能预测的准确性和特征洞察
ACS Omega. 2025 Jul 2;10(27):29781-29792. doi: 10.1021/acsomega.5c04249. eCollection 2025 Jul 15.
3
Neural Network Models for Prediction of Biological Activity using Molecular Dynamics Data: A Case of Photoswitchable Peptides.
使用分子动力学数据预测生物活性的神经网络模型:以光开关肽为例。
Mol Inform. 2025 Jul;44(7):e70001. doi: 10.1002/minf.70001.
4
Machine learning and data-driven methods in computational surface and interface science.计算表面与界面科学中的机器学习和数据驱动方法。
NPJ Comput Mater. 2025;11(1):196. doi: 10.1038/s41524-025-01691-6. Epub 2025 Jul 1.
5
Perspectives on Molecular Mechanisms of Hydrate Formation and Growth at Interfaces: A Mini-Review.界面水合物形成与生长的分子机制研究进展:一篇综述短文
Energy Fuels. 2025 May 8;39(22):10150-10164. doi: 10.1021/acs.energyfuels.5c00942. eCollection 2025 Jun 5.
6
Nearl: extracting dynamic features from molecular dynamics trajectories for machine learning tasks.Nearl:从分子动力学轨迹中提取用于机器学习任务的动态特征。
Bioinformatics. 2025 Jul 1;41(7). doi: 10.1093/bioinformatics/btaf321.
7
Benchmarking the robustness of the correct identification of flexible 3D objects using common machine learning models.使用常见机器学习模型对灵活3D物体正确识别的稳健性进行基准测试。
Patterns (N Y). 2025 Jan 10;6(1):101147. doi: 10.1016/j.patter.2024.101147.
8
MDFit: automated molecular simulations workflow enables high throughput assessment of ligands-protein dynamics.MDFit:自动化分子模拟工作流程可实现配体-蛋白质动力学的高通量评估。
J Comput Aided Mol Des. 2024 Jul 17;38(1):24. doi: 10.1007/s10822-024-00564-2.
9
POxload: Machine Learning Estimates Drug Loadings of Polymeric Micelles.POxload:用于估算聚合物胶束药物载量的机器学习方法。
Mol Pharm. 2024 Jul 1;21(7):3356-3374. doi: 10.1021/acs.molpharmaceut.4c00086. Epub 2024 May 28.
10
Machine Learning Integrating Protein Structure, Sequence, and Dynamics to Predict the Enzyme Activity of Bovine Enterokinase Variants.机器学习整合蛋白质结构、序列和动力学预测牛肠激酶变体的酶活性。
J Chem Inf Model. 2024 Apr 8;64(7):2681-2694. doi: 10.1021/acs.jcim.3c00999. Epub 2024 Feb 22.