利用细粒度力指标改进分子动力学模拟中的机器学习力场。

Improving machine learning force fields for molecular dynamics simulations with fine-grained force metrics.

机构信息

Microsoft Research AI4Science, Beijing 100084, China.

College of Chemistry and Molecular Engineering, Peking University, Beijing 100871, China.

出版信息

J Chem Phys. 2023 Jul 21;159(3). doi: 10.1063/5.0147023.

DOI:10.1063/5.0147023

PMID:37458355

Abstract

Machine learning force fields (MLFFs) have gained popularity in recent years as they provide a cost-effective alternative to ab initio molecular dynamics (MD) simulations. Despite a small error on the test set, MLFFs inherently suffer from generalization and robustness issues during MD simulations. To alleviate these issues, we propose global force metrics and fine-grained metrics from element and conformation aspects to systematically measure MLFFs for every atom and every conformation of molecules. We selected three state-of-the-art MLFFs (ET, NequIP, and ViSNet) and comprehensively evaluated on aspirin, Ac-Ala3-NHMe, and Chignolin MD datasets with the number of atoms ranging from 21 to 166. Driven by the trained MLFFs on these molecules, we performed MD simulations from different initial conformations, analyzed the relationship between the force metrics and the stability of simulation trajectories, and investigated the reason for collapsed simulations. Finally, the performance of MLFFs and the stability of MD simulations can be further improved guided by the proposed force metrics for model training, specifically training MLFF models with these force metrics as loss functions, fine-tuning by reweighting samples in the original dataset, and continued training by recruiting additional unexplored data.

摘要

机器学习力场（MLFFs）近年来越来越受欢迎，因为它们为从头分子动力学（MD）模拟提供了一种具有成本效益的替代方案。尽管在测试集上存在较小的误差，但 MLFF 在 MD 模拟过程中固有地存在泛化和稳健性问题。为了缓解这些问题，我们从元素和构象方面提出了全局力指标和细粒度指标，以系统地测量每个原子和每个分子构象的 MLFF。我们选择了三种最先进的 MLFF（ET、NequIP 和 ViSNet），并在原子数从 21 到 166 的阿司匹林、Ac-Ala3-NHMe 和 Chignolin MD 数据集上进行了全面评估。受这些分子上训练有素的 MLFF 的驱动，我们从不同的初始构象进行了 MD 模拟，分析了力指标与模拟轨迹稳定性之间的关系，并研究了模拟崩溃的原因。最后，通过提出的力指标指导模型训练，可以进一步提高 MLFF 的性能和 MD 模拟的稳定性，特别是将这些力指标用作损失函数来训练 MLFF 模型，通过重新加权原始数据集中的样本进行微调，以及通过招募额外的未探索数据进行持续训练。

相似文献

Improving machine learning force fields for molecular dynamics simulations with fine-grained force metrics.利用细粒度力指标改进分子动力学模拟中的机器学习力场。

J Chem Phys. 2023 Jul 21;159(3). doi: 10.1063/5.0147023.

The emergence of machine learning force fields in drug design.机器学习力场在药物设计中的应用

Med Res Rev. 2024 May;44(3):1147-1182. doi: 10.1002/med.22008. Epub 2024 Jan 3.

Efficient interatomic descriptors for accurate machine learning force fields of extended molecules.高效的原子间描述符，用于准确的机器学习扩展分子力场。

Nat Commun. 2023 Jun 15;14(1):3562. doi: 10.1038/s41467-023-39214-w.

A Euclidean transformer for fast and stable machine learned force fields.一种用于快速稳定机器学习力场的欧几里得变换器。

Nat Commun. 2024 Aug 6;15(1):6539. doi: 10.1038/s41467-024-50620-6.

Force Field Analysis Software and Tools (FFAST): Assessing Machine Learning Force Fields under the Microscope.力场分析软件与工具（FFAST）：在微观层面评估机器学习力场

J Chem Theory Comput. 2023 Dec 12;19(23):8706-8717. doi: 10.1021/acs.jctc.3c00985. Epub 2023 Nov 27.

A Set of Moment Tensor Potentials for Zirconium with Increasing Complexity.一组复杂度递增的锆的矩张量势

J Chem Theory Comput. 2023 Oct 10;19(19):6848-6856. doi: 10.1021/acs.jctc.3c00488. Epub 2023 Sep 12.

AIMD-Chig: Exploring the conformational space of a 166-atom protein Chignolin with ab initio molecular dynamics.AIMD-Chig：运用从头算分子动力学探索 166 原子蛋白 Chignolin 的构象空间。

Sci Data. 2023 Aug 22;10(1):549. doi: 10.1038/s41597-023-02465-9.

Neural network potential from bispectrum components: A case study on crystalline silicon.基于双谱分量的神经网络势：以多晶硅为例的研究

J Chem Phys. 2020 Aug 7;153(5):054118. doi: 10.1063/5.0014677.

BIGDML-Towards accurate quantum machine learning force fields for materials.BIGDML——迈向精确的材料量子机器学习力场

Nat Commun. 2022 Jun 29;13(1):3733. doi: 10.1038/s41467-022-31093-x.

Top-Down Machine Learning of Coarse-Grained Protein Force Fields.从头开始学习粗粒度蛋白质力场。

J Chem Theory Comput. 2023 Nov 14;19(21):7518-7526. doi: 10.1021/acs.jctc.3c00638. Epub 2023 Oct 24.

引用本文的文献

Scaling Graph Neural Networks to Large Proteins.将图神经网络扩展至大型蛋白质

J Chem Theory Comput. 2025 Feb 25;21(4):2055-2066. doi: 10.1021/acs.jctc.4c01420. Epub 2025 Feb 6.

Crash testing machine learning force fields for molecules, materials, and interfaces: molecular dynamics in the TEA challenge 2023.用于分子、材料和界面的碰撞测试机器学习力场：2023年TEA挑战赛中的分子动力学

Chem Sci. 2025 Feb 3;16(8):3738-3754. doi: 10.1039/d4sc06530a. eCollection 2025 Feb 19.

Analyzing Atomic Interactions in Molecules as Learned by Neural Networks.分析神经网络所学习到的分子中的原子相互作用。

J Chem Theory Comput. 2025 Jan 28;21(2):714-729. doi: 10.1021/acs.jctc.4c01424. Epub 2025 Jan 10.

Ab initio characterization of protein molecular dynamics with AIBMD.使用 AIBMD 进行蛋白质分子动力学的从头分析。

Nature. 2024 Nov;635(8040):1019-1027. doi: 10.1038/s41586-024-08127-z. Epub 2024 Nov 6.

A Euclidean transformer for fast and stable machine learned force fields.一种用于快速稳定机器学习力场的欧几里得变换器。

Nat Commun. 2024 Aug 6;15(1):6539. doi: 10.1038/s41467-024-50620-6.

Enhancing geometric representations for molecules with equivariant vector-scalar interactive message passing.通过等变向量-标量交互消息传递增强分子的几何表示。

Nat Commun. 2024 Jan 5;15(1):313. doi: 10.1038/s41467-023-43720-2.

Unsupervised deep learning for molecular dynamics simulations: a novel analysis of protein-ligand interactions in SARS-CoV-2 M.用于分子动力学模拟的无监督深度学习：对严重急性呼吸综合征冠状病毒2（SARS-CoV-2）中蛋白质-配体相互作用的新分析

RSC Adv. 2023 Nov 22;13(48):34249-34261. doi: 10.1039/d3ra06375e. eCollection 2023 Nov 16.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

利用细粒度力指标改进分子动力学模拟中的机器学习力场。

Improving machine learning force fields for molecular dynamics simulations with fine-grained force metrics.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献