• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

运用统计物理学应对人工智能增强分子动力学的陷阱。

Confronting pitfalls of AI-augmented molecular dynamics using statistical physics.

机构信息

NIH Center for Macromolecular Modeling and Bioinformatics, Beckman Institute for Advanced Science and Technology, Department of Biochemistry, Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, Illinois 61801, USA.

Biophysics Program, University of Maryland, College Park, Maryland 20742, USA.

出版信息

J Chem Phys. 2020 Dec 21;153(23):234118. doi: 10.1063/5.0030931.

DOI:10.1063/5.0030931
PMID:33353347
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7863682/
Abstract

Artificial intelligence (AI)-based approaches have had indubitable impact across the sciences through the ability to extract relevant information from raw data. Recently, AI has also found use in enhancing the efficiency of molecular simulations, wherein AI derived slow modes are used to accelerate the simulation in targeted ways. However, while typical fields where AI is used are characterized by a plethora of data, molecular simulations, per construction, suffer from limited sampling and thus limited data. As such, the use of AI in molecular simulations can suffer from a dangerous situation where the AI-optimization could get stuck in spurious regimes, leading to incorrect characterization of the reaction coordinate (RC) for the problem at hand. When such an incorrect RC is then used to perform additional simulations, one could start to deviate progressively from the ground truth. To deal with this problem of spurious AI-solutions, here, we report a novel and automated algorithm using ideas from statistical mechanics. It is based on the notion that a more reliable AI-solution will be one that maximizes the timescale separation between slow and fast processes. To learn this timescale separation even from limited data, we use a maximum caliber-based framework. We show the applicability of this automatic protocol for three classic benchmark problems, namely, the conformational dynamics of a model peptide, ligand-unbinding from a protein, and folding/unfolding energy landscape of the C-terminal domain of protein G. We believe that our work will lead to increased and robust use of trustworthy AI in molecular simulations of complex systems.

摘要

人工智能(AI)技术在各科学领域中的应用已经产生了不可否认的影响,它能够从原始数据中提取相关信息。最近,人工智能也被用于提高分子模拟的效率,其中通过 AI 推导的慢模态来以有针对性的方式加速模拟。然而,虽然 AI 通常应用于数据丰富的领域,但分子模拟由于受到采样限制,其数据有限。因此,在分子模拟中使用 AI 可能会陷入一种危险的情况,即 AI 优化可能会陷入虚假的状态,从而导致对当前问题的反应坐标(RC)的不正确描述。当使用这样一个不正确的 RC 来执行额外的模拟时,就可能会逐渐偏离真实情况。为了解决这个由虚假 AI 解决方案带来的问题,我们在这里报告了一种新颖的自动化算法,该算法借鉴了统计力学的思想。其基本思想是,更可靠的 AI 解决方案将是最大化慢过程和快过程之间时间尺度分离的解决方案。为了即使在有限的数据中学习这种时间尺度分离,我们使用了基于最大口径的框架。我们展示了该自动协议在三个经典基准问题中的适用性,即模型肽的构象动力学、蛋白质上配体的解吸以及蛋白 G 的 C 末端结构域的折叠/去折叠能量景观。我们相信,我们的工作将促进在复杂系统的分子模拟中更广泛、更可靠地使用可信的 AI。

相似文献

1
Confronting pitfalls of AI-augmented molecular dynamics using statistical physics.运用统计物理学应对人工智能增强分子动力学的陷阱。
J Chem Phys. 2020 Dec 21;153(23):234118. doi: 10.1063/5.0030931.
2
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
3
From data to noise to data for mixing physics across temperatures with generative artificial intelligence.从数据到噪声,再到生成式人工智能,跨越温度混合物理的数据。
Proc Natl Acad Sci U S A. 2022 Aug 9;119(32):e2203656119. doi: 10.1073/pnas.2203656119. Epub 2022 Aug 4.
4
Structures, dynamics, complexes, and functions: From classic computation to artificial intelligence.结构、动态、复合物和功能:从经典计算到人工智能。
Curr Opin Struct Biol. 2024 Aug;87:102835. doi: 10.1016/j.sbi.2024.102835. Epub 2024 May 13.
5
Random walks in a free energy landscape combining augmented molecular dynamics simulations with a dynamic graph neural network model.在自由能景观中进行随机游走,结合增强分子动力学模拟和动态图神经网络模型。
J Mol Graph Model. 2022 Jul;114:108199. doi: 10.1016/j.jmgm.2022.108199. Epub 2022 Apr 19.
6
Multi-dimensional spectral gap optimization of order parameters (SGOOP) through conditional probability factorization.通过条件概率分解实现序参量的多维谱隙优化(SGOOP)。
J Chem Phys. 2018 Dec 21;149(23):234105. doi: 10.1063/1.5064856.
7
Modification and optimization of the united-residue (UNRES) potential energy function for canonical simulations. I. Temperature dependence of the effective energy function and tests of the optimization method with single training proteins.用于正则模拟的联合残基(UNRES)势能函数的修改与优化。I. 有效能量函数的温度依赖性及对单一训练蛋白优化方法的测试
J Phys Chem B. 2007 Jan 11;111(1):260-85. doi: 10.1021/jp065380a.
8
Atomistic Peptide Folding Simulations Reveal Interplay of Entropy and Long-Range Interactions in Folding Cooperativity.原子肽折叠模拟揭示了熵和长程相互作用在折叠协同性中的相互作用。
Sci Rep. 2018 Sep 12;8(1):13668. doi: 10.1038/s41598-018-32028-7.
9
Using molecular dynamics simulations to prioritize and understand AI-generated cell penetrating peptides.利用分子动力学模拟对人工智能生成的细胞穿透肽进行优先级排序并加以理解。
Sci Rep. 2021 May 20;11(1):10630. doi: 10.1038/s41598-021-90245-z.
10
Assessment of AI-Based Protein Structure Prediction for the NLRP3 Target.基于人工智能的 NLRP3 靶点蛋白结构预测评估。
Molecules. 2022 Sep 7;27(18):5797. doi: 10.3390/molecules27185797.

引用本文的文献

1
Dissecting Large-Scale Structural Transitions in Membrane Transporters Using Advanced Simulation Technologies.利用先进模拟技术剖析膜转运蛋白中的大规模结构转变
J Phys Chem B. 2025 Apr 17;129(15):3703-3719. doi: 10.1021/acs.jpcb.5c00104. Epub 2025 Mar 18.
2
Calculating Protein-Ligand Residence Times through State Predictive Information Bottleneck Based Enhanced Sampling.通过基于状态预测信息瓶颈的增强采样来计算蛋白配体驻留时间。
J Chem Theory Comput. 2024 Jul 23;20(14):6341-6349. doi: 10.1021/acs.jctc.4c00503. Epub 2024 Jul 11.
3
Calculating Protein-Ligand Residence Times Through State Predictive Information Bottleneck based Enhanced Sampling.通过基于状态预测信息瓶颈的增强采样计算蛋白质-配体停留时间。
bioRxiv. 2024 Apr 20:2024.04.16.589710. doi: 10.1101/2024.04.16.589710.
4
One Descriptor to Fold Them All: Harnessing Intuition and Machine Learning to Identify Transferable Lasso Peptide Reaction Coordinates.一以贯之:利用直觉和机器学习识别可转移的套索肽反应坐标。
J Phys Chem B. 2024 May 2;128(17):4063-4075. doi: 10.1021/acs.jpcb.3c08492. Epub 2024 Apr 3.
5
Collective variable discovery in the age of machine learning: reality, hype and everything in between.机器学习时代的集体变量发现:现实、炒作及其中的一切。
RSC Adv. 2022 Sep 2;12(38):25010-25024. doi: 10.1039/d2ra03660f. eCollection 2022 Aug 30.
6
Committor-Consistent Variational String Method.位形坐标一致变分弦方法。
J Phys Chem Lett. 2022 Oct 13;13(40):9263-9271. doi: 10.1021/acs.jpclett.2c02529. Epub 2022 Sep 29.
7
Accelerating All-Atom Simulations and Gaining Mechanistic Understanding of Biophysical Systems through State Predictive Information Bottleneck.通过状态预测信息瓶颈加速全原子模拟并获得生物物理系统的机制理解。
J Chem Theory Comput. 2022 May 10;18(5):3231-3238. doi: 10.1021/acs.jctc.2c00058. Epub 2022 Apr 6.
8
A Companion Guide to the String Method with Swarms of Trajectories: Characterization, Performance, and Pitfalls.《与轨迹群相关的字符串方法的伴侣指南:特征描述、性能与缺陷》
J Chem Theory Comput. 2022 Mar 8;18(3):1406-1422. doi: 10.1021/acs.jctc.1c01049. Epub 2022 Feb 9.
9
The Middle Science: Traversing Scale In Complex Many-Body Systems.《中间科学:穿越复杂多体系统中的尺度》
ACS Cent Sci. 2021 Aug 25;7(8):1271-1287. doi: 10.1021/acscentsci.1c00685. Epub 2021 Jul 28.
10
Multiscale Reweighted Stochastic Embedding: Deep Learning of Collective Variables for Enhanced Sampling.多尺度重加权随机嵌入:增强采样的集体变量深度学习。
J Phys Chem A. 2021 Jul 22;125(28):6286-6302. doi: 10.1021/acs.jpca.1c02869. Epub 2021 Jul 2.

本文引用的文献

1
Simulating protein-ligand binding with neural network potentials.用神经网络势模拟蛋白质-配体结合
Chem Sci. 2020 Jan 23;11(9):2362-2368. doi: 10.1039/c9sc06017k.
2
Discovering Protein Conformational Flexibility through Artificial-Intelligence-Aided Molecular Dynamics.通过人工智能辅助分子动力学发现蛋白质构象灵活性。
J Phys Chem B. 2020 Sep 24;124(38):8221-8229. doi: 10.1021/acs.jpcb.0c03985. Epub 2020 Sep 9.
3
Gaussian Mixture-Based Enhanced Sampling for Statics and Dynamics.基于高斯混合的静力学和动力学增强采样
J Phys Chem Lett. 2020 Jul 2;11(13):5076-5080. doi: 10.1021/acs.jpclett.0c01125. Epub 2020 Jun 17.
4
Understanding the role of predictive time delay and biased propagator in RAVE.理解预测时间延迟和有偏传播子在 RAVE 中的作用。
J Chem Phys. 2020 Apr 14;152(14):144102. doi: 10.1063/5.0004838.
5
The Maximum Caliber Variational Principle for Nonequilibria.非平衡态的最大口径变分原理。
Annu Rev Phys Chem. 2020 Apr 20;71:213-238. doi: 10.1146/annurev-physchem-071119-040206. Epub 2020 Feb 19.
6
Machine learning approaches for analyzing and enhancing molecular dynamics simulations.机器学习方法在分析和增强分子动力学模拟中的应用。
Curr Opin Struct Biol. 2020 Apr;61:139-145. doi: 10.1016/j.sbi.2019.12.016. Epub 2020 Jan 20.
7
Machine learning for protein folding and dynamics.机器学习在蛋白质折叠和动力学中的应用。
Curr Opin Struct Biol. 2020 Feb;60:77-84. doi: 10.1016/j.sbi.2019.12.005. Epub 2019 Dec 24.
8
Microscopic Characterization of GRP1 PH Domain Interaction with Anionic Membranes.GRP1 PH 结构域与阴离子膜相互作用的微观特征。
J Comput Chem. 2020 Mar 5;41(6):489-499. doi: 10.1002/jcc.26109. Epub 2019 Nov 25.
9
Machine learning transforms how microstates are sampled.机器学习改变了微状态的采样方式。
Science. 2019 Sep 6;365(6457):982-983. doi: 10.1126/science.aay2568.
10
Boltzmann generators: Sampling equilibrium states of many-body systems with deep learning.玻尔兹曼生成器:用深度学习抽样多体系统的平衡态。
Science. 2019 Sep 6;365(6457). doi: 10.1126/science.aaw1147.