• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过遗传算法和神经网络发现分子转变的集体变量。

Discovering Collective Variables of Molecular Transitions via Genetic Algorithms and Neural Networks.

机构信息

Van 't Hoff Institute for Molecular Sciences, AI4Science Laboratory, and Amsterdam Center for Multiscale Modeling, University of Amsterdam, Science Park 904, 1098 XH Amsterdam, The Netherlands.

出版信息

J Chem Theory Comput. 2021 Apr 13;17(4):2294-2306. doi: 10.1021/acs.jctc.0c00981. Epub 2021 Mar 4.

DOI:10.1021/acs.jctc.0c00981
PMID:33662202
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8047796/
Abstract

With the continual improvement of computing hardware and algorithms, simulations have become a powerful tool for understanding all sorts of (bio)molecular processes. To handle the large simulation data sets and to accelerate slow, activated transitions, a condensed set of descriptors, or collective variables (CVs), is needed to discern the relevant dynamics that describes the molecular process of interest. However, proposing an adequate set of CVs that can capture the intrinsic reaction coordinate of the molecular transition is often extremely difficult. Here, we present a framework to find an optimal set of CVs from a pool of candidates using a combination of artificial neural networks and genetic algorithms. The approach effectively replaces the encoder of an autoencoder network with genes to represent the latent space, i.e., the CVs. Given a selection of CVs as input, the network is trained to recover the atom coordinates underlying the CV values at points along the transition. The network performance is used as an estimator of the fitness of the input CVs. Two genetic algorithms optimize the CV selection and the neural network architecture. The successful retrieval of optimal CVs by this framework is illustrated at the hand of two case studies: the well-known conformational change in the alanine dipeptide molecule and the more intricate transition of a base pair in B-DNA from the classic Watson-Crick pairing to the alternative Hoogsteen pairing. Key advantages of our framework include the following: optimal interpretable CVs, avoiding costly calculation of committor or time-correlation functions, and automatic hyperparameter optimization. In addition, we show that applying a time-delay between the network input and output allows for enhanced selection of slow variables. Moreover, the network can also be used to generate molecular configurations of unexplored microstates, for example, for augmentation of the simulation data.

摘要

随着计算硬件和算法的不断改进,模拟已成为理解各种(生物)分子过程的强大工具。为了处理大型模拟数据集并加速缓慢的激活跃迁,需要一组浓缩的描述符或集体变量(CVs),以辨别描述感兴趣的分子过程的相关动力学。然而,提出能够捕捉分子跃迁内在反应坐标的合适 CV 集通常非常困难。在这里,我们提出了一种使用人工神经网络和遗传算法组合从候选者中找到最佳 CV 集的框架。该方法有效地用基因替换自动编码器网络的编码器来表示潜在空间,即 CVs。给定一组 CV 作为输入,网络被训练以恢复 CV 值沿跃迁的点处原子坐标。网络性能用作输入 CV 适应度的估计器。两种遗传算法优化 CV 选择和神经网络架构。通过这个框架成功检索到最佳 CV 来说明两个案例研究:丙氨酸二肽分子中众所周知的构象变化和 B-DNA 中碱基对从经典的 Watson-Crick 配对到替代的 Hoogsteen 配对的更复杂的转变。我们框架的主要优点包括以下几点:最佳可解释的 CVs,避免计算关键函数或时间相关函数的成本,以及自动超参数优化。此外,我们还表明,在网络输入和输出之间施加时间延迟可以增强对慢变量的选择。此外,该网络还可用于生成未探索微观状态的分子构型,例如,用于扩充模拟数据。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36ce/8047796/210e1b700e01/ct0c00981_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36ce/8047796/429682a9ff24/ct0c00981_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36ce/8047796/71da16b44ed0/ct0c00981_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36ce/8047796/520160aa726a/ct0c00981_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36ce/8047796/ea04379c056c/ct0c00981_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36ce/8047796/8b4c5de51fa1/ct0c00981_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36ce/8047796/210e1b700e01/ct0c00981_0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36ce/8047796/429682a9ff24/ct0c00981_0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36ce/8047796/71da16b44ed0/ct0c00981_0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36ce/8047796/520160aa726a/ct0c00981_0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36ce/8047796/ea04379c056c/ct0c00981_0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36ce/8047796/8b4c5de51fa1/ct0c00981_0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36ce/8047796/210e1b700e01/ct0c00981_0006.jpg

相似文献

1
Discovering Collective Variables of Molecular Transitions via Genetic Algorithms and Neural Networks.通过遗传算法和神经网络发现分子转变的集体变量。
J Chem Theory Comput. 2021 Apr 13;17(4):2294-2306. doi: 10.1021/acs.jctc.0c00981. Epub 2021 Mar 4.
2
Discovering Reaction Pathways, Slow Variables, and Committor Probabilities with Machine Learning.利用机器学习发现反应路径、慢变量和反应坐标概率
J Chem Theory Comput. 2023 Jul 25;19(14):4414-4426. doi: 10.1021/acs.jctc.3c00028. Epub 2023 May 24.
3
An exploration of machine learning models for the determination of reaction coordinates associated with conformational transitions.探索用于确定与构象转变相关的反应坐标的机器学习模型。
J Chem Phys. 2023 Jul 21;159(3). doi: 10.1063/5.0147597.
4
The Adaptive Path Collective Variable: A Versatile Biasing Approach to Compute the Average Transition Path and Free Energy of Molecular Transitions.自适应路径集体变量:一种计算分子转变的平均过渡路径和自由能的通用偏置方法。
Methods Mol Biol. 2019;2022:255-290. doi: 10.1007/978-1-4939-9608-7_11.
5
Explaining reaction coordinates of alanine dipeptide isomerization obtained from deep neural networks using Explainable Artificial Intelligence (XAI).使用可解释人工智能(XAI)解释从深度神经网络获得的丙氨酸二肽异构化反应坐标。
J Chem Phys. 2022 Apr 21;156(15):154108. doi: 10.1063/5.0087310.
6
DeepCV: A Deep Learning Framework for Blind Search of Collective Variables in Expanded Configurational Space.DeepCV:用于在扩展构型空间中进行集体变量盲目搜索的深度学习框架。
J Chem Inf Model. 2022 Dec 26;62(24):6352-6364. doi: 10.1021/acs.jcim.2c00883. Epub 2022 Nov 29.
7
A deep autoencoder framework for discovery of metastable ensembles in biomacromolecules.用于发现生物大分子中亚稳态集合的深度自动编码器框架。
J Chem Phys. 2021 Sep 21;155(11):114106. doi: 10.1063/5.0059965.
8
MLCV: Bridging Machine-Learning-Based Dimensionality Reduction and Free-Energy Calculation.MLCV:基于机器学习的降维和自由能计算的桥梁。
J Chem Inf Model. 2022 Jan 10;62(1):1-8. doi: 10.1021/acs.jcim.1c01010. Epub 2021 Dec 23.
9
Finding transition pathways using the string method with swarms of trajectories.使用带有轨迹群的弦方法寻找过渡路径。
J Phys Chem B. 2008 Mar 20;112(11):3432-40. doi: 10.1021/jp0777059. Epub 2008 Feb 22.
10
Modeling and enhanced sampling of molecular systems with smooth and nonlinear data-driven collective variables.基于平滑和非线性数据驱动的整体变量对分子系统进行建模和增强采样。
J Chem Phys. 2013 Dec 7;139(21):214101. doi: 10.1063/1.4830403.

引用本文的文献

1
DeepPath: Overcoming data scarcity for protein transition pathway prediction using physics-based deep learning.深度路径:利用基于物理学的深度学习克服蛋白质转变途径预测中的数据稀缺问题。
bioRxiv. 2025 Mar 2:2025.02.27.640693. doi: 10.1101/2025.02.27.640693.
2
Reaction Coordinates Are Optimal Channels of Energy Flow.反应坐标是能量流动的最优通道。
Annu Rev Phys Chem. 2025 Apr;76(1):153-179. doi: 10.1146/annurev-physchem-082423-010652. Epub 2025 Feb 4.
3
Correlating enzymatic reactivity for different substrates using transferable data-driven collective variables.

本文引用的文献

1
Learning reaction coordinates via cross-entropy minimization: Application to alanine dipeptide.通过交叉熵最小化学习反应坐标:在丙氨酸二肽中的应用。
J Chem Phys. 2020 Aug 7;153(5):054115. doi: 10.1063/5.0009066.
2
Neural-Network-Based Path Collective Variables for Enhanced Sampling of Phase Transformations.基于神经网络的路径集体变量增强相转变的抽样。
Phys Rev Lett. 2019 Dec 13;123(24):245701. doi: 10.1103/PhysRevLett.123.245701.
3
Atomistic insight into the kinetic pathways for Watson-Crick to Hoogsteen transitions in DNA.原子水平上对 DNA 中 Watson-Crick 到 Hoogsteen 转变的动力学途径的深入了解。
利用可转移的数据驱动的集体变量来关联不同底物的酶反应性。
Proc Natl Acad Sci U S A. 2024 Dec 3;121(49):e2416621121. doi: 10.1073/pnas.2416621121. Epub 2024 Nov 26.
4
Effective data-driven collective variables for free energy calculations from metadynamics of paths.用于基于路径元动力学进行自由能计算的有效数据驱动集体变量。
PNAS Nexus. 2024 Apr 12;3(4):pgae159. doi: 10.1093/pnasnexus/pgae159. eCollection 2024 Apr.
5
Enhanced Sampling with Machine Learning.利用机器学习的增强采样
Annu Rev Phys Chem. 2024 Jun;75(1):347-370. doi: 10.1146/annurev-physchem-083122-125941. Epub 2024 Jun 14.
6
Approximating Projections of Conformational Boltzmann Distributions with AlphaFold2 Predictions: Opportunities and Limitations.用 AlphaFold2 预测来逼近构象 Boltzmann 分布的投影:机遇与局限。
J Chem Theory Comput. 2024 Feb 13;20(3):1434-1447. doi: 10.1021/acs.jctc.3c01081. Epub 2024 Jan 12.
7
Exploring the hypolipidemic effects of bergenin from Saxifraga melanocentra Franch: mechanistic insights and potential for hyperlipidemia treatment.探讨岩白菜素对高脂血症的作用及其机制:岩白菜素治疗高脂血症的潜力。
Lipids Health Dis. 2023 Nov 24;22(1):203. doi: 10.1186/s12944-023-01973-2.
8
OneOPES, a Combined Enhanced Sampling Method to Rule Them All.OneOPES,一种一统天下的组合增强采样方法。
J Chem Theory Comput. 2023 Sep 12;19(17):5731-5742. doi: 10.1021/acs.jctc.3c00254. Epub 2023 Aug 21.
9
Open-Source Machine Learning in Computational Chemistry.开源机器学习在计算化学中的应用。
J Chem Inf Model. 2023 Aug 14;63(15):4505-4532. doi: 10.1021/acs.jcim.3c00643. Epub 2023 Jul 19.
10
Discovering Reaction Pathways, Slow Variables, and Committor Probabilities with Machine Learning.利用机器学习发现反应路径、慢变量和反应坐标概率
J Chem Theory Comput. 2023 Jul 25;19(14):4414-4426. doi: 10.1021/acs.jctc.3c00028. Epub 2023 May 24.
Nucleic Acids Res. 2019 Dec 2;47(21):11069-11076. doi: 10.1093/nar/gkz837.
4
Promoting transparency and reproducibility in enhanced molecular simulations.提高增强分子模拟中的透明度和可重复性。
Nat Methods. 2019 Aug;16(8):670-673. doi: 10.1038/s41592-019-0506-8.
5
Predictive collective variable discovery with deep Bayesian models.基于深度贝叶斯模型的预测性协变量发现。
J Chem Phys. 2019 Jan 14;150(2):024109. doi: 10.1063/1.5058063.
6
Folding a small protein using harmonic linear discriminant analysis.利用调和线性判别分析折叠小蛋白。
J Chem Phys. 2018 Nov 21;149(19):194113. doi: 10.1063/1.5053566.
7
Molecular enhanced sampling with autoencoders: On-the-fly collective variable discovery and accelerated free energy landscape exploration.基于自动编码器的分子增强采样:在线共变异构体发现和自由能景观加速探索。
J Comput Chem. 2018 Sep 30;39(25):2079-2102. doi: 10.1002/jcc.25520. Epub 2018 Oct 14.
8
OpenPathSampling: A Python Framework for Path Sampling Simulations. 2. Building and Customizing Path Ensembles and Sample Schemes.OpenPathSampling:用于路径采样模拟的 Python 框架。2. 构建和定制路径集合和采样方案。
J Chem Theory Comput. 2019 Feb 12;15(2):837-856. doi: 10.1021/acs.jctc.8b00627. Epub 2018 Dec 31.
9
OpenPathSampling: A Python Framework for Path Sampling Simulations. 1. Basics.OpenPathSampling:用于路径采样模拟的 Python 框架。1. 基础。
J Chem Theory Comput. 2019 Feb 12;15(2):813-836. doi: 10.1021/acs.jctc.8b00626. Epub 2018 Dec 31.
10
Reweighted autoencoded variational Bayes for enhanced sampling (RAVE).重加权自动编码变分贝叶斯增强采样(RAVE)。
J Chem Phys. 2018 Aug 21;149(7):072301. doi: 10.1063/1.5025487.