分子动力学中的维度降低的自动编码器：集体变量维度、偏差和过渡态。

Autoencoders for dimensionality reduction in molecular dynamics: Collective variable dimension, biasing, and transition states.

机构信息

Integrated Drug Discovery, Molecular Design Sciences, Sanofi, Vitry-sur-Seine, France.

CERMICS, Ecole des Ponts, Marne-la-Vallée, France.

出版信息

J Chem Phys. 2023 Jul 14;159(2). doi: 10.1063/5.0151053.

DOI:10.1063/5.0151053

PMID:37431908

Abstract

The heat shock protein 90 (Hsp90) is a molecular chaperone that controls the folding and activation of client proteins using the free energy of ATP hydrolysis. The Hsp90 active site is in its N-terminal domain (NTD). Our goal is to characterize the dynamics of NTD using an autoencoder-learned collective variable (CV) in conjunction with adaptive biasing force Langevin dynamics. Using dihedral analysis, we cluster all available experimental Hsp90 NTD structures into distinct native states. We then perform unbiased molecular dynamics (MD) simulations to construct a dataset that represents each state and use this dataset to train an autoencoder. Two autoencoder architectures are considered, with one and two hidden layers, respectively, and bottlenecks of dimension k ranging from 1 to 10. We demonstrate that the addition of an extra hidden layer does not significantly improve the performance, while it leads to complicated CVs that increase the computational cost of biased MD calculations. In addition, a two-dimensional (2D) bottleneck can provide enough information of the different states, while the optimal bottleneck dimension is five. For the 2D bottleneck, the 2D CV is directly used in biased MD simulations. For the five-dimensional (5D) bottleneck, we perform an analysis of the latent CV space and identify the pair of CV coordinates that best separates the states of Hsp90. Interestingly, selecting a 2D CV out of the 5D CV space leads to better results than directly learning a 2D CV and allows observation of transitions between native states when running free energy biased dynamics.

摘要

热休克蛋白 90（Hsp90）是一种分子伴侣，它利用 ATP 水解的自由能控制客户蛋白的折叠和激活。Hsp90 的活性部位位于其 N 端结构域（NTD）。我们的目标是使用自动编码器学习的集体变量（CV）结合自适应偏置力拉氏动力学来描述 NTD 的动力学。通过二面角分析，我们将所有可用的实验 Hsp90 NTD 结构聚类为不同的天然状态。然后，我们进行无偏分子动力学（MD）模拟，构建一个代表每个状态的数据集，并使用该数据集训练自动编码器。考虑了两种自动编码器架构，分别具有一个和两个隐藏层，以及从 1 到 10 的瓶颈维度 k。我们证明，添加额外的隐藏层并不会显著提高性能，而会导致复杂的 CV，从而增加有偏 MD 计算的计算成本。此外，二维（2D）瓶颈可以提供足够的不同状态信息，而最佳的瓶颈维度为五。对于 2D 瓶颈，直接在有偏 MD 模拟中使用 2D CV。对于五维（5D）瓶颈，我们对潜在 CV 空间进行分析，并确定最佳分离 Hsp90 状态的 CV 坐标对。有趣的是，从 5D CV 空间中选择 2D CV 比直接学习 2D CV 会产生更好的结果，并允许在运行自由能有偏动力学时观察到天然状态之间的转变。

相似文献

Autoencoders for dimensionality reduction in molecular dynamics: Collective variable dimension, biasing, and transition states.分子动力学中的维度降低的自动编码器：集体变量维度、偏差和过渡态。

J Chem Phys. 2023 Jul 14;159(2). doi: 10.1063/5.0151053.

Chasing Collective Variables Using Autoencoders and Biased Trajectories.使用自动编码器和有偏轨迹追踪集体变量。

J Chem Theory Comput. 2022 Jan 11;18(1):59-78. doi: 10.1021/acs.jctc.1c00415. Epub 2021 Dec 29.

The Adaptive Path Collective Variable: A Versatile Biasing Approach to Compute the Average Transition Path and Free Energy of Molecular Transitions.自适应路径集体变量：一种计算分子转变的平均过渡路径和自由能的通用偏置方法。

Methods Mol Biol. 2019;2022:255-290. doi: 10.1007/978-1-4939-9608-7_11.

Molecular enhanced sampling with autoencoders: On-the-fly collective variable discovery and accelerated free energy landscape exploration.基于自动编码器的分子增强采样：在线共变异构体发现和自由能景观加速探索。

J Comput Chem. 2018 Sep 30;39(25):2079-2102. doi: 10.1002/jcc.25520. Epub 2018 Oct 14.

Machine Learning-Assisted Discovery of Hidden States in Expanded Free Energy Space.机器学习辅助拓展自由能空间中隐藏状态的发现。

J Phys Chem Lett. 2022 Feb 24;13(7):1797-1805. doi: 10.1021/acs.jpclett.1c04004. Epub 2022 Feb 16.

Automated collective variable discovery for MFSD2A transporter from molecular dynamics simulations.基于分子动力学模拟的 MFSD2A 转运蛋白的自动集体变量发现。

Biophys J. 2024 Sep 3;123(17):2934-2955. doi: 10.1016/j.bpj.2024.06.024. Epub 2024 Jun 25.

Global Dynamics of Yeast Hsp90 Middle and C-Terminal Dimer Studied by Advanced Sampling Simulations.通过高级采样模拟研究酵母Hsp90中末端二聚体的全局动力学

Front Mol Biosci. 2019 Sep 27;6:93. doi: 10.3389/fmolb.2019.00093. eCollection 2019.

Structural Characterization of Human Heat Shock Protein 90 N-Terminal Domain and Its Variants K112R and K112A in Complex with a Potent 1,2,3-Triazole-Based Inhibitor.人热休克蛋白 90 N 端结构域及其变体 K112R 和 K112A 与强效 1,2,3-三唑基抑制剂复合物的结构特征。

Int J Mol Sci. 2022 Aug 21;23(16):9458. doi: 10.3390/ijms23169458.

Perspective: Identification of collective variables and metastable states of protein dynamics.观点：蛋白质动力学的集体变量和亚稳态的识别。

J Chem Phys. 2018 Oct 21;149(15):150901. doi: 10.1063/1.5049637.

Modeling signal propagation mechanisms and ligand-based conformational dynamics of the Hsp90 molecular chaperone full-length dimer.热休克蛋白90（Hsp90）分子伴侣全长二聚体的信号传导机制及基于配体的构象动力学建模

PLoS Comput Biol. 2009 Mar;5(3):e1000323. doi: 10.1371/journal.pcbi.1000323. Epub 2009 Mar 20.

引用本文的文献

Accurate Characterization of Binding Kinetics and Allosteric Mechanisms for the HSP90 Chaperone Inhibitors Using AI-Augmented Integrative Biophysical Studies.使用人工智能增强的综合生物物理研究准确表征HSP90伴侣蛋白抑制剂的结合动力学和变构机制

JACS Au. 2024 Apr 1;4(4):1632-1645. doi: 10.1021/jacsau.4c00123. eCollection 2024 Apr 22.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

分子动力学中的维度降低的自动编码器：集体变量维度、偏差和过渡态。

Autoencoders for dimensionality reduction in molecular dynamics: Collective variable dimension, biasing, and transition states.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献