• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于发现生物大分子中亚稳态集合的深度自动编码器框架。

A deep autoencoder framework for discovery of metastable ensembles in biomacromolecules.

机构信息

Tata Institute of Fundamental Research, Center for Interdisciplinary Sciences, Hyderabad 500046, India.

出版信息

J Chem Phys. 2021 Sep 21;155(11):114106. doi: 10.1063/5.0059965.

DOI:10.1063/5.0059965
PMID:34551528
Abstract

Biomacromolecules manifest dynamic conformational fluctuation and involve mutual interconversion among metastable states. A robust mapping of their conformational landscape often requires the low-dimensional projection of the conformational ensemble along optimized collective variables (CVs). However, the traditional choice for the CV is often limited by user-intuition and prior knowledge about the system, and this lacks a rigorous assessment of their optimality over other candidate CVs. To address this issue, we propose an approach in which we first choose the possible combinations of inter-residue Cα-distances within a given macromolecule as a set of input CVs. Subsequently, we derive a non-linear combination of latent space embedded CVs via auto-encoding the unbiased molecular dynamics simulation trajectories within the framework of the feed-forward neural network. We demonstrate the ability of the derived latent space variables in elucidating the conformational landscape in four hierarchically complex systems. The latent space CVs identify key metastable states of a bead-in-a-spring polymer. The combination of the adopted dimensional reduction technique with a Markov state model, built on the derived latent space, reveals multiple spatially and kinetically well-resolved metastable conformations for GB1 β-hairpin. A quantitative comparison based on the variational approach-based scoring of the auto-encoder-derived latent space CVs with the ones obtained via independent component analysis (principal component analysis or time-structured independent component analysis) confirms the optimality of the former. As a practical application, the auto-encoder-derived CVs were found to predict the reinforced folding of a Trp-cage mini-protein in aqueous osmolyte solution. Finally, the protocol was able to decipher the conformational heterogeneities involved in a complex metalloenzyme, namely, cytochrome P450.

摘要

生物大分子表现出动态的构象波动,并涉及亚稳态之间的相互转换。通常需要沿着优化的集体变量(CV)对构象系综进行低维投影,才能对其构象景观进行稳健映射。然而,传统的 CV 选择通常受到用户直觉和对系统的先验知识的限制,并且缺乏对其他候选 CV 进行的最优性的严格评估。为了解决这个问题,我们提出了一种方法,首先选择给定大分子内的残基 Cα-距离的可能组合作为一组输入 CV。随后,我们通过在前馈神经网络框架内对无偏分子动力学模拟轨迹进行自动编码,推导出潜在空间嵌入 CV 的非线性组合。我们在四个层次复杂的系统中展示了推导出的潜在空间变量阐明构象景观的能力。潜在空间 CV 确定了弹簧中珠子聚合物的关键亚稳态。采用降维技术与基于推导出的潜在空间构建的马尔可夫状态模型相结合,揭示了 GB1 β-发夹的多个空间和动力学上分辨率良好的亚稳态构象。基于变分方法的自动编码器推导的潜在空间 CV 的得分与通过独立成分分析(主成分分析或时间结构独立成分分析)获得的得分的定量比较证实了前者的最优性。作为实际应用,发现自动编码器推导的 CV 可以预测色氨酸笼 mini 蛋白在水渗透剂溶液中的增强折叠。最后,该方案能够解析涉及复杂金属酶即细胞色素 P450 的构象异质性。

相似文献

1
A deep autoencoder framework for discovery of metastable ensembles in biomacromolecules.用于发现生物大分子中亚稳态集合的深度自动编码器框架。
J Chem Phys. 2021 Sep 21;155(11):114106. doi: 10.1063/5.0059965.
2
Assessment and optimization of collective variables for protein conformational landscape: GB1 -hairpin as a case study.评估和优化蛋白质构象景观的集体变量:以 GB1-发夹为例。
J Chem Phys. 2018 Sep 7;149(9):094101. doi: 10.1063/1.5041073.
3
An Efficient Path Classification Algorithm Based on Variational Autoencoder to Identify Metastable Path Channels for Complex Conformational Changes.一种基于变分自编码器的高效路径分类算法,用于识别复杂构象变化的亚稳路径通道。
J Chem Theory Comput. 2023 Jul 25;19(14):4728-4742. doi: 10.1021/acs.jctc.3c00318. Epub 2023 Jun 29.
4
A deep encoder-decoder framework for identifying distinct ligand binding pathways.一种用于识别独特配体结合途径的深度编解码器框架。
J Chem Phys. 2023 May 21;158(19). doi: 10.1063/5.0145197.
5
Discovering Collective Variables of Molecular Transitions via Genetic Algorithms and Neural Networks.通过遗传算法和神经网络发现分子转变的集体变量。
J Chem Theory Comput. 2021 Apr 13;17(4):2294-2306. doi: 10.1021/acs.jctc.0c00981. Epub 2021 Mar 4.
6
Automatic discovery of metastable states for the construction of Markov models of macromolecular conformational dynamics.用于构建大分子构象动力学马尔可夫模型的亚稳态自动发现
J Chem Phys. 2007 Apr 21;126(15):155101. doi: 10.1063/1.2714538.
7
Automated collective variable discovery for MFSD2A transporter from molecular dynamics simulations.基于分子动力学模拟的 MFSD2A 转运蛋白的自动集体变量发现。
Biophys J. 2024 Sep 3;123(17):2934-2955. doi: 10.1016/j.bpj.2024.06.024. Epub 2024 Jun 25.
8
Computational methods for exploring protein conformations.计算方法探索蛋白质构象。
Biochem Soc Trans. 2020 Aug 28;48(4):1707-1724. doi: 10.1042/BST20200193.
9
LAST: Latent Space-Assisted Adaptive Sampling for Protein Trajectories.LAST:用于蛋白质轨迹的潜在空间辅助自适应采样。
J Chem Inf Model. 2023 Jan 9;63(1):67-75. doi: 10.1021/acs.jcim.2c01213. Epub 2022 Dec 6.
10
Variational embedding of protein folding simulations using Gaussian mixture variational autoencoders.使用高斯混合变分自动编码器对蛋白质折叠模拟进行变分嵌入。
J Chem Phys. 2021 Nov 21;155(19):194108. doi: 10.1063/5.0069708.

引用本文的文献

1
Structural Plasticity and Functional Dynamics of Pigeon Cryptochrome 4 as Avian Magnetoreceptor.作为鸟类磁受体的鸽子隐花色素4的结构可塑性与功能动力学
J Mol Biol. 2025 May 27:169233. doi: 10.1016/j.jmb.2025.169233.
2
Accurate prediction of the kinetic sequence of physicochemical states using generative artificial intelligence.使用生成式人工智能准确预测物理化学状态的动力学序列。
Chem Sci. 2025 Apr 10. doi: 10.1039/d5sc00108k.
3
Machine Learning-Based Process Optimization in Biopolymer Manufacturing: A Review.基于机器学习的生物聚合物制造过程优化:综述
Polymers (Basel). 2024 Nov 29;16(23):3368. doi: 10.3390/polym16233368.
4
Machine learning approaches for biomolecular, biophysical, and biomaterials research.用于生物分子、生物物理和生物材料研究的机器学习方法。
Biophys Rev (Melville). 2022 Jun 3;3(2):021306. doi: 10.1063/5.0082179. eCollection 2022 Jun.
5
Chasing collective variables using temporal data-driven strategies.使用时间数据驱动策略追踪集体变量。
QRB Discov. 2023 Jan 6;4:e2. doi: 10.1017/qrd.2022.23. eCollection 2023.
6
Long-time-step molecular dynamics can retard simulation of protein-ligand recognition process.长时间步分子动力学可能会延缓蛋白质-配体识别过程的模拟。
Biophys J. 2023 Mar 7;122(5):802-816. doi: 10.1016/j.bpj.2023.01.036. Epub 2023 Feb 1.
7
LAST: Latent Space-Assisted Adaptive Sampling for Protein Trajectories.LAST:用于蛋白质轨迹的潜在空间辅助自适应采样。
J Chem Inf Model. 2023 Jan 9;63(1):67-75. doi: 10.1021/acs.jcim.2c01213. Epub 2022 Dec 6.