• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种最小方差聚类方法可生成稳健且可解释的粗粒度模型。

A Minimum Variance Clustering Approach Produces Robust and Interpretable Coarse-Grained Models.

作者信息

Husic Brooke E, McKiernan Keri A, Wayment-Steele Hannah K, Sultan Mohammad M, Pande Vijay S

机构信息

Department of Chemistry, Stanford University , Stanford, California 94305, United States.

出版信息

J Chem Theory Comput. 2018 Feb 13;14(2):1071-1082. doi: 10.1021/acs.jctc.7b01004. Epub 2018 Jan 24.

DOI:10.1021/acs.jctc.7b01004
PMID:29253336
Abstract

Markov state models (MSMs) are a powerful framework for the analysis of molecular dynamics data sets, such as protein folding simulations, because of their straightforward construction and statistical rigor. The coarse-graining of MSMs into an interpretable number of macrostates is a crucial step for connecting theoretical results with experimental observables. Here we present the minimum variance clustering approach (MVCA) for the coarse-graining of MSMs into macrostate models. The method utilizes agglomerative clustering with Ward's minimum variance objective function, and the similarity of the microstate dynamics is determined using the Jensen-Shannon divergence between the corresponding rows in the MSM transition probability matrix. We first show that MVCA produces intuitive results for a simple tripeptide system and is robust toward long-duration statistical artifacts. MVCA is then applied to two protein folding simulations of the same protein in different force fields to demonstrate that a different number of macrostates is appropriate for each model, revealing a misfolded state present in only one of the simulations. Finally, we show that the same method can be used to analyze a data set containing many MSMs from simulations in different force fields by aggregating them into groups and quantifying their dynamical similarity in the context of force field parameter choices. The minimum variance clustering approach with the Jensen-Shannon divergence provides a powerful tool to group dynamics by similarity, both among model states and among dynamical models themselves.

摘要

马尔可夫状态模型(MSMs)是分析分子动力学数据集(如蛋白质折叠模拟)的强大框架,因其构建简单且统计严谨。将MSMs粗粒化为可解释数量的宏观状态是将理论结果与实验观测联系起来的关键步骤。在此,我们提出了用于将MSMs粗粒化为宏观状态模型的最小方差聚类方法(MVCA)。该方法利用具有沃德最小方差目标函数的凝聚聚类,并且使用MSM转移概率矩阵中对应行之间的 Jensen-Shannon 散度来确定微状态动力学的相似性。我们首先表明,MVCA 对于一个简单的三肽系统产生直观的结果,并且对长时间的统计伪像具有鲁棒性。然后将 MVCA 应用于同一蛋白质在不同力场中的两个蛋白质折叠模拟,以证明每个模型适合不同数量的宏观状态,揭示了仅在其中一个模拟中出现的错误折叠状态。最后,我们表明,通过将包含来自不同力场模拟的许多MSMs的数据集聚集为组,并在力场参数选择的背景下量化它们的动力学相似性,可以使用相同的方法来分析该数据集。具有 Jensen-Shannon 散度的最小方差聚类方法提供了一个强大的工具,可通过相似性对模型状态之间以及动力学模型本身之间的动力学进行分组。

相似文献

1
A Minimum Variance Clustering Approach Produces Robust and Interpretable Coarse-Grained Models.一种最小方差聚类方法可生成稳健且可解释的粗粒度模型。
J Chem Theory Comput. 2018 Feb 13;14(2):1071-1082. doi: 10.1021/acs.jctc.7b01004. Epub 2018 Jan 24.
2
Ward Clustering Improves Cross-Validated Markov State Models of Protein Folding.分域聚类提高蛋白质折叠的交叉验证马氏态模型。
J Chem Theory Comput. 2017 Mar 14;13(3):963-967. doi: 10.1021/acs.jctc.6b01238. Epub 2017 Feb 17.
3
Hierarchical Nyström methods for constructing Markov state models for conformational dynamics.层次 Nyström 方法用于构建构象动力学的 Markov 状态模型。
J Chem Phys. 2013 May 7;138(17):174106. doi: 10.1063/1.4802007.
4
Constructing multi-resolution Markov State Models (MSMs) to elucidate RNA hairpin folding mechanisms.构建多分辨率马尔可夫状态模型(MSMs)以阐明RNA发夹折叠机制。
Pac Symp Biocomput. 2010:228-39. doi: 10.1142/9789814295291_0025.
5
What Markov State Models Can and Cannot Do: Correlation versus Path-Based Observables in Protein-Folding Models.马尔可夫状态模型能做什么与不能做什么:蛋白质折叠模型中基于相关性与基于路径的可观测量
J Chem Theory Comput. 2021 May 11;17(5):3119-3133. doi: 10.1021/acs.jctc.0c01154. Epub 2021 Apr 27.
6
Improved coarse-graining of Markov state models via explicit consideration of statistical uncertainty.通过明确考虑统计不确定性来改进马尔可夫状态模型的粗粒化。
J Chem Phys. 2012 Oct 7;137(13):134111. doi: 10.1063/1.4755751.
7
Markov state models provide insights into dynamic modulation of protein function.马尔可夫状态模型有助于深入了解蛋白质功能的动态调节。
Acc Chem Res. 2015 Feb 17;48(2):414-22. doi: 10.1021/ar5002999. Epub 2015 Jan 3.
8
Markov state models of protein misfolding.蛋白质错误折叠的马尔可夫状态模型。
J Chem Phys. 2016 Feb 21;144(7):075101. doi: 10.1063/1.4941579.
9
Markov State Models: From an Art to a Science.马尔可夫状态模型:从一门艺术到一门科学。
J Am Chem Soc. 2018 Feb 21;140(7):2386-2396. doi: 10.1021/jacs.7b12191. Epub 2018 Feb 2.
10
Mean first passage times in variational coarse graining using Markov state models.使用马尔可夫状态模型的变分粗粒化的平均首次通过时间。
J Chem Phys. 2019 Apr 7;150(13):134107. doi: 10.1063/1.5083924.

引用本文的文献

1
Markov State Models: To Optimize or Not to Optimize.马尔可夫状态模型:优化还是不优化。
J Chem Theory Comput. 2024 Jan 23;20(2):977-988. doi: 10.1021/acs.jctc.3c01134. Epub 2024 Jan 1.
2
The Two-Step Clustering Approach for Metastable States Learning.用于亚稳态学习的两步聚类方法。
Int J Mol Sci. 2021 Jun 19;22(12):6576. doi: 10.3390/ijms22126576.
3
Deep learning the structural determinants of protein biochemical properties by comparing structural ensembles with DiffNets.通过将结构集合与 DiffNets 进行比较,深度学习蛋白质生化性质的结构决定因素。
Nat Commun. 2021 May 21;12(1):3023. doi: 10.1038/s41467-021-23246-1.
4
Galerkin approximation of dynamical quantities using trajectory data.使用轨迹数据对动力学量进行伽辽金近似。
J Chem Phys. 2019 Jun 28;150(24):244111. doi: 10.1063/1.5063730.
5
Simultaneous coherent structure coloring facilitates interpretable clustering of scientific data by amplifying dissimilarity.同时相干结构着色通过放大差异促进科学数据的可解释聚类。
PLoS One. 2019 Mar 13;14(3):e0212442. doi: 10.1371/journal.pone.0212442. eCollection 2019.
6
Conformational analysis of replica exchange MD: Temperature-dependent Markov networks for FF amyloid peptides.复制交换 MD 的构象分析:FF 淀粉样肽的温度依赖马尔可夫网络。
J Chem Phys. 2018 Aug 21;149(7):072323. doi: 10.1063/1.5027580.