分子动力学中慢动力学模式的变分交叉验证

Variational cross-validation of slow dynamical modes in molecular kinetics.

作者信息

McGibbon Robert T, Pande Vijay S

机构信息

Department of Chemistry, Stanford University, Stanford, California 94305, USA.

出版信息

J Chem Phys. 2015 Mar 28;142(12):124105. doi: 10.1063/1.4916292.

DOI:10.1063/1.4916292

PMID:25833563

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4398134/

Abstract

Markov state models are a widely used method for approximating the eigenspectrum of the molecular dynamics propagator, yielding insight into the long-timescale statistical kinetics and slow dynamical modes of biomolecular systems. However, the lack of a unified theoretical framework for choosing between alternative models has hampered progress, especially for non-experts applying these methods to novel biological systems. Here, we consider cross-validation with a new objective function for estimators of these slow dynamical modes, a generalized matrix Rayleigh quotient (GMRQ), which measures the ability of a rank-m projection operator to capture the slow subspace of the system. It is shown that a variational theorem bounds the GMRQ from above by the sum of the first m eigenvalues of the system's propagator, but that this bound can be violated when the requisite matrix elements are estimated subject to statistical uncertainty. This overfitting can be detected and avoided through cross-validation. These result make it possible to construct Markov state models for protein dynamics in a way that appropriately captures the tradeoff between systematic and statistical errors.

摘要

马尔可夫状态模型是一种广泛使用的方法，用于近似分子动力学传播子的本征谱，从而深入了解生物分子系统的长时间尺度统计动力学和慢动力学模式。然而，缺乏一个统一的理论框架来在替代模型之间进行选择阻碍了进展，特别是对于将这些方法应用于新型生物系统的非专家而言。在这里，我们考虑使用一种新的目标函数进行交叉验证，该目标函数用于这些慢动力学模式的估计器，即广义矩阵瑞利商（GMRQ），它衡量秩为m的投影算子捕获系统慢子空间的能力。结果表明，一个变分定理将GMRQ从上方界定为系统传播子的前m个本征值之和，但当所需的矩阵元素在统计不确定性下进行估计时，这个界限可能会被违反。这种过拟合可以通过交叉验证来检测和避免。这些结果使得能够以适当捕捉系统误差和统计误差之间权衡的方式构建蛋白质动力学的马尔可夫状态模型。

相似文献

Variational cross-validation of slow dynamical modes in molecular kinetics.分子动力学中慢动力学模式的变分交叉验证

J Chem Phys. 2015 Mar 28;142(12):124105. doi: 10.1063/1.4916292.

Gaussian Markov transition models of molecular kinetics.分子动力学的高斯马尔可夫转移模型。

J Chem Phys. 2015 Feb 28;142(8):084104. doi: 10.1063/1.4913214.

Optimized parameter selection reveals trends in Markov state models for protein folding.优化的参数选择揭示了蛋白质折叠马尔可夫状态模型的趋势。

J Chem Phys. 2016 Nov 21;145(19):194103. doi: 10.1063/1.4967809.

Markov models of molecular kinetics: generation and validation.分子动力学的马尔可夫模型：生成与验证。

J Chem Phys. 2011 May 7;134(17):174105. doi: 10.1063/1.3565032.

Identification of slow molecular order parameters for Markov model construction.用于马尔可夫模型构建的慢分子序参数的识别。

J Chem Phys. 2013 Jul 7;139(1):015102. doi: 10.1063/1.4811489.

Calculation of the distribution of eigenvalues and eigenvectors in Markovian state models for molecular dynamics.分子动力学马尔可夫状态模型中特征值和特征向量分布的计算。

J Chem Phys. 2007 Jun 28;126(24):244101. doi: 10.1063/1.2740261.

Building Markov state models with solvent dynamics.用溶剂动力学构建马尔可夫状态模型。

BMC Bioinformatics. 2013;14 Suppl 2(Suppl 2):S8. doi: 10.1186/1471-2105-14-S2-S8. Epub 2013 Jan 21.

A Bayesian method for construction of Markov models to describe dynamics on various time-scales.一种构建马尔可夫模型的贝叶斯方法，用于描述各种时间尺度上的动态。

J Chem Phys. 2010 Oct 14;133(14):144113. doi: 10.1063/1.3496438.

Error Bounds for Dynamical Spectral Estimation.动态谱估计的误差界限

SIAM J Math Data Sci. 2021;3(1):225-252. doi: 10.1137/20m1335984. Epub 2021 Feb 11.

Optimal use of data in parallel tempering simulations for the construction of discrete-state Markov models of biomolecular dynamics.在并行温度模拟中优化数据的使用，以构建生物分子动力学的离散状态马尔可夫模型。

J Chem Phys. 2011 Jun 28;134(24):244108. doi: 10.1063/1.3592153.

引用本文的文献

Pathogenic Mutations Disrupt Allosteric Control by .致病突变破坏了由……引起的变构调控。

J Phys Chem B. 2025 Aug 7;129(31):7922-7931. doi: 10.1021/acs.jpcb.5c03653. Epub 2025 Jul 29.

Pathogenic mutations disrupt allosteric control by .致病突变破坏了由……引起的变构调控。

bioRxiv. 2025 Jun 10:2025.06.07.658438. doi: 10.1101/2025.06.07.658438.

Unveiling hidden reaction kinetics of carbon dioxide in supercritical aqueous solutions.揭示超临界水溶液中二氧化碳的隐藏反应动力学。

Proc Natl Acad Sci U S A. 2025 Jan 7;122(1):e2406356121. doi: 10.1073/pnas.2406356121. Epub 2024 Dec 30.

Molecular Mechanisms Underlying the Loop-Closing Dynamics of β-1,4 Galactosyltransferase 1.β-1,4-半乳糖基转移酶1环化动力学的分子机制

J Chem Inf Model. 2025 Jan 13;65(1):390-401. doi: 10.1021/acs.jcim.4c02010. Epub 2024 Dec 31.

Non-Markovian Dynamic Models Identify Non-Canonical KRAS-VHL Encounter Complex Conformations for Novel PROTAC Design.非马尔可夫动态模型识别用于新型PROTAC设计的非规范KRAS-VHL相遇复合物构象

JACS Au. 2024 Sep 24;4(10):3857-3868. doi: 10.1021/jacsau.4c00503. eCollection 2024 Oct 28.

Delineating the stepwise millisecond allosteric activation mechanism of the class C GPCR dimer mGlu5.解析 C 类 G 蛋白偶联受体二聚体 mGlu5 的逐步毫秒变构激活机制。

Nat Commun. 2024 Aug 30;15(1):7519. doi: 10.1038/s41467-024-51999-y.

An Information Bottleneck Approach for Markov Model Construction.一种用于马尔可夫模型构建的信息瓶颈方法。

ArXiv. 2024 Jun 10:arXiv:2404.02856v2.

CoVAMPnet: Comparative Markov State Analysis for Studying Effects of Drug Candidates on Disordered Biomolecules.CoVAMPnet：用于研究候选药物对无序生物分子影响的比较马尔可夫状态分析

JACS Au. 2024 May 28;4(6):2228-2245. doi: 10.1021/jacsau.4c00182. eCollection 2024 Jun 24.

Information Bottleneck Approach for Markov Model Construction.信息瓶颈方法在马尔可夫模型构建中的应用。

J Chem Theory Comput. 2024 Jun 25;20(12):5352-5367. doi: 10.1021/acs.jctc.4c00449. Epub 2024 Jun 10.

Functional protein dynamics in a crystal.晶体中的功能蛋白动力学。

Nat Commun. 2024 Apr 15;15(1):3244. doi: 10.1038/s41467-024-47473-4.

本文引用的文献

GROMACS 4: Algorithms for Highly Efficient, Load-Balanced, and Scalable Molecular Simulation.GROMACS 4：高效、负载均衡和可扩展的分子模拟算法。

J Chem Theory Comput. 2008 Mar;4(3):435-47. doi: 10.1021/ct700301q.

Distribution of Reciprocal of Interatomic Distances: A Fast Structural Metric.原子间距离倒数的分布：一种快速结构度量

J Chem Theory Comput. 2012 Aug 14;8(8):2930-7. doi: 10.1021/ct3003145. Epub 2012 Jul 20.

Systematic Parametrization of Polarizable Force Fields from Quantum Chemistry Data.基于量子化学数据的可极化力场的系统参数化

J Chem Theory Comput. 2013 Jan 8;9(1):452-60. doi: 10.1021/ct300826t. Epub 2012 Nov 29.

EMMA: A Software Package for Markov Model Building and Analysis.EMMA：用于马尔可夫模型构建与分析的软件包。

J Chem Theory Comput. 2012 Jul 10;8(7):2223-38. doi: 10.1021/ct300274u. Epub 2012 Jun 18.

Variational Approach to Molecular Kinetics.分子动力学的变分方法

J Chem Theory Comput. 2014 Apr 8;10(4):1739-52. doi: 10.1021/ct4009156. Epub 2014 Mar 6.

MDTraj: A Modern Open Library for the Analysis of Molecular Dynamics Trajectories.MDTraj：用于分析分子动力学轨迹的现代开放库。

Biophys J. 2015 Oct 20;109(8):1528-32. doi: 10.1016/j.bpj.2015.08.015.

Structure-guided simulations illuminate the mechanism of ATP transport through VDAC1.结构导向模拟阐明了 ATP 通过 VDAC1 运输的机制。

Nat Struct Mol Biol. 2014 Jul;21(7):626-32. doi: 10.1038/nsmb.2841. Epub 2014 Jun 8.

Markov state models of biomolecular conformational dynamics.生物分子构象动力学的马尔可夫状态模型。

Curr Opin Struct Biol. 2014 Apr;25:135-44. doi: 10.1016/j.sbi.2014.04.002. Epub 2014 May 16.

Statistical model selection for Markov models of biomolecular dynamics.生物分子动力学马尔可夫模型的统计模型选择

J Phys Chem B. 2014 Jun 19;118(24):6475-81. doi: 10.1021/jp411822r. Epub 2014 Apr 25.

Activation pathway of Src kinase reveals intermediate states as targets for drug design.Src激酶的激活途径揭示了作为药物设计靶点的中间状态。

Nat Commun. 2014 Mar 3;5:3397. doi: 10.1038/ncomms4397.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验