Konovalov Kirill A, Unarta Ilona Christy, Cao Siqin, Goonetilleke Eshani C, Huang Xuhui
Department of Chemistry, State Key Laboratory of Molecular Neuroscience, The Hong Kong University of Science and Technology, Kowloon, Hong Kong.
Hong Kong Center for Neurodegenerative Diseases, Hong Kong Science Park, Hong Kong.
JACS Au. 2021 Aug 4;1(9):1330-1341. doi: 10.1021/jacsau.1c00254. eCollection 2021 Sep 27.
Markov state models (MSMs) based on molecular dynamics (MD) simulations are routinely employed to study protein folding, however, their application to functional conformational changes of biomolecules is still limited. In the past few years, the field of computational chemistry has experienced a surge of advancements stemming from machine learning algorithms, and MSMs have not been left out. Unlike global processes, such as protein folding, the application of MSMs to functional conformational changes is challenging because they mostly consist of localized structural transitions. Therefore, it is critical to properly select a subset of structural features that can describe the slowest dynamics of these functional conformational changes. To address this challenge, we recommend several automatic feature selection methods such as Spectral-OASIS. To identify states in MSMs, the chosen features can be subject to dimensionality reduction methods such as TICA or deep learning based VAMPNets to project MD conformations onto a few collective variables for subsequent clustering. Another challenge for the application of MSMs to the study of functional conformational changes is the ability to comprehend their biophysical mechanisms, as MSMs built for these processes often require a large number of states. We recommend the recently developed quasi-MSMs (qMSMs) to address this issue. Compared to MSMs, qMSMs encode the non-Markovian dynamics via the generalized master equation and can significantly reduce the number of states. As a result, qMSMs can be built with a handful of states to facilitate the interpretation of functional conformational changes. In the wake of machine learning, we believe that the rapid advancement in the MSM methodology will lead to their wider application in studying functional conformational changes of biomolecules.
基于分子动力学(MD)模拟的马尔可夫状态模型(MSMs)通常用于研究蛋白质折叠,然而,它们在生物分子功能构象变化中的应用仍然有限。在过去几年中,计算化学领域因机器学习算法而取得了突飞猛进的发展,MSMs也不例外。与蛋白质折叠等全局过程不同,将MSMs应用于功能构象变化具有挑战性,因为这些变化大多由局部结构转变组成。因此,正确选择能够描述这些功能构象变化最慢动力学的结构特征子集至关重要。为应对这一挑战,我们推荐了几种自动特征选择方法,如Spectral-OASIS。为了在MSMs中识别状态,可以对所选特征进行降维方法处理,如TICA或基于深度学习的VAMPNets,以便将MD构象投影到几个集体变量上,随后进行聚类。将MSMs应用于功能构象变化研究的另一个挑战是理解其生物物理机制的能力,因为为这些过程构建的MSMs通常需要大量状态。我们推荐最近开发的准MSMs(qMSMs)来解决这个问题。与MSMs相比,qMSMs通过广义主方程对非马尔可夫动力学进行编码,并且可以显著减少状态数量。因此,可以用少数状态构建qMSMs,以促进对功能构象变化的解释。随着机器学习的发展,我们相信MSM方法的快速进步将使其在研究生物分子功能构象变化方面得到更广泛的应用。