Department of Chemistry, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong.
J Chem Phys. 2017 Jul 28;147(4):044112. doi: 10.1063/1.4995558.
Constructing Markov state models from large-scale molecular dynamics simulation trajectories is a promising approach to dissect the kinetic mechanisms of complex chemical and biological processes. Combined with transition path theory, Markov state models can be applied to identify all pathways connecting any conformational states of interest. However, the identified pathways can be too complex to comprehend, especially for multi-body processes where numerous parallel pathways with comparable flux probability often coexist. Here, we have developed a path lumping method to group these parallel pathways into metastable path channels for analysis. We define the similarity between two pathways as the intercrossing flux between them and then apply the spectral clustering algorithm to lump these pathways into groups. We demonstrate the power of our method by applying it to two systems: a 2D-potential consisting of four metastable energy channels and the hydrophobic collapse process of two hydrophobic molecules. In both cases, our algorithm successfully reveals the metastable path channels. We expect this path lumping algorithm to be a promising tool for revealing unprecedented insights into the kinetic mechanisms of complex multi-body processes.
从大规模分子动力学模拟轨迹构建马尔可夫状态模型是剖析复杂化学和生物过程动力学机制的一种很有前途的方法。结合转移路径理论,马尔可夫状态模型可用于识别连接任何感兴趣构象状态的所有途径。然而,所识别的途径可能过于复杂,难以理解,特别是对于多体过程,其中经常存在许多具有可比通量概率的并行途径。在这里,我们开发了一种路径聚类方法,将这些并行途径分组为亚稳路径通道进行分析。我们将两条途径之间的相似性定义为它们之间的交叉通量,然后应用谱聚类算法将这些途径聚类成组。我们通过将其应用于两个系统来证明我们方法的有效性:一个由四个亚稳能量通道组成的 2D 势和两个疏水分子的疏水塌缩过程。在这两种情况下,我们的算法都成功地揭示了亚稳路径通道。我们期望这种路径聚类算法成为揭示复杂多体过程动力学机制的一种有前途的工具。