Department of Physics, Indian Institute of Technology, Jammu 181221, India.
Department of Chemistry, University of South Florida, Tampa, Florida 33620, United States.
J Chem Theory Comput. 2020 Mar 10;16(3):1816-1826. doi: 10.1021/acs.jctc.9b00955. Epub 2020 Feb 17.
Markov state models can describe ensembles of pathways via kinetic networks but are difficult to create when large free-energy barriers limit unbiased sampling. Chain-of-states simulations allow sampling over large free-energy barriers but are often constructed using a single pathway that is unlikely to thermodynamically average over orthogonal degrees of freedom in complex systems. Here, we combine the advantages of these two approaches in the form of a Markov state model of Markov state models, which we call a Hierarchical Markov state model. In this approach, independent Markov models are constructed in regions of configuration space that are locally well sampled but are separated by large free-energy barriers from other regions. A string method is used to construct an ensemble of pathways connecting the states of these different local Markov models, and the rate through each pathway is then estimated. These rates are then combined with the rate information from the local Markov models in a master equation to predict global rates, fluxes, and populations. By applying this hierarchical approach to tractable systems, a toy potential and dipeptides, we demonstrate that it is more accurate than the conventional single-pathway description. The advantages of this approach are that it (i) is more realistic than the conventional chain-of-states approach, as an ensemble of pathways rather than a single pathway is used to describe processes in high-dimensional systems, and (ii) it resolves the issue of poor sampling in Markov State model building when large free-energy barriers are present. The divide-and-conquer strategy inherent to this approach should make this procedure straightforward to apply to more complex systems.
马科夫状态模型可以通过动力学网络来描述途径的集合,但当大的自由能势垒限制无偏采样时,就很难创建。链状态模拟允许在大的自由能势垒上进行采样,但通常使用一条途径进行构建,而在复杂系统中,这条途径不太可能在热力学上对正交自由度进行平均。在这里,我们以马科夫状态模型的马科夫状态模型的形式结合了这两种方法的优点,我们称之为层次马科夫状态模型。在这种方法中,独立的马科夫模型是在配置空间的局部很好采样的区域中构建的,但与其他区域被大的自由能势垒隔开。字符串方法用于构建连接这些不同局部马科夫模型状态的途径的集合,然后估计每条途径的速率。然后,这些速率与局部马科夫模型的速率信息在主方程中结合起来,以预测全局速率、通量和种群。通过将这种层次方法应用于可处理的系统、一个玩具势能和二肽,我们证明它比传统的单途径描述更准确。这种方法的优点是:(i) 比传统的链状态方法更现实,因为它使用途径的集合而不是单一途径来描述高维系统中的过程;(ii) 当存在大的自由能势垒时,它解决了马科夫状态模型构建中采样不佳的问题。这种方法固有的分而治之的策略应该使这一过程很容易应用于更复杂的系统。