Zhang Yongli, Jiao Junyi, Rebane Aleksander A
Department of Cell Biology, School of Medicine, Yale University, New Haven, Connecticut.
Department of Cell Biology, School of Medicine, Yale University, New Haven, Connecticut; Integrated Graduate Program in Physical and Engineering Biology, Yale University, New Haven, Connecticut.
Biophys J. 2016 Nov 15;111(10):2110-2124. doi: 10.1016/j.bpj.2016.09.045.
Hidden Markov modeling (HMM) has revolutionized kinetic studies of macromolecules. However, results from HMM often violate detailed balance when applied to the transitions under thermodynamic equilibrium, and the consequence of such violation has not been well understood. Here, to our knowledge, we developed a new HMM method that satisfies detailed balance (HMM-DB) and optimizes model parameters by gradient search. We used free energy of stable and transition states as independent fitting parameters and considered both normal and skew normal distributions of the measurement noise. We validated our method by analyzing simulated extension trajectories that mimicked experimental data of single protein folding from optical tweezers. We then applied HMM-DB to elucidate kinetics of regulated SNARE zippering containing degenerate states. For both simulated and measured trajectories, we found that HMM-DB significantly reduced overfitting of short trajectories compared to the standard HMM based on an expectation-maximization algorithm, leading to more accurate and reliable model fitting by HMM-DB. We revealed how HMM-DB could be conveniently used to derive a simplified energy landscape of protein folding. Finally, we extended HMM-DB to correct the baseline drift in single-molecule trajectories. Together, we demonstrated an efficient, versatile, and reliable method of HMM for kinetics studies of macromolecules under thermodynamic equilibrium.
隐马尔可夫模型(HMM)彻底改变了大分子的动力学研究。然而,当应用于热力学平衡下的转变时,HMM的结果常常违反细致平衡,而这种违反的后果尚未得到很好的理解。在此,据我们所知,我们开发了一种满足细致平衡的新HMM方法(HMM-DB),并通过梯度搜索优化模型参数。我们将稳定态和过渡态的自由能用作独立的拟合参数,并考虑了测量噪声的正态分布和斜正态分布。我们通过分析模拟的伸展轨迹来验证我们的方法,这些轨迹模仿了来自光镊的单蛋白折叠的实验数据。然后,我们应用HMM-DB来阐明包含简并态的受调控SNARE拉链的动力学。对于模拟轨迹和测量轨迹,我们发现与基于期望最大化算法的标准HMM相比,HMM-DB显著减少了短轨迹的过拟合,从而使HMM-DB的模型拟合更准确、更可靠。我们揭示了HMM-DB如何能够方便地用于推导蛋白质折叠的简化能量景观。最后,我们扩展了HMM-DB以校正单分子轨迹中的基线漂移。总之,我们展示了一种用于热力学平衡下大分子动力学研究的高效、通用且可靠的HMM方法。