School of Mathematics, University of Bristol, Bristol, United Kingdom.
PLoS One. 2012;7(1):e29703. doi: 10.1371/journal.pone.0029703. Epub 2012 Jan 11.
We present a new method for inferring hidden Markov models from noisy time sequences without the necessity of assuming a model architecture, thus allowing for the detection of degenerate states. This is based on the statistical prediction techniques developed by Crutchfield et al. and generates so called causal state models, equivalent in structure to hidden Markov models. The new method is applicable to any continuous data which clusters around discrete values and exhibits multiple transitions between these values such as tethered particle motion data or Fluorescence Resonance Energy Transfer (FRET) spectra. The algorithms developed have been shown to perform well on simulated data, demonstrating the ability to recover the model used to generate the data under high noise, sparse data conditions and the ability to infer the existence of degenerate states. They have also been applied to new experimental FRET data of Holliday Junction dynamics, extracting the expected two state model and providing values for the transition rates in good agreement with previous results and with results obtained using existing maximum likelihood based methods. The method differs markedly from previous Markov-model reconstructions in being able to uncover truly hidden states.
我们提出了一种从嘈杂的时间序列中推断隐马尔可夫模型的新方法,无需假设模型结构,从而可以检测到简并状态。这是基于 Crutchfield 等人开发的统计预测技术,并生成所谓的因果状态模型,在结构上与隐马尔可夫模型等效。该新方法适用于任何围绕离散值聚集且表现出这些值之间多次转换的连续数据,例如系泊粒子运动数据或荧光共振能量转移 (FRET) 光谱。所开发的算法在模拟数据上表现良好,证明了在高噪声、稀疏数据条件下能够恢复用于生成数据的模型的能力,并且能够推断出简并状态的存在。它们还被应用于新的 Holliday Junction 动力学 FRET 实验数据,提取了预期的两状态模型,并提供了与先前结果以及使用现有最大似然法获得的结果一致的跃迁率值。该方法与以前的马尔可夫模型重建方法明显不同,因为它能够揭示真正的隐藏状态。