Liu Yu-Ying, Li Shuang, Li Fuxin, Song Le, Rehg James M
College of Computing Georgia Institute of Technology Atlanta, GA.
Adv Neural Inf Process Syst. 2015;28:3599-3607.
The Continuous-Time Hidden Markov Model (CT-HMM) is an attractive approach to modeling disease progression due to its ability to describe noisy observations arriving irregularly in time. However, the lack of an efficient parameter learning algorithm for CT-HMM restricts its use to very small models or requires unrealistic constraints on the state transitions. In this paper, we present the first complete characterization of efficient EM-based learning methods for CT-HMM models. We demonstrate that the learning problem consists of two challenges: the estimation of posterior state probabilities and the computation of end-state conditioned statistics. We solve the first challenge by reformulating the estimation problem in terms of an equivalent discrete time-inhomogeneous hidden Markov model. The second challenge is addressed by adapting three approaches from the continuous time Markov chain literature to the CT-HMM domain. We demonstrate the use of CT-HMMs with more than 100 states to visualize and predict disease progression using a glaucoma dataset and an Alzheimer's disease dataset.
连续时间隐马尔可夫模型(CT - HMM)是一种用于对疾病进展进行建模的有吸引力的方法,因为它能够描述随时间不规则到达的噪声观测值。然而,缺乏一种针对CT - HMM的高效参数学习算法限制了其应用于非常小的模型,或者需要对状态转移施加不切实际的约束。在本文中,我们首次全面刻画了基于期望最大化(EM)的CT - HMM模型高效学习方法。我们证明学习问题包含两个挑战:后验状态概率的估计和终态条件统计量的计算。我们通过将估计问题重新表述为等效的离散时间非齐次隐马尔可夫模型来解决第一个挑战。通过将连续时间马尔可夫链文献中的三种方法应用于CT - HMM领域来解决第二个挑战。我们展示了使用具有100多个状态的CT - HMM,通过青光眼数据集和阿尔茨海默病数据集来可视化和预测疾病进展。