1 Department of Biostatistics, Christian Medical College, Vellore, India.
2 Department of Clinical Microbiology, Christian Medical College, Vellore, India.
Stat Methods Med Res. 2019 May;28(5):1552-1563. doi: 10.1177/0962280218766964. Epub 2018 Apr 4.
Hidden Markov models are stochastic models in which the observations are assumed to follow a mixture distribution, but the parameters of the components are governed by a Markov chain which is unobservable. The issues related to the estimation of Poisson-hidden Markov models in which the observations are coming from mixture of Poisson distributions and the parameters of the component Poisson distributions are governed by an m-state Markov chain with an unknown transition probability matrix are explained here. These methods were applied to the data on Vibrio cholerae counts reported every month for 11-year span at Christian Medical College, Vellore, India. Using Viterbi algorithm, the best estimate of the state sequence was obtained and hence the transition probability matrix. The mean passage time between the states were estimated. The 95% confidence interval for the mean passage time was estimated via Monte Carlo simulation. The three hidden states of the estimated Markov chain are labelled as 'Low', 'Moderate' and 'High' with the mean counts of 1.4, 6.6 and 20.2 and the estimated average duration of stay of 3, 3 and 4 months, respectively. Environmental risk factors were studied using Markov ordinal logistic regression analysis. No significant association was found between disease severity levels and climate components.
隐马尔可夫模型是一种随机模型,其中假设观测值遵循混合分布,但组成部分的参数受不可观测的马尔可夫链控制。这里解释了与泊松隐马尔可夫模型的估计相关的问题,其中观测值来自泊松分布的混合,组成部分泊松分布的参数由具有未知转移概率矩阵的 m 状态马尔可夫链控制。这些方法应用于印度维洛尔基督教医学院报告的为期 11 年的霍乱弧菌计数每月数据。使用维特比算法,获得了状态序列的最佳估计值,从而获得了转移概率矩阵。估计了状态之间的平均通过时间。通过蒙特卡罗模拟估计了平均通过时间的 95%置信区间。估计的马尔可夫链的三个隐藏状态标记为“低”、“中”和“高”,平均计数分别为 1.4、6.6 和 20.2,估计的平均停留时间分别为 3、3 和 4 个月。使用马尔可夫有序逻辑回归分析研究了环境风险因素。没有发现疾病严重程度水平与气候成分之间存在显著关联。