Lagona Francesco, Jdanov Dmitri, Shkolnikova Maria
University of Roma Tre, Rome, Italy; Max Planck Institute for Demographic Research, Rostock, Germany.
Stat Med. 2014 Oct 15;33(23):4116-34. doi: 10.1002/sim.6220. Epub 2014 Jun 2.
Longitudinal data are often segmented by unobserved time-varying factors, which introduce latent heterogeneity at the observation level, in addition to heterogeneity across subjects. We account for this latent structure by a linear mixed hidden Markov model. It integrates subject-specific random effects and Markovian sequences of time-varying effects in the linear predictor. We propose an expectationŰ-maximization algorithm for maximum likelihood estimation, based on data augmentation. It reduces to the iterative maximization of the expected value of a complete likelihood function, derived from an augmented dataset with case weights, alternated with weights updating. In a case study of the Survey on Stress Aging and Health in Russia, the model is exploited to estimate the influence of the observed covariates under unobserved time-varying factors, which affect the cardiovascular activity of each subject during the observation period.
纵向数据通常会被未观察到的随时间变化的因素分割,这些因素除了会导致个体间的异质性外,还会在观测层面引入潜在的异质性。我们通过线性混合隐马尔可夫模型来考虑这种潜在结构。该模型在线性预测器中整合了个体特定的随机效应和随时间变化效应的马尔可夫序列。我们基于数据增强提出了一种期望最大化算法用于最大似然估计。它简化为对一个完整似然函数的期望值进行迭代最大化,该似然函数来自一个带有病例权重的扩充数据集,同时交替进行权重更新。在俄罗斯压力、衰老与健康调查的案例研究中,该模型被用于估计在未观察到的随时间变化因素下观测协变量的影响,这些因素在观测期内影响每个个体的心血管活动。