Punzo Antonio, Ingrassia Salvatore, Maruotti Antonello
Dipartimento di Economia e Impresa, Università di Catania, Catania, Italy.
Dipartimento di Giurisprudenza, Economia, Politica e Lingue Moderne, Libera Università Maria Ss. Assunta, Roma, Italy.
Stat Med. 2018 Apr 22. doi: 10.1002/sim.7687.
A time-varying latent variable model is proposed to jointly analyze multivariate mixed-support longitudinal data. The proposal can be viewed as an extension of hidden Markov regression models with fixed covariates (HMRMFCs), which is the state of the art for modelling longitudinal data, with a special focus on the underlying clustering structure. HMRMFCs are inadequate for applications in which a clustering structure can be identified in the distribution of the covariates, as the clustering is independent from the covariates distribution. Here, hidden Markov regression models with random covariates are introduced by explicitly specifying state-specific distributions for the covariates, with the aim of improving the recovering of the clusters in the data with respect to a fixed covariates paradigm. The hidden Markov regression models with random covariates class is defined focusing on the exponential family, in a generalized linear model framework. Model identifiability conditions are sketched, an expectation-maximization algorithm is outlined for parameter estimation, and various implementation and operational issues are discussed. Properties of the estimators of the regression coefficients, as well as of the hidden path parameters, are evaluated through simulation experiments and compared with those of HMRMFCs. The method is applied to physical activity data.
提出了一种时变潜变量模型,用于联合分析多变量混合支持纵向数据。该提议可被视为具有固定协变量的隐马尔可夫回归模型(HMRMFC)的扩展,HMRMFC是纵向数据建模的最新技术,特别关注潜在的聚类结构。HMRMFC不适用于可以在协变量分布中识别聚类结构的应用,因为聚类与协变量分布无关。在此,通过明确指定协变量的特定状态分布,引入了具有随机协变量的隐马尔可夫回归模型,目的是相对于固定协变量范式改进数据中聚类的恢复。具有随机协变量类的隐马尔可夫回归模型是在广义线性模型框架内,围绕指数族定义的。概述了模型可识别性条件,概述了用于参数估计的期望最大化算法,并讨论了各种实现和操作问题。通过模拟实验评估回归系数估计器以及隐藏路径参数估计器的性质,并与HMRMFC的性质进行比较。该方法应用于身体活动数据。