Larsen Klaus
Clinical Research Unit, Hvidovre Hospital, Kettegård Allé 30, DK-2650 Hvidovre, Denmark.
Biometrics. 2004 Mar;60(1):85-92. doi: 10.1111/j.0006-341X.2004.00141.x.
Multiple categorical variables are commonly used in medical and epidemiological research to measure specific aspects of human health and functioning. To analyze such data, models have been developed considering these categorical variables as imperfect indicators of an individual's "true" status of health or functioning. In this article, the latent class regression model is used to model the relationship between covariates, a latent class variable (the unobserved status of health or functioning), and the observed indicators (e.g., variables from a questionnaire). The Cox model is extended to encompass a latent class variable as predictor of time-to-event, while using information about latent class membership available from multiple categorical indicators. The expectation-maximization (EM) algorithm is employed to obtain maximum likelihood estimates, and standard errors are calculated based on the profile likelihood, treating the nonparametric baseline hazard as a nuisance parameter. A sampling-based method for model checking is proposed. It allows for graphical investigation of the assumption of proportional hazards across latent classes. It may also be used for checking other model assumptions, such as no additional effect of the observed indicators given latent class. The usefulness of the model framework and the proposed techniques are illustrated in an analysis of data from the Women's Health and Aging Study concerning the effect of severe mobility disability on time-to-death for elderly women.
多个分类变量常用于医学和流行病学研究,以衡量人类健康和功能的特定方面。为了分析此类数据,已开发出一些模型,将这些分类变量视为个体“真实”健康或功能状态的不完美指标。在本文中,潜在类别回归模型用于对协变量、一个潜在类别变量(健康或功能的未观察状态)以及观察指标(例如,来自问卷的变量)之间的关系进行建模。Cox模型被扩展,以纳入一个潜在类别变量作为事件发生时间的预测因子,同时利用从多个分类指标中获得的关于潜在类别成员身份的信息。采用期望最大化(EM)算法来获得最大似然估计,并基于轮廓似然计算标准误差,将非参数基线风险视为一个干扰参数。提出了一种基于抽样的模型检验方法。它允许对潜在类别之间的比例风险假设进行图形化研究。它还可用于检验其他模型假设,例如在给定潜在类别的情况下观察指标无额外效应。通过对妇女健康与衰老研究中关于严重行动不便对老年妇女死亡时间影响的数据进行分析,说明了模型框架和所提出技术的实用性。