Rabe-Hesketh S, Yang S, Pickles A
Department of Biostatistics and Computing, Institute of Psychiatry, King's College, London, UK.
Stat Methods Med Res. 2001 Dec;10(6):409-27. doi: 10.1177/096228020101000604.
Multilevel models were originally developed to allow linear regression or ANOVA models to be applied to observations that are not mutually independent. This lack of independence commonly arises due to clustering of the units of observations into 'higher level units' such as patients in hospitals. In linear mixed models, the within-cluster correlations are modelled by including random effects in a linear model. In this paper, we discuss generalizations of linear mixed models suitable for responses subject to systematic and random measurement error and interval censoring. The first example uses data from two cross-sectional surveys of schoolchildren to investigate risk factors for early first experimentation with cigarettes. Here the recalled times of the children's first cigarette are likely to be subject to both systematic and random measurement errors as well as being interval censored. We describe multilevel models for interval censored survival times as special cases of generalized linear mixed models and discuss methods of estimating systematic recall bias. The second example is a longitudinal study of mental health problems of patients nested in clinics. Here the outcome is measured by multiple questionnaires allowing the measurement errors to be modelled within a linear latent growth curve model. The resulting model is a multilevel structural equation model. We briefly discuss such models both as extensions of linear mixed models and as extensions of structural equation models. Several different model structures are examined. An important goal of the paper is to place a number of methods that readers may have considered as being distinct within a single overall modelling framework.
多层模型最初是为了使线性回归或方差分析模型能够应用于并非相互独立的观测值而开发的。观测值缺乏独立性通常是由于观测单位聚集成“更高层次的单位”,例如医院中的患者。在线性混合模型中,通过在线性模型中纳入随机效应来对聚类内的相关性进行建模。在本文中,我们讨论适用于存在系统和随机测量误差以及区间删失的响应变量的线性混合模型的推广。第一个例子使用来自两项学童横断面调查的数据,以研究首次尝试吸烟的早期风险因素。在这里,儿童首次吸烟的回忆时间可能既存在系统测量误差又存在随机测量误差,并且是区间删失的。我们将区间删失生存时间的多层模型描述为广义线性混合模型的特殊情况,并讨论估计系统回忆偏差的方法。第二个例子是对嵌套在诊所中的患者心理健康问题的纵向研究。在这里,通过多个问卷来测量结果,从而可以在线性潜在增长曲线模型中对测量误差进行建模。所得模型是一个多层结构方程模型。我们简要讨论此类模型,既将其视为线性混合模型的扩展,也将其视为结构方程模型的扩展。研究了几种不同的模型结构。本文的一个重要目标是将读者可能认为不同的一些方法置于一个统一的总体建模框架内。