Schluchter M D
Department of Epidemiology and Biostatistics, University of South Carolina, Columbia 29208.
Stat Med. 1988 Jan-Feb;7(1-2):317-24. doi: 10.1002/sim.4780070132.
Incomplete and unbalanced multivariate data often arise in longitudinal studies due to missing or unequally-timed repeated measurements and/or the presence of time-varying covariates. A general approach to analysing such data is through maximum likelihood analysis using a linear model for the expected responses, and structural models for the within-subject covariances. Two important advantages of this approach are: (1) the generality of the model allows the analyst to consider a wider range of models than were previously possible using classical methods developed for balanced and complete data, and (2) maximum likelihood estimates obtained from incomplete data are often preferable to other estimates such as those obtained from complete cases from the standpoint of bias and efficiency. A variety of applications of the model are discussed, including univariate and multivariate analysis of incomplete repeated measures data, analysis of growth curves with missing data using random effects and time-series models, and applications to unbalanced longitudinal data.
在纵向研究中,由于重复测量缺失或时间不均等,以及/或者存在随时间变化的协变量,常常会出现不完整和不平衡的多变量数据。分析此类数据的一般方法是通过最大似然分析,使用线性模型来估计预期响应,并使用结构模型来估计受试者内协方差。这种方法的两个重要优点是:(1)该模型的通用性使分析师能够考虑比以前使用为平衡和完整数据开发的经典方法更广泛的模型,(2)从不完整数据中获得的最大似然估计在偏差和效率方面通常优于其他估计,例如从完整病例中获得的估计。文中讨论了该模型的各种应用,包括不完整重复测量数据的单变量和多变量分析、使用随机效应和时间序列模型对缺失数据的生长曲线进行分析,以及对不平衡纵向数据的应用。