Jansen Ivy, Molenberghs Geert, Aerts Marc, Thijs Herbert, Van Steen Kristel
Biostatistics, Center for Statistics, Limburgs Universitair Centrum, Universitaire Campus, B-3590 Diepenbeek, Belgium.
Biometrics. 2003 Jun;59(2):410-9. doi: 10.1111/1541-0420.00048.
Recently, a lot of concern has been raised about assumptions needed in order to fit statistical models to incomplete multivariate and longitudinal data. In response, research efforts are being devoted to the development of tools that assess the sensitivity of such models to often strong but always, at least in part, unverifiable assumptions. Many efforts have been devoted to longitudinal data, primarily in the selection model context, although some researchers have expressed interest in the pattern-mixture setting as well. A promising tool, proposed by Verbeke et al. (2001, Biometrics 57, 43-50), is based on local influence (Cook, 1986, Journal of the Royal Statistical Society, Series B 48, 133-169). These authors considered the Diggle and Kenward (1994, Applied Statistics 43, 49-93) model, which is based on a selection model, integrating a linear mixed model for continuous outcomes with logistic regression for dropout. In this article, we show that a similar idea can be developed for multivariate and longitudinal binary data, subject to nonmonotone missingness. We focus on the model proposed by Baker, Rosenberger, and DerSimonian (1992, Statistics in Medicine 11, 643-657). The original model is first extended to allow for (possibly continuous) covariates, whereafter a local influence strategy is developed to support the model-building process. The model is able to deal with nonmonotone missingness but has some limitations as well, stemming from the conditional nature of the model parameters. Some analytical insight is provided into the behavior of the local influence graphs.
最近,为了将统计模型应用于不完整的多变量和纵向数据,人们对所需的假设提出了诸多关注。作为回应,研究工作致力于开发工具,以评估此类模型对通常很强但至少部分无法验证的假设的敏感性。许多工作都集中在纵向数据上,主要是在选择模型的背景下,尽管一些研究人员也对模式混合设定表示了兴趣。Verbeke等人(2001年,《生物统计学》57卷,43 - 50页)提出的一个很有前景的工具基于局部影响(Cook,1986年,《皇家统计学会学报》,B辑48卷,133 - 169页)。这些作者考虑了Diggle和Kenward(1994年,《应用统计学》43卷,49 - 93页)的模型,该模型基于选择模型,将连续结果的线性混合模型与用于失访的逻辑回归相结合。在本文中,我们表明,对于多变量和纵向二元数据,在存在非单调缺失的情况下,可以开发类似的思路。我们关注Baker、Rosenberger和DerSimonian(1992年,《医学统计学》11卷,643 - 657页)提出的模型。原始模型首先进行扩展以允许(可能是连续的)协变量,之后开发一种局部影响策略来支持模型构建过程。该模型能够处理非单调缺失,但也存在一些局限性,这源于模型参数的条件性质。我们对局部影响图的行为提供了一些分析见解。