Rhemtulla Mijke, Savalei Victoria, Little Todd D
Programme Group Psychological Methods, Department of Psychology, University of Amsterdam, Weesperplein 4, Room 208, 1018XA, Amsterdam, Netherlands.
Department of Psychology, University of British Columbia, 3410-2136 West Mall, Vancouver, BC, V6T 1Z4, Canada.
Psychometrika. 2016 Mar;81(1):60-89. doi: 10.1007/s11336-014-9422-0. Epub 2014 Sep 13.
In planned missingness (PM) designs, certain data are set a priori to be missing. PM designs can increase validity and reduce cost; however, little is known about the loss of efficiency that accompanies these designs. The present paper compares PM designs to reduced sample (RN) designs that have the same total number of data points concentrated in fewer participants. In 4 studies, we consider models for both observed and latent variables, designs that do or do not include an "X set" of variables with complete data, and a full range of between- and within-set correlation values. All results are obtained using asymptotic relative efficiency formulas, and thus no data are generated; this novel approach allows us to examine whether PM designs have theoretical advantages over RN designs removing the impact of sampling error. Our primary findings are that (a) in manifest variable regression models, estimates of regression coefficients have much lower relative efficiency in PM designs as compared to RN designs, (b) relative efficiency of factor correlation or latent regression coefficient estimates is maximized when the indicators of each latent variable come from different sets, and (c) the addition of an X set improves efficiency in manifest variable regression models only for the parameters that directly involve the X-set variables, but it substantially improves efficiency of most parameters in latent variable models. We conclude that PM designs can be beneficial when the model of interest is a latent variable model; recommendations are made for how to optimize such a design.
在计划缺失(PM)设计中,某些数据被预先设定为缺失。PM设计可以提高效度并降低成本;然而,对于这些设计所伴随的效率损失却知之甚少。本文将PM设计与缩减样本(RN)设计进行比较,RN设计中相同数量的数据点集中在较少的参与者身上。在4项研究中,我们考虑了观测变量和潜在变量的模型、包含或不包含具有完整数据的“X组”变量的设计,以及一系列组间和组内相关值。所有结果均使用渐近相对效率公式获得,因此未生成数据;这种新颖的方法使我们能够检验PM设计是否比RN设计具有理论优势,消除了抽样误差的影响。我们的主要发现是:(a)在显变量回归模型中,与RN设计相比,PM设计中回归系数的估计相对效率要低得多;(b)当每个潜在变量的指标来自不同组时,因子相关或潜在回归系数估计的相对效率最大化;(c)添加X组仅对直接涉及X组变量的参数提高了显变量回归模型的效率,但它显著提高了潜在变量模型中大多数参数的效率。我们得出结论,当感兴趣的模型是潜在变量模型时,PM设计可能是有益的;并就如何优化此类设计提出了建议。