Bernaards C A, Sijtsma K
Multivariate Behav Res. 2000 Jul 1;35(3):321-64. doi: 10.1207/S15327906MBR3503_03.
This study deals with the influence of each of twelve imputation methods and two methods using the EM algorithm on the results of maximum likelihood factor analysis as compared with results obtained from the complete data factor analysis (no missing scores). Complete questionnaire rating scale data were simulated and, next, missing item scores were created under both ignorable and nonignorable nonresponse mechanisms. Next, imputation methods were used to fill the gaps and factor analysis was applied to both the original complete data and to the data sets including imputed scores. Each imputation method was implemented once with residual error and once without residual error. Also, one EM method estimated the factor loadings directly and the other estimated the complete data covariance matrix, which subsequently was factor analyzed. A design was analyzed with design factors Latent Trait Structure (technically called Mixing Configuration), Correlation Between Latent Traits, Nonresponse Mechanism, Percentage of Missingness, Sample Size, and Imputation Method. We found that, in general, methods that impute a score based on a respondent's mean score obtained from his/her observed item scores best recovered the factor loadings structure from the complete data. Moreover, for unidimensional data person mean methods with a residual error gave better results than the other imputation methods, either with or without a residual error component. For the EM methods a smaller design was analyzed. The conclusion was that both EM methods better recovered the complete data factor loadings than the imputation methods.
本研究探讨了十二种插补方法中的每一种以及两种使用期望最大化(EM)算法的方法对最大似然因子分析结果的影响,并将其与完全数据因子分析(无缺失分数)得到的结果进行比较。模拟了完整的问卷评分量表数据,接下来,在可忽略和不可忽略的无应答机制下创建缺失项目分数。然后,使用插补方法填补空白,并对原始完整数据和包括插补分数的数据集进行因子分析。每种插补方法都在有残差误差和无残差误差的情况下各实施一次。此外,一种EM方法直接估计因子载荷,另一种方法估计完整数据协方差矩阵,随后对其进行因子分析。对一个设计进行了分析,设计因素包括潜在特质结构(技术上称为混合配置)、潜在特质之间的相关性、无应答机制、缺失率、样本量和插补方法。我们发现,一般来说,基于受访者从其观察到的项目分数中获得的平均分数来插补分数的方法,能最好地从完整数据中恢复因子载荷结构。此外,对于单维数据,有残差误差的个人均值方法比其他插补方法(无论有无残差误差成分)能给出更好的结果。对于EM方法,分析了一个较小的设计。结论是,两种EM方法比插补方法能更好地恢复完整数据的因子载荷。