Lai Mark H C, Zhang Jiaqi
School of Education, University of CincinnatiCincinnati, OH, United States.
Front Psychol. 2017 Jul 28;8:1286. doi: 10.3389/fpsyg.2017.01286. eCollection 2017.
In conventional structural equation modeling (SEM), with the presence of even a tiny amount of data contamination due to outliers or influential observations, normal-theory maximum likelihood (ML-Normal) is not efficient and can be severely biased. The multivariate--based SEM, which recently got implemented in Mplus as an approach for mixture modeling, represents a robust estimation alternative to downweigh the impact of outliers and influential observations. To our knowledge, the use of maximum likelihood estimation with a multivariate- model (ML-) to handle outliers has not been shown in SEM literature. In this paper we demonstrate the use of ML- using the classic Holzinger and Swineford (1939) data set with a few observations modified as outliers or influential observations. A simulation study is then conducted to examine the performance of fit indices and information criteria under ML-Normal and ML- in the presence of outliers. Results showed that whereas all fit indices got worse for ML-Normal with increasing amount of outliers and influential observations, their values were relatively stable with ML-, and the use of information criteria was effective in selecting ML-normal without data contamination and selecting ML- with data contamination, especially when the sample size was at least 200.
在传统的结构方程模型(SEM)中,即使存在由于异常值或有影响的观测值导致的少量数据污染,正态理论最大似然法(ML-Normal)也效率不高且可能存在严重偏差。基于多变量的SEM最近在Mplus中作为一种混合建模方法得以实现,它是一种稳健估计方法,可减轻异常值和有影响的观测值的影响。据我们所知,在结构方程模型文献中尚未展示使用多变量模型的最大似然估计(ML-)来处理异常值的情况。在本文中,我们使用经典的霍尔津格和斯温福德(1939)数据集,并将一些观测值修改为异常值或有影响的观测值,以此展示ML-的使用方法。接着进行了一项模拟研究,以检验在存在异常值的情况下,ML-Normal和ML-下拟合指数和信息准则的性能。结果表明,随着异常值和有影响的观测值数量增加,ML-Normal的所有拟合指数都会变差,而ML-的拟合指数值相对稳定,并且信息准则的使用在选择无数据污染情况下的ML-normal和有数据污染情况下的ML-时是有效的,尤其是当样本量至少为200时。