Suppr超能文献

评估存在数据污染情况下基于多变量的结构方程模型的拟合指数

Evaluating Fit Indices for Multivariate -Based Structural Equation Modeling with Data Contamination.

作者信息

Lai Mark H C, Zhang Jiaqi

机构信息

School of Education, University of CincinnatiCincinnati, OH, United States.

出版信息

Front Psychol. 2017 Jul 28;8:1286. doi: 10.3389/fpsyg.2017.01286. eCollection 2017.

Abstract

In conventional structural equation modeling (SEM), with the presence of even a tiny amount of data contamination due to outliers or influential observations, normal-theory maximum likelihood (ML-Normal) is not efficient and can be severely biased. The multivariate--based SEM, which recently got implemented in Mplus as an approach for mixture modeling, represents a robust estimation alternative to downweigh the impact of outliers and influential observations. To our knowledge, the use of maximum likelihood estimation with a multivariate- model (ML-) to handle outliers has not been shown in SEM literature. In this paper we demonstrate the use of ML- using the classic Holzinger and Swineford (1939) data set with a few observations modified as outliers or influential observations. A simulation study is then conducted to examine the performance of fit indices and information criteria under ML-Normal and ML- in the presence of outliers. Results showed that whereas all fit indices got worse for ML-Normal with increasing amount of outliers and influential observations, their values were relatively stable with ML-, and the use of information criteria was effective in selecting ML-normal without data contamination and selecting ML- with data contamination, especially when the sample size was at least 200.

摘要

在传统的结构方程模型(SEM)中,即使存在由于异常值或有影响的观测值导致的少量数据污染,正态理论最大似然法(ML-Normal)也效率不高且可能存在严重偏差。基于多变量的SEM最近在Mplus中作为一种混合建模方法得以实现,它是一种稳健估计方法,可减轻异常值和有影响的观测值的影响。据我们所知,在结构方程模型文献中尚未展示使用多变量模型的最大似然估计(ML-)来处理异常值的情况。在本文中,我们使用经典的霍尔津格和斯温福德(1939)数据集,并将一些观测值修改为异常值或有影响的观测值,以此展示ML-的使用方法。接着进行了一项模拟研究,以检验在存在异常值的情况下,ML-Normal和ML-下拟合指数和信息准则的性能。结果表明,随着异常值和有影响的观测值数量增加,ML-Normal的所有拟合指数都会变差,而ML-的拟合指数值相对稳定,并且信息准则的使用在选择无数据污染情况下的ML-normal和有数据污染情况下的ML-时是有效的,尤其是当样本量至少为200时。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ca93/5532449/f1dd65132e0c/fpsyg-08-01286-g0001.jpg

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验