Guastadisegni Lucia, Cagnone Silvia, Moustaki Irini, Vasdekis Vassilis
University of Bologna, Bologna, Italy.
London School of Economics and Political Science, London, UK.
Educ Psychol Meas. 2022 Apr;82(2):254-280. doi: 10.1177/00131644211020355. Epub 2021 Jun 2.
This article studies the Type I error, false positive rates, and power of four versions of the Lagrange multiplier test to detect measurement noninvariance in item response theory (IRT) models for binary data under model misspecification. The tests considered are the Lagrange multiplier test computed with the Hessian and cross-product approach, the generalized Lagrange multiplier test and the generalized jackknife score test. The two model misspecifications are those of local dependence among items and nonnormal distribution of the latent variable. The power of the tests is computed in two ways, empirically through Monte Carlo simulation methods and asymptotically, using the asymptotic distribution of each test under the alternative hypothesis. The performance of these tests is evaluated by means of a simulation study. The results highlight that, under mild model misspecification, all tests have good performance while, under strong model misspecification, the tests performance deteriorates, especially for false positive rates under local dependence and power for small sample size under misspecification of the latent variable distribution. In general, the Lagrange multiplier test computed with the Hessian approach and the generalized Lagrange multiplier test have better performance in terms of false positive rates while the Lagrange multiplier test computed with the cross-product approach has the highest power for small sample sizes. The asymptotic power turns out to be a good alternative to the classic empirical power because it is less time consuming. The Lagrange tests studied here have been also applied to a real data set.
本文研究了在模型设定错误的情况下,用于检测二元数据的项目反应理论(IRT)模型中测量非不变性的拉格朗日乘数检验的四个版本的第一类错误、误报率和功效。所考虑的检验包括使用海森矩阵和交叉乘积方法计算的拉格朗日乘数检验、广义拉格朗日乘数检验和广义刀切分数检验。两种模型设定错误分别是项目之间的局部依赖性和潜在变量的非正态分布。检验的功效通过两种方式计算,一种是通过蒙特卡罗模拟方法进行实证计算,另一种是在备择假设下使用每个检验的渐近分布进行渐近计算。通过模拟研究对这些检验的性能进行了评估。结果表明,在轻度模型设定错误下,所有检验都具有良好的性能,而在强模型设定错误下,检验性能会下降,特别是在局部依赖性下的误报率以及潜在变量分布设定错误下小样本量的功效方面。一般来说,用海森矩阵方法计算的拉格朗日乘数检验和广义拉格朗日乘数检验在误报率方面表现更好,而用交叉乘积方法计算的拉格朗日乘数检验在小样本量时具有最高的功效。渐近功效结果是经典实证功效的一个很好的替代方法,因为它耗时较少。这里研究的拉格朗日检验也已应用于一个实际数据集。