在模型设定错误情况下使用拉格朗日乘数检验评估测量不变性。

Use of the Lagrange Multiplier Test for Assessing Measurement Invariance Under Model Misspecification.

作者信息

Guastadisegni Lucia, Cagnone Silvia, Moustaki Irini, Vasdekis Vassilis

机构信息

University of Bologna, Bologna, Italy.

London School of Economics and Political Science, London, UK.

出版信息

Educ Psychol Meas. 2022 Apr;82(2):254-280. doi: 10.1177/00131644211020355. Epub 2021 Jun 2.

DOI:10.1177/00131644211020355

PMID:35185159

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8850767/

Abstract

This article studies the Type I error, false positive rates, and power of four versions of the Lagrange multiplier test to detect measurement noninvariance in item response theory (IRT) models for binary data under model misspecification. The tests considered are the Lagrange multiplier test computed with the Hessian and cross-product approach, the generalized Lagrange multiplier test and the generalized jackknife score test. The two model misspecifications are those of local dependence among items and nonnormal distribution of the latent variable. The power of the tests is computed in two ways, empirically through Monte Carlo simulation methods and asymptotically, using the asymptotic distribution of each test under the alternative hypothesis. The performance of these tests is evaluated by means of a simulation study. The results highlight that, under mild model misspecification, all tests have good performance while, under strong model misspecification, the tests performance deteriorates, especially for false positive rates under local dependence and power for small sample size under misspecification of the latent variable distribution. In general, the Lagrange multiplier test computed with the Hessian approach and the generalized Lagrange multiplier test have better performance in terms of false positive rates while the Lagrange multiplier test computed with the cross-product approach has the highest power for small sample sizes. The asymptotic power turns out to be a good alternative to the classic empirical power because it is less time consuming. The Lagrange tests studied here have been also applied to a real data set.

摘要

本文研究了在模型设定错误的情况下，用于检测二元数据的项目反应理论（IRT）模型中测量非不变性的拉格朗日乘数检验的四个版本的第一类错误、误报率和功效。所考虑的检验包括使用海森矩阵和交叉乘积方法计算的拉格朗日乘数检验、广义拉格朗日乘数检验和广义刀切分数检验。两种模型设定错误分别是项目之间的局部依赖性和潜在变量的非正态分布。检验的功效通过两种方式计算，一种是通过蒙特卡罗模拟方法进行实证计算，另一种是在备择假设下使用每个检验的渐近分布进行渐近计算。通过模拟研究对这些检验的性能进行了评估。结果表明，在轻度模型设定错误下，所有检验都具有良好的性能，而在强模型设定错误下，检验性能会下降，特别是在局部依赖性下的误报率以及潜在变量分布设定错误下小样本量的功效方面。一般来说，用海森矩阵方法计算的拉格朗日乘数检验和广义拉格朗日乘数检验在误报率方面表现更好，而用交叉乘积方法计算的拉格朗日乘数检验在小样本量时具有最高的功效。渐近功效结果是经典实证功效的一个很好的替代方法，因为它耗时较少。这里研究的拉格朗日检验也已应用于一个实际数据集。

相似文献

Use of the Lagrange Multiplier Test for Assessing Measurement Invariance Under Model Misspecification.在模型设定错误情况下使用拉格朗日乘数检验评估测量不变性。

Educ Psychol Meas. 2022 Apr;82(2):254-280. doi: 10.1177/00131644211020355. Epub 2021 Jun 2.

On Lagrange Multiplier Tests in Multidimensional Item Response Theory: Information Matrices and Model Misspecification.关于多维项目反应理论中的拉格朗日乘数检验：信息矩阵与模型误设

Educ Psychol Meas. 2018 Aug;78(4):653-678. doi: 10.1177/0013164417714506. Epub 2017 Jul 6.

New roles of Lagrange multiplier method in generalizability theory: Inference of estimating the optimal sample size for teaching ability evaluation of college teachers.拉格朗日乘数法在概化理论中的新角色：推断最佳样本量以评估高校教师教学能力。

PLoS One. 2024 Oct 17;19(10):e0307710. doi: 10.1371/journal.pone.0307710. eCollection 2024.

Testing Linear Models for Ability Parameters in Item Response Models.测试项目反应模型中能力参数的线性模型。

Multivariate Behav Res. 2005 Jan 1;40(1):25-51. doi: 10.1207/s15327906mbr4001_2.

Comparing score tests and other local dependence diagnostics for the graded response model.比较等级反应模型的比分检验和其他局部相依性诊断方法。

Br J Math Stat Psychol. 2014 Nov;67(3):496-513. doi: 10.1111/bmsp.12030. Epub 2013 Nov 25.

Model Modification in Covariance Structure Modeling: A Comparison among Likelihood Ratio, Lagrange Multiplier, and Wald Tests.协方差结构建模中的模型修正：似然比、拉格朗日乘数和 Wald 检验的比较。

Multivariate Behav Res. 1990 Jan 1;25(1):115-36. doi: 10.1207/s15327906mbr2501_13.

Impact of error structure misspecification when testing measurement invariance and latent-factor mean difference using MIMIC and multiple-group confirmatory factor analysis.使用 MIMIC 和多群组验证性因子分析检验测量不变性和潜在因子均值差异时，误差结构误设的影响。

Behav Res Methods. 2019 Dec;51(6):2688-2699. doi: 10.3758/s13428-018-1124-6.

Assessing and Resolving Model Misspecifications in Metabolic Flux Analysis.评估与解决代谢通量分析中的模型误设问题

Bioengineering (Basel). 2017 May 24;4(2):48. doi: 10.3390/bioengineering4020048.

Fit Indexes, Lagrange Multipliers, Constraint Changes and Incomplete Data in Structural Models.结构模型中的拟合指数、拉格朗日乘数、约束变化和不完整数据。

Multivariate Behav Res. 1990 Apr 1;25(2):163-72. doi: 10.1207/s15327906mbr2502_3.

Lagrange multiplier based transport theory for quantum wires.基于拉格朗日乘数法的量子线输运理论。

J Chem Phys. 2004 Apr 15;120(15):7165-8. doi: 10.1063/1.1687316.

引用本文的文献

Investigating heterogeneity in IRTree models for multiple response processes with score-based partitioning.使用基于分数的划分方法研究用于多个响应过程的IRT树模型中的异质性。

Br J Math Stat Psychol. 2025 May;78(2):420-439. doi: 10.1111/bmsp.12367. Epub 2024 Nov 4.

Power Analysis for the Wald, LR, Score, and Gradient Tests in a Marginal Maximum Likelihood Framework: Applications in IRT.边缘极大似然框架下 Wald、LR、Score 和梯度检验的功效分析：IRT 中的应用。

Psychometrika. 2023 Dec;88(4):1249-1298. doi: 10.1007/s11336-022-09883-5. Epub 2022 Aug 27.

本文引用的文献

Educ Psychol Meas. 2018 Aug;78(4):653-678. doi: 10.1177/0013164417714506. Epub 2017 Jul 6.

Statistical power of likelihood ratio and Wald tests in latent class models with covariates.具有协变量的潜类模型中似然比和 Wald 检验的统计功效。

Behav Res Methods. 2017 Oct;49(5):1824-1837. doi: 10.3758/s13428-016-0825-y.

A Monte Carlo Investigation of Methods for Controlling Type I Errors with Specification Searches in Structural Equation Modeling.结构方程模型中通过规范搜索控制I型错误方法的蒙特卡洛研究

Multivariate Behav Res. 1998 Jul 1;33(3):365-83. doi: 10.1207/s15327906mbr3303_3.

Comparing score tests and other local dependence diagnostics for the graded response model.比较等级反应模型的比分检验和其他局部相依性诊断方法。

Br J Math Stat Psychol. 2014 Nov;67(3):496-513. doi: 10.1111/bmsp.12030. Epub 2013 Nov 25.

Goodness-of-fit testing using components based on marginal frequencies of multinomial data.使用基于多项数据边际频率的组件进行拟合优度检验。

Br J Math Stat Psychol. 2008 Nov;61(Pt 2):331-60. doi: 10.1348/000711007X204215. Epub 2007 Apr 21.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验