识别项目反应理论模型中的不匹配来源。

Identifying the Source of Misfit in Item Response Theory Models.

作者信息

Liu Yang, Maydeu-Olivares Alberto

机构信息

a The University of North Carolina at Chapel Hill.

b Faculty of Psychology, University of Barcelona.

出版信息

Multivariate Behav Res. 2014 Jul-Aug;49(4):354-71. doi: 10.1080/00273171.2014.910744.

DOI:10.1080/00273171.2014.910744

PMID:26765803

Abstract

When an item response theory model fails to fit adequately, the items for which the model provides a good fit and those for which it does not must be determined. To this end, we compare the performance of several fit statistics for item pairs with known asymptotic distributions under maximum likelihood estimation of the item parameters: (a) a mean and variance adjustment to bivariate Pearson's X(2), (b) a bivariate subtable analog to Reiser's (1996) overall goodness-of-fit test, (c) a z statistic for the bivariate residual cross product, and (d) Maydeu-Olivares and Joe's (2006) M2 statistic applied to bivariate subtables. The unadjusted Pearson's X(2) with heuristically determined degrees of freedom is also included in the comparison. For binary and ordinal data, our simulation results suggest that the z statistic has the best Type I error and power behavior among all the statistics under investigation when the observed information matrix is used in its computation. However, if one has to use the cross-product information, the mean and variance adjusted X(2) is recommended. We illustrate the use of pairwise fit statistics in 2 real-data examples and discuss possible extensions of the current research in various directions.

摘要

当项目反应理论模型拟合不充分时，必须确定模型拟合良好的项目和拟合不佳的项目。为此，我们在项目参数的最大似然估计下，比较了几种拟合统计量对具有已知渐近分布的项目对的性能：(a) 对双变量皮尔逊卡方的均值和方差调整；(b) 与赖泽尔（1996年）总体拟合优度检验类似的双变量子表检验；(c) 双变量残差交叉乘积的z统计量；(d) 应用于双变量子表的梅德乌-奥利瓦雷斯和乔（2006年）的M2统计量。比较中还包括具有启发式确定自由度的未调整皮尔逊卡方。对于二元和有序数据，我们的模拟结果表明，当在计算中使用观测信息矩阵时，z统计量在所有研究的统计量中具有最佳的I型错误和检验功效表现。然而，如果必须使用交叉乘积信息，建议使用均值和方差调整后的卡方。我们在两个实际数据示例中说明了成对拟合统计量的使用，并讨论了当前研究在各个方向上可能的扩展。

相似文献

Identifying the Source of Misfit in Item Response Theory Models.识别项目反应理论模型中的不匹配来源。

Multivariate Behav Res. 2014 Jul-Aug;49(4):354-71. doi: 10.1080/00273171.2014.910744.

Limited-information goodness-of-fit testing of diagnostic classification item response models.诊断分类项目反应模型的有限信息拟合优度检验

Br J Math Stat Psychol. 2016 Nov;69(3):225-252. doi: 10.1111/bmsp.12074.

Assessing Approximate Fit in Categorical Data Analysis.评估分类数据分析中的近似拟合度。

Multivariate Behav Res. 2014 Jul-Aug;49(4):305-28. doi: 10.1080/00273171.2014.911075.

Limited-information goodness-of-fit testing of hierarchical item factor models.层次项目因子模型的有限信息拟合优度检验。

Br J Math Stat Psychol. 2013 May;66(2):245-76. doi: 10.1111/j.2044-8317.2012.02050.x. Epub 2012 May 29.

Incremental Model Fit Assessment in the Case of Categorical Data: Tucker-Lewis Index for Item Response Theory Modeling.分类数据情况下的增量模型拟合评估：项目反应理论建模的 Tucker-Lewis 指数。

Prev Sci. 2023 Apr;24(3):455-466. doi: 10.1007/s11121-021-01253-4. Epub 2021 May 10.

How should we assess the fit of Rasch-type models? Approximating the power of goodness-of-fit statistics in categorical data analysis.我们应如何评估拉施型模型的拟合度？在分类数据分析中近似拟合优度统计量的功效。

Psychometrika. 2013 Jan;78(1):116-33. doi: 10.1007/s11336-012-9293-1. Epub 2012 Oct 20.

Investigating the Behaviors of and RMSEA in Fitting a Unidimensional Model to Multidimensional Data.研究在将单维模型拟合到多维数据时[具体指标]和近似误差均方根（RMSEA）的行为。（注：原文中“the Behaviors of and”这里“of”后面应该有具体指标未给出完整）

Appl Psychol Meas. 2017 Nov;41(8):632-644. doi: 10.1177/0146621617710464. Epub 2017 May 30.

An Extended GFfit Statistic Defined on Orthogonal Components of Pearson's Chi-Square.基于 Pearson 卡方正交分量的扩展 GFfit 统计量。

Psychometrika. 2023 Mar;88(1):208-240. doi: 10.1007/s11336-022-09866-6. Epub 2022 Jun 3.

Limited-information goodness-of-fit testing of item response theory models for sparse 2 tables.针对稀疏2×2列联表的项目反应理论模型的有限信息拟合优度检验

Br J Math Stat Psychol. 2006 May;59(Pt 1):173-94. doi: 10.1348/000711005X66419.

Identifying measurement disturbance effects using Rasch item fit statistics and the Logit Residual Index.使用拉施克项目拟合统计量和对数残差指数识别测量干扰效应。

J Outcome Meas. 1998;2(4):338-50.

引用本文的文献

Getting started with the graded response model: An introduction and tutorial in R.分级反应模型入门：R语言中的介绍与教程

Int J Psychol. 2025 Feb;60(1):e13265. doi: 10.1002/ijop.13265. Epub 2024 Nov 12.

Modified Item-Fit Indices for Dichotomous IRT Models with Missing Data.具有缺失数据的二分IRT模型的修正项目拟合指数

Appl Psychol Meas. 2022 Nov;46(8):705-719. doi: 10.1177/01466216221125176. Epub 2022 Sep 19.

An Extended GFfit Statistic Defined on Orthogonal Components of Pearson's Chi-Square.基于 Pearson 卡方正交分量的扩展 GFfit 统计量。

Psychometrika. 2023 Mar;88(1):208-240. doi: 10.1007/s11336-022-09866-6. Epub 2022 Jun 3.

Analyzing the Fit of IRT Models With the Hausman Test.使用豪斯曼检验分析IRT模型的拟合度。

Front Psychol. 2020 Feb 11;11:149. doi: 10.3389/fpsyg.2020.00149. eCollection 2020.

Restricted Recalibration of Item Response Theory Models.项目反应理论模型的受限再校准。

Psychometrika. 2019 Jun;84(2):529-553. doi: 10.1007/s11336-019-09667-4. Epub 2019 Mar 20.

Assessing Item-Level Fit for Higher Order Item Response Theory Models.评估高阶项目反应理论模型的项目水平拟合度。

Appl Psychol Meas. 2018 Nov;42(8):644-659. doi: 10.1177/0146621618762740. Epub 2018 Mar 21.

Generalized Fiducial Inference for Binary Logistic Item Response Models.二元逻辑斯蒂项目反应模型的广义置信推断

Psychometrika. 2016 Jun;81(2):290-324. doi: 10.1007/s11336-015-9492-7. Epub 2016 Jan 14.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

识别项目反应理论模型中的不匹配来源。

Identifying the Source of Misfit in Item Response Theory Models.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献