项目反应理论模型的受限再校准。

Restricted Recalibration of Item Response Theory Models.

机构信息

Department of Human Development and Quantitative Methodology, University of Maryland, College Park, USA.

Department of Psychology, University of South Carolina, Columbia, USA.

出版信息

Psychometrika. 2019 Jun;84(2):529-553. doi: 10.1007/s11336-019-09667-4. Epub 2019 Mar 20.

DOI:10.1007/s11336-019-09667-4

PMID:30895437

Abstract

In item response theory (IRT), it is often necessary to perform restricted recalibration (RR) of the model: A set of (focal) parameters is estimated holding a set of (nuisance) parameters fixed. Typical applications of RR include expanding an existing item bank, linking multiple test forms, and associating constructs measured by separately calibrated tests. In the current work, we provide full statistical theory for RR of IRT models under the framework of pseudo-maximum likelihood estimation. We describe the standard error calculation for the focal parameters, the assessment of overall goodness-of-fit (GOF) of the model, and the identification of misfitting items. We report a simulation study to evaluate the performance of these methods in the scenario of adding a new item to an existing test. Parameter recovery for the focal parameters as well as Type I error and power of the proposed tests are examined. An empirical example is also included, in which we validate the pediatric fatigue short-form scale in the Patient-Reported Outcome Measurement Information System (PROMIS), compute global and local GOF statistics, and update parameters for the misfitting items.

摘要

在项目反应理论 (IRT) 中，通常需要对模型进行受限再校准 (RR)：固定一组 (干扰) 参数，估计一组 (焦点) 参数。RR 的典型应用包括扩展现有的项目库、链接多个测试表单，以及关联由单独校准测试测量的结构。在当前的工作中，我们在拟最大似然估计框架下为 IRT 模型的 RR 提供了完整的统计理论。我们描述了焦点参数的标准误差计算、模型的整体拟合优度 (GOF) 评估以及不匹配项目的识别。我们报告了一项模拟研究，以评估这些方法在向现有测试中添加新项目的情况下的性能。还检查了焦点参数的参数恢复、拟议测试的Ⅰ类错误和功效。还包括一个实证示例，我们验证了患者报告的结局测量信息系统 (PROMIS) 中的儿科疲劳简短量表，计算了全球和局部 GOF 统计数据，并更新了不匹配项目的参数。

相似文献

Restricted Recalibration of Item Response Theory Models.项目反应理论模型的受限再校准。

Psychometrika. 2019 Jun;84(2):529-553. doi: 10.1007/s11336-019-09667-4. Epub 2019 Mar 20.

Impact of IRT item misfit on score estimates and severity classifications: an examination of PROMIS depression and pain interference item banks.IRT项目不匹配对分数估计和严重程度分类的影响：对患者报告结果测量信息系统（PROMIS）抑郁和疼痛干扰项目库的检验

Qual Life Res. 2017 Mar;26(3):555-564. doi: 10.1007/s11136-016-1467-3. Epub 2016 Dec 1.

Comparing five depression measures in depressed Chinese patients using item response theory: an examination of item properties, measurement precision and score comparability.运用项目反应理论比较中国抑郁症患者的五种抑郁量表：项目特性、测量精度及分数可比性检验

Health Qual Life Outcomes. 2017 Apr 4;15(1):60. doi: 10.1186/s12955-017-0631-y.

Characterizing Sampling Variability for Item Response Theory Scale Scores in a Fixed-Parameter Calibrated Projection Design.在固定参数校准投影设计中刻画项目反应理论量表分数的抽样变异性

Appl Psychol Meas. 2022 Sep;46(6):509-528. doi: 10.1177/01466216221108136. Epub 2022 Jun 20.

Differential item functioning in the Patient Reported Outcomes Measurement Information System Pediatric Short Forms in a sample of children and adolescents with cerebral palsy.脑瘫儿童和青少年样本中患者报告结局测量信息系统儿科简表中的项目功能差异。

Dev Med Child Neurol. 2016 Nov;58(11):1132-1138. doi: 10.1111/dmcn.13138. Epub 2016 Apr 21.

The PROMIS fatigue item bank has good measurement properties in patients with fibromyalgia and severe fatigue.患者报告结果测量信息系统（PROMIS）疲劳条目库在纤维肌痛和严重疲劳患者中具有良好的测量属性。

Qual Life Res. 2017 Jun;26(6):1417-1426. doi: 10.1007/s11136-017-1501-0. Epub 2017 Jan 30.

Limited-information goodness-of-fit testing of diagnostic classification item response models.诊断分类项目反应模型的有限信息拟合优度检验

Br J Math Stat Psychol. 2016 Nov;69(3):225-252. doi: 10.1111/bmsp.12074.

The irtQ R package: a user-friendly tool for item response theorybased test data analysis and calibration.irtQ R 包：一个用于基于项目反应理论的测试数据分析和校准的用户友好型工具。

J Educ Eval Health Prof. 2024;21:23. doi: 10.3352/jeehp.2024.21.23. Epub 2024 Sep 12.

Rasch fit statistics as a test of the invariance of item parameter estimates.拉施拟合统计作为项目参数估计不变性的一种检验。

J Appl Meas. 2003;4(2):153-63.

Testing item response theory invariance of the standardized Quality-of-life Disease Impact Scale (QDIS(®)) in acute coronary syndrome patients: differential functioning of items and test.急性冠状动脉综合征患者中标准化生活质量疾病影响量表（QDIS(®)）的项目反应理论不变性测试：项目和测试的差异功能

Qual Life Res. 2015 Aug;24(8):1809-22. doi: 10.1007/s11136-015-0916-8. Epub 2015 Jan 20.

引用本文的文献

Appl Psychol Meas. 2022 Sep;46(6):509-528. doi: 10.1177/01466216221108136. Epub 2022 Jun 20.

Psychometric Properties of an Instrument to Assess the Fear of COVID-19 in a Sample in Argentina: a Mixed Approach.一种用于评估阿根廷样本中对新冠病毒恐惧的工具的心理测量特性：一种混合方法。

Int J Ment Health Addict. 2022 Jan 14:1-14. doi: 10.1007/s11469-021-00742-5.

本文引用的文献

Bootstrap-Calibrated Interval Estimates for Latent Variable Scores in Item Response Theory.项目反应理论中潜在变量分数的引导校准区间估计。

Psychometrika. 2018 Jun;83(2):333-354. doi: 10.1007/s11336-017-9582-9. Epub 2017 Sep 6.

Identifying the Source of Misfit in Item Response Theory Models.识别项目反应理论模型中的不匹配来源。

Multivariate Behav Res. 2014 Jul-Aug;49(4):354-71. doi: 10.1080/00273171.2014.910744.

Assessing Approximate Fit in Categorical Data Analysis.评估分类数据分析中的近似拟合度。

Multivariate Behav Res. 2014 Jul-Aug;49(4):305-28. doi: 10.1080/00273171.2014.911075.

Factor Analysis of Ordinal Variables: A Comparison of Three Approaches.有序变量的因子分析：三种方法的比较

Multivariate Behav Res. 2001 Jul 1;36(3):347-87. doi: 10.1207/S15327906347-387.

Item diagnostics in multivariate discrete data.多元离散数据中的项目诊断

Psychol Methods. 2015 Jun;20(2):276-92. doi: 10.1037/a0039015. Epub 2015 Apr 13.

Comparing score tests and other local dependence diagnostics for the graded response model.比较等级反应模型的比分检验和其他局部相依性诊断方法。

Br J Math Stat Psychol. 2014 Nov;67(3):496-513. doi: 10.1111/bmsp.12030. Epub 2013 Nov 25.

Assessing item fit for unidimensional item response theory models using residuals from estimated item response functions.使用估计的项目反应函数的残差评估一维项目反应理论模型的项目拟合度。

Psychometrika. 2013 Jul;78(3):417-40. doi: 10.1007/s11336-012-9305-1. Epub 2012 Dec 14.

Development and psychometric properties of the PROMIS(®) pediatric fatigue item banks.PROMIS(®) 儿童疲劳项目库的开发和心理计量学特性。

Qual Life Res. 2013 Nov;22(9):2417-27. doi: 10.1007/s11136-013-0357-1. Epub 2013 Feb 2.

Characterizing Sources of Uncertainty in IRT Scale Scores.刻画IRT量表分数中的不确定性来源

Educ Psychol Meas. 2012 Apr 1;72(2):264-290. doi: 10.1177/0013164411410056. Epub 2011 Aug 25.

Limited-information goodness-of-fit testing of hierarchical item factor models.层次项目因子模型的有限信息拟合优度检验。

Br J Math Stat Psychol. 2013 May;66(2):245-76. doi: 10.1111/j.2044-8317.2012.02050.x. Epub 2012 May 29.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

项目反应理论模型的受限再校准。

Restricted Recalibration of Item Response Theory Models.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献