• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

探究单维IRT模型中模型失配的实际后果。

Investigating the Practical Consequences of Model Misfit in Unidimensional IRT Models.

作者信息

Crişan Daniela R, Tendeiro Jorge N, Meijer Rob R

机构信息

University of Groningen, The Netherlands.

出版信息

Appl Psychol Meas. 2017 Sep;41(6):439-455. doi: 10.1177/0146621617695522. Epub 2017 Mar 17.

DOI:10.1177/0146621617695522
PMID:28804181
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5533251/
Abstract

In this article, the consequences of violations of unidimensionality on selection decisions in the framework of unidimensional item response theory (IRT) models are investigated based on simulated data. The factors manipulated include the severity of violations, the proportion of misfitting items, and test length. The outcomes that were considered are the precision and accuracy of the estimated model parameters, the correlations of estimated ability ([Formula: see text]) and number-correct ([Formula: see text]) scores with the true ability ([Formula: see text]), the ranks of the examinees and the overlap between sets of examinees selected based on either [Formula: see text], [Formula: see text], or [Formula: see text] scores, and the bias in criterion-related validity estimates. Results show that the [Formula: see text] values were unbiased by violations of unidimensionality, but their precision decreased as multidimensionality and the proportion of misfitting items increased; the estimated item parameters were robust to violations of unidimensionality. The correlations between [Formula: see text], [Formula: see text], and [Formula: see text] scores, the agreement between the three selection criteria, and the accuracy of criterion-related validity estimates are all negatively affected, to some extent, by increasing levels of multidimensionality and the proportion of misfitting items. However, removing the misfitting items only improved the results in the case of severe multidimensionality and large proportion of misfitting items, and deteriorated them otherwise.

摘要

在本文中,基于模拟数据研究了在单维项目反应理论(IRT)模型框架下违反单维性对选择决策的影响。所操纵的因素包括违反的严重程度、不拟合项目的比例和测验长度。所考虑的结果包括估计模型参数的精度和准确性、估计能力([公式:见原文])和答对题数([公式:见原文])分数与真实能力([公式:见原文])的相关性、考生的排名以及基于[公式:见原文]、[公式:见原文]或[公式:见原文]分数选择的考生集合之间的重叠情况,以及与标准相关的效度估计中的偏差。结果表明,[公式:见原文]值不受违反单维性的影响,但随着多维性和不拟合项目比例的增加,其精度会降低;估计的项目参数对违反单维性具有稳健性。[公式:见原文]、[公式:见原文]和[公式:见原文]分数之间的相关性、三种选择标准之间的一致性以及与标准相关的效度估计的准确性在一定程度上都受到多维性水平和不拟合项目比例增加的负面影响。然而,去除不拟合项目仅在严重多维性和不拟合项目比例较大的情况下改善了结果,否则会使其恶化。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36e1/5978523/bbed92237fb6/10.1177_0146621617695522-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36e1/5978523/a08f5091cfa0/10.1177_0146621617695522-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36e1/5978523/76304cc49a9d/10.1177_0146621617695522-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36e1/5978523/88b11a42f05a/10.1177_0146621617695522-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36e1/5978523/ec081376f965/10.1177_0146621617695522-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36e1/5978523/bbed92237fb6/10.1177_0146621617695522-fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36e1/5978523/a08f5091cfa0/10.1177_0146621617695522-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36e1/5978523/76304cc49a9d/10.1177_0146621617695522-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36e1/5978523/88b11a42f05a/10.1177_0146621617695522-fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36e1/5978523/ec081376f965/10.1177_0146621617695522-fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/36e1/5978523/bbed92237fb6/10.1177_0146621617695522-fig5.jpg

相似文献

1
Investigating the Practical Consequences of Model Misfit in Unidimensional IRT Models.探究单维IRT模型中模型失配的实际后果。
Appl Psychol Meas. 2017 Sep;41(6):439-455. doi: 10.1177/0146621617695522. Epub 2017 Mar 17.
2
Impact of IRT item misfit on score estimates and severity classifications: an examination of PROMIS depression and pain interference item banks.IRT项目不匹配对分数估计和严重程度分类的影响:对患者报告结果测量信息系统(PROMIS)抑郁和疼痛干扰项目库的检验
Qual Life Res. 2017 Mar;26(3):555-564. doi: 10.1007/s11136-016-1467-3. Epub 2016 Dec 1.
3
Practical Consequences of Item Response Theory Model Misfit in the Context of Test Equating with Mixed-Format Test Data.在混合格式测试数据的测试等值背景下,项目反应理论模型失配的实际后果。
Front Psychol. 2017 Apr 4;8:484. doi: 10.3389/fpsyg.2017.00484. eCollection 2017.
4
On the Practical Consequences of Misfit in Mokken Scaling.关于莫肯量表中不匹配的实际后果
Appl Psychol Meas. 2020 Sep;44(6):482-496. doi: 10.1177/0146621620920925. Epub 2020 Jun 1.
5
Practical Significance of Item Misfit in Educational Assessments.教育评估中项目不匹配的实际意义。
Appl Psychol Meas. 2017 Jul;41(5):388-400. doi: 10.1177/0146621617692978. Epub 2017 Mar 1.
6
Unidimensional IRT Item Parameter Estimates Across Equivalent Test Forms With Confounding Specifications Within Dimensions.具有维度内混杂规范的等效测试形式间的单维项目反应理论项目参数估计
Educ Psychol Meas. 2016 Apr;76(2):258-279. doi: 10.1177/0013164415589756. Epub 2015 Jun 9.
7
Assessing Item-Level Fit for Higher Order Item Response Theory Models.评估高阶项目反应理论模型的项目水平拟合度。
Appl Psychol Meas. 2018 Nov;42(8):644-659. doi: 10.1177/0146621618762740. Epub 2018 Mar 21.
8
Genetic parameters and expected responses to selection for components of feed efficiency in a Duroc pig line.杜洛克猪系饲料效率构成因素的遗传参数和选择反应。
Genet Sel Evol. 2017 Dec 1;49(1):86. doi: 10.1186/s12711-017-0362-x.
9
Asymptotically Correct Standardization of Person-Fit Statistics Beyond Dichotomous Items.二分法项目之外的人适切性统计量的渐近正确标准化
Psychometrika. 2016 Dec;81(4):992-1013. doi: 10.1007/s11336-015-9465-x. Epub 2015 May 8.
10
Detecting and Explaining Aberrant Responding to the Outcome Questionnaire-45.检测并解释对结果问卷-45的异常反应
Assessment. 2015 Aug;22(4):513-24. doi: 10.1177/1073191114560882. Epub 2014 Dec 16.

引用本文的文献

1
Development of self-report measures of physical, mental, and emotional fatigability: the michigan fatigability index (MIFI).身体、心理和情绪易疲劳性自我报告测量方法的开发:密歇根易疲劳指数(MIFI)。
Qual Life Res. 2025 Jun;34(6):1735-1748. doi: 10.1007/s11136-025-03934-x. Epub 2025 Mar 6.
2
Using ECHO program data to develop a brief measure of caregiver support and cognitive stimulation using the home observation for measurement of the environment-infant/toddler (HOME-IT).利用 ECHO 项目数据,采用家庭观察评估婴儿/学步儿环境(HOME-IT)开发一种简要的照料者支持和认知刺激测量工具。
Child Dev. 2024 Nov-Dec;95(6):2241-2251. doi: 10.1111/cdev.14137. Epub 2024 Jul 30.
3

本文引用的文献

1
On the Assessment of Monte Carlo Error in Simulation-Based Statistical Analyses.基于模拟的统计分析中蒙特卡罗误差的评估
Am Stat. 2009 May 1;63(2):155-162. doi: 10.1198/tast.2009.0030.
2
A power primer.强力底漆。
Psychol Bull. 1992 Jul;112(1):155-9. doi: 10.1037//0033-2909.112.1.155.
3
The role of the bifactor model in resolving dimensionality issues in health outcomes measures.双因素模型在解决健康结果测量中的维度问题方面的作用。
How to screen for social withdrawal in primary care: An evaluation of the alarm distress baby scale using item response theory.
如何在初级保健中筛查社交退缩:使用项目反应理论对警报苦恼婴儿量表的评估。
Int J Nurs Stud Adv. 2021 Jul 15;3:100038. doi: 10.1016/j.ijnsa.2021.100038. eCollection 2021 Nov.
4
New Dizziness Impact Measures of Positional, Functional, and Emotional Status Were Supported for Reliability, Validity, and Efficiency.新的关于位置、功能和情绪状态的头晕影响测量方法在可靠性、有效性和效率方面得到了验证。
Arch Rehabil Res Clin Transl. 2024 Jan 5;6(1):100320. doi: 10.1016/j.arrct.2024.100320. eCollection 2024 Mar.
5
Development of the PROMIS pediatric stigma and extension to the PROMIS pediatric stigma: skin item banks.开发 PROMIS 儿童污名量表及其在 PROMIS 儿童污名:皮肤项目库中的扩展。
Qual Life Res. 2024 Mar;33(3):865-873. doi: 10.1007/s11136-023-03574-z. Epub 2024 Jan 3.
6
Novel measures to assess ventricular assist device patient-reported outcomes: Findings from the MCS A-QOL study.评估心室辅助装置患者报告结局的新方法:来自 MCS A-QOL 研究的结果。
J Heart Lung Transplant. 2024 Jan;43(1):36-50. doi: 10.1016/j.healun.2023.08.007. Epub 2023 Aug 15.
7
Item Response Theory Modeling of the Verb Naming Test.动词命名测验的项目反应理论建模。
J Speech Lang Hear Res. 2023 May 9;66(5):1718-1739. doi: 10.1044/2023_JSLHR-22-00458. Epub 2023 Mar 31.
8
Developing an Exposure Burden Score for Chemical Mixtures Using Item Response Theory, with Applications to PFAS Mixtures.运用项目反应理论为化学混合物开发暴露负担评分,应用于 PFAS 混合物。
Environ Health Perspect. 2022 Nov;130(11):117001. doi: 10.1289/EHP10125. Epub 2022 Nov 2.
9
Development and calibration data for the Healthcare Access Item Bank: a new computer adaptive test for persons with type 2 diabetes mellitus.医疗保健获取项目库的开发和校准数据:一种用于 2 型糖尿病患者的新型计算机自适应测试。
Qual Life Res. 2023 Mar;32(3):781-796. doi: 10.1007/s11136-022-03278-w. Epub 2022 Oct 31.
10
Development and calibration data for the Medication Adherence Item Bank: a new computer adaptive test for persons with type 2 diabetes mellitus.用于药物依从性项目库的开发和校准数据:一种用于 2 型糖尿病患者的新型计算机自适应测试。
Qual Life Res. 2023 Mar;32(3):813-826. doi: 10.1007/s11136-022-03275-z. Epub 2022 Oct 28.
Qual Life Res. 2007;16 Suppl 1:19-31. doi: 10.1007/s11136-007-9183-7. Epub 2007 May 4.
4
Examining assumptions about item responding in personality assessment: should ideal point methods be considered for scale development and scoring?审视人格评估中关于项目反应的假设:量表开发和计分是否应考虑理想点法?
J Appl Psychol. 2006 Jan;91(1):25-39. doi: 10.1037/0021-9010.91.1.25.
5
Evaluation of global testing procedures for item fit to the Rasch model.评估项目与拉施模型拟合度的整体测试程序。
Br J Math Stat Psychol. 2003 May;56(Pt 1):127-43. doi: 10.1348/000711003321645395.
6
Using item mean squares to evaluate fit to the Rasch model.使用项目均方来评估对拉施模型的拟合度。
J Outcome Meas. 1998;2(1):66-78.
7
Statistical methods for assessing agreement between two methods of clinical measurement.评估两种临床测量方法之间一致性的统计方法。
Lancet. 1986 Feb 8;1(8476):307-10.