• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多重假设违背下努力调节项目反应理论模型的参数估计准确性

Parameter Estimation Accuracy of the Effort-Moderated Item Response Theory Model Under Multiple Assumption Violations.

作者信息

Rios Joseph A, Soland James

机构信息

University of Minnesota, Minneapolis, MN, USA.

University of Virginia, Charlottesville, VA, USA.

出版信息

Educ Psychol Meas. 2021 Jun;81(3):569-594. doi: 10.1177/0013164420949896. Epub 2020 Sep 2.

DOI:10.1177/0013164420949896
PMID:33994564
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8072948/
Abstract

As low-stakes testing contexts increase, low test-taking effort may serve as a serious validity threat. One common solution to this problem is to identify noneffortful responses and treat them as missing during parameter estimation via the effort-moderated item response theory (EM-IRT) model. Although this model has been shown to outperform traditional IRT models (e.g., two-parameter logistic [2PL]) in parameter estimation under simulated conditions, prior research has failed to examine its performance under violations to the model's assumptions. Therefore, the objective of this simulation study was to examine item and mean ability parameter recovery when violating the assumptions that noneffortful responding occurs randomly (Assumption 1) and is unrelated to the underlying ability of examinees (Assumption 2). Results demonstrated that, across conditions, the EM-IRT model provided robust item parameter estimates to violations of Assumption 1. However, bias values greater than 0.20 were observed for the EM-IRT model when violating Assumption 2; nonetheless, these values were still lower than the 2PL model. In terms of mean ability estimates, model results indicated equal performance between the EM-IRT and 2PL models across conditions. Across both models, mean ability estimates were found to be biased by more than 0.25 when violating Assumption 2. However, our accompanying empirical study suggested that this biasing occurred under extreme conditions that may not be present in some operational settings. Overall, these results suggest that the EM-IRT model provides superior item and equal mean ability parameter estimates in the presence of model violations under realistic conditions when compared with the 2PL model.

摘要

随着低风险测试情境的增加,低测试投入可能成为严重的效度威胁。解决这个问题的一个常见方法是识别非投入性回答,并在通过投入调节项目反应理论(EM-IRT)模型进行参数估计时将其视为缺失值。尽管该模型在模拟条件下的参数估计中已被证明优于传统的IRT模型(如两参数逻辑斯蒂模型[2PL]),但先前的研究未能考察其在违反模型假设情况下的表现。因此,本模拟研究的目的是在违反非投入性回答随机出现(假设1)且与考生潜在能力无关(假设2)的假设时,检验项目和平均能力参数的恢复情况。结果表明,在各种条件下,EM-IRT模型对违反假设1的情况能提供稳健的项目参数估计。然而,当违反假设2时,EM-IRT模型观察到偏差值大于0.20;尽管如此,这些值仍低于2PL模型。就平均能力估计而言,模型结果表明在各种条件下EM-IRT模型和2PL模型的表现相当。在两个模型中,当违反假设2时,发现平均能力估计的偏差超过0.25。然而,我们附带的实证研究表明,这种偏差发生在某些实际操作环境中可能不存在的极端条件下。总体而言,这些结果表明,与2PL模型相比,在现实条件下存在模型违反的情况下,EM-IRT模型能提供更优的项目参数估计和相当的平均能力参数估计。

相似文献

1
Parameter Estimation Accuracy of the Effort-Moderated Item Response Theory Model Under Multiple Assumption Violations.多重假设违背下努力调节项目反应理论模型的参数估计准确性
Educ Psychol Meas. 2021 Jun;81(3):569-594. doi: 10.1177/0013164420949896. Epub 2020 Sep 2.
2
Investigating the Impact of Noneffortful Responses on Individual-Level Scores: Can the Effort-Moderated IRT Model Serve as a Solution?探究非努力性反应对个体水平分数的影响:努力调节的项目反应理论模型能否成为一种解决方案?
Appl Psychol Meas. 2021 Sep;45(6):391-406. doi: 10.1177/01466216211013896. Epub 2021 Jun 11.
3
Assessing the Accuracy of Parameter Estimates in the Presence of Rapid Guessing Misclassifications.在存在快速猜测错误分类的情况下评估参数估计的准确性。
Educ Psychol Meas. 2022 Feb;82(1):122-150. doi: 10.1177/00131644211003640. Epub 2021 Apr 21.
4
Effects of violating local independence on IRT parameter estimation for the Binomial Trials Model.
Res Q Exerc Sport. 1992 Dec;63(4):356-9. doi: 10.1080/02701367.1992.10608756.
5
Modeling Sequential Dependencies in Progressive Matrices: An Auto-Regressive Item Response Theory (AR-IRT) Approach.渐进矩阵中序列依赖关系的建模:一种自回归项目反应理论(AR-IRT)方法。
J Intell. 2024 Jan 15;12(1):7. doi: 10.3390/jintelligence12010007.
6
Investigating the Impact of Item Parameter Drift for Item Response Theory Models with Mixture Distributions.研究项目参数漂移对具有混合分布的项目反应理论模型的影响。
Front Psychol. 2016 Feb 24;7:255. doi: 10.3389/fpsyg.2016.00255. eCollection 2016.
7
A flexible approach to modelling over-, under- and equidispersed count data in IRT: The Two-Parameter Conway-Maxwell-Poisson Model.一种用于在项目反应理论(IRT)中对过度离散、不足离散和等离散计数数据进行建模的灵活方法:双参数康威-麦克斯韦-泊松模型。
Br J Math Stat Psychol. 2022 Nov;75(3):411-443. doi: 10.1111/bmsp.12273. Epub 2022 Jun 9.
8
Rasch Model Parameter Estimation in the Presence of a Nonnormal Latent Trait Using a Nonparametric Bayesian Approach.使用非参数贝叶斯方法在存在非正态潜在特质的情况下进行拉施模型参数估计。
Educ Psychol Meas. 2016 Aug;76(4):662-684. doi: 10.1177/0013164415608418. Epub 2015 Oct 12.
9
Modeling Test-Taking Non-effort in MIRT Models.在多指标多因索模型中对考试不努力行为进行建模
Front Psychol. 2019 Feb 4;10:145. doi: 10.3389/fpsyg.2019.00145. eCollection 2019.
10
Semisupervised Learning Method to Adjust Biased Item Difficulty Estimates Caused by Nonignorable Missingness in a Virtual Learning Environment.用于调整虚拟学习环境中由不可忽视的缺失值导致的有偏差项目难度估计的半监督学习方法。
Educ Psychol Meas. 2022 Jun;82(3):539-567. doi: 10.1177/00131644211020494. Epub 2021 Jun 4.

引用本文的文献

1
Treating Noneffortful Responses as Missing.将非努力性反应视为缺失数据。
Educ Psychol Meas. 2024 Nov 29:00131644241297925. doi: 10.1177/00131644241297925.
2
Is Effort Moderated Scoring Robust to Multidimensional Rapid Guessing?努力调节评分对多维快速猜测是否稳健?
Educ Psychol Meas. 2025 Feb;85(1):134-155. doi: 10.1177/00131644241246749. Epub 2024 Apr 27.
3
Enhancing Effort-Moderated Item Response Theory Models by Evaluating a Two-Step Estimation Method and Multidimensional Variations on the Model.通过评估两步估计方法和模型的多维变体来增强努力调节项目反应理论模型
Educ Psychol Meas. 2024 Oct 6:00131644241280727. doi: 10.1177/00131644241280727.
4
A Comparison of Response Time Threshold Scoring Procedures in Mitigating Bias From Rapid Guessing Behavior.缓解快速猜测行为偏差的反应时间阈值评分程序比较
Educ Psychol Meas. 2024 Apr;84(2):387-420. doi: 10.1177/00131644231168398. Epub 2023 Apr 26.
5
Identifying Ability and Nonability Groups: Incorporating Response Times Using Mixture Modeling.识别能力组和非能力组:使用混合模型纳入反应时间。
Educ Psychol Meas. 2022 Dec;82(6):1087-1106. doi: 10.1177/00131644211072833. Epub 2022 Jan 20.
6
Estimation of Person Ability under Rapid and Effortful Responding.快速且费力反应下的个体能力评估
J Intell. 2022 Sep 13;10(3):67. doi: 10.3390/jintelligence10030067.
7
The Role of Response Times on the Measurement of Mental Ability.反应时间在心理能力测量中的作用。
Front Psychol. 2022 Jun 17;13:892317. doi: 10.3389/fpsyg.2022.892317. eCollection 2022.
8
A Comparison of Robust Likelihood Estimators to Mitigate Bias From Rapid Guessing.稳健似然估计器用于减轻快速猜测偏差的比较
Appl Psychol Meas. 2022 May;46(3):236-249. doi: 10.1177/01466216221084371. Epub 2022 Apr 4.
9
Assessing the Accuracy of Parameter Estimates in the Presence of Rapid Guessing Misclassifications.在存在快速猜测错误分类的情况下评估参数估计的准确性。
Educ Psychol Meas. 2022 Feb;82(1):122-150. doi: 10.1177/00131644211003640. Epub 2021 Apr 21.
10
Investigating the Impact of Noneffortful Responses on Individual-Level Scores: Can the Effort-Moderated IRT Model Serve as a Solution?探究非努力性反应对个体水平分数的影响:努力调节的项目反应理论模型能否成为一种解决方案?
Appl Psychol Meas. 2021 Sep;45(6):391-406. doi: 10.1177/01466216211013896. Epub 2021 Jun 11.

本文引用的文献

1
Methods of Detecting Insufficient Effort Responding: Comparisons and Practical Recommendations.检测努力反应不足的方法:比较与实用建议。
Educ Psychol Meas. 2020 Apr;80(2):312-345. doi: 10.1177/0013164419865316. Epub 2019 Jul 31.
2
A mixture model for responses and response times with a higher-order ability structure to detect rapid guessing behaviour.一种用于反应和反应时间的混合模型,具有高阶能力结构以检测快速猜测行为。
Br J Math Stat Psychol. 2020 May;73(2):261-288. doi: 10.1111/bmsp.12175. Epub 2019 Aug 6.
3
A Two-Stage Approach to Differentiating Normal and Aberrant Behavior in Computer Based Testing.基于计算机的测试中正常与异常行为的两阶段区分方法。
Psychometrika. 2018 Mar;83(1):223-254. doi: 10.1007/s11336-016-9525-x. Epub 2016 Oct 28.
4
A mixture hierarchical model for response times and response accuracy.一种用于反应时间和反应准确性的混合层次模型。
Br J Math Stat Psychol. 2015 Nov;68(3):456-77. doi: 10.1111/bmsp.12054. Epub 2015 Apr 15.
5
Identifying careless responses in survey data.识别调查数据中的粗心回答。
Psychol Methods. 2012 Sep;17(3):437-55. doi: 10.1037/a0028085. Epub 2012 Apr 16.