在存在快速猜测错误分类的情况下评估参数估计的准确性。

Assessing the Accuracy of Parameter Estimates in the Presence of Rapid Guessing Misclassifications.

作者信息

Rios Joseph A

机构信息

University of Minnesota, Twin Cities, Minneapolis, MN, USA.

出版信息

Educ Psychol Meas. 2022 Feb;82(1):122-150. doi: 10.1177/00131644211003640. Epub 2021 Apr 21.

DOI:10.1177/00131644211003640

PMID:34992309

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8725050/

Abstract

The presence of rapid guessing (RG) presents a challenge to practitioners in obtaining accurate estimates of measurement properties and examinee ability. In response to this concern, researchers have utilized response times as a proxy of RG and have attempted to improve parameter estimation accuracy by filtering RG responses using popular scoring approaches, such as the effort-moderated item response theory (EM-IRT) model. However, such an approach assumes that RG can be correctly identified based on an indirect proxy of examinee behavior. A failure to meet this assumption leads to the inclusion of distortive and psychometrically uninformative information in parameter estimates. To address this issue, a simulation study was conducted to examine how violations to the assumption of correct RG classification influences EM-IRT item and ability parameter estimation accuracy and compares these results with parameter estimates from the three-parameter logistic (3PL) model, which includes RG responses in scoring. Two RG misclassification factors were manipulated: type (underclassification vs. overclassification) and rate (10%, 30%, and 50%). Results indicated that the EM-IRT model provided improved item parameter estimation over the 3PL model regardless of misclassification type and rate. Furthermore, under most conditions, increased rates of RG underclassification were associated with the greatest bias in ability parameter estimates from the EM-IRT model. In spite of this, the EM-IRT model with RG misclassifications demonstrated more accurate ability parameter estimation than the 3PL model when the mean ability of RG subgroups did not differ. This suggests that in certain situations it may be better for practitioners to (a) imperfectly identify RG than to ignore the presence of such invalid responses and (b) select liberal over conservative response time thresholds to mitigate bias from underclassified RG.

摘要

快速猜测（RG）的存在给从业者在获取测量属性和考生能力的准确估计方面带来了挑战。针对这一问题，研究人员将反应时间用作RG的替代指标，并试图通过使用流行的评分方法（如努力调节项目反应理论（EM-IRT）模型）过滤RG反应来提高参数估计的准确性。然而，这种方法假设可以基于考生行为的间接替代指标正确识别RG。如果不能满足这一假设，就会导致在参数估计中纳入扭曲且在心理测量学上无信息价值的信息。为了解决这个问题，进行了一项模拟研究，以检验违反正确RG分类假设如何影响EM-IRT项目和能力参数估计的准确性，并将这些结果与三参数逻辑斯蒂（3PL）模型的参数估计结果进行比较，3PL模型在评分时包括了RG反应。操纵了两个RG误分类因素：类型（分类不足与分类过度）和比率（10%、30%和50%）。结果表明，无论误分类类型和比率如何，EM-IRT模型在项目参数估计方面都优于3PL模型。此外，在大多数情况下，RG分类不足比率的增加与EM-IRT模型能力参数估计中最大的偏差相关。尽管如此，当RG亚组的平均能力没有差异时，存在RG误分类的EM-IRT模型在能力参数估计方面比3PL模型更准确。这表明在某些情况下，对于从业者来说，（a）不完全识别RG可能比忽略此类无效反应的存在更好，并且（b）选择宽松而非保守的反应时间阈值以减轻分类不足的RG带来的偏差可能更好。

相似文献

Assessing the Accuracy of Parameter Estimates in the Presence of Rapid Guessing Misclassifications.

Educ Psychol Meas. 2022 Feb;82(1):122-150. doi: 10.1177/00131644211003640. Epub 2021 Apr 21.

Parameter Estimation Accuracy of the Effort-Moderated Item Response Theory Model Under Multiple Assumption Violations.

Educ Psychol Meas. 2021 Jun;81(3):569-594. doi: 10.1177/0013164420949896. Epub 2020 Sep 2.

Investigating the Impact of Noneffortful Responses on Individual-Level Scores: Can the Effort-Moderated IRT Model Serve as a Solution?

Appl Psychol Meas. 2021 Sep;45(6):391-406. doi: 10.1177/01466216211013896. Epub 2021 Jun 11.

A Comparison of Response Time Threshold Scoring Procedures in Mitigating Bias From Rapid Guessing Behavior.

Educ Psychol Meas. 2024 Apr;84(2):387-420. doi: 10.1177/00131644231168398. Epub 2023 Apr 26.

A Comparison of Robust Likelihood Estimators to Mitigate Bias From Rapid Guessing.

Appl Psychol Meas. 2022 May;46(3):236-249. doi: 10.1177/01466216221084371. Epub 2022 Apr 4.

Modeling Rapid Guessing Behaviors in Computer-Based Testlet Items.

Appl Psychol Meas. 2023 Jan;47(1):19-33. doi: 10.1177/01466216221125177. Epub 2022 Sep 9.

Investigating the Impact of Item Parameter Drift for Item Response Theory Models with Mixture Distributions.

Front Psychol. 2016 Feb 24;7:255. doi: 10.3389/fpsyg.2016.00255. eCollection 2016.

Quantifying the Distorting Effect of Rapid Guessing on Estimates of Coefficient Αlpha.

Appl Psychol Meas. 2022 Jan;46(1):40-52. doi: 10.1177/01466216211051719. Epub 2021 Oct 11.

A Two-Parameter Logistic Extension Model: .

Appl Psychol Meas. 2019 Sep;43(6):449-463. doi: 10.1177/0146621618800273. Epub 2018 Sep 29.

IRT Scoring and Test Blueprint Fidelity.

Appl Psychol Meas. 2018 Jul;42(5):393-400. doi: 10.1177/0146621618754897. Epub 2018 Feb 20.

引用本文的文献

Treating Noneffortful Responses as Missing.

Educ Psychol Meas. 2024 Nov 29:00131644241297925. doi: 10.1177/00131644241297925.

A Comparison of Response Time Threshold Scoring Procedures in Mitigating Bias From Rapid Guessing Behavior.

Educ Psychol Meas. 2024 Apr;84(2):387-420. doi: 10.1177/00131644231168398. Epub 2023 Apr 26.

Testing Replicability and Generalizability of the Time on Task Effect.

J Intell. 2023 Apr 28;11(5):82. doi: 10.3390/jintelligence11050082.

Investigating the Effect of Differential Rapid Guessing on Population Invariance in Equating.

Appl Psychol Meas. 2022 Oct;46(7):589-604. doi: 10.1177/01466216221108991. Epub 2022 Jun 16.

Quantifying the Distorting Effect of Rapid Guessing on Estimates of Coefficient Αlpha.

Appl Psychol Meas. 2022 Jan;46(1):40-52. doi: 10.1177/01466216211051719. Epub 2021 Oct 11.

Investigating the Impact of Noneffortful Responses on Individual-Level Scores: Can the Effort-Moderated IRT Model Serve as a Solution?

Appl Psychol Meas. 2021 Sep;45(6):391-406. doi: 10.1177/01466216211013896. Epub 2021 Jun 11.

本文引用的文献

Parameter Estimation Accuracy of the Effort-Moderated Item Response Theory Model Under Multiple Assumption Violations.

Educ Psychol Meas. 2021 Jun;81(3):569-594. doi: 10.1177/0013164420949896. Epub 2020 Sep 2.

An Overview of Models for Response Times and Processes in Cognitive Tests.

Front Psychol. 2019 Feb 6;10:102. doi: 10.3389/fpsyg.2019.00102. eCollection 2019.

Modeling Test-Taking Non-effort in MIRT Models.

Front Psychol. 2019 Feb 4;10:145. doi: 10.3389/fpsyg.2019.00145. eCollection 2019.

Measuring Ability, Speed, or Both? Challenges, Psychometric Solutions, and What Can Be Gained From Experimental Control.

Measurement ( Mahwah N J). 2015 Oct 2;13(3-4):133-164. doi: 10.1080/15366367.2015.1100020. Epub 2015 Dec 7.

A mixture hierarchical model for response times and response accuracy.

Br J Math Stat Psychol. 2015 Nov;68(3):456-77. doi: 10.1111/bmsp.12054. Epub 2015 Apr 15.

Random Responding from Participants is a Threat to the Validity of Social Science Research Results.

Front Psychol. 2011 Jan 21;1:220. doi: 10.3389/fpsyg.2010.00220. eCollection 2010.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

在存在快速猜测错误分类的情况下评估参数估计的准确性。

Assessing the Accuracy of Parameter Estimates in the Presence of Rapid Guessing Misclassifications.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献