• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于检测多分类认知诊断模型中异常反应的新人适配统计量。

A new person-fit statistic for the detection of aberrant responses in polytomous cognitive diagnostic models.

作者信息

Gao Xuliang, Hou Minmin, Wang Fang, Zhou Jinyu

机构信息

School of Psychology, Guizhou Normal University, Huaxi University Town, Guian New District, Guiyang, 550025, Guizhou Province, China.

School of General Education, Guizhou University of Commerce, Twenty-Sixth Avenue, Maijia Town, Baiyun District, Guiyang, 550001, Guizhou Province, China.

出版信息

Behav Res Methods. 2025 Apr 9;57(5):138. doi: 10.3758/s13428-025-02659-6.

DOI:10.3758/s13428-025-02659-6
PMID:40205248
Abstract

Assessing person-fit in cognitive diagnostic assessments is a critical research area. Inability to identify misfitting responses can lead to misinterpretation of students' attribute profiles, potentially resulting in incorrect remedial actions. Despite its importance, there is a lack of research on person-fit statistics for polytomous cognitive diagnostic models (CDM). To address this, we propose a new person-fit statistic, WR, specifically designed for polytomous items in CDMs. We evaluated WR's ability to detect three types of abnormal behaviors through simulation studies, comparing its performance with established statistics including l, infit, and outfit. The results show that WR consistently demonstrated stable and superior detection capabilities across all experimental scenarios. Traditional methods showed inconsistent detection abilities for different anomalies; l was more effective at detecting cheating, while infit was better for creative responses. In high-quality test environments, WR performed best, though the difference compared to traditional methods was not significant. However, in low-quality conditions, WR significantly outperformed traditional methods. Overall, WR proved to be an effective tool for detecting person misfit in polytomous scoring CDMs. Finally, we analyzed a real educational assessment data to assess the practical application of WR.

摘要

在认知诊断评估中评估个体拟合度是一个关键的研究领域。无法识别不匹配的反应可能会导致对学生属性概况的错误解读,进而可能导致不正确的补救措施。尽管其很重要,但针对多分类认知诊断模型(CDM)的个体拟合度统计研究却很匮乏。为了解决这一问题,我们提出了一种新的个体拟合度统计量WR,它是专门为CDM中的多分类项目设计的。我们通过模拟研究评估了WR检测三种异常行为的能力,并将其性能与包括l、内拟合和外拟合在内的既定统计量进行了比较。结果表明,在所有实验场景中,WR始终表现出稳定且卓越的检测能力。传统方法对不同异常情况的检测能力不一致;l在检测作弊方面更有效,而内拟合在检测创造性反应方面表现更好。在高质量的测试环境中,WR表现最佳,尽管与传统方法相比差异不显著。然而,在低质量条件下,WR明显优于传统方法。总体而言,WR被证明是检测多分类计分CDM中个体不匹配的有效工具。最后,我们分析了真实的教育评估数据以评估WR的实际应用。

相似文献

1
A new person-fit statistic for the detection of aberrant responses in polytomous cognitive diagnostic models.一种用于检测多分类认知诊断模型中异常反应的新人适配统计量。
Behav Res Methods. 2025 Apr 9;57(5):138. doi: 10.3758/s13428-025-02659-6.
2
Evaluating Person Fit for Cognitive Diagnostic Assessment.评估认知诊断评估的个体适配性。
Appl Psychol Meas. 2015 May;39(3):223-238. doi: 10.1177/0146621614557272. Epub 2014 Nov 17.
3
Three new corrections for standardized person-fit statistics for tests with polytomous items.三种新的校正方法用于校正多项选择题测试的标准化个体适合度统计量。
Br J Math Stat Psychol. 2024 Nov;77(3):634-650. doi: 10.1111/bmsp.12342. Epub 2024 Apr 17.
4
Using the Bootstrap Method to Evaluate the Critical Range of Misfit for Polytomous Rasch Fit Statistics.使用自助法评估多分类Rasch拟合统计量的失配临界范围。
Psychol Rep. 2016 Jun;118(3):937-56. doi: 10.1177/0033294116649434. Epub 2016 May 19.
5
Rasch fit statistics as a test of the invariance of item parameter estimates.拉施拟合统计作为项目参数估计不变性的一种检验。
J Appl Meas. 2003;4(2):153-63.
6
Improvement and application of back random response detection: Based on cumulative sum and change point analysis.背向随机响应检测的改进与应用:基于累积和与变点分析。
Behav Res Methods. 2024 Dec;56(8):8640-8657. doi: 10.3758/s13428-024-02495-0. Epub 2024 Sep 10.
7
On the Sequential Hierarchical Cognitive Diagnostic Model.论序列分层认知诊断模型
Front Psychol. 2020 Oct 7;11:579018. doi: 10.3389/fpsyg.2020.579018. eCollection 2020.
8
New Paradigm of Identifiable General-response Cognitive Diagnostic Models: Beyond Categorical Data.可识别的通用反应认知诊断模型的新范式:超越类别数据。
Psychometrika. 2024 Dec;89(4):1304-1336. doi: 10.1007/s11336-024-09983-4. Epub 2024 Jul 5.
9
A Polytomous Scoring Approach to Handle Not-Reached Items in Low-Stakes Assessments.一种用于处理低风险评估中未作答项目的多分类计分方法。
Educ Psychol Meas. 2021 Oct;81(5):847-871. doi: 10.1177/0013164421991211. Epub 2021 Feb 12.
10
Comparing Person-Fit and Traditional Indices Across Careless Response Patterns in Surveys.比较调查中粗心回答模式下的个体拟合度指标与传统指标。
Appl Psychol Meas. 2023 Sep;47(5-6):365-385. doi: 10.1177/01466216231194358. Epub 2023 Aug 3.

本文引用的文献

1
Detecting Cheating in Large-Scale Assessment: The Transfer of Detectors to New Tests.检测大规模评估中的作弊行为:将检测器应用于新测试
Educ Psychol Meas. 2023 Oct;83(5):1033-1058. doi: 10.1177/00131644221132723. Epub 2022 Nov 4.
2
Robustness of Adaptive Measurement of Change to Item Parameter Estimation Error.变化的自适应测量对项目参数估计误差的稳健性。
Educ Psychol Meas. 2022 Aug;82(4):643-677. doi: 10.1177/00131644211033902. Epub 2021 Aug 16.
3
The polytomous discrimination index for prediction involving multistate processes under intermittent observation.
多状态过程下间断观测预测的多项式判别指数。
Stat Med. 2022 Aug 30;41(19):3661-3678. doi: 10.1002/sim.9441. Epub 2022 May 20.
4
A Tutorial on Cognitive Diagnosis Modeling for Characterizing Mental Health Symptom Profiles Using Existing Item Responses.使用现有项目反应对心理健康症状特征进行认知诊断建模的教程。
Prev Sci. 2023 Apr;24(3):480-492. doi: 10.1007/s11121-022-01346-8. Epub 2022 Feb 3.
5
Detecting Careless Responding in Survey Data Using Stochastic Gradient Boosting.使用随机梯度提升法检测调查数据中的粗心应答情况。
Educ Psychol Meas. 2022 Feb;82(1):29-56. doi: 10.1177/00131644211004708. Epub 2021 Apr 19.
6
A Polytomous Scoring Approach to Handle Not-Reached Items in Low-Stakes Assessments.一种用于处理低风险评估中未作答项目的多分类计分方法。
Educ Psychol Meas. 2021 Oct;81(5):847-871. doi: 10.1177/0013164421991211. Epub 2021 Feb 12.
7
Refinement of the Reflective Function Questionnaire for Youth (RFQY) Scale B Using Item Response Theory.使用项目反应理论对青少年反思功能问卷(RFQY)量表B进行优化。
Assessment. 2022 Sep;29(6):1204-1215. doi: 10.1177/10731911211003971. Epub 2021 Apr 2.
8
Methods of Detecting Insufficient Effort Responding: Comparisons and Practical Recommendations.检测努力反应不足的方法:比较与实用建议。
Educ Psychol Meas. 2020 Apr;80(2):312-345. doi: 10.1177/0013164419865316. Epub 2019 Jul 31.
9
Optimal assessment of protective behavioral strategies among college drinkers: An item response theory analysis.最佳评估大学生饮酒者的保护性行为策略:项目反应理论分析。
Psychol Assess. 2020 Apr;32(4):394-406. doi: 10.1037/pas0000799. Epub 2020 Jan 30.
10
Opening the black box of selection.打开选择的黑箱。
Adv Health Sci Educ Theory Pract. 2020 May;25(2):363-382. doi: 10.1007/s10459-019-09925-1. Epub 2019 Oct 9.