• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

计算机自适应测试中认知错误的多分类建模

Polytomous modeling of cognitive errors in computer adaptive testing.

作者信息

Wang L, Li C S

机构信息

Teachers College, University of Cincinnati, OH 45221-0002, USA.

出版信息

J Appl Meas. 2001;2(4):356-78.

PMID:12011504
Abstract

In the past two decades of psychometric research, an array of extended item response models has been proposed to capture the complex nature of human cognition. While the literature abounds in model fit analysis, the debate on model selection in different testing conditions continues. This study examines the problems of model selection in computer adaptive testing (CAT) of cognitive errors by comparing the relative measurement efficiency of polytomous modeling over dichotomous modeling under different scoring schemes and termination criteria. Monte Carlo simulation was adopted as the inquiry paradigm to generate 1000 subjects and 100 items in the calibration sample and 200 simulees in the CAT sample. The results suggest that polytomous CAT yields marginal gains over dichotomous CAT when termination criteria are more stringent (shorter test length or smaller standard error of ability estimate). When the conventional dichotomous scoring scheme is adopted, in which all partially correct answers are scored as incorrect, polytomous CAT cannot prevent the non-uniform gain in test information as was observed in paper-and-pencil testing.

摘要

在过去二十年的心理测量学研究中,人们提出了一系列扩展的项目反应模型,以捕捉人类认知的复杂本质。虽然文献中充斥着模型拟合分析,但关于不同测试条件下模型选择的争论仍在继续。本研究通过比较在不同评分方案和终止标准下,多分类建模相对于二分类建模的相对测量效率,考察了认知错误的计算机自适应测试(CAT)中的模型选择问题。采用蒙特卡罗模拟作为探究范式,在校准样本中生成1000名受试者和100个项目,在CAT样本中生成200个模拟受试者。结果表明,当终止标准更严格(测试长度更短或能力估计的标准误差更小)时,多分类CAT比二分类CAT有微小的优势。当采用传统的二分类评分方案,即所有部分正确的答案都被计为错误时,多分类CAT无法像纸笔测试那样防止测试信息的不均匀增加。

相似文献

1
Polytomous modeling of cognitive errors in computer adaptive testing.计算机自适应测试中认知错误的多分类建模
J Appl Meas. 2001;2(4):356-78.
2
The maximum priority index method for severely constrained item selection in computerized adaptive testing.计算机化自适应测试中严重受限项目选择的最大优先级指数法。
Br J Math Stat Psychol. 2009 May;62(Pt 2):369-83. doi: 10.1348/000711008X304376. Epub 2008 Jun 2.
3
The effect of item pool restriction on the precision of ability measurement for a Rasch-based CAT: comparisons to traditional fixed length examinations.项目池限制对基于拉施模型的计算机自适应测试中能力测量精度的影响:与传统固定长度考试的比较
J Outcome Meas. 1998;2(2):97-122.
4
Rasch fit statistics as a test of the invariance of item parameter estimates.拉施拟合统计作为项目参数估计不变性的一种检验。
J Appl Meas. 2003;4(2):153-63.
5
Multilevel IRT using dichotomous and polytomous response data.使用二分法和多分法响应数据的多级项目反应理论
Br J Math Stat Psychol. 2005 May;58(Pt 1):145-72. doi: 10.1348/000711005X38951.
6
Marginal likelihood inference for a model for item responses and response times.项目反应和反应时间模型的边缘似然推断。
Br J Math Stat Psychol. 2010 Nov;63(Pt 3):603-26. doi: 10.1348/000711009X481360. Epub 2010 Jan 28.
7
Binary items and beyond: a simulation of computer adaptive testing using the Rasch partial credit model.二分制项目及其他:使用拉施克部分计分模型的计算机自适应测试模拟
J Appl Meas. 2008;9(1):81-104.
8
A comparison of three polytomous item response theory models in the context of testlet scoring.在分块计分背景下三种多分类项目反应理论模型的比较
J Outcome Meas. 1999;3(1):1-20.
9
Improving measurement in health education and health behavior research using item response modeling: introducing item response modeling.使用项目反应模型改进健康教育与健康行为研究中的测量:介绍项目反应模型
Health Educ Res. 2006 Dec;21 Suppl 1:i4-18. doi: 10.1093/her/cyl108. Epub 2006 Oct 3.
10
Using the dichotomous Rasch model to analyze polytomous items.使用二分法Rasch模型分析多分类项目。
J Appl Meas. 2013;14(1):44-56.