• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

缺失数据对参数估计的影响:计算机自适应测试中的三个例子

The Impact of Missing Data on Parameter Estimation: Three Examples in Computerized Adaptive Testing.

作者信息

Liu Xiaowen, Loken Eric

机构信息

Key Research Base of Humanities and Social Sciences of the Ministry of Education, Academy of Psychology and Behavior, Tianjin Normal University, Tianjin, China.

Faculty of Psychology, Tianjin Normal University, China.

出版信息

Educ Psychol Meas. 2025 Jan 7:00131644241306990. doi: 10.1177/00131644241306990.

DOI:10.1177/00131644241306990
PMID:39780953
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11705310/
Abstract

In computerized adaptive testing (CAT), examinees see items targeted to their ability level. Postoperational data have a high degree of missing information relative to designs where everyone answers all questions. Item responses are observed over a restricted range of abilities, reducing item-total score correlations. However, if the adaptive item selection depends only on observed responses, the data are missing at random (MAR). We simulated data from three different testing designs (common items, randomly selected items, and CAT) and found that it was possible to re-estimate both person and item parameters from postoperational CAT data. In a multidimensional CAT, we show that it is necessary to include all responses from the testing phase to avoid violating missing data assumptions. We also observed that some CAT designs produced "reversals" where item discriminations became negative causing dramatic under and over-estimation of abilities. Our results apply to situations where researchers work with data drawn from adaptive testing or from instructional tools with adaptive delivery. To avoid bias, researchers must make sure they use all the data necessary to meet the MAR assumptions.

摘要

在计算机自适应测试(CAT)中,考生会看到针对其能力水平的题目。与所有人都回答所有问题的设计相比,测试后的数据存在高度的信息缺失。在能力的有限范围内观察到题目回答情况,这降低了题目总分相关性。然而,如果自适应题目选择仅取决于观察到的回答,那么数据就是随机缺失(MAR)。我们模拟了来自三种不同测试设计(共同题目、随机选择题目和CAT)的数据,发现从测试后CAT数据中重新估计人和题目的参数是可行的。在多维CAT中,我们表明有必要纳入测试阶段的所有回答,以避免违反缺失数据假设。我们还观察到,一些CAT设计会产生“反转”,即题目区分度变为负数,导致能力的严重低估和高估。我们的结果适用于研究人员处理来自自适应测试或具有自适应交付功能的教学工具的数据的情况。为避免偏差,研究人员必须确保他们使用满足MAR假设所需的所有数据。

相似文献

1
The Impact of Missing Data on Parameter Estimation: Three Examples in Computerized Adaptive Testing.缺失数据对参数估计的影响:计算机自适应测试中的三个例子
Educ Psychol Meas. 2025 Jan 7:00131644241306990. doi: 10.1177/00131644241306990.
2
An Adaptive Design for Item Parameter Online Estimation and Q-Matrix Online Calibration in CD-CAT.一种用于计算机化自适应测验中项目参数在线估计和Q矩阵在线校准的自适应设计。
Front Psychol. 2021 Aug 24;12:710497. doi: 10.3389/fpsyg.2021.710497. eCollection 2021.
3
Item selection methods in multidimensional computerized adaptive testing for forced-choice items using Thurstonian IRT model.多维计算机化自适应测试中使用 Thurstonian IRT 模型的多选题的项目选择方法。
Behav Res Methods. 2024 Feb;56(2):600-614. doi: 10.3758/s13428-022-02037-6. Epub 2023 Feb 7.
4
The irtQ R package: a user-friendly tool for item response theorybased test data analysis and calibration.irtQ R 包:一个用于基于项目反应理论的测试数据分析和校准的用户友好型工具。
J Educ Eval Health Prof. 2024;21:23. doi: 10.3352/jeehp.2024.21.23. Epub 2024 Sep 12.
5
Applications of computerized adaptive testing (CAT) to the assessment of headache impact.计算机自适应测试(CAT)在头痛影响评估中的应用。
Qual Life Res. 2003 Dec;12(8):935-52. doi: 10.1023/a:1026115230284.
6
Adaptive measurement of cognitive function based on multidimensional item response theory.基于多维项目反应理论的认知功能自适应测量
Alzheimers Dement (N Y). 2024 Nov 30;10(4):e70018. doi: 10.1002/trc2.70018. eCollection 2024 Oct-Dec.
7
Best Design for Multidimensional Computerized Adaptive Testing With the Bifactor Model.基于双因素模型的多维计算机自适应测试的最佳设计
Educ Psychol Meas. 2015 Dec;75(6):954-978. doi: 10.1177/0013164415575147. Epub 2015 Mar 25.
8
The Impact of Item Model Parameter Variations on Person Parameter Estimation in Computerized Adaptive Testing With Automatically Generated Items.自动生成试题的计算机自适应测试中试题模型参数变化对人员参数估计的影响
Appl Psychol Meas. 2023 Jun;47(4):275-290. doi: 10.1177/01466216231165313. Epub 2023 Mar 17.
9
Combining Cognitive Diagnostic Computerized Adaptive Testing With Multidimensional Item Response Theory.将认知诊断计算机化自适应测试与多维项目反应理论相结合。
Appl Psychol Meas. 2022 Jun;46(4):288-302. doi: 10.1177/01466216221084214. Epub 2022 Apr 18.
10
Developing new online calibration methods for multidimensional computerized adaptive testing.开发用于多维计算机自适应测试的新型在线校准方法。
Br J Math Stat Psychol. 2017 Feb;70(1):81-117. doi: 10.1111/bmsp.12083.

引用本文的文献

1
A random forest dynamic threshold imputation method for handling missing data in cognitive diagnosis assessments.一种用于处理认知诊断评估中缺失数据的随机森林动态阈值插补方法。
Front Psychol. 2025 Aug 5;16:1487111. doi: 10.3389/fpsyg.2025.1487111. eCollection 2025.

本文引用的文献

1
Control Theory Forecasts of Optimal Training Dosage to Facilitate Children's Arithmetic Learning in a Digital Educational Application.控制理论预测优化训练剂量,以促进儿童在数字教育应用中的算术学习。
Psychometrika. 2022 Jun;87(2):559-592. doi: 10.1007/s11336-021-09829-3. Epub 2022 Mar 15.
2
Clinical validity of PROMIS Depression, Anxiety, and Anger across diverse clinical samples.患者报告结果测量信息系统(PROMIS)抑郁、焦虑和愤怒量表在不同临床样本中的临床效度
J Clin Epidemiol. 2016 May;73:119-27. doi: 10.1016/j.jclinepi.2015.08.036. Epub 2016 Feb 27.
3
Item response theory, computerized adaptive testing, and PROMIS: assessment of physical function.项目反应理论、计算机化自适应测验和 PROMIS:身体功能评估。
J Rheumatol. 2014 Jan;41(1):153-8. doi: 10.3899/jrheum.130813. Epub 2013 Nov 15.
4
The use of PROMIS and assessment center to deliver patient-reported outcome measures in clinical research.在临床研究中使用患者报告结局测量信息系统(PROMIS)和评估中心来提供患者报告的结局指标。
J Appl Meas. 2010;11(3):304-14.
5
An item response analysis of the pediatric PROMIS anxiety and depressive symptoms scales.儿童 PROMIS 焦虑和抑郁症状量表的项目反应分析。
Qual Life Res. 2010 May;19(4):595-607. doi: 10.1007/s11136-010-9619-3. Epub 2010 Mar 7.