使用SimulCAT进行计算机自适应测试的模拟研究：一篇指导性文章。

Conducting simulation studies for computerized adaptive testing using SimulCAT: an instructional piece.

作者信息

Han Kyung Chris Tyek

机构信息

Graduate Management Admission Council, Reston, VA, USA.

出版信息

J Educ Eval Health Prof. 2018;15:20. doi: 10.3352/jeehp.2018.15.20. Epub 2018 Aug 17.

DOI:10.3352/jeehp.2018.15.20

PMID:30114899

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6194482/

Abstract

Computerized adaptive testing (CAT) technology is widely used in a variety of licensing and certification examinations administered to health professionals in the United States. Many more countries worldwide are expected to adopt CAT for their national licensing examinations for health professionals due to its reduced test time and more accurate estimation of a test-taker's performance ability. Continuous improvements to CAT algorithms promote the stability and reliability of the results of such examinations. For this reason, conducting simulation studies is a critically important component of evaluating the design of CAT programs and their implementation. This report introduces the principles of SimulCAT, a software program developed for conducting CAT simulation studies. The key evaluation criteria for CAT simulation studies are explained and some guidelines are offered for practitioners and test developers. A step-by-step tutorial example of a SimulCAT run is also presented. The SimulCAT program supports most of the methods used for the 3 key components of item selection in CAT: the item selection criterion, item exposure control, and content balancing. Methods for determining the test length (fixed or variable) and score estimation algorithms are also covered. The simulation studies presented include output files for the response string, item use, standard error of estimation, Newton-Raphson iteration information, theta estimation, the full response matrix, and the true standard error of estimation. In CAT simulations, one condition cannot be generalized to another; therefore, it is recommended that practitioners perform CAT simulation studies in each stage of CAT development.

摘要

计算机自适应测试（CAT）技术在美国广泛应用于各类针对卫生专业人员的执照和认证考试。由于CAT能够缩短考试时间并更准确地评估考生的表现能力，预计全球更多国家将在其卫生专业人员国家执照考试中采用CAT。对CAT算法的持续改进提升了此类考试结果的稳定性和可靠性。因此，开展模拟研究是评估CAT项目设计及其实施的至关重要的组成部分。本报告介绍了SimulCAT的原理，SimulCAT是一款为开展CAT模拟研究而开发的软件程序。阐述了CAT模拟研究的关键评估标准，并为从业者和考试开发者提供了一些指导方针。还给出了一个SimulCAT运行的分步教程示例。SimulCAT程序支持用于CAT中项目选择的3个关键组成部分的大多数方法：项目选择标准、项目曝光控制和内容平衡。还涵盖了确定考试长度（固定或可变）的方法以及分数估计算法。所呈现的模拟研究包括响应字符串、项目使用、估计标准误差、牛顿-拉弗森迭代信息、θ估计、完整响应矩阵以及真实估计标准误差的输出文件。在CAT模拟中，一种情况不能推广到另一种情况；因此，建议从业者在CAT开发的每个阶段都进行CAT模拟研究。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/600c/6194482/2fe37d4421df/jeehp-15-20f1.jpg

相似文献

Conducting simulation studies for computerized adaptive testing using SimulCAT: an instructional piece.使用SimulCAT进行计算机自适应测试的模拟研究：一篇指导性文章。

J Educ Eval Health Prof. 2018;15:20. doi: 10.3352/jeehp.2018.15.20. Epub 2018 Aug 17.

Overview and current management of computerized adaptive testing in licensing/certification examinations.执照/认证考试中计算机自适应测试的概述与当前管理

J Educ Eval Health Prof. 2017 Jul 26;14:17. doi: 10.3352/jeehp.2017.14.17. eCollection 2017.

Post-hoc simulation study of computerized adaptive testing for the Korean Medical Licensing Examination.韩国医师执照考试计算机化自适应测试的事后模拟研究

J Educ Eval Health Prof. 2018 May 17;15:14. doi: 10.3352/jeehp.2018.15.14. eCollection 2018.

Components of the item selection algorithm in computerized adaptive testing.计算机自适应测试中项目选择算法的组成部分。

J Educ Eval Health Prof. 2018 Mar 24;15:7. doi: 10.3352/jeehp.2018.15.7. eCollection 2018.

The irtQ R package: a user-friendly tool for item response theorybased test data analysis and calibration.irtQ R 包：一个用于基于项目反应理论的测试数据分析和校准的用户友好型工具。

J Educ Eval Health Prof. 2024;21:23. doi: 10.3352/jeehp.2024.21.23. Epub 2024 Sep 12.

The impacts of computer adaptive testing from a variety of perspectives.从各种角度看计算机自适应测试的影响。

J Educ Eval Health Prof. 2017 May 29;14:12. doi: 10.3352/jeehp.2017.14.12. eCollection 2017.

The Asymptotic Distribution of Average Test Overlap Rate in Computerized Adaptive Testing.计算机化自适应测验中平均测验重叠率的渐近分布。

Psychometrika. 2019 Dec;84(4):1129-1151. doi: 10.1007/s11336-019-09674-5. Epub 2019 Jul 1.

Combining CAT with cognitive diagnosis: a weighted item selection approach.结合计算机辅助翻译与认知诊断：一种加权项目选择方法。

Behav Res Methods. 2012 Mar;44(1):95-109. doi: 10.3758/s13428-011-0143-3.

Utilizing response times in cognitive diagnostic computerized adaptive testing under the higher-order deterministic input, noisy 'and' gate model.利用高阶确定性输入、噪声“与”门模型下认知诊断计算机自适应测验中的反应时间。

Br J Math Stat Psychol. 2020 Feb;73(1):109-141. doi: 10.1111/bmsp.12160. Epub 2019 Feb 22.

Comparing computer adaptive testing stopping rules under the generalized partial-credit model.比较广义部分信用模型下的计算机自适应测试停止规则。

Behav Res Methods. 2019 Jun;51(3):1305-1320. doi: 10.3758/s13428-018-1068-x.

引用本文的文献

exploring counselor-client agreement on clients' work capacity in established and consultative dyads.探讨在既定和咨询性二元组中，咨询师与来访者就来访者工作能力达成的共识。

J Employ Couns. 2020 Sep;57(3):98-114. doi: 10.1002/joec.12148. Epub 2020 Sep 11.

From Development to Validation: Exploring the Efficiency of Numetrive, a Computerized Adaptive Assessment of Numerical Reasoning.从开发到验证：探索Numetrive的效率，一种数字推理的计算机自适应评估。

Behav Sci (Basel). 2025 Feb 25;15(3):268. doi: 10.3390/bs15030268.

Effect of Differential Item Functioning on Computer Adaptive Testing Under Different Conditions.不同条件下项目功能差异对计算机自适应测试的影响。

Appl Psychol Meas. 2024 Nov;48(7-8):303-322. doi: 10.1177/01466216241284295. Epub 2024 Sep 17.

Presidential address: improving item validity and adopting computer-based testing, clinical skills assessments, artificial intelligence, and virtual reality in health professions licensing examinations in Korea.主席致辞：提高项目效度并在韩国卫生专业执照考试中采用计算机化考试、临床技能评估、人工智能和虚拟现实技术

J Educ Eval Health Prof. 2023;20:8. doi: 10.3352/jeehp.2023.20.8. Epub 2023 Mar 27.

Development of a Computerized Adaptive Testing for Internet Addiction.网络成瘾计算机自适应测试的开发。

Front Psychol. 2019 May 7;10:1010. doi: 10.3389/fpsyg.2019.01010. eCollection 2019.

Updates from 2018: Being indexed in EMBASE, becoming an affiliated journal of the World Federation for Medical Education, implementing an optional open data policy, adopting principles of transparency and best practice in scholarly publishing, and appreciation to reviewers.2018年的更新内容：被EMBASE数据库收录，成为世界医学教育联合会的附属期刊，实施可选的开放数据政策，采用学术出版中的透明度原则和最佳实践，以及对审稿人的感谢。

J Educ Eval Health Prof. 2018;15:36. doi: 10.3352/jeehp.2018.15.36. Epub 2018 Dec 28.

本文引用的文献

Maximum Likelihood Score Estimation Method With Fences for Short-Length Tests and Computerized Adaptive Tests.用于短长度测试和计算机自适应测试的带边界的最大似然分数估计方法

Appl Psychol Meas. 2016 Jun;40(4):289-301. doi: 10.1177/0146621616631317. Epub 2016 Feb 15.

Components of the item selection algorithm in computerized adaptive testing.计算机自适应测试中项目选择算法的组成部分。

J Educ Eval Health Prof. 2018 Mar 24;15:7. doi: 10.3352/jeehp.2018.15.7. eCollection 2018.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用SimulCAT进行计算机自适应测试的模拟研究：一篇指导性文章。

Conducting simulation studies for computerized adaptive testing using SimulCAT: an instructional piece.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献