irtQ R 包：一个用于基于项目反应理论的测试数据分析和校准的用户友好型工具。

The irtQ R package: a user-friendly tool for item response theorybased test data analysis and calibration.

机构信息

College of Education, Inha University, Incheon, Korea.

出版信息

J Educ Eval Health Prof. 2024;21:23. doi: 10.3352/jeehp.2024.21.23. Epub 2024 Sep 12.

DOI:10.3352/jeehp.2024.21.23

PMID:39262318

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11561393/

Abstract

Computerized adaptive testing (CAT) has become a widely adopted test design for high-stakes licensing and certification exams, particularly in the health professions in the United States, due to its ability to tailor test difficulty in real time, reducing testing time while providing precise ability estimates. A key component of CAT is item response theory (IRT), which facilitates the dynamic selection of items based on examinees' ability levels during a test. Accurate estimation of item and ability parameters is essential for successful CAT implementation, necessitating convenient and reliable software to ensure precise parameter estimation. This paper introduces the irtQ R package (http://CRAN.R-project.org/), which simplifies IRTbased analysis and item calibration under unidimensional IRT models. While it does not directly simulate CAT, it provides essential tools to support CAT development, including parameter estimation using marginal maximum likelihood estimation via the expectation-maximization algorithm, pretest item calibration through fixed item parameter calibration and fixed ability parameter calibration methods, and examinee ability estimation. The package also enables users to compute item and test characteristic curves and information functions necessary for evaluating the psychometric properties of a test. This paper illustrates the key features of the irtQ package through examples using simulated datasets, demonstrating its utility in IRT applications such as test data analysis and ability scoring. By providing a user-friendly environment for IRT analysis, irtQ significantly enhances the capacity for efficient adaptive testing research and operations. Finally, the paper highlights additional core functionalities of irtQ, emphasizing its broader applicability to the development and operation of IRT-based assessments.

摘要

计算机化自适应测验（CAT）已成为美国高风险许可和认证考试中广泛采用的测试设计，这主要是因为其能够实时调整测试难度，在减少测试时间的同时提供精确的能力估计。CAT 的一个关键组成部分是项目反应理论（IRT），它可以根据考生在测试中的能力水平动态选择项目。准确估计项目和能力参数对于成功实施 CAT 至关重要，这需要方便可靠的软件来确保精确的参数估计。本文介绍了 irtQ R 包（http://CRAN.R-project.org/），它简化了基于 IRT 的分析和一维 IRT 模型下的项目校准。虽然它不直接模拟 CAT，但它提供了支持 CAT 开发的基本工具，包括通过期望最大化算法的边际最大似然估计进行参数估计、通过固定项目参数校准和固定能力参数校准方法进行预测试项目校准，以及考生能力估计。该包还使用户能够计算项目和测试特征曲线以及信息函数，这些是评估测试心理测量特性所必需的。本文通过使用模拟数据集的示例说明了 irtQ 包的主要功能，展示了它在 IRT 应用中的效用，如测试数据分析和能力评分。通过为 IRT 分析提供用户友好的环境，irtQ 大大增强了高效自适应测试研究和操作的能力。最后，本文强调了 irtQ 的其他核心功能，强调了它在基于 IRT 的评估的开发和操作中的更广泛适用性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/a331/11561393/98e72d390225/jeehp-21-23f1.jpg

相似文献

The irtQ R package: a user-friendly tool for item response theorybased test data analysis and calibration.irtQ R 包：一个用于基于项目反应理论的测试数据分析和校准的用户友好型工具。

J Educ Eval Health Prof. 2024;21:23. doi: 10.3352/jeehp.2024.21.23. Epub 2024 Sep 12.

Overview and current management of computerized adaptive testing in licensing/certification examinations.执照/认证考试中计算机自适应测试的概述与当前管理

J Educ Eval Health Prof. 2017 Jul 26;14:17. doi: 10.3352/jeehp.2017.14.17. eCollection 2017.

Developing new online calibration methods for multidimensional computerized adaptive testing.开发用于多维计算机自适应测试的新型在线校准方法。

Br J Math Stat Psychol. 2017 Feb;70(1):81-117. doi: 10.1111/bmsp.12083.

Introduction to the LIVECAT web-based computerized adaptive testing platform.LIVECAT 网络版计算机化自适应测试平台简介。

J Educ Eval Health Prof. 2020;17:27. doi: 10.3352/jeehp.2020.17.27. Epub 2020 Sep 29.

: An R Package for Online Item Calibration, Scoring, Evaluation of Model Fit, and Useful Functions for Unidimensional IRT.一个用于在线项目校准、评分、模型拟合评估以及一维项目反应理论实用函数的R包。

Appl Psychol Meas. 2020 Oct;44(7-8):563-565. doi: 10.1177/0146621620921247. Epub 2020 May 21.

Conducting simulation studies for computerized adaptive testing using SimulCAT: an instructional piece.使用SimulCAT进行计算机自适应测试的模拟研究：一篇指导性文章。

J Educ Eval Health Prof. 2018;15:20. doi: 10.3352/jeehp.2018.15.20. Epub 2018 Aug 17.

Utilizing response times in cognitive diagnostic computerized adaptive testing under the higher-order deterministic input, noisy 'and' gate model.利用高阶确定性输入、噪声“与”门模型下认知诊断计算机自适应测验中的反应时间。

Br J Math Stat Psychol. 2020 Feb;73(1):109-141. doi: 10.1111/bmsp.12160. Epub 2019 Feb 22.

Methods for online calibration of Q-matrix and item parameters for polytomous responses in cognitive diagnostic computerized adaptive testing.多类别反应认知诊断计算机化自适应测验中 Q 矩阵和项目参数的在线标定方法。

Behav Res Methods. 2024 Oct;56(7):6792-6811. doi: 10.3758/s13428-024-02392-6. Epub 2024 Apr 30.

Statistical Foundations for Computerized Adaptive Testing with Response Revision.基于响应修正的计算机自适应测试的统计基础。

Psychometrika. 2019 Jun;84(2):375-394. doi: 10.1007/s11336-019-09662-9. Epub 2019 Feb 25.

Optimal Item Calibration for Computerized Achievement Tests.计算机化成就测验的最佳项目标定。

Psychometrika. 2019 Dec;84(4):1101-1128. doi: 10.1007/s11336-019-09673-6. Epub 2019 Jun 10.

本文引用的文献

Introduction to the LIVECAT web-based computerized adaptive testing platform.LIVECAT 网络版计算机化自适应测试平台简介。

J Educ Eval Health Prof. 2020;17:27. doi: 10.3352/jeehp.2020.17.27. Epub 2020 Sep 29.

Maximum Likelihood Score Estimation Method With Fences for Short-Length Tests and Computerized Adaptive Tests.用于短长度测试和计算机自适应测试的带边界的最大似然分数估计方法

Appl Psychol Meas. 2016 Jun;40(4):289-301. doi: 10.1177/0146621616631317. Epub 2016 Feb 15.

Overview and current management of computerized adaptive testing in licensing/certification examinations.执照/认证考试中计算机自适应测试的概述与当前管理

J Educ Eval Health Prof. 2017 Jul 26;14:17. doi: 10.3352/jeehp.2017.14.17. eCollection 2017.

A New Online Calibration Method for Multidimensional Computerized Adaptive Testing.一种用于多维计算机自适应测试的新型在线校准方法。

Psychometrika. 2016 Sep;81(3):674-701. doi: 10.1007/s11336-015-9482-9. Epub 2015 Nov 25.

Preparing the implementation of computerized adaptive testing for high-stakes examinations.为高风险考试准备计算机自适应测试的实施。

J Educ Eval Health Prof. 2008;5:1. doi: 10.3352/jeehp.2008.5.1. Epub 2008 Dec 22.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

irtQ R 包：一个用于基于项目反应理论的测试数据分析和校准的用户友好型工具。

The irtQ R package: a user-friendly tool for item response theorybased test data analysis and calibration.

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献