使用 Rasch 和确定性输入、嘈杂“与”门诊断分类模型对每个内容领域的掌握程度的准确性和一致性：一项基于韩国医师执照考试数据的模拟研究和真实世界分析。

The accuracy and consistency of mastery for each content domain using the Rasch and deterministic inputs, noisy “and” gate diagnostic classification models: a simulation study and a real-world analysis using data from the Korean Medical Licensing Examination.

机构信息

Department of Psychology, College of Social Science, Hallym University, Chuncheon, Korea.

Hallym Applied Psychology Institute, College of Social Science, Hallym University, Chuncheon, Korea.

出版信息

J Educ Eval Health Prof. 2021;18:15. doi: 10.3352/jeehp.2021.18.15. Epub 2021 Jul 5.

DOI:10.3352/jeehp.2021.18.15

PMID:34225413

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8381025/

Abstract

PURPOSE

Diagnostic classification models (DCMs) were developed to identify the mastery or non-mastery of the attributes required for solving test items, but their application has been limited to very low-level attributes, and the accuracy and consistency of high-level attributes using DCMs have rarely been reported compared with classical test theory (CTT) and item response theory models. This paper compared the accuracy of high-level attribute mastery between deterministic inputs, noisy “and” gate (DINA) and Rasch models, along with sub-scores based on CTT.

METHODS

First, a simulation study explored the effects of attribute length (number of items per attribute) and the correlations among attributes with respect to the accuracy of mastery. Second, a real-data study examined model and item fit and investigated the consistency of mastery for each attribute among the 3 models using the 2017 Korean Medical Licensing Examination with 360 items.

RESULTS

Accuracy of mastery increased with a higher number of items measuring each attribute across all conditions. The DINA model was more accurate than the CTT and Rasch models for attributes with high correlations (>0.5) and few items. In the real-data analysis, the DINA and Rasch models generally showed better item fits and appropriate model fit. The consistency of mastery between the Rasch and DINA models ranged from 0.541 to 0.633 and the correlations of person attribute scores between the Rasch and DINA models ranged from 0.579 to 0.786.

CONCLUSION

Although all 3 models provide a mastery decision for each examinee, the individual mastery profile using the DINA model provides more accurate decisions for attributes with high correlations than the CTT and Rasch models. The DINA model can also be directly applied to tests with complex structures, unlike the CTT and Rasch models, and it provides different diagnostic information from the CTT and Rasch models.

摘要

目的

诊断分类模型（DCM）旨在识别解决测试项目所需属性的掌握或未掌握情况，但它们的应用仅限于非常低层次的属性，与经典测试理论（CTT）和项目反应理论模型相比，使用 DCM 报告高级属性的准确性和一致性的情况很少。本文比较了确定性输入、噪声“与”门（DINA）和 Rasch 模型之间的高级属性掌握的准确性，以及基于 CTT 的子分数。

方法

首先，一项模拟研究探讨了属性长度（每个属性的项目数）和属性之间的相关性对掌握准确性的影响。其次，一项真实数据研究检验了模型和项目拟合情况，并使用 2017 年韩国医师执照考试的 360 个项目，研究了 3 种模型中每个属性的掌握情况的一致性。

结果

在所有条件下，随着用于测量每个属性的项目数量的增加，掌握的准确性也随之提高。在属性相关性较高（>0.5）和项目较少的情况下，DINA 模型比 CTT 和 Rasch 模型更准确。在真实数据分析中，DINA 和 Rasch 模型通常显示出更好的项目拟合和适当的模型拟合。Rasch 和 DINA 模型之间的掌握一致性范围为 0.541 至 0.633，Rasch 和 DINA 模型之间的人格属性得分的相关性范围为 0.579 至 0.786。

结论

尽管所有 3 种模型都为每个考生提供了掌握决策，但 DINA 模型提供的高相关性属性的个人掌握情况比 CTT 和 Rasch 模型更准确。与 CTT 和 Rasch 模型不同，DINA 模型可直接应用于具有复杂结构的测试，并且提供了与 CTT 和 Rasch 模型不同的诊断信息。

相似文献

The accuracy and consistency of mastery for each content domain using the Rasch and deterministic inputs, noisy “and” gate diagnostic classification models: a simulation study and a real-world analysis using data from the Korean Medical Licensing Examination.使用 Rasch 和确定性输入、嘈杂“与”门诊断分类模型对每个内容领域的掌握程度的准确性和一致性：一项基于韩国医师执照考试数据的模拟研究和真实世界分析。

J Educ Eval Health Prof. 2021;18:15. doi: 10.3352/jeehp.2021.18.15. Epub 2021 Jul 5.

Estimation of item parameters and examinees' mastery probability in each domain of the Korean medical licensing examination using deterministic inputs, noisy and gate(DINA) model.使用确定性输入、噪声与门(DINA)模型，对韩国医师执照考试各领域项目参数和考生掌握概率进行估计。

J Educ Eval Health Prof. 2020;17:35. doi: 10.3352/jeehp.2020.17.35. Epub 2020 Nov 17.

Examining Parameter Invariance in a General Diagnostic Classification Model.检验一般诊断分类模型中的参数不变性。

Front Psychol. 2020 Jan 13;10:2930. doi: 10.3389/fpsyg.2019.02930. eCollection 2019.

Bayesian Estimation of the DINA Q matrix.贝叶斯估计 DINA Q 矩阵。

Psychometrika. 2018 Mar;83(1):89-108. doi: 10.1007/s11336-017-9579-4. Epub 2017 Aug 31.

An Empirical Q-Matrix Validation Method for the Polytomous G-DINA Model.多选项 G-DINA 模型的实证 Q 矩阵验证方法。

Psychometrika. 2022 Jun;87(2):693-724. doi: 10.1007/s11336-021-09821-x. Epub 2021 Nov 29.

Scalable Bayesian Approach for the Dina Q-Matrix Estimation Combining Stochastic Optimization and Variational Inference.可扩展的贝叶斯方法用于结合随机优化和变分推断的 Dina Q-矩阵估计。

Psychometrika. 2023 Mar;88(1):302-331. doi: 10.1007/s11336-022-09884-4. Epub 2022 Sep 12.

Utilizing response times in cognitive diagnostic computerized adaptive testing under the higher-order deterministic input, noisy 'and' gate model.利用高阶确定性输入、噪声“与”门模型下认知诊断计算机自适应测验中的反应时间。

Br J Math Stat Psychol. 2020 Feb;73(1):109-141. doi: 10.1111/bmsp.12160. Epub 2019 Feb 22.

Sample Size Requirements for Applying Diagnostic Classification Models.应用诊断分类模型的样本量要求。

Front Psychol. 2021 Jan 25;11:621251. doi: 10.3389/fpsyg.2020.621251. eCollection 2020.

Probabilistic-Input, Noisy Conjunctive Models for Cognitive Diagnosis.用于认知诊断的概率输入噪声合取模型

Front Psychol. 2018 Jun 14;9:997. doi: 10.3389/fpsyg.2018.00997. eCollection 2018.

Estimating the DINA model parameters using the No-U-Turn Sampler.使用无回转采样器估计DINA模型参数。

Biom J. 2018 Mar;60(2):352-368. doi: 10.1002/bimj.201600225. Epub 2017 Dec 1.

引用本文的文献

Development of a character qualities test for medical students in Korea using polytomous item response theory and factor analysis: a preliminary scale development study.使用多项/item 反应理论和因素分析开发韩国医学生品格特质测验：初步量表开发研究。

J Educ Eval Health Prof. 2023;20:20. doi: 10.3352/jeehp.2023.20.20. Epub 2023 Jun 26.

本文引用的文献

Post-hoc simulation study of computerized adaptive testing for the Korean Medical Licensing Examination.韩国医师执照考试计算机化自适应测试的事后模拟研究

J Educ Eval Health Prof. 2018 May 17;15:14. doi: 10.3352/jeehp.2018.15.14. eCollection 2018.

Usefulness of the DETECT program for assessing the internal structure of dimensionality in simulated data and results of the Korean nursing licensing examination.DETECT程序在评估模拟数据维度的内部结构及韩国护士执照考试结果方面的效用。

J Educ Eval Health Prof. 2017 Dec 27;14:32. doi: 10.3352/jeehp.2017.14.32. eCollection 2017.

An analysis of dimensionality using factor analysis (true-score theory) and Rasch measurement: what is the difference? Which method is better?使用因子分析（真分数理论）和拉施测量法进行维度分析：有何差异？哪种方法更好？

J Appl Meas. 2005;6(1):80-99.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用 Rasch 和确定性输入、嘈杂“与”门诊断分类模型对每个内容领域的掌握程度的准确性和一致性：一项基于韩国医师执照考试数据的模拟研究和真实世界分析。

The accuracy and consistency of mastery for each content domain using the Rasch and deterministic inputs, noisy “and” gate diagnostic classification models: a simulation study and a real-world analysis using data from the Korean Medical Licensing Examination.

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献