• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

强制选择项目和非认知测试的诊断分类模型。

Diagnostic Classification Model for Forced-Choice Items and Noncognitive Tests.

作者信息

Huang Hung-Yu

机构信息

University of Taipei, Taiwan.

出版信息

Educ Psychol Meas. 2023 Feb;83(1):146-180. doi: 10.1177/00131644211069906. Epub 2022 Jan 7.

DOI:10.1177/00131644211069906
PMID:36601255
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9806518/
Abstract

The forced-choice (FC) item formats used for noncognitive tests typically develop a set of response options that measure different traits and instruct respondents to make judgments among these options in terms of their preference to control the response biases that are commonly observed in normative tests. Diagnostic classification models (DCMs) can provide information regarding the mastery status of test takers on latent discrete variables and are more commonly used for cognitive tests employed in educational settings than for noncognitive tests. The purpose of this study is to develop a new class of DCM for FC items under the higher-order DCM framework to meet the practical demands of simultaneously controlling for response biases and providing diagnostic classification information. By conducting a series of simulations and calibrating the model parameters with a Bayesian estimation, the study shows that, in general, the model parameters can be recovered satisfactorily with the use of long tests and large samples. More attributes improve the precision of the second-order latent trait estimation in a long test, but decrease the classification accuracy and the estimation quality of the structural parameters. When statements are allowed to load on two distinct attributes in paired comparison items, the specific-attribute condition produces better a parameter estimation than the overlap-attribute condition. Finally, an empirical analysis related to work-motivation measures is presented to demonstrate the applications and implications of the new model.

摘要

用于非认知测试的强制选择(FC)项目格式通常会开发一组测量不同特质的响应选项,并指示受访者根据自己的偏好对这些选项进行判断,以控制在规范性测试中常见的响应偏差。诊断分类模型(DCM)可以提供有关考生在潜在离散变量上的掌握状态的信息,并且在教育环境中用于认知测试的情况比用于非认知测试更为常见。本研究的目的是在高阶DCM框架下为FC项目开发一类新的DCM,以满足同时控制响应偏差和提供诊断分类信息的实际需求。通过进行一系列模拟并使用贝叶斯估计对模型参数进行校准,研究表明,一般来说,使用长测试和大样本可以令人满意地恢复模型参数。更多属性在长测试中提高了二阶潜在特质估计的精度,但降低了分类准确性和结构参数的估计质量。当在配对比较项目中允许陈述加载到两个不同的属性上时,特定属性条件比重叠属性条件产生更好的参数估计。最后,给出了一项与工作动机测量相关的实证分析,以展示新模型的应用和意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ffb/9806518/de51c98e00ab/10.1177_00131644211069906-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ffb/9806518/6e3821798e59/10.1177_00131644211069906-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ffb/9806518/de51c98e00ab/10.1177_00131644211069906-fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ffb/9806518/6e3821798e59/10.1177_00131644211069906-fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7ffb/9806518/de51c98e00ab/10.1177_00131644211069906-fig2.jpg

相似文献

1
Diagnostic Classification Model for Forced-Choice Items and Noncognitive Tests.强制选择项目和非认知测试的诊断分类模型。
Educ Psychol Meas. 2023 Feb;83(1):146-180. doi: 10.1177/00131644211069906. Epub 2022 Jan 7.
2
Mixture Random-Effect IRT Models for Controlling Extreme Response Style on Rating Scales.用于控制量表极端反应风格的混合随机效应项目反应理论模型。
Front Psychol. 2016 Nov 2;7:1706. doi: 10.3389/fpsyg.2016.01706. eCollection 2016.
3
Examining Parameter Invariance in a General Diagnostic Classification Model.检验一般诊断分类模型中的参数不变性。
Front Psychol. 2020 Jan 13;10:2930. doi: 10.3389/fpsyg.2019.02930. eCollection 2019.
4
Item Response Theory Models for Ipsative Tests With Multidimensional Pairwise Comparison Items.具有多维成对比较项目的自比测验的项目反应理论模型
Appl Psychol Meas. 2017 Nov;41(8):600-613. doi: 10.1177/0146621617703183. Epub 2017 Apr 9.
5
The accuracy and consistency of mastery for each content domain using the Rasch and deterministic inputs, noisy “and” gate diagnostic classification models: a simulation study and a real-world analysis using data from the Korean Medical Licensing Examination.使用 Rasch 和确定性输入、嘈杂“与”门诊断分类模型对每个内容领域的掌握程度的准确性和一致性:一项基于韩国医师执照考试数据的模拟研究和真实世界分析。
J Educ Eval Health Prof. 2021;18:15. doi: 10.3352/jeehp.2021.18.15. Epub 2021 Jul 5.
6
A Dominance Variant Under the Multi-Unidimensional Pairwise-Preference Framework: Model Formulation and Markov Chain Monte Carlo Estimation.多维度成对偏好框架下的一个显性变异:模型构建与马尔可夫链蒙特卡罗估计
Appl Psychol Meas. 2016 Oct;40(7):500-516. doi: 10.1177/0146621616662226. Epub 2016 Aug 20.
7
On Bank Assembly and Block Selection in Multidimensional Forced-Choice Adaptive Assessments.多维强制选择自适应评估中的题库组装与题目选择
Educ Psychol Meas. 2023 Apr;83(2):294-321. doi: 10.1177/00131644221087986. Epub 2022 Apr 28.
8
Comparing Traditional and IRT Scoring of Forced-Choice Tests.比较强制选择测试的传统评分与项目反应理论评分
Appl Psychol Meas. 2015 Nov;39(8):598-612. doi: 10.1177/0146621615585851. Epub 2015 May 19.
9
MCMC Z-G: An IRT Computer Program for Forced-Choice Noncognitive Measurement.MCMC Z - G:一种用于强制选择非认知测量的IRT计算机程序。
Appl Psychol Meas. 2016 Oct;40(7):551-553. doi: 10.1177/0146621616663682. Epub 2016 Aug 20.
10
A Bayesian Random Block Item Response Theory Model for Forced-Choice Formats.一种用于强制选择格式的贝叶斯随机块项目反应理论模型。
Educ Psychol Meas. 2020 Jun;80(3):578-603. doi: 10.1177/0013164419871659. Epub 2019 Aug 27.

引用本文的文献

1
Improving reliability estimation in cognitive diagnosis modeling.提高认知诊断建模中的可靠性估计
Behav Res Methods. 2023 Oct;55(7):3446-3460. doi: 10.3758/s13428-022-01967-5. Epub 2022 Sep 20.

本文引用的文献

1
Inferring Latent Structure in Polytomous Data with a Higher-Order Diagnostic Model.用高阶诊断模型推断多项数据中的潜在结构。
Multivariate Behav Res. 2023 Mar-Apr;58(2):368-386. doi: 10.1080/00273171.2021.1985949. Epub 2021 Oct 26.
2
On the Statistical and Practical Limitations of Thurstonian IRT Models.关于瑟斯顿IRT模型的统计及实际局限性
Educ Psychol Meas. 2019 Oct;79(5):827-854. doi: 10.1177/0013164419832063. Epub 2019 Feb 22.
3
Adaptive testing with the GGUM-RANK multidimensional forced choice model: Comparison of pair, triplet, and tetrad scoring.
使用 GGUM-RANK 多维迫选模型进行自适应测试:对偶、三元组和四元组评分的比较。
Behav Res Methods. 2020 Apr;52(2):761-772. doi: 10.3758/s13428-019-01274-6.
4
A Law of Comparative Preference: Distinctions Between Models of Personal Preference and Impersonal Judgment in Pair Comparison Designs.比较偏好定律:成对比较设计中个人偏好模型与客观判断模型之间的区别
Appl Psychol Meas. 2019 May;43(3):181-194. doi: 10.1177/0146621617738014. Epub 2017 Nov 2.
5
Utilizing response times in cognitive diagnostic computerized adaptive testing under the higher-order deterministic input, noisy 'and' gate model.利用高阶确定性输入、噪声“与”门模型下认知诊断计算机自适应测验中的反应时间。
Br J Math Stat Psychol. 2020 Feb;73(1):109-141. doi: 10.1111/bmsp.12160. Epub 2019 Feb 22.
6
On the Identifiability of Diagnostic Classification Models.诊断分类模型的可识别性研究
Psychometrika. 2019 Mar;84(1):19-40. doi: 10.1007/s11336-018-09658-x. Epub 2019 Jan 23.
7
Validation of a Questionnaire for Personality Profiling using Cognitive Diagnostic Modeling.使用认知诊断模型对一份人格剖析问卷进行验证
Span J Psychol. 2018 Dec 3;21:E63. doi: 10.1017/sjp.2018.62.
8
Retrofitting Diagnostic Classification Models to Responses From IRT-Based Assessment Forms.将诊断分类模型改造为基于项目反应理论(IRT)的评估表的响应。
Educ Psychol Meas. 2018 Jun;78(3):357-383. doi: 10.1177/0013164416685599. Epub 2017 Jan 8.
9
Item Response Theory Models for Ipsative Tests With Multidimensional Pairwise Comparison Items.具有多维成对比较项目的自比测验的项目反应理论模型
Appl Psychol Meas. 2017 Nov;41(8):600-613. doi: 10.1177/0146621617703183. Epub 2017 Apr 9.
10
A Dominance Variant Under the Multi-Unidimensional Pairwise-Preference Framework: Model Formulation and Markov Chain Monte Carlo Estimation.多维度成对偏好框架下的一个显性变异:模型构建与马尔可夫链蒙特卡罗估计
Appl Psychol Meas. 2016 Oct;40(7):500-516. doi: 10.1177/0146621616662226. Epub 2016 Aug 20.