Hsu Chia-Ling, Wang Wen-Chung
The Education University of Hong Kong, New Territories, China.
Hong Kong Examinations and Assessment Authority, New Territories, China.
Appl Psychol Meas. 2022 May;46(3):185-199. doi: 10.1177/01466216211066610. Epub 2022 Mar 1.
Cognitive diagnosis computerized adaptive testing (CD-CAT) aims to identify each examinee's strengths and weaknesses on latent attributes for appropriate classification into an attribute profile. As the cost of a CD-CAT misclassification differs across user needs (e.g., remedial program vs. scholarship eligibilities), item selection can incorporate such costs to improve measurement efficiency. This study proposes such a method, (MER), based on Bayesian decision theory. According to simulations, using MER to identify examinees with no mastery (MER-U0) or full mastery (MER-U1) showed greater classification accuracy and efficiency than other methods for these attribute profiles, especially for shorter tests or low quality item banks. For other attribute profiles, regardless of item quality or termination criterion, MER methods, modified posterior-weighted Kullback-Leibler information (MPWKL), posterior-weighted CDM discrimination index (PWCDI), and Shannon entropy (SHE) performed similarly and outperformed posterior-weighted attribute-level CDM discrimination index (PWACDI) in classification accuracy and test efficiency, especially on short tests. MER with a zero-one loss function, MER-U0, MER-U1, and PWACDI utilized item banks more effectively than the other methods. Overall, these results show the feasibility of using MER in CD-CAT to increase the accuracy for specific attribute profiles to address different user needs.
认知诊断计算机自适应测试(CD - CAT)旨在识别每个考生在潜在属性方面的优势和劣势,以便将其合理分类到一个属性配置文件中。由于CD - CAT误分类的成本因用户需求而异(例如,辅导计划与奖学金资格),项目选择可以纳入此类成本以提高测量效率。本研究基于贝叶斯决策理论提出了这样一种方法,即最大期望收益(MER)。根据模拟结果,使用MER来识别未掌握(MER - U0)或完全掌握(MER - U1)的考生,对于这些属性配置文件,比其他方法具有更高的分类准确性和效率,特别是对于较短的测试或质量较低的题库。对于其他属性配置文件,无论项目质量或终止标准如何,MER方法、修正后的后验加权库尔贝克 - 莱布勒信息(MPWKL)、后验加权CDM鉴别指数(PWCDI)和香农熵(SHE)的表现相似,并且在分类准确性和测试效率方面优于后验加权属性级CDM鉴别指数(PWACDI),尤其是在短测试中。具有零一损失函数的MER、MER - U0、MER - U1和PWACDI比其他方法更有效地利用了题库。总体而言,这些结果表明在CD - CAT中使用MER来提高特定属性配置文件的准确性以满足不同用户需求的可行性。