Nishio Mizuho, Ota Eiji
Kobe University, Kobe, Japan.
Futaba Numerical Technologies, Iruma, Japan.
PeerJ Comput Sci. 2024 Oct 4;10:e2380. doi: 10.7717/peerj-cs.2380. eCollection 2024.
This study aimed to enhance the multidimensional nominal response model (MDNRM) for multiclass classification in diagnostic radiology.
This retrospective study involved the extension of the conventional nominal response model (NRM) to create the two-parameter MDNRM (2PL-MDNRM). Seven models of MDNRM, including the original MDNRM and subtypes of 2PL-MDNRM, were employed to estimate test-takers' abilities and test item complexity. These models were applied to a clinical diagnostic radiology dataset. Rhat values were calculated to evaluate model convergence. Additionally, values of the widely applicable information criterion (wAIC) and Pareto-smoothed importance sampling leave-one-out cross-validation (LOO) were calculated to evaluate the goodness of fit of the seven models. The best-performing model was selected based on the values of wAIC and LOO. Probability of direction (PD) was used to evaluate whether one estimated parameter significantly differed.
All estimated parameters across the seven models demonstrated Rhat values below 1.10, indicating stable convergence. The best wAIC and LOO values (988 and 1,121, respectively) were achieved with 2PL-MDNRM using the truncated normal distribution and 2PL-MDNRM using the truncated normal distribution. Notably, one test-taker (radiologist) exhibited significantly superior ability compared to another based on PD results from the best models, while no significant difference was observed in nonoptimal models.
2PL-MDNRM successfully achieved parameter estimation convergence, and its superiority over the original MDNRM was demonstrated through wAIC and LOO values.
本研究旨在改进用于放射诊断学多类别分类的多维名义反应模型(MDNRM)。
这项回顾性研究涉及对传统名义反应模型(NRM)进行扩展,以创建双参数MDNRM(2PL-MDNRM)。采用七种MDNRM模型,包括原始MDNRM和2PL-MDNRM的亚型,来估计考生的能力和试题难度。这些模型应用于临床放射诊断数据集。计算Rhat值以评估模型收敛情况。此外,计算广泛适用信息准则(wAIC)值和帕累托平滑重要性抽样留一法交叉验证(LOO)值,以评估这七种模型的拟合优度。根据wAIC和LOO值选择表现最佳的模型。使用方向概率(PD)来评估一个估计参数是否存在显著差异。
七个模型的所有估计参数的Rhat值均低于1.10,表明收敛稳定。使用截断正态分布的2PL-MDNRM和使用截断正态分布的2PL-MDNRM分别获得了最佳的wAIC和LOO值(分别为988和1121)。值得注意的是,根据最佳模型的PD结果,一名考生(放射科医生)表现出明显优于另一名考生的能力,而在非最优模型中未观察到显著差异。
2PL-MDNRM成功实现了参数估计收敛,并且通过wAIC和LOO值证明了其优于原始MDNRM。