Suppr超能文献

优化诊断放射学中多类分类的统计评估:双参数多维名义响应模型的研究

Optimizing statistical evaluation of multiclass classification in diagnostic radiology: a study of the two-parameter multidimensional nominal response model.

作者信息

Nishio Mizuho, Ota Eiji

机构信息

Kobe University, Kobe, Japan.

Futaba Numerical Technologies, Iruma, Japan.

出版信息

PeerJ Comput Sci. 2024 Oct 4;10:e2380. doi: 10.7717/peerj-cs.2380. eCollection 2024.

Abstract

PURPOSE

This study aimed to enhance the multidimensional nominal response model (MDNRM) for multiclass classification in diagnostic radiology.

MATERIALS AND METHODS

This retrospective study involved the extension of the conventional nominal response model (NRM) to create the two-parameter MDNRM (2PL-MDNRM). Seven models of MDNRM, including the original MDNRM and subtypes of 2PL-MDNRM, were employed to estimate test-takers' abilities and test item complexity. These models were applied to a clinical diagnostic radiology dataset. Rhat values were calculated to evaluate model convergence. Additionally, values of the widely applicable information criterion (wAIC) and Pareto-smoothed importance sampling leave-one-out cross-validation (LOO) were calculated to evaluate the goodness of fit of the seven models. The best-performing model was selected based on the values of wAIC and LOO. Probability of direction (PD) was used to evaluate whether one estimated parameter significantly differed.

RESULTS

All estimated parameters across the seven models demonstrated Rhat values below 1.10, indicating stable convergence. The best wAIC and LOO values (988 and 1,121, respectively) were achieved with 2PL-MDNRM using the truncated normal distribution and 2PL-MDNRM using the truncated normal distribution. Notably, one test-taker (radiologist) exhibited significantly superior ability compared to another based on PD results from the best models, while no significant difference was observed in nonoptimal models.

CONCLUSION

2PL-MDNRM successfully achieved parameter estimation convergence, and its superiority over the original MDNRM was demonstrated through wAIC and LOO values.

摘要

目的

本研究旨在改进用于放射诊断学多类别分类的多维名义反应模型(MDNRM)。

材料与方法

这项回顾性研究涉及对传统名义反应模型(NRM)进行扩展,以创建双参数MDNRM(2PL-MDNRM)。采用七种MDNRM模型,包括原始MDNRM和2PL-MDNRM的亚型,来估计考生的能力和试题难度。这些模型应用于临床放射诊断数据集。计算Rhat值以评估模型收敛情况。此外,计算广泛适用信息准则(wAIC)值和帕累托平滑重要性抽样留一法交叉验证(LOO)值,以评估这七种模型的拟合优度。根据wAIC和LOO值选择表现最佳的模型。使用方向概率(PD)来评估一个估计参数是否存在显著差异。

结果

七个模型的所有估计参数的Rhat值均低于1.10,表明收敛稳定。使用截断正态分布的2PL-MDNRM和使用截断正态分布的2PL-MDNRM分别获得了最佳的wAIC和LOO值(分别为988和1121)。值得注意的是,根据最佳模型的PD结果,一名考生(放射科医生)表现出明显优于另一名考生的能力,而在非最优模型中未观察到显著差异。

结论

2PL-MDNRM成功实现了参数估计收敛,并且通过wAIC和LOO值证明了其优于原始MDNRM。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5828/11623241/bdb826e372ab/peerj-cs-10-2380-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验