Gamble Cooper, Faghani Shahriar, Erickson Bradley J
From the Radiology Informatics Laboratory, Department of Radiology, Mayo Clinic, 200 1st St SW, Rochester, MN 55905.
Radiol Artif Intell. 2025 Mar;7(2):e240032. doi: 10.1148/ryai.240032.
Purpose To apply conformal prediction to a deep learning (DL) model for intracranial hemorrhage (ICH) detection and evaluate model performance in detection as well as model accuracy in identifying challenging cases. Materials and Methods This was a retrospective (November-December 2017) study of 491 noncontrast head CT volumes from the CQ500 dataset, in which three senior radiologists annotated sections containing ICH. The dataset was split into definite and challenging (uncertain) subsets, in which challenging images were defined as those in which there was disagreement among readers. A DL model was trained on patients from the definite data (training dataset) to perform ICH localization and classification into five classes. To develop an uncertainty-aware DL model, 1546 sections of the definite data (calibration dataset) were used for Mondrian conformal prediction (MCP). The uncertainty-aware DL model was tested on 8401 definite and challenging sections to assess its ability to identify challenging sections. The difference in predictive performance ( value) and ability to identify challenging sections (accuracy) were reported. Results The study included 146 patients (mean age, 45.7 years ± 9.9 [SD]; 76 [52.1%] men, 70 [47.9%] women). After the MCP procedure, the model achieved an F1 score of 0.919 for localization and classification. Additionally, it correctly identified patients with challenging cases with 95.3% (143 of 150) accuracy. It did not incorrectly label any definite sections as challenging. Conclusion The uncertainty-aware MCP-augmented DL model achieved high performance in ICH detection and high accuracy in identifying challenging sections, suggesting its usefulness in automated ICH detection and potential to increase trustworthiness of DL models in radiology. CT, Head and Neck, Brain, Brain Stem, Hemorrhage, Feature Detection, Diagnosis, Supervised Learning © RSNA, 2025 See also commentary by Ngum and Filippi in this issue.
目的 将共形预测应用于用于颅内出血(ICH)检测的深度学习(DL)模型,并评估该模型在检测方面的性能以及识别具有挑战性病例的模型准确性。材料与方法 这是一项对CQ500数据集中491例非增强头部CT容积进行的回顾性研究(2017年11月至12月),其中三位资深放射科医生对包含ICH的切片进行了标注。数据集被分为明确和具有挑战性(不确定)的子集,其中具有挑战性的图像被定义为读者之间存在分歧的图像。在明确数据(训练数据集)的患者上训练一个DL模型,以进行ICH定位并分类为五个类别。为了开发一个不确定性感知DL模型,使用了1546个明确数据的切片(校准数据集)进行蒙德里安共形预测(MCP)。在8401个明确和具有挑战性的切片上对不确定性感知DL模型进行测试,以评估其识别具有挑战性切片的能力。报告了预测性能(F1值)和识别具有挑战性切片的能力(准确性)的差异。结果 该研究纳入了146例患者(平均年龄45.7岁±9.9[标准差];男性76例[52.1%],女性70例[47.9%])。经过MCP程序后,该模型在定位和分类方面的F1分数为0.919。此外,它正确识别具有挑战性病例的患者的准确率为95.3%(150例中的143例)。它没有将任何明确的切片错误标记为具有挑战性。结论 不确定性感知的MCP增强DL模型在ICH检测中实现了高性能,在识别具有挑战性切片方面具有高准确性,表明其在自动ICH检测中的有用性以及提高放射学中DL模型可信度的潜力。CT、头颈部、脑、脑干、出血、特征检测、诊断、监督学习 © RSNA,2025 另见本期Ngum和Filippi的评论。