Mikulić Mateo, Vičević Dominik, Nagy Eszter, Napravnik Mateja, Štajduhar Ivan, Tschauner Sebastian, Hržić Franko
University of Rijeka, Faculty of Engineering, Department of Computer Engineering, Vukovarska 58, Rijeka, 51000, Croatia.
Medical University of Graz, Department of Radiology, Division of Pediatric Radiology, Graz, 8036, Austria.
J Imaging Inform Med. 2025 Feb;38(1):177-190. doi: 10.1007/s10278-024-01194-8. Epub 2024 Jul 17.
Multiple studies within the medical field have highlighted the remarkable effectiveness of using convolutional neural networks for predicting medical conditions, sometimes even surpassing that of medical professionals. Despite their great performance, convolutional neural networks operate as black boxes, potentially arriving at correct conclusions for incorrect reasons or areas of focus. Our work explores the possibility of mitigating this phenomenon by identifying and occluding confounding variables within images. Specifically, we focused on the prediction of osteopenia, a serious medical condition, using the publicly available GRAZPEDWRI-DX dataset. After detection of the confounding variables in the dataset, we generated masks that occlude regions of images associated with those variables. By doing so, models were forced to focus on different parts of the images for classification. Model evaluation using F1-score, precision, and recall showed that models trained on non-occluded images typically outperformed models trained on occluded images. However, a test where radiologists had to choose a model based on the focused regions extracted by the GRAD-CAM method showcased different outcomes. The radiologists' preference shifted towards models trained on the occluded images. These results suggest that while occluding confounding variables may degrade model performance, it enhances interpretability, providing more reliable insights into the reasoning behind predictions. The code to repeat our experiment is available on the following link: https://github.com/mikulicmateo/osteopenia .
医学领域的多项研究强调了使用卷积神经网络预测医疗状况的显著有效性,有时甚至超过医学专业人员。尽管卷积神经网络表现出色,但它们如同黑箱运作,可能会因错误的原因或关注点得出正确结论。我们的工作探索了通过识别和遮挡图像中的混杂变量来减轻这一现象的可能性。具体而言,我们使用公开可用的GRAZPEDWRI-DX数据集专注于骨质减少(一种严重的医疗状况)的预测。在检测到数据集中的混杂变量后,我们生成了遮挡与这些变量相关图像区域的掩码。通过这样做,模型被迫专注于图像的不同部分进行分类。使用F1分数、精确率和召回率进行的模型评估表明,在未遮挡图像上训练的模型通常优于在遮挡图像上训练的模型。然而,在一项放射科医生必须根据GRAD-CAM方法提取的聚焦区域选择模型的测试中,结果却有所不同。放射科医生的偏好转向了在遮挡图像上训练的模型。这些结果表明,虽然遮挡混杂变量可能会降低模型性能,但它增强了可解释性,为预测背后的推理提供了更可靠的见解。重复我们实验的代码可在以下链接获取:https://github.com/mikulicmateo/osteopenia 。