Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, Canada.
Children's Hospital of Eastern Ontario Research Institute, Ottawa, Canada.
Sci Rep. 2024 Apr 19;14(1):9013. doi: 10.1038/s41598-024-59248-4.
Deep learning algorithms have demonstrated remarkable potential in clinical diagnostics, particularly in the field of medical imaging. In this study, we investigated the application of deep learning models in early detection of fetal kidney anomalies. To provide an enhanced interpretation of those models' predictions, we proposed an adapted two-class representation and developed a multi-class model interpretation approach for problems with more than two labels and variable hierarchical grouping of labels. Additionally, we employed the explainable AI (XAI) visualization tools Grad-CAM and HiResCAM, to gain insights into model predictions and identify reasons for misclassifications. The study dataset consisted of 969 ultrasound images from unique patients; 646 control images and 323 cases of kidney anomalies, including 259 cases of unilateral urinary tract dilation and 64 cases of unilateral multicystic dysplastic kidney. The best performing model achieved a cross-validated area under the ROC curve of 91.28% ± 0.52%, with an overall accuracy of 84.03% ± 0.76%, sensitivity of 77.39% ± 1.99%, and specificity of 87.35% ± 1.28%. Our findings emphasize the potential of deep learning models in predicting kidney anomalies from limited prenatal ultrasound imagery. The proposed adaptations in model representation and interpretation represent a novel solution to multi-class prediction problems.
深度学习算法在临床诊断中表现出了显著的潜力,特别是在医学成像领域。在这项研究中,我们研究了深度学习模型在早期检测胎儿肾脏异常中的应用。为了更深入地理解这些模型的预测结果,我们提出了一种自适应的两分类表示方法,并开发了一种适用于多标签和标签可变层次分组问题的多分类模型解释方法。此外,我们还使用了可解释人工智能(XAI)可视化工具 Grad-CAM 和 HiResCAM,以深入了解模型预测并确定分类错误的原因。研究数据集由 969 张来自独特患者的超声图像组成;其中 646 张为对照组图像,323 张为肾脏异常病例,包括 259 例单侧尿路扩张和 64 例单侧多囊性发育不良肾脏。表现最佳的模型在交叉验证的 ROC 曲线下面积为 91.28%±0.52%,整体准确率为 84.03%±0.76%,灵敏度为 77.39%±1.99%,特异性为 87.35%±1.28%。我们的研究结果强调了深度学习模型在预测有限产前超声图像中的肾脏异常方面的潜力。模型表示和解释方面的改进代表了多分类预测问题的一种新的解决方案。