Department of Nuclear Medicine, The Second Affiliated Hospital of Chongqing Medical University, Chong Qing, China.
Front Endocrinol (Lausanne). 2023 Aug 11;14:1224191. doi: 10.3389/fendo.2023.1224191. eCollection 2023.
The aim of this study was to improve the diagnostic performance of nuclear medicine physicians using a deep convolutional neural network (DCNN) model and validate the results with two multicenter datasets for thyroid disease by analyzing clinical single-photon emission computed tomography (SPECT) image data.
In this multicenter retrospective study, 3194 SPECT thyroid images were collected for model training (n=2067), internal validation (n=514) and external validation (n=613). First, four pretrained DCNN models (AlexNet, ShuffleNetV2, MobileNetV3 and ResNet-34) for were tested multiple medical image classification of thyroid disease types (i.e., Graves' disease, subacute thyroiditis, thyroid tumor and normal thyroid). The best performing model was then subjected to fivefold cross-validation to further assess its performance, and the diagnostic performance of this model was compared with that of junior and senior nuclear medicine physicians. Finally, class-specific attentional regions were visualized with attention heatmaps using gradient-weighted class activation mapping.
Each of the four pretrained neural networks attained an overall accuracy of more than 0.85 for the classification of SPECT thyroid images. The improved ResNet-34 model performed best, with an accuracy of 0.944. For the internal validation set, the ResNet-34 model showed higher accuracy ( < 0.001) when compared to that of the senior nuclear medicine physician, with an improvement of nearly 10%. Our model achieved an overall accuracy of 0.931 for the external dataset, a significantly higher accuracy than that of the senior physician (0.931 vs. 0.868, < 0.001).
The DCNN-based model performed well in terms of diagnosing thyroid scintillation images. The DCNN model showed higher sensitivity and greater specificity in identifying Graves' disease, subacute thyroiditis, and thyroid tumors compared to those of nuclear medicine physicians, illustrating the feasibility of deep learning models to improve the diagnostic efficiency for assisting clinicians.
本研究旨在通过分析临床单光子发射计算机断层扫描(SPECT)甲状腺影像数据,利用深度卷积神经网络(DCNN)模型提高核医学医师的诊断性能,并通过两个多中心甲状腺疾病数据集进行验证。
在这项多中心回顾性研究中,共采集了 3194 例 SPECT 甲状腺图像用于模型训练(n=2067)、内部验证(n=514)和外部验证(n=613)。首先,测试了四种预训练的 DCNN 模型(AlexNet、ShuffleNetV2、MobileNetV3 和 ResNet-34)对甲状腺疾病类型(即格雷夫斯病、亚急性甲状腺炎、甲状腺肿瘤和正常甲状腺)的多种医学图像分类。然后,将表现最好的模型进行五重交叉验证,以进一步评估其性能,并将该模型的诊断性能与初级和高级核医学医师进行比较。最后,使用梯度加权类激活映射可视化注意力热图,显示类特异性注意力区域。
四种预训练的神经网络对 SPECT 甲状腺图像的分类均达到了 0.85 以上的总体准确率。改进后的 ResNet-34 模型表现最佳,准确率为 0.944。对于内部验证集,ResNet-34 模型的准确率(<0.001)明显高于高级核医学医师,提高了近 10%。我们的模型对外部数据集的总体准确率为 0.931,明显高于高级医师(0.931 比 0.868,<0.001)。
基于 DCNN 的模型在诊断甲状腺闪烁成像方面表现良好。与核医学医师相比,DCNN 模型在识别格雷夫斯病、亚急性甲状腺炎和甲状腺肿瘤方面具有更高的敏感性和特异性,表明深度学习模型在提高辅助临床医生诊断效率方面具有可行性。