Department of Healthcare Information Technology, Inje University, Gimhae, Republic of Korea.
Department of Radiology, Gangnam Severance Hospital, Yonsei University College of Medicine, 211 Eonju-ro, Gangnam-gu, Seoul, 06273, Republic of Korea.
Sci Rep. 2021 Dec 14;11(1):23925. doi: 10.1038/s41598-021-03516-0.
This study aimed to assess the diagnostic performance of deep convolutional neural networks (DCNNs) in classifying breast microcalcification in screening mammograms. To this end, 1579 mammographic images were collected retrospectively from patients exhibiting suspicious microcalcification in screening mammograms between July 2007 and December 2019. Five pre-trained DCNN models and an ensemble model were used to classify the microcalcifications as either malignant or benign. Approximately one million images from the ImageNet database had been used to train the five DCNN models. Herein, 1121 mammographic images were used for individual model fine-tuning, 198 for validation, and 260 for testing. Gradient-weighted class activation mapping (Grad-CAM) was used to confirm the validity of the DCNN models in highlighting the microcalcification regions most critical for determining the final class. The ensemble model yielded the best AUC (0.856). The DenseNet-201 model achieved the best sensitivity (82.47%) and negative predictive value (NPV; 86.92%). The ResNet-101 model yielded the best accuracy (81.54%), specificity (91.41%), and positive predictive value (PPV; 81.82%). The high PPV and specificity achieved by the ResNet-101 model, in particular, demonstrated the model effectiveness in microcalcification diagnosis, which, in turn, may considerably help reduce unnecessary biopsies.
本研究旨在评估深度卷积神经网络(DCNN)在筛查性乳房 X 光片中对乳腺微钙化进行分类的诊断性能。为此,回顾性地从 2007 年 7 月至 2019 年 12 月在筛查性乳房 X 光片中出现可疑微钙化的患者中收集了 1579 张乳房 X 光图像。使用了五个预训练的 DCNN 模型和一个集成模型来对微钙化进行分类,判断其为恶性或良性。这五个 DCNN 模型均使用来自 ImageNet 数据库的约 100 万张图像进行训练。在这里,1121 张乳房 X 光图像用于对单个模型进行微调,198 张用于验证,260 张用于测试。使用梯度加权类激活映射(Grad-CAM)来确认 DCNN 模型在突出对确定最终类别最关键的微钙化区域方面的有效性。集成模型产生了最佳的 AUC(0.856)。DenseNet-201 模型实现了最佳的敏感性(82.47%)和阴性预测值(NPV;86.92%)。ResNet-101 模型产生了最佳的准确率(81.54%)、特异性(91.41%)和阳性预测值(PPV;81.82%)。ResNet-101 模型的高 PPV 和特异性尤其证明了该模型在微钙化诊断方面的有效性,这反过来又可以极大地帮助减少不必要的活检。