Department of Radiology, Xinhua Hospital, Shanghai Jiao Tong University School of Medicine, No. 1665 Kongjiang Road, Shanghai, 200092, China.
Beijing Infervision Technology Co. Ltd., Beijing, 100025, China.
Eur Radiol. 2021 Aug;31(8):5902-5912. doi: 10.1007/s00330-020-07659-y. Epub 2021 Jan 26.
To investigate the value of full-field digital mammography-based deep learning (DL) in predicting malignancy of Breast Imaging Reporting and Data System (BI-RADS) 4 microcalcifications.
A total of 384 patients with 414 pathologically confirmed microcalcifications (221 malignant and 193 benign) were randomly allocated into the training, validation, and testing datasets (272/71/71 lesions) in this retrospective study. A combined DL model was developed incorporating mammography and clinical variables. Model performance was evaluated by using areas under the receiver operating characteristic curve (AUC) and compared with the clinical model, stand-alone DL image model, and BI-RADS approach. The predictive performance for malignancy was also compared between the combined model and human readers (2 juniors and 2 seniors).
The combined DL model demonstrated favorable AUC, sensitivity, and specificity of 0.910, 85.3%, and 91.9% in predicting BI-RADS 4 malignant microcalcifications in the testing dataset, which outperformed the clinical model, DL image model, and BI-RADS with AUCs of 0.799, 0.841, and 0.804, respectively. The combined model achieved non-inferior performance as senior radiologists (p = 0.860, p = 0.800) and outperformed junior radiologists (p = 0.155, p = 0.029). The diagnostic performance of two junior radiologists was improved after artificial intelligence assistance with AUCs increased to 0.854 and 0.901 from 0.816 (p = 0.556) and 0.773 (p = 0.046), while the interobserver agreement was improved with a kappa value increased to 0.843 from 0.331.
The combined deep learning model can improve the malignancy prediction of BI-RADS 4 microcalcifications in screening mammography and assist junior radiologists to achieve better performance, which can facilitate clinical decision-making.
• The combined deep learning model demonstrated high diagnostic power, sensitivity, and specificity for predicting malignant BI-RADS 4 mammographic microcalcifications. • The combined model achieved similar performance with senior breast radiologists, while it outperformed junior breast radiologists. • Deep learning could improve the diagnostic performance of junior radiologists and facilitate clinical decision-making.
研究基于全数字化乳腺摄影的深度学习(DL)在预测乳腺影像报告和数据系统(BI-RADS)4 微钙化恶性程度中的价值。
本回顾性研究共纳入 384 例经病理证实的 414 处微钙化患者(221 例恶性,193 例良性),将其随机分配至训练集、验证集和测试集(分别有 272/71/71 个病灶)。该研究构建了一个包含乳腺影像学和临床变量的联合 DL 模型。采用受试者工作特征曲线下面积(AUC)评估模型性能,并与临床模型、独立 DL 图像模型和 BI-RADS 方法进行比较。还比较了联合模型和 2 名初级和 2 名高级放射科医师的预测恶性肿瘤的性能。
在测试集中,联合 DL 模型对 BI-RADS 4 恶性微钙化的预测具有较高的 AUC、敏感度和特异度(AUC 为 0.910、85.3%和 91.9%),优于临床模型(AUC 为 0.799)、DL 图像模型(AUC 为 0.841)和 BI-RADS(AUC 为 0.804)。联合模型与高级放射科医师的表现相当(p=0.860,p=0.800),优于初级放射科医师(p=0.155,p=0.029)。人工智能辅助后,两名初级放射科医师的诊断性能提高,AUC 分别从 0.816(p=0.556)和 0.773(p=0.046)提高至 0.854(p=0.056)和 0.901(p=0.029),观察者间一致性也有所提高,kappa 值从 0.331 提高至 0.843。
联合深度学习模型可提高筛查性乳腺钼靶 BI-RADS 4 微钙化的恶性预测能力,辅助初级放射科医师获得更好的诊断性能,有助于临床决策。
• 联合深度学习模型对预测 BI-RADS 4 乳腺钼靶微钙化的恶性程度具有较高的诊断效能、敏感度和特异度。• 联合模型与高级乳腺放射科医师的表现相当,优于初级乳腺放射科医师。• 深度学习可提高初级放射科医师的诊断性能,有助于临床决策。