Sun Yingshi, Qu Yuhong, Wang Dong, Li Yi, Ye Lin, Du Jingbo, Xu Bing, Li Baoqing, Li Xiaoting, Zhang Kexin, Shi Yanjie, Sun Ruijia, Wang Yichuan, Long Rong, Chen Dengbo, Li Haijiao, Wang Liwei, Cao Min
Key Laboratory of Carcinogenesis and Translational Research (Ministry of Education/Beijing), Department of Radiology, Peking University Cancer Hospital & Institute, Beijing 100142, China.
Department of Radiology, Beijing Chaoyang Hospital, Capital Medical University, Beijing 100020, China.
Chin J Cancer Res. 2021 Dec 31;33(6):682-693. doi: 10.21147/j.issn.1000-9604.2021.06.05.
Computer-aided diagnosis using deep learning algorithms has been initially applied in the field of mammography, but there is no large-scale clinical application.
This study proposed to develop and verify an artificial intelligence model based on mammography. Firstly, mammograms retrospectively collected from six centers were randomized to a training dataset and a validation dataset for establishing the model. Secondly, the model was tested by comparing 12 radiologists' performance with and without it. Finally, prospectively enrolled women with mammograms from six centers were diagnosed by radiologists with the model. The detection and diagnostic capabilities were evaluated using the free-response receiver operating characteristic (FROC) curve and ROC curve.
The sensitivity of model for detecting lesions after matching was 0.908 for false positive rate of 0.25 in unilateral images. The area under ROC curve (AUC) to distinguish the benign lesions from malignant lesions was 0.855 [95% confidence interval (95% CI): 0.830, 0.880]. The performance of 12 radiologists with the model was higher than that of radiologists alone (AUC: 0.852 . 0.805, P=0.005). The mean reading time of with the model was shorter than that of reading alone (80.18 s . 62.28 s, P=0.032). In prospective application, the sensitivity of detection reached 0.887 at false positive rate of 0.25; the AUC of radiologists with the model was 0.983 (95% CI: 0.978, 0.988), with sensitivity, specificity, positive predictive value (PPV), and negative predictive value (NPV) of 94.36%, 98.07%, 87.76%, and 99.09%, respectively.
The artificial intelligence model exhibits high accuracy for detecting and diagnosing breast lesions, improves diagnostic accuracy and saves time.
利用深度学习算法的计算机辅助诊断已初步应用于乳腺钼靶领域,但尚无大规模临床应用。
本研究旨在开发并验证一种基于乳腺钼靶的人工智能模型。首先,将从六个中心回顾性收集的乳腺钼靶随机分为训练数据集和验证数据集以建立模型。其次,通过比较12名放射科医生在使用和不使用该模型时的表现来测试模型。最后,由放射科医生使用该模型对六个中心前瞻性纳入的有乳腺钼靶检查的女性进行诊断。使用自由响应接收器操作特征(FROC)曲线和ROC曲线评估检测和诊断能力。
在单侧图像中,模型在匹配后检测病变的灵敏度为0.908,假阳性率为0.25。区分良性病变和恶性病变的ROC曲线下面积(AUC)为0.855 [95%置信区间(95%CI):0.830,0.880]。12名放射科医生使用该模型时的表现高于单独使用时(AUC:0.852对0.805,P = 0.005)。使用模型时的平均阅读时间短于单独阅读时(80.18秒对62.28秒,P = 0.032)。在前瞻性应用中,假阳性率为0.25时检测灵敏度达到0.887;放射科医生使用该模型时的AUC为0.983(95%CI:0.978,0.988),灵敏度、特异度、阳性预测值(PPV)和阴性预测值(NPV)分别为94.36%、98.07%、87.76%和99.09%。
该人工智能模型在检测和诊断乳腺病变方面具有较高准确性,提高了诊断准确性并节省了时间。