Department of Biomedical Science and Technology, Graduate School, Kyung Hee University, #23 Kyungheedae-ro, Dongdaemun-gu, Seoul 02447, Republic of Korea.
Department of Radiology, Kyung Hee University Hospital, Kyung Hee University College of Medicine, #23 Kyungheedae-ro, Dongdaemun-gu, Seoul 02447, Republic of Korea.
Eur J Radiol. 2022 Sep;154:110369. doi: 10.1016/j.ejrad.2022.110369. Epub 2022 May 23.
Mammography is the initial examination to detect breast cancer symptoms, and quality control of mammography devices is crucial to maintain accurate diagnosis and to safeguard against degradation of performance. The objective of this study was to assist radiologists in mammography phantom image evaluation by developing and validating an interpretable deep learning model capable of objectively evaluating the quality of standard phantom images for mammography.
A total of 2,208 mammography phantom images were collected for periodic accreditation of the scanner from 1,755 institutions. The dataset was randomly split into training (1,808 images) and testing (400 images) datasets with subgroups (76 images) for the multi-reader study. To develop an interpretable model that contains two deep learning networks in series, five processing steps were performed: mammography phantom detection, phantom object detection, post-processing, score evaluation, and a report with a comment about ambiguous results.
For phantom detection, the accuracy and mean intersection over union (mIOU) were 1.00 and 0.938 in the test dataset, respectively. During phantom object detection, a total of 6,369 out of 6,400 objects were detected as the correct object class, and the accuracy and mIOU were 0.995 and 0.813, respectively. The predicted score for each object showed a consensus of 97.40% excluding ambiguous points and 59.10% for ambiguous points of the groups.
The interpretable deep learning model using large-scale data from multiple centers shows high performance and reasonable object scoring, successfully validating the reliability and feasibility of mammography phantom image quality management.
乳腺 X 线摄影是检测乳腺癌症状的初始检查,乳腺 X 线摄影设备的质量控制对于保持准确诊断和防止性能下降至关重要。本研究的目的是通过开发和验证一个可解释的深度学习模型来协助放射科医生进行乳腺 X 线摄影体模图像评估,该模型能够客观地评估乳腺 X 线摄影体模标准图像的质量。
共收集了 1755 家机构的 2208 张乳腺 X 线摄影体模图像,用于扫描仪的定期认证。数据集随机分为训练集(1808 张图像)和测试集(400 张图像),其中子组(76 张图像)用于多读者研究。为了开发一个可解释的模型,该模型包含两个串联的深度学习网络,共进行了五个处理步骤:乳腺 X 线摄影体模检测、体模目标检测、后处理、评分评估以及带有模糊结果注释的报告。
在测试数据集,乳腺 X 线摄影体模检测的准确率和平均交并比(mIOU)分别为 1.00 和 0.938。在乳腺 X 线摄影体模目标检测中,总共检测到 6369 个目标对象,均为正确的目标对象类别,准确率和 mIOU 分别为 0.995 和 0.813。预测的每个目标对象的分数,在排除模糊点时,有 97.40%的一致性,在模糊点时,有 59.10%的一致性。
使用来自多个中心的大规模数据的可解释深度学习模型显示出了较高的性能和合理的目标评分,成功验证了乳腺 X 线摄影体模图像质量管理的可靠性和可行性。