School of Science, Zhejiang University of Science and Technology, Hangzhou 310012, China.
School of Microelectronics, Shandong University, Jinan 250101, China; Shenzhen Research Institute of Shandong University, A301 Virtual University Park in South District of Shenzhen, China.
Comput Biol Med. 2022 Nov;150:106082. doi: 10.1016/j.compbiomed.2022.106082. Epub 2022 Sep 15.
Early accurate mammography screening and diagnosis can reduce the mortality of breast cancer. Although CNN-based breast cancer computer-aided diagnosis (CAD) systems have achieved significant results in recent years, precise diagnosis of lesions in mammogram remains a challenge due to low signal-to-noise ratio (SNR) and physiological characteristics. Many researchers achieved excellent performance in detecting mammographic images by inputting region of interest (ROI) annotations while ROI annotations require a great quantity of manual labor, time and resources. We propose a two-stage method that combines images preprocessing and model optimization to address the aforementioned challenges. Firstly, we propose the breast database preprocess (BDP) method to preprocess INbreast then we get INbreast. The only label we need is benign or malignant label of one mammogram, not manual labeling such as ROI annotations. Secondly, we apply focal loss to ECA-Net50 which is an improved model based on ResNet50 with efficient channel attention (ECA) module. Our method can adaptively extract the key features of mammograms, meanwhile solving the problem of hard-to-classify samples and unbalanced categories. The AUC value of our method on INbreast is 0.960, accuracy is 0.929, Recall is 0.928. The precision of our method on INbreast is 0.883 which improved by 0.254 compared to ResNet50. In addition, we use Grad-CAM to visualize the effect of our model. The visualized heatmaps extracted by our method can focus more on lesion regions. Both numerical and visualized experiments demonstrate that our method achieves satisfactory performance.
早期准确的乳腺 X 线摄影筛查和诊断可以降低乳腺癌的死亡率。尽管基于 CNN 的乳腺癌计算机辅助诊断 (CAD) 系统近年来已经取得了显著的成果,但由于信噪比 (SNR) 和生理特征较低,对乳腺 X 线片中病变的精确诊断仍然是一个挑战。许多研究人员通过输入感兴趣区域 (ROI) 注释来实现对乳腺 X 线图像的出色检测性能,而 ROI 注释需要大量的人工、时间和资源。我们提出了一种两阶段的方法,结合图像预处理和模型优化来解决上述挑战。首先,我们提出了乳腺数据库预处理 (BDP) 方法来预处理 INbreast,然后得到 INbreast。我们只需要一个乳腺 X 线片的良性或恶性标签,而不是 ROI 注释等手动标记。其次,我们将焦点损失应用于基于 ResNet50 的改进模型 ECA-Net50,该模型具有高效的通道注意力 (ECA) 模块。我们的方法可以自适应地提取乳腺 X 线片的关键特征,同时解决难以分类样本和不平衡类别问题。我们的方法在 INbreast 上的 AUC 值为 0.960,准确率为 0.929,召回率为 0.928。我们的方法在 INbreast 上的精度为 0.883,与 ResNet50 相比提高了 0.254。此外,我们使用 Grad-CAM 来可视化我们模型的效果。我们的方法提取的可视化热图可以更集中于病变区域。数值和可视化实验均表明,我们的方法取得了令人满意的性能。