Cui Rongsheng, Yang Runzhuo, Liu Feng, Geng Hua
College of Electronic Information and Optical Engineering, Nankai University, Tianjin, China.
College of Electronic Information and Optical Engineering, Nankai University, Tianjin, China; Tianjin Key Laboratory of Optoelectronic Sensor and Sensing Network Technology, Nankai University, Tianjin, China.
Comput Biol Med. 2023 Jan;152:106384. doi: 10.1016/j.compbiomed.2022.106384. Epub 2022 Nov 30.
The convolutional neural networks (CNNs) have been widely proposed in the medical image analysis tasks, especially in the image segmentations. In recent years, the encoder-decoder structures, such as the U-Net, were rendered. However, the multi-scale information transmission and effective modeling for long-range feature dependencies in these structures were not sufficiently considered. To improve the performance of the existing methods, we propose a novel hybrid dual dilated attention network (HDA-Net) to conduct the lesion region segmentations. In the proposed network, we innovatively present the comprehensive hybrid dilated convolution (CHDC) module, which facilitates the transmission of the multi-scale information. Based on the CHDC module and the attention mechanisms, we design a novel dual dilated gated attention (DDGA) block to enhance the saliency of related regions from the multi-scale aspect. Besides, a dilated dense (DD) block is designed to expand the receptive fields. The ablation studies were performed to verify our proposed blocks. Besides, the interpretability of the HDA-Net was analyzed through the visualization of the attention weight maps from the key blocks. Compared to the state-of-the-art methods including CA-Net, DeepLabV3+, and Attention U-Net, the HDA-Net outperforms significantly, with the metrics of Dice, Average Symmetric Surface Distance (ASSD), and mean Intersection-over-Union (mIoU) reaching 93.16%, 93.63%, and 94.72%, 0.36 pix, 0.69 pix, and 0.52 pix, and 88.03%, 88.67%, and 90.33% on three publicly available medical image datasets: MAEDE-MAFTOUNI (COVID-19 CT), ISIC-2018 (Melanoma Dermoscopy), and Kvasir-SEG (Gastrointestinal Disease Polyp), respectively.
卷积神经网络(CNN)已在医学图像分析任务中被广泛提出,尤其是在图像分割方面。近年来,诸如U-Net等编码器-解码器结构被提出。然而,这些结构中多尺度信息传输以及对长距离特征依赖的有效建模未得到充分考虑。为提高现有方法的性能,我们提出一种新颖的混合双扩张注意力网络(HDA-Net)来进行病变区域分割。在所提出的网络中,我们创新性地提出了综合混合扩张卷积(CHDC)模块,它有助于多尺度信息的传输。基于CHDC模块和注意力机制,我们设计了一种新颖的双扩张门控注意力(DDGA)块,从多尺度方面增强相关区域的显著性。此外,设计了一个扩张密集(DD)块来扩大感受野。进行了消融研究以验证我们提出的块。此外,通过关键块的注意力权重图可视化分析了HDA-Net的可解释性。与包括CA-Net、DeepLabV3+和注意力U-Net在内的当前最先进方法相比,HDA-Net表现显著更优,在三个公开可用的医学图像数据集:MAEDE-MAFTOUNI(COVID-19 CT)、ISIC-2018(黑色素瘤皮肤镜检查)和Kvasir-SEG(胃肠道疾病息肉)上,Dice、平均对称表面距离(ASSD)和平均交并比(mIoU)指标分别达到93.16%、93.63%和94.72%,0.36像素、0.69像素和0.52像素,以及88.03%、88.67%和90.33%。