Mubashar Mehreen, Ali Hazrat, Grönlund Christer, Azmat Shoaib
Present Address: Department of Electrical and Computer Engineering, COMSATS University Islamabad, Abbottabad Campus, Abbottabad, Pakistan.
Department of Radiation Sciences, Umeå University, Umeå, Sweden.
Neural Comput Appl. 2022;34(20):17723-17739. doi: 10.1007/s00521-022-07419-7. Epub 2022 Jun 3.
U-Net is a widely adopted neural network in the domain of medical image segmentation. Despite its quick embracement by the medical imaging community, its performance suffers on complicated datasets. The problem can be ascribed to its simple feature extracting blocks: encoder/decoder, and the semantic gap between encoder and decoder. Variants of U-Net (such as R2U-Net) have been proposed to address the problem of simple feature extracting blocks by making the network deeper, but it does not deal with the semantic gap problem. On the other hand, another variant UNET++ deals with the semantic gap problem by introducing dense skip connections but has simple feature extraction blocks. To overcome these issues, we propose a new U-Net based medical image segmentation architecture R2U++. In the proposed architecture, the adapted changes from vanilla U-Net are: (1) the plain convolutional backbone is replaced by a deeper recurrent residual convolution block. The increased field of view with these blocks aids in extracting crucial features for segmentation which is proven by improvement in the overall performance of the network. (2) The semantic gap between encoder and decoder is reduced by dense skip pathways. These pathways accumulate features coming from multiple scales and apply concatenation accordingly. The modified architecture has embedded multi-depth models, and an ensemble of outputs taken from varying depths improves the performance on foreground objects appearing at various scales in the images. The performance of R2U++ is evaluated on four distinct medical imaging modalities: electron microscopy, X-rays, fundus, and computed tomography. The average gain achieved in IoU score is 1.5 ± 0.37% and in dice score is 0.9 ± 0.33% over UNET++, whereas, 4.21 ± 2.72 in IoU and 3.47 ± 1.89 in dice score over R2U-Net across different medical imaging segmentation datasets.
U-Net是医学图像分割领域中广泛采用的神经网络。尽管它很快被医学成像界所接受,但其在复杂数据集上的性能不佳。这个问题可以归因于其简单的特征提取模块:编码器/解码器,以及编码器和解码器之间的语义鸿沟。已经提出了U-Net的变体(如R2U-Net)来通过使网络更深来解决简单特征提取模块的问题,但它没有解决语义鸿沟问题。另一方面,另一个变体UNET++通过引入密集跳跃连接来解决语义鸿沟问题,但具有简单的特征提取模块。为了克服这些问题,我们提出了一种基于U-Net的新的医学图像分割架构R2U++。在所提出的架构中,相对于原始U-Net的适应性变化包括:(1)将普通卷积主干替换为更深的循环残差卷积块。这些块增加的视野有助于提取用于分割的关键特征,这通过网络整体性能的提高得到了证明。(2)通过密集跳跃路径减少编码器和解码器之间的语义鸿沟。这些路径累积来自多个尺度的特征并相应地应用拼接。修改后的架构嵌入了多深度模型,并且从不同深度获取的输出集合提高了在图像中不同尺度出现的前景对象的性能。R2U++的性能在四种不同的医学成像模态上进行了评估:电子显微镜、X射线、眼底和计算机断层扫描。在不同的医学成像分割数据集上,相对于UNET++,IoU分数的平均增益为1.5±0.37%,骰子分数的平均增益为0.9±0.33%;相对于R2U-Net,IoU分数为4.21±2.72,骰子分数为3.47±1.89。