Su Run, Zhang Deyun, Liu Jinhuai, Cheng Chuandong
Institute of Intelligent Machines, Hefei Institutes of Physical Science, Chinese Academy of Sciences, Hefei, China.
Science Island Branch of Graduate School, University of Science and Technology of China, Hefei, China.
Front Genet. 2021 Feb 11;12:639930. doi: 10.3389/fgene.2021.639930. eCollection 2021.
Aiming at the limitation of the convolution kernel with a fixed receptive field and unknown prior to optimal network width in U-Net, multi-scale U-Net (MSU-Net) is proposed by us for medical image segmentation. First, multiple convolution sequence is used to extract more semantic features from the images. Second, the convolution kernel with different receptive fields is used to make features more diverse. The problem of unknown network width is alleviated by efficient integration of convolution kernel with different receptive fields. In addition, the multi-scale block is extended to other variants of the original U-Net to verify its universality. Five different medical image segmentation datasets are used to evaluate MSU-Net. A variety of imaging modalities are included in these datasets, such as electron microscopy, dermoscope, ultrasound, etc. Intersection over Union (IoU) of MSU-Net on each dataset are 0.771, 0.867, 0.708, 0.900, and 0.702, respectively. Experimental results show that MSU-Net achieves the best performance on different datasets. Our implementation is available at https://github.com/CN-zdy/MSU_Net.
针对U-Net中具有固定感受野的卷积核以及最优网络宽度未知的局限性,我们提出了多尺度U-Net(MSU-Net)用于医学图像分割。首先,使用多个卷积序列从图像中提取更多语义特征。其次,使用具有不同感受野的卷积核使特征更加多样化。通过有效整合具有不同感受野的卷积核,缓解了网络宽度未知的问题。此外,将多尺度块扩展到原始U-Net的其他变体以验证其通用性。使用五个不同的医学图像分割数据集来评估MSU-Net。这些数据集中包含了多种成像模态,如电子显微镜、皮肤镜、超声等。MSU-Net在每个数据集上的交并比(IoU)分别为0.771、0.867、0.708、0.900和0.702。实验结果表明,MSU-Net在不同数据集上取得了最佳性能。我们的实现可在https://github.com/CN-zdy/MSU_Net获取。