Wang Yongtao, Tian Shengwei, Yu Long, Wu Weidong, Zhang Dezhi, Wang Junwen, Cheng Junlong
College of Software Engineering, Xinjiang University, Urumqi, Xinjiang, China.
Key Laboratory of Software Engineering Technology, Xinjiang University, Urumqi, Xinjiang, China.
Technol Health Care. 2023;31(1):181-195. doi: 10.3233/THC-220174.
The results of medical image segmentation can provide reliable evidence for clinical diagnosis and treatment. The U-Net proposed previously has been widely used in the field of medical image segmentation. Its encoder extracts semantic features of different scales at different stages, but does not carry out special processing for semantic features of each scale.
To improve the feature expression ability and segmentation performance of U-Net, we proposed a feature supplement and optimization U-Net (FSOU-Net).
First, we put forward the view that semantic features of different scales should be treated differently. Based on this view, we classify the semantic features automatically extracted by encoders into two categories: shallow semantic features and deep semantic features. Then, we propose the shallow feature supplement module (SFSM), which obtains fine-grained semantic features through up-sampling to supplement the shallow semantic information. Finally, we propose the deep feature optimization module (DFOM), which uses the expansive convolution of different receptive fields to obtain multi-scale features and then performs multi-scale feature fusion to optimize the deep semantic information.
The proposed model is experimented on three medical image segmentation public datasets, and the experimental results prove the correctness of the proposed idea. The segmentation performance of the model is higher than the advanced models for medical image segmentation. Compared with baseline network U-NET, the main index of Dice index is 0.75% higher on the RITE dataset, 2.3% higher on the Kvasir-SEG dataset, and 0.24% higher on the GlaS dataset.
The proposed method can greatly improve the feature representation ability and segmentation performance of the model.
医学图像分割的结果可为临床诊断和治疗提供可靠依据。先前提出的U-Net已在医学图像分割领域得到广泛应用。其编码器在不同阶段提取不同尺度的语义特征,但未对各尺度的语义特征进行特殊处理。
为提高U-Net的特征表达能力和分割性能,我们提出了一种特征补充与优化的U-Net(FSOU-Net)。
首先,我们提出应区别对待不同尺度的语义特征这一观点。基于此观点,我们将编码器自动提取的语义特征分为两类:浅层语义特征和深层语义特征。然后,我们提出浅层特征补充模块(SFSM),通过上采样获得细粒度语义特征以补充浅层语义信息。最后,我们提出深层特征优化模块(DFOM),利用不同感受野的扩张卷积获取多尺度特征,然后进行多尺度特征融合以优化深层语义信息。
所提出的模型在三个医学图像分割公共数据集上进行了实验,实验结果证明了所提想法的正确性。该模型的分割性能高于医学图像分割的先进模型。与基线网络U-Net相比,在RITE数据集上,Dice指数的主要指标高出0.75%,在Kvasir-SEG数据集上高出2.3%,在GlaS数据集上高出0.24%。
所提出的方法可大幅提高模型的特征表示能力和分割性能。