School of Information Engineering, Southwest University of Science and Technology, Mianyang 621010, China.
Mianyang Central Hospital, School of Medicine, University of Electronic Science and Technology of China, Mianyang 621000, China; NHC Key Laboratory of Nuclear Technology Medical Transformation, Mianyang Central Hospital, Mianyang 621000, China.
Comput Biol Med. 2024 Sep;180:108947. doi: 10.1016/j.compbiomed.2024.108947. Epub 2024 Aug 1.
Recently, ViT and CNNs based on encoder-decoder architecture have become the dominant model in the field of medical image segmentation. However, there are some deficiencies for each of them: (1) It is difficult for CNNs to capture the interaction between two locations with consideration of the longer distance. (2) ViT cannot acquire the interaction of local context information and carries high computational complexity. To optimize the above deficiencies, we propose a new network for medical image segmentation, which is called FCSU-Net. FCSU-Net uses the proposed collaborative fusion of multi-scale feature block that enables the network to obtain more abundant and more accurate features. In addition, FCSU-Net fuses full-scale feature information through the FFF (Full-scale Feature Fusion) structure instead of simple skip connections, and establishes long-range dependencies on multiple dimensions through the CS (Cross-dimension Self-attention) mechanism. Meantime, every dimension is complementary to each other. Also, CS mechanism has the advantage of convolutions capturing local contextual weights. Finally, FCSU-Net is validated on several datasets, and the results show that FCSU-Net not only has a relatively small number of parameters, but also has a leading segmentation performance.
最近,基于编解码器架构的 ViT 和 CNNs 已成为医学图像分割领域的主导模型。然而,它们各自都存在一些缺陷:(1)CNN 很难捕捉两个位置之间的长距离交互。(2)ViT 无法获取局部上下文信息的交互,并且计算复杂度高。为了优化上述缺陷,我们提出了一种新的医学图像分割网络,称为 FCSU-Net。FCSU-Net 使用了我们提出的多尺度特征块的协作融合,使网络能够获得更丰富、更准确的特征。此外,FCSU-Net 通过 FFF(全尺度特征融合)结构融合全尺度特征信息,而不是简单的跳过连接,并通过 CS(跨维度自注意力)机制在多个维度上建立长程依赖关系。同时,每个维度都是相互补充的。此外,CS 机制具有卷积捕捉局部上下文权重的优势。最后,我们在多个数据集上验证了 FCSU-Net,结果表明,FCSU-Net 不仅具有相对较少的参数,而且具有领先的分割性能。