Wu Wei, Yang Xin, Yao Chenggui, Liu Ou, Zhao Qi, Shuai Jianwei
School of Computer Science and Software Engineering, University of Science and Technology Liaoning, Anshan 114051, China.
College of Data Science, Jiaxing University, Jiaxing 314000, China.
Research (Wash D C). 2025 Sep 3;8:0869. doi: 10.34133/research.0869. eCollection 2025.
U-structure has become a foundational approach in medical image segmentation, consistently demonstrating strong performance across various segmentation tasks. Most current models are based on this framework, customizing encoder-decoder components to achieve higher accuracy across various segmentation challenges. However, this often comes at the cost of increased parameter counts, which inevitably limit their practicality in real-world applications. In this study, we provide an E-shaped segmentation framework that discards the traditional step-by-step resolution recovery decoding process, instead directly aggregating multi-scale features extracted by the encoder at each stage for deep cross-level integration. Additionally, we propose an innovative multi-scale large-kernel convolution (MLKConv) module, designed to enhance high-level feature representation by effectively capturing both local and global contextual information. Compared to U-structure, the proposed E-structured approach substantially reduces parameters while delivering superior performance, especially in complex segmentation tasks. Based on this structure, we develop 2 segmentation networks specifically for 2-dimensional (2D) and 3D medical images. 2D E-SegNet is evaluated on four 2D segmentation benchmark datasets (Synapse multi-organ, ACDC, Kvasir-Seg, and BUSI), while 3D E-SegNet is assessed on four 3D segmentation benchmark datasets (Synapse, ACDC, NIH Pancreas, and Lung). Experimental results demonstrate that our approach outperforms the current leading U-shaped models across multiple datasets, achieving new state-of-the-art (SOTA) performance with fewer parameters. In summary, our research introduces a novel approach to medical image segmentation, offering potential improvements and contributing to ongoing advancements in the field. Our code is publicly available on https://github.com/zhaoqi106/E-SegNet.
U结构已成为医学图像分割的一种基础方法,在各种分割任务中始终表现出强大的性能。当前大多数模型都基于此框架,通过定制编码器-解码器组件来在各种分割挑战中实现更高的精度。然而,这通常是以增加参数数量为代价的,这不可避免地限制了它们在实际应用中的实用性。在本研究中,我们提供了一种E形分割框架,该框架摒弃了传统的逐步分辨率恢复解码过程,而是直接聚合编码器在每个阶段提取的多尺度特征以进行深度跨层集成。此外,我们提出了一种创新的多尺度大内核卷积(MLKConv)模块,旨在通过有效捕获局部和全局上下文信息来增强高级特征表示。与U结构相比,所提出的E形方法在大幅减少参数的同时还能提供卓越的性能,尤其是在复杂的分割任务中。基于这种结构,我们专门为二维(2D)和三维(3D)医学图像开发了2个分割网络。2D E-SegNet在四个2D分割基准数据集(Synapse多器官、ACDC、Kvasir-Seg和BUSI)上进行评估,而3D E-SegNet在四个3D分割基准数据集(Synapse、ACDC、NIH胰腺和肺)上进行评估。实验结果表明,我们的方法在多个数据集上优于当前领先的U形模型,以更少的参数实现了新的最优(SOTA)性能。总之,我们的研究引入了一种新颖的医学图像分割方法,提供了潜在的改进,并为该领域的持续发展做出了贡献。我们的代码可在https://github.com/zhaoqi106/E-SegNet上公开获取。