Zhang Chenghao, Wang Lingfei, Zhang Chunyu, Zhang Yu, Wang Peng, Li Jin
College of Intelligent Systems Science and Engineering, Harbin Engineering University, Harbin 150001, China.
College of Computer and Control Engineering, Qiqihar University, Qiqihar 161006, China.
Bioengineering (Basel). 2025 Jun 11;12(6):636. doi: 10.3390/bioengineering12060636.
Semantic segmentation plays a critical role in medical image analysis, offering indispensable information for the diagnosis and treatment planning of liver diseases. However, due to the complex anatomical structure of the liver and significant inter-patient variability, the current methods exhibit notable limitations in feature extraction and fusion, which pose a major challenge to achieving accurate liver segmentation. To address these challenges, this study proposes an improved U-Net-based liver semantic segmentation method that enhances segmentation performance through optimized feature extraction and fusion mechanisms. Firstly, a multi-scale input strategy is employed to account for the variability in liver features at different scales. A multi-scale convolutional attention (MSCA) mechanism is integrated into the encoder to aggregate multi-scale information and improve feature representation. Secondly, an atrous spatial pyramid pooling (ASPP) module is incorporated into the bottleneck layer to capture features at various receptive fields using dilated convolutions, while global pooling is applied to enhance the acquisition of contextual information and ensure efficient feature transmission. Furthermore, a Channel Transformer module replaces the traditional skip connections to strengthen the interaction and fusion between encoder and decoder features, thereby reducing the semantic gap. The effectiveness of this method was validated on integrated public datasets, achieving an Intersection over Union (IoU) of 0.9315 for liver segmentation tasks, outperforming other mainstream approaches. This provides a novel solution for precise liver image segmentation and holds significant clinical value for liver disease diagnosis and treatment.
语义分割在医学图像分析中起着关键作用,为肝脏疾病的诊断和治疗规划提供不可或缺的信息。然而,由于肝脏复杂的解剖结构和患者之间显著的变异性,当前方法在特征提取和融合方面存在明显局限性,这对实现准确的肝脏分割构成了重大挑战。为应对这些挑战,本研究提出了一种基于改进U-Net的肝脏语义分割方法,通过优化特征提取和融合机制来提高分割性能。首先,采用多尺度输入策略来考虑不同尺度下肝脏特征的变异性。将多尺度卷积注意力(MSCA)机制集成到编码器中,以聚合多尺度信息并改善特征表示。其次,在瓶颈层引入空洞空间金字塔池化(ASPP)模块,使用扩张卷积捕获不同感受野的特征,同时应用全局池化来增强上下文信息的获取并确保高效的特征传递。此外,通道变换器模块取代传统的跳跃连接,以加强编码器和解码器特征之间的交互和融合,从而缩小语义差距。该方法的有效性在综合公共数据集上得到验证,在肝脏分割任务中实现了0.9315的交并比(IoU),优于其他主流方法。这为精确的肝脏图像分割提供了一种新的解决方案,对肝脏疾病的诊断和治疗具有重要的临床价值。