Liu Qi, Zhao Zhenfeng, Wu Yingbo, Wu Siqi, He Yutong, Wang Haibin, Wang Shenwen
School of Information Engineering, Hebei GEO University, Shijiazhuang, 050031, China.
Hebei Huiji Technology Development Co., Ltd, Shijiazhuang, 050000, China.
Sci Rep. 2025 Aug 7;15(1):28895. doi: 10.1038/s41598-025-13636-6.
In the pathological diagnosis of colorectal cancer, the precise segmentation of glandular and cellular contours serves as the fundamental basis for achieving accurate clinical diagnosis. However, this task presents significant challenges due to complex phenomena such as nuclear staining heterogeneity, variations in nuclear size, boundary overlap, and nuclear clustering. With the continuous advancement of deep learning techniques-particularly encoder-decoder architectures-and the emergence of various high-performance functional modules, multi module collaborative fusion has become an effective approach to enhance segmentation performance. To this end, this study proposes the RPAU-Net++ model, which integrates the ResNet-50 encoder (R), the Joint Pyramid Fusion Module (P), and the Convolutional Block Attention Module (A) into the UNet++ framework, forming a multi-module-enhanced segmentation architecture. Specifically, ResNet-50 mitigates gradient vanishing and degradation issues in deep network training through residual skip connections, thereby improving model convergence stability and feature representation depth. JPFM achieves progressive fusion of cross-layer features via a multi-scale feature pyramid, enhancing the encoding capability for complex tissue structures and fine boundary information. CBAM employs adaptive weight allocation in both spatial and channel dimensions to focus on target region features while effectively suppressing irrelevant background noise, thereby improving feature discriminability. Comparative experiments on the GlaS and CoNIC colorectal cancer pathology datasets, as well as the more challenging PanNuke dataset, demonstrate that RPAU-Net++ significantly outperforms mainstream models in key segmentation metrics such as IoU and Dice, providing a more accurate solution for pathological image segmentation in colorectal cancer.
在结直肠癌的病理诊断中,腺轮廓和细胞轮廓的精确分割是实现准确临床诊断的基本依据。然而,由于核染色异质性、核大小变化、边界重叠和核聚集等复杂现象,这项任务面临重大挑战。随着深度学习技术特别是编码器-解码器架构的不断进步以及各种高性能功能模块的出现,多模块协同融合已成为提高分割性能的有效方法。为此,本研究提出了RPAU-Net++模型,该模型将ResNet-50编码器(R)、联合金字塔融合模块(P)和卷积块注意力模块(A)集成到UNet++框架中,形成了一种多模块增强分割架构。具体而言,ResNet-50通过残差跳跃连接减轻了深度网络训练中的梯度消失和退化问题,从而提高了模型收敛稳定性和特征表示深度。JPFM通过多尺度特征金字塔实现跨层特征的渐进融合,增强了对复杂组织结构和精细边界信息的编码能力。CBAM在空间和通道维度上采用自适应权重分配,专注于目标区域特征,同时有效抑制无关背景噪声,从而提高特征可辨别性。在GlaS和CoNIC结直肠癌病理数据集以及更具挑战性的PanNuke数据集上的对比实验表明,RPAU-Net++在IoU和Dice等关键分割指标上显著优于主流模型,为结直肠癌病理图像分割提供了更准确的解决方案。