Zhou Yinghong, Xie Yiying, Cai Nian, Liang Yuchen, Gong Ruifeng, Wang Ping
School of Information Engineering, Guangdong University of Technology, Guangzhou, 510006, China.
Department of Hepatobiliary Surgery in the First Affiliated Hospital of Guangzhou Medical University, Guangzhou, 510120, China.
Med Biol Eng Comput. 2025 Jan;63(1):127-138. doi: 10.1007/s11517-024-03183-z. Epub 2024 Aug 23.
Image segmentation is a key step of the 3D reconstruction of the hepatobiliary duct tree, which is significant for preoperative planning. In this paper, a novel 3D U-Net variant is designed for CT image segmentation of hepatobiliary ducts from the abdominal CT scans, which is composed of a 3D encoder-decoder and a 3D multi-feedforward self-attention module (MFSAM). To well sufficient semantic and spatial features with high inference speed, the 3D ConvNeXt block is designed as the 3D extension of the 2D ConvNeXt. To improve the ability of semantic feature extraction, the MFSAM is designed to transfer the semantic and spatial features at different scales from the encoder to the decoder. Also, to balance the losses for the voxels and the edges of the hepatobiliary ducts, a boundary-aware overlap cross-entropy loss is proposed by combining the cross-entropy loss, the Dice loss, and the boundary loss. Experimental results indicate that the proposed method is superior to some existing deep networks as well as the radiologist without rich experience in terms of CT segmentation of hepatobiliary ducts, with a segmentation performance of 76.54% Dice and 6.56 HD.
图像分割是肝胆管树三维重建的关键步骤,对术前规划具有重要意义。本文设计了一种新颖的3D U-Net变体,用于从腹部CT扫描中对肝胆管进行CT图像分割,它由一个3D编码器-解码器和一个3D多前馈自注意力模块(MFSAM)组成。为了以高推理速度充分利用语义和空间特征,将3D ConvNeXt块设计为2D ConvNeXt的3D扩展。为了提高语义特征提取能力,设计MFSAM将不同尺度的语义和空间特征从编码器传递到解码器。此外,为了平衡肝胆管体素和边缘的损失,通过结合交叉熵损失、Dice损失和边界损失,提出了一种边界感知重叠交叉熵损失。实验结果表明,所提出的方法在肝胆管CT分割方面优于一些现有的深度网络以及经验不足的放射科医生,分割性能为Dice 76.54%和HD 6.56。