School of Life & Environmental Science, Guilin University of Electronic Technology, Guilin, 541004, China.
State Key Laboratory of Oncology in South China, Sun Yat-sen University Cancer Center, Collaborative Innovation Center for Cancer Medicine, Guangzhou, 510060, China.
Comput Biol Med. 2024 Mar;171:108120. doi: 10.1016/j.compbiomed.2024.108120. Epub 2024 Feb 6.
The blurriness of boundaries in medical image target regions hinders further improvement in automatic segmentation accuracy and is a challenging problem. To address this issue, we propose a model called long-distance perceptual UNet (LD-UNet), which has a powerful long-distance perception ability and can effectively perceive the semantic context of an entire image. Specifically, LD-UNet utilizes global and local long-distance induction modules, which endow the model with contextual semantic induction capabilities for long-distance feature dependencies. The modules perform long-distance semantic perception at the high and low stages of LD-UNet, respectively, effectively improving the accuracy of local blurred information assessment. We also propose a top-down deep supervision method to enhance the ability of the model to fit data. Then, extensive experiments on four types of tumor data with blurred boundaries are conducted. The dataset includes nasopharyngeal carcinoma, esophageal carcinoma, pancreatic carcinoma, and colorectal carcinoma. The dice similarity coefficient scores obtained by LD-UNet on the four datasets are 73.35%, 85.93%, 70.04%, and 82.71%. Experimental results demonstrate that LD-UNet is more effective in improving the segmentation accuracy of blurred boundary regions than other methods with long-distance perception, such as transformers. Among all models, LD-UNet achieves the best performance. By visualizing the feature dependency field of the models, we further explore the advantages of LD-UNet in segmenting blurred boundaries.
医学图像目标区域边界的模糊性阻碍了自动分割精度的进一步提高,是一个具有挑战性的问题。为了解决这个问题,我们提出了一种名为长距离感知 U-Net(LD-UNet)的模型,它具有强大的长距离感知能力,可以有效地感知整个图像的语义上下文。具体来说,LD-UNet 利用全局和局部长距离感应模块,赋予模型对长距离特征依赖性的上下文语义感应能力。这些模块分别在 LD-UNet 的高低阶段进行长距离语义感知,有效地提高了对局部模糊信息的评估精度。我们还提出了一种自上而下的深度监督方法,以增强模型对数据的拟合能力。然后,我们在具有模糊边界的四种类型的肿瘤数据上进行了广泛的实验。数据集包括鼻咽癌、食管癌、胰腺癌和结直肠癌。LD-UNet 在这四个数据集上的 Dice 相似系数得分为 73.35%、85.93%、70.04%和 82.71%。实验结果表明,LD-UNet 在提高模糊边界区域的分割精度方面比具有长距离感知的其他方法(如变压器)更有效。在所有模型中,LD-UNet 的性能最佳。通过可视化模型的特征依赖字段,我们进一步探讨了 LD-UNet 在分割模糊边界方面的优势。