Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:2123-2127. doi: 10.1109/EMBC48229.2022.9871945.
Recently, convolutional neural network(CNN) has achieved great success in medical image segmentation. However, due to the limitation of convolutional receptive field, the pure convolutional neural network is difficult to further improve its performance. Given the outstanding ability of transformers in extracting the long-range dependency, some works have successfully applied it to computer vision and achieved better results than CNN in some tasks. Based on transformers could remedy the shortage of CNN, in this paper, we propose ITUnet, a segmentation network using CNN and transformers as features extractor. The combination of CNN and transformers enables the network to learn both short- and long-range dependency of features, which is beneficial to segmentation tasks. We evaluate our method on a head-and-neck CT dataset which has 18 kinds of organs to be segmented. The experimental results demonstrate that our proposed method shows better accuracy and robustness, the proposed methods achieve the Dice score of 77.72 and the 95% Hausdorff Distance of 2.31, outperforming the existing methods.
最近,卷积神经网络(CNN)在医学图像分割中取得了巨大成功。然而,由于卷积感受野的限制,纯卷积神经网络难以进一步提高其性能。鉴于变压器在提取长程依赖关系方面的出色能力,一些工作已成功将其应用于计算机视觉,并在某些任务中取得了比 CNN 更好的结果。基于变压器可以弥补 CNN 的不足,在本文中,我们提出了 ITUnet,这是一种使用 CNN 和变压器作为特征提取器的分割网络。CNN 和变压器的结合使网络能够学习特征的短程和长程依赖关系,这有利于分割任务。我们在一个包含 18 种器官的头颈部 CT 数据集上评估了我们的方法。实验结果表明,我们提出的方法具有更好的准确性和鲁棒性,提出的方法的 Dice 得分为 77.72,95%Hausdorff 距离为 2.31,优于现有方法。