IEEE Trans Cybern. 2024 Sep;54(9):5040-5053. doi: 10.1109/TCYB.2024.3368154. Epub 2024 Aug 26.
Segmenting polyps from colonoscopy images is very important in clinical practice since it provides valuable information for colorectal cancer. However, polyp segmentation remains a challenging task as polyps have camouflage properties and vary greatly in size. Although many polyp segmentation methods have been recently proposed and produced remarkable results, most of them cannot yield stable results due to the lack of features with distinguishing properties and those with high-level semantic details. Therefore, we proposed a novel polyp segmentation framework called contrastive Transformer network (CTNet), with three key components of contrastive Transformer backbone, self-multiscale interaction module (SMIM), and collection information module (CIM), which has excellent learning and generalization abilities. The long-range dependence and highly structured feature map space obtained by CTNet through contrastive Transformer can effectively localize polyps with camouflage properties. CTNet benefits from the multiscale information and high-resolution feature maps with high-level semantic obtained by SMIM and CIM, respectively, and thus can obtain accurate segmentation results for polyps of different sizes. Without bells and whistles, CTNet yields significant gains of 2.3%, 3.7%, 3.7%, 18.2%, and 10.1% over classical method PraNet on Kvasir-SEG, CVC-ClinicDB, Endoscene, ETIS-LaribPolypDB, and CVC-ColonDB respectively. In addition, CTNet has advantages in camouflaged object detection and defect detection. The code is available at https://github.com/Fhujinwu/CTNet.
从结肠镜图像中分割息肉在临床实践中非常重要,因为它为结直肠癌提供了有价值的信息。然而,由于息肉具有伪装特性并且大小差异很大,因此息肉分割仍然是一项具有挑战性的任务。尽管最近已经提出了许多息肉分割方法并取得了显著的成果,但由于缺乏具有区分特性和具有高级语义细节的特征,大多数方法都无法产生稳定的结果。因此,我们提出了一种名为对比 Transformer 网络(CTNet)的新型息肉分割框架,它具有三个关键组件:对比 Transformer 骨干、自多尺度交互模块(SMIM)和集合信息模块(CIM),具有出色的学习和泛化能力。CTNet 通过对比 Transformer 获得的长程依赖关系和高度结构化的特征图空间可以有效地定位具有伪装特性的息肉。CTNet 受益于 SMIM 和 CIM 分别获得的多尺度信息和具有高级语义的高分辨率特征图,从而可以为不同大小的息肉获得准确的分割结果。没有花里胡哨的东西,CTNet 在 Kvasir-SEG、CVC-ClinicDB、Endoscene、ETIS-LaribPolypDB 和 CVC-ColonDB 上分别比经典方法 PraNet 提高了 2.3%、3.7%、3.7%、18.2%和 10.1%。此外,CTNet 在伪装物体检测和缺陷检测方面具有优势。代码可在 https://github.com/Fhujinwu/CTNet 上获得。