Lijin P, Ullah Mohib, Vats Anuja, Cheikh Faouzi Alaya, Santhosh Kumar G, Nair Madhu S
Artificial Intelligence and Computer Vision Lab, Department of Computer Science, Cochin University of Science and Technology, Kochi, Kerala 682022 India.
Norwegian University of Science and Technology, Teknologivegen 22, 2815 Gjøvik, Norway.
Biomed Eng Lett. 2024 Aug 20;14(6):1421-1431. doi: 10.1007/s13534-024-00415-x. eCollection 2024 Nov.
Colorectal cancer ranks as the second most prevalent cancer worldwide, with a high mortality rate. Colonoscopy stands as the preferred procedure for diagnosing colorectal cancer. Detecting polyps at an early stage is critical for effective prevention and diagnosis. However, challenges in colonoscopic procedures often lead medical practitioners to seek support from alternative techniques for timely polyp identification. Polyp segmentation emerges as a promising approach to identify polyps in colonoscopy images. In this paper, we propose an advanced method, PolySegNet, that leverages both Vision Transformer and Swin Transformer, coupled with a Convolutional Neural Network (CNN) decoder. The fusion of these models facilitates a comprehensive analysis of various modules in our proposed architecture.To assess the performance of PolySegNet, we evaluate it on three colonoscopy datasets, a combined dataset, and their augmented versions. The experimental results demonstrate that PolySegNet achieves competitive results in terms of polyp segmentation accuracy and efficacy, achieving a mean Dice score of 0.92 and a mean Intersection over Union (IoU) of 0.86. These metrics highlight the superior performance of PolySegNet in accurately delineating polyp boundaries compared to existing methods. PolySegNet has shown great promise in accurately and efficiently segmenting polyps in medical images. The proposed method could be the foundation for a new class of transformer-based segmentation models in medical image analysis.
结直肠癌是全球第二大常见癌症,死亡率很高。结肠镜检查是诊断结直肠癌的首选方法。早期检测息肉对于有效预防和诊断至关重要。然而,结肠镜检查过程中的挑战常常促使医生寻求替代技术的支持,以便及时识别息肉。息肉分割成为一种在结肠镜图像中识别息肉的有前景的方法。在本文中,我们提出了一种先进的方法PolySegNet,它利用视觉Transformer和Swin Transformer,并结合卷积神经网络(CNN)解码器。这些模型的融合有助于对我们提出的架构中的各个模块进行全面分析。为了评估PolySegNet的性能,我们在三个结肠镜检查数据集、一个组合数据集及其增强版本上对其进行评估。实验结果表明,PolySegNet在息肉分割的准确性和有效性方面取得了有竞争力的结果,平均Dice分数为0.92,平均交并比(IoU)为0.86。这些指标突出了PolySegNet与现有方法相比在准确描绘息肉边界方面的卓越性能。PolySegNet在准确高效地分割医学图像中的息肉方面显示出了巨大的潜力。所提出的方法可能成为医学图像分析中一类基于Transformer的新分割模型的基础。