NA-segformer：一种基于邻域注意力的多层次 Transformer 模型，用于结肠镜下息肉分割。

NA-segformer: A multi-level transformer model based on neighborhood attention for colonoscopic polyp segmentation.

机构信息

Hunan Engineering Research Center of Advanced Embedded Computing and Intelligent Medical Systems, Xiangnan University, Chenzhou, 423300, China.

School of Computer and Artificial Intelligence, Xiangnan University, Chenzhou, 423300, China.

出版信息

Sci Rep. 2024 Sep 28;14(1):22527. doi: 10.1038/s41598-024-74123-y.

DOI:10.1038/s41598-024-74123-y

PMID:39342011

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11438879/

Abstract

In various countries worldwide, the incidence of colon cancer-related deaths has been on the rise in recent years. Early detection of symptoms and identification of intestinal polyps are crucial for improving the cure rate of colon cancer patients. Automated computer-aided diagnosis (CAD) has emerged as a solution to the low efficiency of traditional methods relying on manual diagnosis by physicians. Deep learning is the latest direction of CAD development and has shown promise for colonoscopic polyp segmentation. In this paper, we present a multi-level encoder-decoder architecture for polyp segmentation based on the Transformer architecture, termed NA-SegFormer. To improve the performance of existing Transformer-based segmentation algorithms for edge segmentation on colon polyps, we propose a patch merging module with a neighbor attention mechanism based on overlap patch merging. Since colon tract polyps vary greatly in size and different datasets have different sample sizes, we used a unified focal loss to solve the problem of category imbalance in colon tract polyp data. To assess the effectiveness of our proposed method, we utilized video capsule endoscopy and typical colonoscopy polyp datasets, as well as a dataset containing surgical equipment. On the datasets Kvasir-SEG, Kvasir-Instrument and KvasirCapsule-SEG, the Dice score of our proposed model reached 94.30%, 94.59% and 82.73%, with an accuracy of 98.26%, 99.02% and 81.84% respectively. The proposed method achieved inference speed with an Frame-per-second (FPS) of 125.01. The results demonstrated that our suggested model effectively segmented polyps better than several well-known and latest models. In addition, the proposed method has advantages in trade-off between inference speed and accuracy, and it will be of great significance to real-time colonoscopic polyp segmentation. The code is available at https://github.com/promisedong/NAFormer .

摘要

在世界各国，结肠癌相关死亡率近年来呈上升趋势。早期发现症状和识别肠息肉对于提高结肠癌患者的治愈率至关重要。自动化计算机辅助诊断 (CAD) 已成为解决传统方法依靠医生手动诊断效率低下的一种解决方案。深度学习是 CAD 发展的最新方向，已显示出在结肠镜息肉分割方面的应用前景。在本文中，我们提出了一种基于 Transformer 架构的用于息肉分割的多层次编码器-解码器架构，称为 NA-SegFormer。为了提高现有的基于 Transformer 的分割算法对结肠息肉边缘分割的性能，我们提出了一种基于重叠补丁合并的带有邻居注意力机制的补丁合并模块。由于结肠管腔息肉大小差异很大，并且不同数据集的样本大小不同，我们使用统一的焦点损失来解决结肠管腔息肉数据中类别不平衡的问题。为了评估我们提出的方法的有效性，我们利用了视频胶囊内窥镜和典型结肠镜息肉数据集，以及包含手术设备的数据集。在 Kvasir-SEG、Kvasir-Instrument 和 KvasirCapsule-SEG 数据集上，我们提出的模型的 Dice 分数分别达到了 94.30%、94.59%和 82.73%，准确率分别达到了 98.26%、99.02%和 81.84%。该方法的推断速度达到了 125.01 帧每秒 (FPS)。结果表明，我们提出的模型能够更好地分割息肉，优于几个知名和最新的模型。此外，该方法在推断速度和准确性之间的权衡方面具有优势，对于实时结肠镜息肉分割具有重要意义。代码可在 https://github.com/promisedong/NAFormer 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/99f0/11438879/9c64e6fba11e/41598_2024_74123_Fig1_HTML.jpg

相似文献

NA-segformer: A multi-level transformer model based on neighborhood attention for colonoscopic polyp segmentation.

Sci Rep. 2024 Sep 28;14(1):22527. doi: 10.1038/s41598-024-74123-y.

Multi-scale nested UNet with transformer for colorectal polyp segmentation.

J Appl Clin Med Phys. 2024 Jun;25(6):e14351. doi: 10.1002/acm2.14351. Epub 2024 Mar 29.

Dual-branch multi-information aggregation network with transformer and convolution for polyp segmentation.

Comput Biol Med. 2024 Jan;168:107760. doi: 10.1016/j.compbiomed.2023.107760. Epub 2023 Nov 30.

A lighter hybrid feature fusion framework for polyp segmentation.

Sci Rep. 2024 Oct 5;14(1):23179. doi: 10.1038/s41598-024-72763-8.

SR-AttNet: An Interpretable Stretch-Relax Attention based Deep Neural Network for Polyp Segmentation in Colonoscopy Images.

Comput Biol Med. 2023 Jun;160:106945. doi: 10.1016/j.compbiomed.2023.106945. Epub 2023 Apr 21.

PolypSegNet: A modified encoder-decoder architecture for automated polyp segmentation from colonoscopy images.

Comput Biol Med. 2021 Jan;128:104119. doi: 10.1016/j.compbiomed.2020.104119. Epub 2020 Nov 13.

UViT-Seg: An Efficient ViT and U-Net-Based Framework for Accurate Colorectal Polyp Segmentation in Colonoscopy and WCE Images.

J Imaging Inform Med. 2024 Oct;37(5):2354-2374. doi: 10.1007/s10278-024-01124-8. Epub 2024 Apr 26.

CTNet: Contrastive Transformer Network for Polyp Segmentation.

IEEE Trans Cybern. 2024 Sep;54(9):5040-5053. doi: 10.1109/TCYB.2024.3368154. Epub 2024 Aug 26.

DLGRAFE-Net: A double loss guided residual attention and feature enhancement network for polyp segmentation.

PLoS One. 2024 Sep 12;19(9):e0308237. doi: 10.1371/journal.pone.0308237. eCollection 2024.

Iterative feedback-based models for image and video polyp segmentation.

Comput Biol Med. 2024 Jul;177:108569. doi: 10.1016/j.compbiomed.2024.108569. Epub 2024 May 11.

本文引用的文献

TransUNet: Rethinking the U-Net architecture design for medical image segmentation through the lens of transformers.

Med Image Anal. 2024 Oct;97:103280. doi: 10.1016/j.media.2024.103280. Epub 2024 Jul 22.

CTNet: Contrastive Transformer Network for Polyp Segmentation.

IEEE Trans Cybern. 2024 Sep;54(9):5040-5053. doi: 10.1109/TCYB.2024.3368154. Epub 2024 Aug 26.

Automated classification of polyps using deep learning architectures and few-shot learning.

BMC Med Imaging. 2023 Apr 20;23(1):59. doi: 10.1186/s12880-023-01007-4.

MISSFormer: An Effective Transformer for 2D Medical Image Segmentation.

IEEE Trans Med Imaging. 2023 May;42(5):1484-1494. doi: 10.1109/TMI.2022.3230943. Epub 2023 May 2.

A Real-Time Polyp-Detection System with Clinical Application in Colonoscopy Using Deep Convolutional Neural Networks.

J Imaging. 2023 Jan 24;9(2):26. doi: 10.3390/jimaging9020026.

Cancer statistics, 2023.

CA Cancer J Clin. 2023 Jan;73(1):17-48. doi: 10.3322/caac.21763.

Frame-by-Frame Analysis of a Commercially Available Artificial Intelligence Polyp Detection System in Full-Length Colonoscopies.

Digestion. 2022;103(5):378-385. doi: 10.1159/000525345. Epub 2022 Jun 29.

Fast machine learning annotation in the medical domain: a semi-automated video annotation tool for gastroenterologists.

Biomed Eng Online. 2022 May 25;21(1):33. doi: 10.1186/s12938-022-01001-x.

Development and evaluation of a deep learning model to improve the usability of polyp detection systems during interventions.

United European Gastroenterol J. 2022 Jun;10(5):477-484. doi: 10.1002/ueg2.12235. Epub 2022 May 5.

Non-equivalent images and pixels: Confidence-aware resampling with meta-learning mixup for polyp segmentation.

Med Image Anal. 2022 May;78:102394. doi: 10.1016/j.media.2022.102394. Epub 2022 Feb 18.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

NA-segformer：一种基于邻域注意力的多层次 Transformer 模型，用于结肠镜下息肉分割。

NA-segformer: A multi-level transformer model based on neighborhood attention for colonoscopic polyp segmentation.

机构信息

Hunan Engineering Research Center of Advanced Embedded Computing and Intelligent Medical Systems, Xiangnan University, Chenzhou, 423300, China.

School of Computer and Artificial Intelligence, Xiangnan University, Chenzhou, 423300, China.

出版信息

Sci Rep. 2024 Sep 28;14(1):22527. doi: 10.1038/s41598-024-74123-y.

DOI:10.1038/s41598-024-74123-y

PMID:39342011

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11438879/

Abstract

摘要

NA-segformer：一种基于邻域注意力的多层次 Transformer 模型，用于结肠镜下息肉分割。

NA-segformer: A multi-level transformer model based on neighborhood attention for colonoscopic polyp segmentation.

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

NA-segformer：一种基于邻域注意力的多层次 Transformer 模型，用于结肠镜下息肉分割。

NA-segformer: A multi-level transformer model based on neighborhood attention for colonoscopic polyp segmentation.

机构信息

出版信息

相似文献

本文引用的文献