了解你的方向：一种具有视点感知的息肉分割框架。

Know your orientation: A viewpoint-aware framework for polyp segmentation.

机构信息

School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), Shenzhen, 518055, China; Department of Electronic Information Engineering, Beihang University, Beijing, 100191, China.

Department of Electronic Information Engineering, Beihang University, Beijing, 100191, China.

出版信息

Med Image Anal. 2024 Oct;97:103288. doi: 10.1016/j.media.2024.103288. Epub 2024 Jul 29.

DOI:10.1016/j.media.2024.103288

PMID:39096844

Abstract

Automatic polyp segmentation in endoscopic images is critical for the early diagnosis of colorectal cancer. Despite the availability of powerful segmentation models, two challenges still impede the accuracy of polyp segmentation algorithms. Firstly, during a colonoscopy, physicians frequently adjust the orientation of the colonoscope tip to capture underlying lesions, resulting in viewpoint changes in the colonoscopy images. These variations increase the diversity of polyp visual appearance, posing a challenge for learning robust polyp features. Secondly, polyps often exhibit properties similar to the surrounding tissues, leading to indistinct polyp boundaries. To address these problems, we propose a viewpoint-aware framework named VANet for precise polyp segmentation. In VANet, polyps are emphasized as a discriminative feature and thus can be localized by class activation maps in a viewpoint classification process. With these polyp locations, we design a viewpoint-aware Transformer (VAFormer) to alleviate the erosion of attention by the surrounding tissues, thereby inducing better polyp representations. Additionally, to enhance the polyp boundary perception of the network, we develop a boundary-aware Transformer (BAFormer) to encourage self-attention towards uncertain regions. As a consequence, the combination of the two modules is capable of calibrating predictions and significantly improving polyp segmentation performance. Extensive experiments on seven public datasets across six metrics demonstrate the state-of-the-art results of our method, and VANet can handle colonoscopy images in real-world scenarios effectively. The source code is available at https://github.com/1024803482/Viewpoint-Aware-Network.

摘要

自动内窥镜图像中的息肉分割对于结直肠癌的早期诊断至关重要。尽管有功能强大的分割模型可用，但仍有两个挑战阻碍了息肉分割算法的准确性。首先，在结肠镜检查过程中，医生经常调整结肠镜尖端的方向以捕获潜在的病变，从而导致结肠镜图像的视角发生变化。这些变化增加了息肉视觉外观的多样性，给学习健壮的息肉特征带来了挑战。其次，息肉通常表现出与周围组织相似的特性，导致息肉边界不明显。为了解决这些问题，我们提出了一个名为 VANet 的视角感知框架，用于精确的息肉分割。在 VANet 中，息肉被强调为一个有区别的特征，因此可以通过类激活图在视角分类过程中定位。有了这些息肉的位置，我们设计了一个视角感知的 Transformer（VAFormer）来减轻周围组织对注意力的侵蚀，从而诱导更好的息肉表示。此外，为了增强网络对息肉边界的感知能力，我们开发了一个边界感知的 Transformer（BAFormer），以鼓励自我注意力集中在不确定的区域。因此，这两个模块的结合能够校准预测，并显著提高息肉分割性能。在六个指标的七个公共数据集上的广泛实验表明了我们方法的最先进的结果，并且 VANet 可以有效地处理实际场景中的结肠镜图像。源代码可在 https://github.com/1024803482/Viewpoint-Aware-Network 上获得。

相似文献

Know your orientation: A viewpoint-aware framework for polyp segmentation.了解你的方向：一种具有视点感知的息肉分割框架。

Med Image Anal. 2024 Oct;97:103288. doi: 10.1016/j.media.2024.103288. Epub 2024 Jul 29.

A lighter hybrid feature fusion framework for polyp segmentation.一种用于息肉分割的轻量化混合特征融合框架。

Sci Rep. 2024 Oct 5;14(1):23179. doi: 10.1038/s41598-024-72763-8.

Multi-scale nested UNet with transformer for colorectal polyp segmentation.多尺度嵌套 UNet 与 Transformer 相结合的结直肠息肉分割方法。

J Appl Clin Med Phys. 2024 Jun;25(6):e14351. doi: 10.1002/acm2.14351. Epub 2024 Mar 29.

Three-stage polyp segmentation network based on reverse attention feature purification with Pyramid Vision Transformer.基于带 Pyramid Vision Transformer 的反向注意力特征提纯的三段式息肉分割网络。

Comput Biol Med. 2024 Sep;179:108930. doi: 10.1016/j.compbiomed.2024.108930. Epub 2024 Jul 26.

WDFF-Net: Weighted Dual-Branch Feature Fusion Network for Polyp Segmentation With Object-Aware Attention Mechanism.WDFF-Net：基于目标感知注意力机制的带权双分支特征融合网络的息肉分割。

IEEE J Biomed Health Inform. 2024 Jul;28(7):4118-4131. doi: 10.1109/JBHI.2024.3381891. Epub 2024 Jul 2.

Iterative feedback-based models for image and video polyp segmentation.基于迭代反馈的图像和视频息肉分割模型。

Comput Biol Med. 2024 Jul;177:108569. doi: 10.1016/j.compbiomed.2024.108569. Epub 2024 May 11.

CTNet: Contrastive Transformer Network for Polyp Segmentation.CTNet：用于息肉分割的对比 Transformer 网络。

IEEE Trans Cybern. 2024 Sep;54(9):5040-5053. doi: 10.1109/TCYB.2024.3368154. Epub 2024 Aug 26.

NA-segformer: A multi-level transformer model based on neighborhood attention for colonoscopic polyp segmentation.NA-segformer：一种基于邻域注意力的多层次 Transformer 模型，用于结肠镜下息肉分割。

Sci Rep. 2024 Sep 28;14(1):22527. doi: 10.1038/s41598-024-74123-y.

PSTNet: Enhanced Polyp Segmentation With Multi-Scale Alignment and Frequency Domain Integration.PSTNet：基于多尺度对齐和频域集成的增强息肉分割。

IEEE J Biomed Health Inform. 2024 Oct;28(10):6042-6053. doi: 10.1109/JBHI.2024.3421550. Epub 2024 Oct 3.

PolypMixNet: Enhancing semi-supervised polyp segmentation with polyp-aware augmentation.PolypMixNet：利用息肉感知增强进行半监督息肉分割。

Comput Biol Med. 2024 Mar;170:108006. doi: 10.1016/j.compbiomed.2024.108006. Epub 2024 Jan 15.

引用本文的文献

Artificial intelligence diagnosis and heatmap agent for mitral valve prolapse using 3D cine echocardiography.使用三维心脏超声心动图的二尖瓣脱垂人工智能诊断与热图代理

iScience. 2025 Jun 28;28(8):113033. doi: 10.1016/j.isci.2025.113033. eCollection 2025 Aug 15.

Medical Image Segmentation: A Comprehensive Review of Deep Learning-Based Methods.医学图像分割：基于深度学习方法的全面综述

Tomography. 2025 Apr 30;11(5):52. doi: 10.3390/tomography11050052.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

了解你的方向：一种具有视点感知的息肉分割框架。

Know your orientation: A viewpoint-aware framework for polyp segmentation.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献