IRv2-Net：一种深度学习框架，用于通过集成 InceptionResNetV2 和 UNet 架构以及测试时增强技术来提高息肉分割性能。

IRv2-Net: A Deep Learning Framework for Enhanced Polyp Segmentation Performance Integrating InceptionResNetV2 and UNet Architecture with Test Time Augmentation Techniques.

机构信息

Department of Computer Science & Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, Bangladesh.

Department of Electrical & Computer Engineering, Rajshahi University of Engineering & Technology, Rajshahi 6204, Bangladesh.

出版信息

Sensors (Basel). 2023 Sep 7;23(18):7724. doi: 10.3390/s23187724.

DOI:10.3390/s23187724

PMID:37765780

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10534485/

Abstract

Colorectal polyps in the colon or rectum are precancerous growths that can lead to a more severe disease called colorectal cancer. Accurate segmentation of polyps using medical imaging data is essential for effective diagnosis. However, manual segmentation by endoscopists can be time-consuming, error-prone, and expensive, leading to a high rate of missed anomalies. To solve this problem, an automated diagnostic system based on deep learning algorithms is proposed to find polyps. The proposed IRv2-Net model is developed using the UNet architecture with a pre-trained InceptionResNetV2 encoder to extract most features from the input samples. The Test Time Augmentation (TTA) technique, which utilizes the characteristics of the original, horizontal, and vertical flips, is used to gain precise boundary information and multi-scale image features. The performance of numerous state-of-the-art (SOTA) models is compared using several metrics such as accuracy, Dice Similarity Coefficients (DSC), Intersection Over Union (IoU), precision, and recall. The proposed model is tested on the Kvasir-SEG and CVC-ClinicDB datasets, demonstrating superior performance in handling unseen real-time data. It achieves the highest area coverage in the area under the Receiver Operating Characteristic (ROC-AUC) and area under Precision-Recall (AUC-PR) curves. The model exhibits excellent qualitative testing outcomes across different types of polyps, including more oversized, smaller, over-saturated, sessile, or flat polyps, within the same dataset and across different datasets. Our approach can significantly minimize the number of missed rating difficulties. Lastly, a graphical interface is developed for producing the mask in real-time. The findings of this study have potential applications in clinical colonoscopy procedures and can serve based on further research and development.

摘要

结直肠息肉是一种癌前病变，可导致更严重的疾病，即结直肠癌。使用医学成像数据准确分割息肉对于有效的诊断至关重要。然而，内镜医师的手动分割可能既耗时、易错又昂贵，导致异常漏诊率高。为了解决这个问题，提出了一种基于深度学习算法的自动化诊断系统来发现息肉。所提出的 IRv2-Net 模型是使用 UNet 架构和预训练的 InceptionResNetV2 编码器开发的，从输入样本中提取大多数特征。使用测试时间增强 (TTA) 技术，利用原始、水平和垂直翻转的特征，获得精确的边界信息和多尺度图像特征。使用准确性、Dice 相似系数 (DSC)、交并比 (IoU)、精度和召回率等多个指标比较了许多最先进 (SOTA) 模型的性能。在所提出的模型在 Kvasir-SEG 和 CVC-ClinicDB 数据集上进行测试，在处理看不见的实时数据方面表现出卓越的性能。它在接收器操作特征 (ROC-AUC) 和精度-召回率 (AUC-PR) 曲线下的面积覆盖率方面达到最高。该模型在同一数据集和不同数据集内的不同类型息肉（包括更大、更小、过饱和、无蒂或扁平息肉）的定性测试结果均表现出色。我们的方法可以显著减少漏诊难度的数量。最后，开发了一个图形界面来实时生成掩模。本研究的结果具有在临床结肠镜检查程序中的潜在应用，并可以根据进一步的研究和开发来提供帮助。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0b67/10534485/e67d3d96bf27/sensors-23-07724-g001.jpg

相似文献

IRv2-Net: A Deep Learning Framework for Enhanced Polyp Segmentation Performance Integrating InceptionResNetV2 and UNet Architecture with Test Time Augmentation Techniques.

Sensors (Basel). 2023 Sep 7;23(18):7724. doi: 10.3390/s23187724.

Focus U-Net: A novel dual attention-gated CNN for polyp segmentation during colonoscopy.

Comput Biol Med. 2021 Oct;137:104815. doi: 10.1016/j.compbiomed.2021.104815. Epub 2021 Sep 2.

Enhanced accuracy with Segmentation of Colorectal Polyp using NanoNetB, and Conditional Random Field Test-Time Augmentation.

Front Robot AI. 2024 Aug 9;11:1387491. doi: 10.3389/frobt.2024.1387491. eCollection 2024.

Using DUCK-Net for polyp image segmentation.

Sci Rep. 2023 Jun 16;13(1):9803. doi: 10.1038/s41598-023-36940-5.

Multi-scale nested UNet with transformer for colorectal polyp segmentation.

J Appl Clin Med Phys. 2024 Jun;25(6):e14351. doi: 10.1002/acm2.14351. Epub 2024 Mar 29.

UViT-Seg: An Efficient ViT and U-Net-Based Framework for Accurate Colorectal Polyp Segmentation in Colonoscopy and WCE Images.

J Imaging Inform Med. 2024 Oct;37(5):2354-2374. doi: 10.1007/s10278-024-01124-8. Epub 2024 Apr 26.

HMA-Net: A deep U-shaped network combined with HarDNet and multi-attention mechanism for medical image segmentation.

Med Phys. 2023 Mar;50(3):1635-1646. doi: 10.1002/mp.16065. Epub 2022 Nov 3.

A Comprehensive Study on Colorectal Polyp Segmentation With ResUNet++, Conditional Random Field and Test-Time Augmentation.

IEEE J Biomed Health Inform. 2021 Jun;25(6):2029-2040. doi: 10.1109/JBHI.2021.3049304. Epub 2021 Jun 3.

GAR-Net: Guided Attention Residual Network for Polyp Segmentation from Colonoscopy Video Frames.

Diagnostics (Basel). 2022 Dec 30;13(1):123. doi: 10.3390/diagnostics13010123.

Li-SegPNet: Encoder-Decoder Mode Lightweight Segmentation Network for Colorectal Polyps Analysis.

IEEE Trans Biomed Eng. 2023 Apr;70(4):1330-1339. doi: 10.1109/TBME.2022.3216269. Epub 2023 Mar 21.

引用本文的文献

Fine tuned CatBoost machine learning approach for early detection of cardiovascular disease through predictive modeling.

Sci Rep. 2025 Aug 25;15(1):31199. doi: 10.1038/s41598-025-13790-x.

Mamba-fusion for privacy-preserving disease prediction.

Sci Rep. 2025 Jul 1;15(1):21819. doi: 10.1038/s41598-025-06306-0.

An efficient fine tuning strategy of segment anything model for polyp segmentation.

Sci Rep. 2025 Apr 23;15(1):14088. doi: 10.1038/s41598-025-97802-w.

HDL-ACO hybrid deep learning and ant colony optimization for ocular optical coherence tomography image classification.

Sci Rep. 2025 Feb 18;15(1):5888. doi: 10.1038/s41598-025-89961-7.

MugenNet: A Novel Combined Convolution Neural Network and Transformer Network with Application in Colonic Polyp Image Segmentation.

Sensors (Basel). 2024 Nov 23;24(23):7473. doi: 10.3390/s24237473.

本文引用的文献

Using DUCK-Net for polyp image segmentation.

Sci Rep. 2023 Jun 16;13(1):9803. doi: 10.1038/s41598-023-36940-5.

CaraNet: context axial reverse attention network for segmentation of small medical objects.

J Med Imaging (Bellingham). 2023 Jan;10(1):014005. doi: 10.1117/1.JMI.10.1.014005. Epub 2023 Feb 18.

Dual encoder-decoder-based deep polyp segmentation network for colonoscopy images.

Sci Rep. 2023 Jan 21;13(1):1183. doi: 10.1038/s41598-023-28530-2.

Automatic Extraction of Muscle Parameters with Attention UNet in Ultrasonography.

Sensors (Basel). 2022 Jul 13;22(14):5230. doi: 10.3390/s22145230.

Clinical target segmentation using a novel deep neural network: double attention Res-U-Net.

Sci Rep. 2022 Apr 25;12(1):6717. doi: 10.1038/s41598-022-10429-z.

AFP-Mask: Anchor-Free Polyp Instance Segmentation in Colonoscopy.

IEEE J Biomed Health Inform. 2022 Jul;26(7):2995-3006. doi: 10.1109/JBHI.2022.3147686. Epub 2022 Jul 1.

Artificial intelligence-assisted colonoscopy: A review of current state of practice and research.

World J Gastroenterol. 2021 Dec 21;27(47):8103-8122. doi: 10.3748/wjg.v27.i47.8103.

Deep Learning for Caries Detection and Classification.

Diagnostics (Basel). 2021 Sep 13;11(9):1672. doi: 10.3390/diagnostics11091672.

Improving convolutional neural networks performance for image classification using test time augmentation: a case study using MURA dataset.

Health Inf Sci Syst. 2021 Jul 31;9(1):33. doi: 10.1007/s13755-021-00163-7. eCollection 2021 Dec.

Text Data Augmentation for Deep Learning.

J Big Data. 2021;8(1):101. doi: 10.1186/s40537-021-00492-0. Epub 2021 Jul 19.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

IRv2-Net：一种深度学习框架，用于通过集成 InceptionResNetV2 和 UNet 架构以及测试时增强技术来提高息肉分割性能。

IRv2-Net: A Deep Learning Framework for Enhanced Polyp Segmentation Performance Integrating InceptionResNetV2 and UNet Architecture with Test Time Augmentation Techniques.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献