• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

无线胶囊内镜中的精度增强:一种基于新型变压器的实时视频目标检测方法。

Precision enhancement in wireless capsule endoscopy: a novel transformer-based approach for real-time video object detection.

作者信息

Habe Tsedeke Temesgen, Haataja Keijo, Toivanen Pekka

机构信息

School of Computing, University of Eastern Finland, Kuopio, North Savo, Finland.

出版信息

Front Artif Intell. 2025 Apr 30;8:1529814. doi: 10.3389/frai.2025.1529814. eCollection 2025.

DOI:10.3389/frai.2025.1529814
PMID:40370809
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12075415/
Abstract

BACKGROUND

Wireless Capsule Endoscopy (WCE) enables non-invasive imaging of the gastrointestinal tract but generates vast video data, making real-time and accurate abnormality detection challenging. Traditional detection methods struggle with uncontrolled illumination, complex textures, and high-speed processing demands.

METHODS

This study presents a novel approach using Real-Time Detection Transformer (RT-DETR), a transformer-based object detection model, specifically optimized for WCE video analysis. The model captures contextual information between frames and handles variable image conditions. It was evaluated using the Kvasir-Capsule dataset, with performance assessed across three RT-DETR variants: Small (S), Medium (M), and X-Large (X).

RESULTS

RT-DETR-X achieved the highest detection precision. RT-DETR-M offered a practical trade-off between accuracy and speed, while RT-DETR-S processed frames at 270 FPS, enabling real-time performance. All three models demonstrated improved detection accuracy and computational efficiency compared to baseline methods.

DISCUSSION

The RT-DETR framework significantly enhances precision and real-time performance in gastrointestinal abnormality detection using WCE. Its clinical potential lies in supporting faster and more accurate diagnosis. Future work will focus on further optimization and deployment in endoscopic video analysis systems.

摘要

背景

无线胶囊内镜(WCE)能够对胃肠道进行无创成像,但会产生大量视频数据,这使得实时准确地检测异常具有挑战性。传统的检测方法难以应对光照不受控制、纹理复杂以及高速处理需求等问题。

方法

本研究提出了一种使用实时检测变压器(RT-DETR)的新方法,RT-DETR是一种基于变压器的目标检测模型,专门针对WCE视频分析进行了优化。该模型捕捉帧之间的上下文信息,并处理各种图像条件。使用Kvasir-Capsule数据集对其进行评估,通过三种RT-DETR变体进行性能评估:小(S)、中(M)和超大(X)。

结果

RT-DETR-X实现了最高的检测精度。RT-DETR-M在准确性和速度之间提供了一个实际的权衡,而RT-DETR-S以270帧每秒的速度处理帧,实现了实时性能。与基线方法相比,所有三种模型都展示了更高的检测精度和计算效率。

讨论

RT-DETR框架显著提高了使用WCE进行胃肠道异常检测的精度和实时性能。其临床潜力在于支持更快、更准确的诊断。未来的工作将集中在内窥镜视频分析系统中的进一步优化和部署。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c64/12075415/5545f10e7325/frai-08-1529814-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c64/12075415/fee18dad61d2/frai-08-1529814-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c64/12075415/c5956ebd6c9d/frai-08-1529814-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c64/12075415/d746138fdf29/frai-08-1529814-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c64/12075415/39cf857cbce5/frai-08-1529814-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c64/12075415/9ef1ca2b12db/frai-08-1529814-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c64/12075415/54514dd1899f/frai-08-1529814-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c64/12075415/abdf5df07504/frai-08-1529814-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c64/12075415/5545f10e7325/frai-08-1529814-g0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c64/12075415/fee18dad61d2/frai-08-1529814-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c64/12075415/c5956ebd6c9d/frai-08-1529814-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c64/12075415/d746138fdf29/frai-08-1529814-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c64/12075415/39cf857cbce5/frai-08-1529814-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c64/12075415/9ef1ca2b12db/frai-08-1529814-g0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c64/12075415/54514dd1899f/frai-08-1529814-g0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c64/12075415/abdf5df07504/frai-08-1529814-g0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2c64/12075415/5545f10e7325/frai-08-1529814-g0008.jpg

相似文献

1
Precision enhancement in wireless capsule endoscopy: a novel transformer-based approach for real-time video object detection.无线胶囊内镜中的精度增强:一种基于新型变压器的实时视频目标检测方法。
Front Artif Intell. 2025 Apr 30;8:1529814. doi: 10.3389/frai.2025.1529814. eCollection 2025.
2
Colored Video Analysis in Wireless Capsule Endoscopy: A Survey of State-of-the-Art.无线胶囊内镜中的彩色视频分析:现状综述。
Curr Med Imaging. 2020;16(9):1074-1084. doi: 10.2174/1573405616666200124140915.
3
Automatic patient-level recognition of four species on thin blood smear by a real-time detection transformer (RT-DETR) object detection algorithm: a proof-of-concept and evaluation.实时检测变压器 (RT-DETR) 目标检测算法对薄血涂片上四种 物种的自动患者水平识别:概念验证和评估。
Microbiol Spectr. 2024 Feb 6;12(2):e0144023. doi: 10.1128/spectrum.01440-23. Epub 2024 Jan 3.
4
DV-DETR: Improved UAV Aerial Small Target Detection Algorithm Based on RT-DETR.DV-DETR:基于RT-DETR的改进型无人机航空小目标检测算法
Sensors (Basel). 2024 Nov 19;24(22):7376. doi: 10.3390/s24227376.
5
Multiple kidney stones prediction with efficient RT-DETR model.基于高效RT-DETR模型的多发性肾结石预测
Comput Biol Med. 2025 May;190:110023. doi: 10.1016/j.compbiomed.2025.110023. Epub 2025 Mar 18.
6
Plant recognition of maize seedling stage in UAV remote sensing images based on H-RT-DETR.基于H-RT-DETR的无人机遥感图像中玉米苗期植株识别
Plant Methods. 2025 May 14;21(1):60. doi: 10.1186/s13007-025-01382-9.
7
Pavement-DETR: A High-Precision Real-Time Detection Transformer for Pavement Defect Detection.路面DETR:一种用于路面缺陷检测的高精度实时检测变压器
Sensors (Basel). 2025 Apr 11;25(8):2426. doi: 10.3390/s25082426.
8
Bearing-DETR: A Lightweight Deep Learning Model for Bearing Defect Detection Based on RT-DETR.轴承-DETR:一种基于RT-DETR的用于轴承缺陷检测的轻量级深度学习模型。
Sensors (Basel). 2024 Jun 30;24(13):4262. doi: 10.3390/s24134262.
9
RT-DETR-SoilCuc: detection method for cucumber germinationinsoil based environment.RT-DETR-SoilCuc:基于土壤环境的黄瓜种子萌发检测方法
Front Plant Sci. 2024 Aug 22;15:1425103. doi: 10.3389/fpls.2024.1425103. eCollection 2024.
10
River floating object detection with transformer model in real time.基于变压器模型的河流漂浮物实时检测
Sci Rep. 2025 Mar 16;15(1):9026. doi: 10.1038/s41598-025-93659-1.

引用本文的文献

1
Can YOLO Detect Retinal Pathologies? A Step Towards Automated OCT Analysis.YOLO能检测视网膜病变吗?迈向自动光学相干断层扫描分析的一步。
Diagnostics (Basel). 2025 Jul 19;15(14):1823. doi: 10.3390/diagnostics15141823.

本文引用的文献

1
Breast Tumor Detection and Diagnosis Using an Improved Faster R-CNN in DCE-MRI.在动态对比增强磁共振成像中使用改进的更快区域卷积神经网络进行乳腺肿瘤检测与诊断
Bioengineering (Basel). 2024 Dec 1;11(12):1217. doi: 10.3390/bioengineering11121217.
2
CRH-YOLO for precise and efficient detection of gastrointestinal polyps.用于精确高效检测胃肠道息肉的CRH-YOLO
Sci Rep. 2024 Dec 3;14(1):30033. doi: 10.1038/s41598-024-81842-9.
3
Research on Microscale Vehicle Logo Detection Based on Real-Time DEtection TRansformer (RT-DETR).基于实时检测变压器(RT-DETR)的微尺度车辆标志检测研究
Sensors (Basel). 2024 Oct 30;24(21):6987. doi: 10.3390/s24216987.
4
Pixel-wise annotation for clear and contaminated regions segmentation in wireless capsule endoscopy images: A multicentre database.用于无线胶囊内窥镜图像中清晰和污染区域分割的逐像素标注:一个多中心数据库。
Data Brief. 2024 Sep 10;57:110927. doi: 10.1016/j.dib.2024.110927. eCollection 2024 Dec.
5
A semantic feature enhanced YOLOv5-based network for polyp detection from colonoscopy images.基于语义特征增强的 YOLOv5 网络的结肠镜图像息肉检测方法。
Sci Rep. 2024 Jul 5;14(1):15478. doi: 10.1038/s41598-024-66642-5.
6
Multi-scale coupled attention for visual object detection.用于视觉目标检测的多尺度耦合注意力机制
Sci Rep. 2024 May 16;14(1):11191. doi: 10.1038/s41598-024-60897-8.
7
Automatic patient-level recognition of four species on thin blood smear by a real-time detection transformer (RT-DETR) object detection algorithm: a proof-of-concept and evaluation.实时检测变压器 (RT-DETR) 目标检测算法对薄血涂片上四种 物种的自动患者水平识别:概念验证和评估。
Microbiol Spectr. 2024 Feb 6;12(2):e0144023. doi: 10.1128/spectrum.01440-23. Epub 2024 Jan 3.
8
High-Speed and Accurate Diagnosis of Gastrointestinal Disease: Learning on Endoscopy Images Using Lightweight Transformer with Local Feature Attention.胃肠道疾病的高速准确诊断:使用具有局部特征注意力的轻量级Transformer在内窥镜图像上进行学习
Bioengineering (Basel). 2023 Dec 13;10(12):1416. doi: 10.3390/bioengineering10121416.
9
Wireless capsule endoscopy multiclass classification using three-dimensional deep convolutional neural network model.基于三维深度卷积神经网络模型的无线胶囊内镜多分类。
Biomed Eng Online. 2023 Dec 15;22(1):124. doi: 10.1186/s12938-023-01186-9.
10
SI-ViT: Shuffle instance-based Vision Transformer for pancreatic cancer ROSE image classification.SI-ViT:用于胰腺癌ROSE图像分类的基于实例混洗的视觉Transformer
Comput Methods Programs Biomed. 2024 Feb;244:107969. doi: 10.1016/j.cmpb.2023.107969. Epub 2023 Dec 8.