快速且强大的内镜内容区域估计：基于精简GPU的流程及精选基准数据集。

Rapid and robust endoscopic content area estimation: A lean GPU-based pipeline and curated benchmark dataset.

作者信息

Budd Charlie, Garcia-Peraza-Herrera Luis C, Huber Martin, Ourselin Sebastien, Vercauteren Tom

机构信息

King's College London, UK.

Hypervision Surgical Ltd, UK.

出版信息

Comput Methods Biomech Biomed Eng Imaging Vis. 2023 Jul 4;11(4):1215-1224. doi: 10.1080/21681163.2022.2156393. Epub 2023 Jan 4.

DOI:10.1080/21681163.2022.2156393

PMID:38600897

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7615255/

Abstract

Endoscopic content area refers to the informative area enclosed by the dark, non-informative, border regions present in most endoscopic footage. The estimation of the content area is a common task in endoscopic image processing and computer vision pipelines. Despite the apparent simplicity of the problem, several factors make reliable real-time estimation surprisingly challenging. The lack of rigorous investigation into the topic combined with the lack of a common benchmark dataset for this task has been a long-lasting issue in the field. In this paper, we propose two variants of a lean GPU-based computational pipeline combining edge detection and circle fitting. The two variants differ by relying on handcrafted features, and learned features respectively to extract content area edge point candidates. We also present a first-of-its-kind dataset of manually annotated and pseudo-labelled content areas across a range of surgical indications. To encourage further developments, the curated dataset, and an implementation of both algorithms, has been made public (https://doi.org/10.7303/syn32148000, https://github.com/charliebudd/torch-content-area). We compare our proposed algorithm with a state-of-the-art U-Net-based approach and demonstrate significant improvement in terms of both accuracy (Hausdorff distance: 6.3 px versus 118.1 px) and computational time (Average runtime per frame: 0.13 ms versus 11.2 ms).

摘要

内镜内容区域是指在大多数内镜视频中由黑暗的、无信息的边界区域所包围的信息区域。内容区域的估计是内镜图像处理和计算机视觉流程中的一项常见任务。尽管这个问题表面上很简单，但有几个因素使得可靠的实时估计极具挑战性。对该主题缺乏严格的研究，再加上缺乏针对此任务的通用基准数据集，一直是该领域长期存在的问题。在本文中，我们提出了一种基于精简GPU的计算流程的两个变体，该流程结合了边缘检测和圆拟合。这两个变体的不同之处在于，分别依靠手工制作的特征和学习到的特征来提取内容区域边缘点候选。我们还展示了首个跨一系列手术指征的手动标注和伪标注内容区域的数据集。为鼓励进一步发展，精心策划的数据集以及这两种算法的实现已公开（https://doi.org/10.7303/syn32148000，https://github.com/charliebudd/torch-content-area）。我们将我们提出的算法与基于U-Net的最先进方法进行比较，并在准确性（豪斯多夫距离：6.3像素对118.1像素）和计算时间（每帧平均运行时间：0.13毫秒对11.2毫秒）方面都展示出显著改进。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2ab1/7615255/4d3912264a8c/EMS177365-f001.jpg

相似文献

Rapid and robust endoscopic content area estimation: A lean GPU-based pipeline and curated benchmark dataset.快速且强大的内镜内容区域估计：基于精简GPU的流程及精选基准数据集。

Comput Methods Biomech Biomed Eng Imaging Vis. 2023 Jul 4;11(4):1215-1224. doi: 10.1080/21681163.2022.2156393. Epub 2023 Jan 4.

Detection, segmentation, and 3D pose estimation of surgical tools using convolutional neural networks and algebraic geometry.使用卷积神经网络和代数几何进行手术工具的检测、分割和三维姿态估计。

Med Image Anal. 2021 May;70:101994. doi: 10.1016/j.media.2021.101994. Epub 2021 Feb 7.

A fast monocular 6D pose estimation method for textureless objects based on perceptual hashing and template matching.一种基于感知哈希和模板匹配的无纹理物体快速单目6D姿态估计方法。

Front Robot AI. 2025 Jan 8;11:1424036. doi: 10.3389/frobt.2024.1424036. eCollection 2024.

Robust segmentation of arterial walls in intravascular ultrasound images using Dual Path U-Net.使用双通道 U-Net 对血管内超声图像中的动脉壁进行稳健分割。

Ultrasonics. 2019 Jul;96:24-33. doi: 10.1016/j.ultras.2019.03.014. Epub 2019 Mar 23.

Dense GPU-enhanced surface reconstruction from stereo endoscopic images for intraoperative registration.基于双目内窥镜图像的密集 GPU 增强表面重建用于术中配准。

Med Phys. 2012 Mar;39(3):1632-45. doi: 10.1118/1.3681017.

EA-Net: Edge-aware network for brain structure segmentation via decoupled high and low frequency features.EA-Net：通过解耦高低频特征实现脑结构分割的边缘感知网络。

Comput Biol Med. 2022 Nov;150:106139. doi: 10.1016/j.compbiomed.2022.106139. Epub 2022 Sep 21.

LumVertCancNet: A novel 3D lumbar vertebral body cancellous bone location and segmentation method based on hybrid Swin-transformer.LumVertCancNet：一种基于混合 Swin-Transformer 的新型 3D 腰椎松质骨定位与分割方法。

Comput Biol Med. 2024 Mar;171:108237. doi: 10.1016/j.compbiomed.2024.108237. Epub 2024 Feb 28.

An uncertainty-aware deep learning architecture with outlier mitigation for prostate gland segmentation in radiotherapy treatment planning.具有异常值缓解的不确定性感知深度学习架构，用于放射治疗计划中的前列腺分割。

Med Phys. 2023 Jan;50(1):311-322. doi: 10.1002/mp.15982. Epub 2022 Sep 28.

EndoSLAM dataset and an unsupervised monocular visual odometry and depth estimation approach for endoscopic videos.内镜 SLAM 数据集和一种用于内镜视频的无监督单目视觉里程计和深度估计方法。

Med Image Anal. 2021 Jul;71:102058. doi: 10.1016/j.media.2021.102058. Epub 2021 Apr 15.

StaSiS-Net: A stacked and siamese disparity estimation network for depth reconstruction in modern 3D laparoscopy.StaSiS-Net：一种用于现代三维腹腔镜深度重建的堆叠式连体视差估计网络。

Med Image Anal. 2022 Apr;77:102380. doi: 10.1016/j.media.2022.102380. Epub 2022 Jan 30.

引用本文的文献

Advancing artificial intelligence applicability in endoscopy through source-agnostic camera signal extraction from endoscopic images.通过从内镜图像中提取与源无关的相机信号来推进人工智能在内镜检查中的适用性。

PLoS One. 2025 Jun 11;20(6):e0325987. doi: 10.1371/journal.pone.0325987. eCollection 2025.

Deep Reinforcement Learning Based System for Intraoperative Hyperspectral Video Autofocusing.基于深度强化学习的术中高光谱视频自动聚焦系统

Med Image Comput Comput Assist Interv. 2023 Oct 1:658-667. doi: 10.1007/978-3-031-43996-4_63.

A Vascular Feature Detection and Matching Method Based on Dual-Branch Fusion and Structure Enhancement.基于双分支融合和结构增强的血管特征检测与匹配方法。

Sensors (Basel). 2024 Mar 15;24(6):1880. doi: 10.3390/s24061880.

本文引用的文献

Deep homography estimation in dynamic surgical scenes for laparoscopic camera motion extraction.用于腹腔镜相机运动提取的动态手术场景中的深度单应性估计。

Comput Methods Biomech Biomed Eng Imaging Vis. 2022 Feb 23;10(3):321-329. doi: 10.1080/21681163.2021.2002195. eCollection 2022.

Robotic Endoscope Control Via Autonomous Instrument Tracking.通过自主器械跟踪实现机器人内窥镜控制。

Front Robot AI. 2022 Apr 11;9:832208. doi: 10.3389/frobt.2022.832208. eCollection 2022.

EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos.EndoNet：腹腔镜视频识别任务的深度架构。

IEEE Trans Med Imaging. 2017 Jan;36(1):86-97. doi: 10.1109/TMI.2016.2593957. Epub 2016 Jul 22.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

快速且强大的内镜内容区域估计：基于精简GPU的流程及精选基准数据集。

Rapid and robust endoscopic content area estimation: A lean GPU-based pipeline and curated benchmark dataset.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献