无需人工标注的手术工具分割图像合成

Image Compositing for Segmentation of Surgical Tools Without Manual Annotations.

作者信息

Garcia-Peraza-Herrera Luis C, Fidon Lucas, D'Ettorre Claudia, Stoyanov Danail, Vercauteren Tom, Ourselin Sebastien

出版信息

IEEE Trans Med Imaging. 2021 May;40(5):1450-1460. doi: 10.1109/TMI.2021.3057884. Epub 2021 Apr 30.

DOI:10.1109/TMI.2021.3057884

PMID:33556005

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8092331/

Abstract

Producing manual, pixel-accurate, image segmentation labels is tedious and time-consuming. This is often a rate-limiting factor when large amounts of labeled images are required, such as for training deep convolutional networks for instrument-background segmentation in surgical scenes. No large datasets comparable to industry standards in the computer vision community are available for this task. To circumvent this problem, we propose to automate the creation of a realistic training dataset by exploiting techniques stemming from special effects and harnessing them to target training performance rather than visual appeal. Foreground data is captured by placing sample surgical instruments over a chroma key (a.k.a. green screen) in a controlled environment, thereby making extraction of the relevant image segment straightforward. Multiple lighting conditions and viewpoints can be captured and introduced in the simulation by moving the instruments and camera and modulating the light source. Background data is captured by collecting videos that do not contain instruments. In the absence of pre-existing instrument-free background videos, minimal labeling effort is required, just to select frames that do not contain surgical instruments from videos of surgical interventions freely available online. We compare different methods to blend instruments over tissue and propose a novel data augmentation approach that takes advantage of the plurality of options. We show that by training a vanilla U-Net on semi-synthetic data only and applying a simple post-processing, we are able to match the results of the same network trained on a publicly available manually labeled real dataset.

摘要

生成手动的、像素精确的图像分割标签既繁琐又耗时。当需要大量带标签的图像时，比如用于训练手术场景中仪器 - 背景分割的深度卷积网络时，这往往是一个限速因素。目前没有与计算机视觉社区行业标准相当的大型数据集可用于此任务。为了规避这个问题，我们建议通过利用源自特效的技术并将其用于提升训练性能而非视觉效果，来自动创建一个逼真的训练数据集。前景数据是在受控环境中通过将样本手术器械放置在色度键（又称绿幕）上进行采集的，从而使相关图像片段的提取变得简单直接。通过移动器械和相机以及调节光源，可以在模拟中捕捉和引入多种光照条件和视角。背景数据是通过收集不含器械的视频来获取的。在没有预先存在的无器械背景视频的情况下，只需要进行最少的标注工作，即从网上免费获取的手术干预视频中选择不包含手术器械的帧即可。我们比较了将器械融合到组织上的不同方法，并提出了一种利用多种选项的新型数据增强方法。我们表明，仅在半合成数据上训练一个普通的U-Net并应用简单的后处理，我们就能与在公开可用的手动标注真实数据集上训练的同一网络的结果相匹配。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae95/8092331/a5af914aa032/garci1-3057884.jpg

相似文献

Image Compositing for Segmentation of Surgical Tools Without Manual Annotations.

IEEE Trans Med Imaging. 2021 May;40(5):1450-1460. doi: 10.1109/TMI.2021.3057884. Epub 2021 Apr 30.

Catheter segmentation in X-ray fluoroscopy using synthetic data and transfer learning with light U-nets.

Comput Methods Programs Biomed. 2020 Aug;192:105420. doi: 10.1016/j.cmpb.2020.105420. Epub 2020 Feb 29.

The effects of different levels of realism on the training of CNNs with only synthetic images for the semantic segmentation of robotic instruments in a head phantom.

Int J Comput Assist Radiol Surg. 2020 Aug;15(8):1257-1265. doi: 10.1007/s11548-020-02185-0. Epub 2020 May 22.

Detection, segmentation, and 3D pose estimation of surgical tools using convolutional neural networks and algebraic geometry.

Med Image Anal. 2021 May;70:101994. doi: 10.1016/j.media.2021.101994. Epub 2021 Feb 7.

Image generation by GAN and style transfer for agar plate image segmentation.

Comput Methods Programs Biomed. 2020 Feb;184:105268. doi: 10.1016/j.cmpb.2019.105268. Epub 2019 Dec 17.

A convolutional neural network for segmentation of yeast cells without manual training annotations.

Bioinformatics. 2022 Feb 7;38(5):1427-1433. doi: 10.1093/bioinformatics/btab835.

Unpaired deep adversarial learning for multi-class segmentation of instruments in robot-assisted surgical videos.

Int J Med Robot. 2023 Aug;19(4):e2514. doi: 10.1002/rcs.2514. Epub 2023 Mar 28.

Generating Synthetic Labeled Data From Existing Anatomical Models: An Example With Echocardiography Segmentation.

IEEE Trans Med Imaging. 2021 Oct;40(10):2783-2794. doi: 10.1109/TMI.2021.3051806. Epub 2021 Sep 30.

Fully automatic multi-organ segmentation for head and neck cancer radiotherapy using shape representation model constrained fully convolutional neural networks.

Med Phys. 2018 Oct;45(10):4558-4567. doi: 10.1002/mp.13147. Epub 2018 Sep 19.

Data augmentation using generative adversarial networks (CycleGAN) to improve generalizability in CT segmentation tasks.

Sci Rep. 2019 Nov 15;9(1):16884. doi: 10.1038/s41598-019-52737-x.

引用本文的文献

SegMatch: semi-supervised surgical instrument segmentation.

Sci Rep. 2025 Apr 23;15(1):14042. doi: 10.1038/s41598-025-94568-z.

Optimizing intraoperative AI: evaluation of YOLOv8 for real-time recognition of robotic and laparoscopic instruments.

J Robot Surg. 2025 Mar 31;19(1):131. doi: 10.1007/s11701-025-02284-7.

Neural Radiance Fields for High-Fidelity Soft Tissue Reconstruction in Endoscopy.

Sensors (Basel). 2025 Jan 19;25(2):565. doi: 10.3390/s25020565.

Deep homography estimation in dynamic surgical scenes for laparoscopic camera motion extraction.

Comput Methods Biomech Biomed Eng Imaging Vis. 2022 Feb 23;10(3):321-329. doi: 10.1080/21681163.2021.2002195. eCollection 2022.

Eliminating the need for manual segmentation to determine size and volume from MRI. A proof of concept on segmenting the lateral ventricles.

PLoS One. 2023 May 11;18(5):e0285414. doi: 10.1371/journal.pone.0285414. eCollection 2023.

IEEE Trans Med Imaging. 2023 Oct;42(10):2832-2841. doi: 10.1109/TMI.2023.3266137. Epub 2023 Oct 2.

Improving needle visibility in LED-based photoacoustic imaging using deep learning with semi-synthetic datasets.

Photoacoustics. 2022 Apr 7;26:100351. doi: 10.1016/j.pacs.2022.100351. eCollection 2022 Jun.

Robotic Endoscope Control Via Autonomous Instrument Tracking.

Front Robot AI. 2022 Apr 11;9:832208. doi: 10.3389/frobt.2022.832208. eCollection 2022.

本文引用的文献

Enabling machine learning in X-ray-based procedures via realistic simulation of image formation.

Int J Comput Assist Radiol Surg. 2019 Sep;14(9):1517-1528. doi: 10.1007/s11548-019-02011-2. Epub 2019 Jun 11.

Surgical data science for next-generation interventions.

Nat Biomed Eng. 2017 Sep;1(9):691-696. doi: 10.1038/s41551-017-0132-7.

Weakly supervised convolutional LSTM approach for tool tracking in laparoscopic videos.

Int J Comput Assist Radiol Surg. 2019 Jun;14(6):1059-1067. doi: 10.1007/s11548-019-01958-6. Epub 2019 Apr 9.

Exploiting the potential of unlabeled endoscopic video data with self-supervised learning.

Int J Comput Assist Radiol Surg. 2018 Jun;13(6):925-933. doi: 10.1007/s11548-018-1772-0. Epub 2018 Apr 27.

Real-time ultrasound transducer localization in fluoroscopy images by transfer learning from synthetic training data.

Med Image Anal. 2014 Dec;18(8):1320-8. doi: 10.1016/j.media.2014.04.007. Epub 2014 May 5.

Atlas encoding by randomized forests for efficient label propagation.

Med Image Comput Comput Assist Interv. 2013;16(Pt 3):66-73. doi: 10.1007/978-3-642-40760-4_9.

Feature classification for tracking articulated surgical tools.

Med Image Comput Comput Assist Interv. 2012;15(Pt 2):592-600. doi: 10.1007/978-3-642-33418-4_73.

Towards image guided robotic surgery: multi-arm tracking through hybrid localization.

Int J Comput Assist Radiol Surg. 2009 May;4(3):281-6. doi: 10.1007/s11548-009-0294-1. Epub 2009 Mar 19.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

无需人工标注的手术工具分割图像合成

Image Compositing for Segmentation of Surgical Tools Without Manual Annotations.

作者信息

Garcia-Peraza-Herrera Luis C, Fidon Lucas, D'Ettorre Claudia, Stoyanov Danail, Vercauteren Tom, Ourselin Sebastien

出版信息

IEEE Trans Med Imaging. 2021 May;40(5):1450-1460. doi: 10.1109/TMI.2021.3057884. Epub 2021 Apr 30.

DOI:10.1109/TMI.2021.3057884

PMID:33556005

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8092331/

Abstract

摘要

无需人工标注的手术工具分割图像合成

Image Compositing for Segmentation of Surgical Tools Without Manual Annotations.

作者信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

无需人工标注的手术工具分割图像合成

Image Compositing for Segmentation of Surgical Tools Without Manual Annotations.

作者信息

出版信息

相似文献

引用本文的文献

本文引用的文献