探索非配对图像翻译中的语义一致性，以生成用于手术应用的数据。

Exploring semantic consistency in unpaired image translation to generate data for surgical applications.

机构信息

Department of Translational Surgical Oncology, National Centre for Tumor Diseases(NCT/UCC), Dresden, 01307, Germany.

SECAI, TU Dresden, Dresden, Germany.

出版信息

Int J Comput Assist Radiol Surg. 2024 Jun;19(6):985-993. doi: 10.1007/s11548-024-03079-1. Epub 2024 Feb 26.

DOI:10.1007/s11548-024-03079-1

PMID:38407730

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11599420/

Abstract

PURPOSE

In surgical computer vision applications, data privacy and expert annotation challenges impede the acquisition of labeled training data. Unpaired image-to-image translation techniques have been explored to automatically generate annotated datasets by translating synthetic images into a realistic domain. The preservation of structure and semantic consistency, i.e., per-class distribution during translation, poses a significant challenge, particularly in cases of semantic distributional mismatch.

METHOD

This study empirically investigates various translation methods for generating data in surgical applications, explicitly focusing on semantic consistency. Through our analysis, we introduce a novel and simple combination of effective approaches, which we call ConStructS. The defined losses within this approach operate on multiple image patches and spatial resolutions during translation.

RESULTS

Various state-of-the-art models were extensively evaluated on two challenging surgical datasets. With two different evaluation schemes, the semantic consistency and the usefulness of the translated images on downstream semantic segmentation tasks were evaluated. The results demonstrate the effectiveness of the ConStructS method in minimizing semantic distortion, with images generated by this model showing superior utility for downstream training.

CONCLUSION

In this study, we tackle semantic inconsistency in unpaired image translation for surgical applications with minimal labeled data. The simple model (ConStructS) enhances consistency during translation and serves as a practical way of generating fully labeled and semantically consistent datasets at minimal cost. Our code is available at https://gitlab.com/nct_tso_public/constructs .

摘要

目的

在外科计算机视觉应用中，数据隐私和专家注释挑战阻碍了有标签训练数据的获取。人们探索了未配对的图像到图像翻译技术，通过将合成图像转换为逼真的域来自动生成带注释的数据集。在翻译过程中，结构和语义一致性的保留，即每个类别的分布，是一个重大挑战，尤其是在语义分布不匹配的情况下。

方法

本研究对外科应用中生成数据的各种翻译方法进行了实证研究，特别关注语义一致性。通过我们的分析，我们引入了一种新颖而简单的有效方法组合，我们称之为 ConStructS。该方法中的定义损失在翻译过程中针对多个图像补丁和空间分辨率进行操作。

结果

在两个具有挑战性的外科数据集上，对各种最先进的模型进行了广泛评估。通过两种不同的评估方案，评估了语义一致性和翻译图像在下游语义分割任务中的有用性。结果表明，ConStructS 方法在最小化语义失真方面非常有效，该模型生成的图像在下游训练中具有更好的实用性。

结论

在这项研究中，我们针对具有最小有标签数据的外科应用中的未配对图像翻译中的语义不一致性问题进行了研究。简单的模型（ConStructS）增强了翻译过程中的一致性，并为以最小成本生成完全标记和语义一致的数据集提供了一种实用方法。我们的代码可在 https://gitlab.com/nct_tso_public/constructs 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f42b/11599420/6faf5ad24913/11548_2024_3079_Fig1_HTML.jpg

相似文献

Exploring semantic consistency in unpaired image translation to generate data for surgical applications.

Int J Comput Assist Radiol Surg. 2024 Jun;19(6):985-993. doi: 10.1007/s11548-024-03079-1. Epub 2024 Feb 26.

Semantic-Oriented Labeled-to-Unlabeled Distribution Translation for Image Segmentation.

IEEE Trans Med Imaging. 2022 Feb;41(2):434-445. doi: 10.1109/TMI.2021.3114329. Epub 2022 Feb 2.

CycleSGAN: A cycle-consistent and semantics-preserving generative adversarial network for unpaired MR-to-CT image synthesis.

Comput Med Imaging Graph. 2024 Oct;117:102431. doi: 10.1016/j.compmedimag.2024.102431. Epub 2024 Sep 4.

A bidirectional multilayer contrastive adaptation network with anatomical structure preservation for unpaired cross-modality medical image segmentation.

Comput Biol Med. 2022 Oct;149:105964. doi: 10.1016/j.compbiomed.2022.105964. Epub 2022 Aug 19.

Mining core information by evaluating semantic importance for unpaired image captioning.

Neural Netw. 2024 Nov;179:106519. doi: 10.1016/j.neunet.2024.106519. Epub 2024 Jul 9.

Dual domain distribution disruption with semantics preservation: Unsupervised domain adaptation for medical image segmentation.

Med Image Anal. 2024 Oct;97:103275. doi: 10.1016/j.media.2024.103275. Epub 2024 Jul 14.

CyCMIS: Cycle-consistent Cross-domain Medical Image Segmentation via diverse image augmentation.

Med Image Anal. 2022 Feb;76:102328. doi: 10.1016/j.media.2021.102328. Epub 2021 Dec 8.

A modality-collaborative convolution and transformer hybrid network for unpaired multi-modal medical image segmentation with limited annotations.

Med Phys. 2023 Sep;50(9):5460-5478. doi: 10.1002/mp.16338. Epub 2023 Mar 15.

PolypMixNet: Enhancing semi-supervised polyp segmentation with polyp-aware augmentation.

Comput Biol Med. 2024 Mar;170:108006. doi: 10.1016/j.compbiomed.2024.108006. Epub 2024 Jan 15.

DEPAS: De-novo Pathology Semantic Masks using a Generative Model.

Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-7. doi: 10.1109/EMBC40787.2023.10340437.

引用本文的文献

Semantic hyperspectral image synthesis for cross-modality knowledge transfer in surgical data science.

Int J Comput Assist Radiol Surg. 2025 Apr 24. doi: 10.1007/s11548-025-03364-7.

Deep learning-enabled transformation of anterior segment images to corneal fluorescein staining images for enhanced corneal disease screening.

Comput Struct Biotechnol J. 2025 Mar 7;28:94-105. doi: 10.1016/j.csbj.2025.02.039. eCollection 2025.

本文引用的文献

Surgical data science - from concepts toward clinical translation.

Med Image Anal. 2022 Feb;76:102306. doi: 10.1016/j.media.2021.102306. Epub 2021 Nov 18.

Surgical data science for next-generation interventions.

Nat Biomed Eng. 2017 Sep;1(9):691-696. doi: 10.1038/s41551-017-0132-7.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

探索非配对图像翻译中的语义一致性，以生成用于手术应用的数据。

Exploring semantic consistency in unpaired image translation to generate data for surgical applications.

机构信息

Department of Translational Surgical Oncology, National Centre for Tumor Diseases(NCT/UCC), Dresden, 01307, Germany.

SECAI, TU Dresden, Dresden, Germany.