• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

探索非配对图像翻译中的语义一致性,以生成用于手术应用的数据。

Exploring semantic consistency in unpaired image translation to generate data for surgical applications.

机构信息

Department of Translational Surgical Oncology, National Centre for Tumor Diseases(NCT/UCC), Dresden, 01307, Germany.

SECAI, TU Dresden, Dresden, Germany.

出版信息

Int J Comput Assist Radiol Surg. 2024 Jun;19(6):985-993. doi: 10.1007/s11548-024-03079-1. Epub 2024 Feb 26.

DOI:10.1007/s11548-024-03079-1
PMID:38407730
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11599420/
Abstract

PURPOSE

In surgical computer vision applications, data privacy and expert annotation challenges impede the acquisition of labeled training data. Unpaired image-to-image translation techniques have been explored to automatically generate annotated datasets by translating synthetic images into a realistic domain. The preservation of structure and semantic consistency, i.e., per-class distribution during translation, poses a significant challenge, particularly in cases of semantic distributional mismatch.

METHOD

This study empirically investigates various translation methods for generating data in surgical applications, explicitly focusing on semantic consistency. Through our analysis, we introduce a novel and simple combination of effective approaches, which we call ConStructS. The defined losses within this approach operate on multiple image patches and spatial resolutions during translation.

RESULTS

Various state-of-the-art models were extensively evaluated on two challenging surgical datasets. With two different evaluation schemes, the semantic consistency and the usefulness of the translated images on downstream semantic segmentation tasks were evaluated. The results demonstrate the effectiveness of the ConStructS method in minimizing semantic distortion, with images generated by this model showing superior utility for downstream training.

CONCLUSION

In this study, we tackle semantic inconsistency in unpaired image translation for surgical applications with minimal labeled data. The simple model (ConStructS) enhances consistency during translation and serves as a practical way of generating fully labeled and semantically consistent datasets at minimal cost. Our code is available at https://gitlab.com/nct_tso_public/constructs .

摘要

目的

在外科计算机视觉应用中,数据隐私和专家注释挑战阻碍了有标签训练数据的获取。人们探索了未配对的图像到图像翻译技术,通过将合成图像转换为逼真的域来自动生成带注释的数据集。在翻译过程中,结构和语义一致性的保留,即每个类别的分布,是一个重大挑战,尤其是在语义分布不匹配的情况下。

方法

本研究对外科应用中生成数据的各种翻译方法进行了实证研究,特别关注语义一致性。通过我们的分析,我们引入了一种新颖而简单的有效方法组合,我们称之为 ConStructS。该方法中的定义损失在翻译过程中针对多个图像补丁和空间分辨率进行操作。

结果

在两个具有挑战性的外科数据集上,对各种最先进的模型进行了广泛评估。通过两种不同的评估方案,评估了语义一致性和翻译图像在下游语义分割任务中的有用性。结果表明,ConStructS 方法在最小化语义失真方面非常有效,该模型生成的图像在下游训练中具有更好的实用性。

结论

在这项研究中,我们针对具有最小有标签数据的外科应用中的未配对图像翻译中的语义不一致性问题进行了研究。简单的模型(ConStructS)增强了翻译过程中的一致性,并为以最小成本生成完全标记和语义一致的数据集提供了一种实用方法。我们的代码可在 https://gitlab.com/nct_tso_public/constructs 获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f42b/11599420/99aa419ed263/11548_2024_3079_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f42b/11599420/6faf5ad24913/11548_2024_3079_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f42b/11599420/ee3524868a87/11548_2024_3079_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f42b/11599420/e46c8dec354f/11548_2024_3079_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f42b/11599420/99aa419ed263/11548_2024_3079_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f42b/11599420/6faf5ad24913/11548_2024_3079_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f42b/11599420/ee3524868a87/11548_2024_3079_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f42b/11599420/e46c8dec354f/11548_2024_3079_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f42b/11599420/99aa419ed263/11548_2024_3079_Fig4_HTML.jpg

相似文献

1
Exploring semantic consistency in unpaired image translation to generate data for surgical applications.探索非配对图像翻译中的语义一致性,以生成用于手术应用的数据。
Int J Comput Assist Radiol Surg. 2024 Jun;19(6):985-993. doi: 10.1007/s11548-024-03079-1. Epub 2024 Feb 26.
2
Semantic-Oriented Labeled-to-Unlabeled Distribution Translation for Image Segmentation.面向语义的有标签到无标签分布转换在图像分割中的应用。
IEEE Trans Med Imaging. 2022 Feb;41(2):434-445. doi: 10.1109/TMI.2021.3114329. Epub 2022 Feb 2.
3
CycleSGAN: A cycle-consistent and semantics-preserving generative adversarial network for unpaired MR-to-CT image synthesis.CycleSGAN:一种用于非配对 MR 到 CT 图像合成的循环一致且语义保持的生成对抗网络。
Comput Med Imaging Graph. 2024 Oct;117:102431. doi: 10.1016/j.compmedimag.2024.102431. Epub 2024 Sep 4.
4
A bidirectional multilayer contrastive adaptation network with anatomical structure preservation for unpaired cross-modality medical image segmentation.一种具有解剖结构保持的双向多层对比适应网络,用于非配对跨模态医学图像分割。
Comput Biol Med. 2022 Oct;149:105964. doi: 10.1016/j.compbiomed.2022.105964. Epub 2022 Aug 19.
5
Mining core information by evaluating semantic importance for unpaired image captioning.通过评估语义重要性来挖掘未配对图像字幕的核心信息。
Neural Netw. 2024 Nov;179:106519. doi: 10.1016/j.neunet.2024.106519. Epub 2024 Jul 9.
6
Dual domain distribution disruption with semantics preservation: Unsupervised domain adaptation for medical image segmentation.双域分布破坏与语义保持:医学图像分割的无监督域自适应。
Med Image Anal. 2024 Oct;97:103275. doi: 10.1016/j.media.2024.103275. Epub 2024 Jul 14.
7
CyCMIS: Cycle-consistent Cross-domain Medical Image Segmentation via diverse image augmentation.CyCMIS:基于多样化图像增强的循环一致跨域医学图像分割。
Med Image Anal. 2022 Feb;76:102328. doi: 10.1016/j.media.2021.102328. Epub 2021 Dec 8.
8
A modality-collaborative convolution and transformer hybrid network for unpaired multi-modal medical image segmentation with limited annotations.一种用于具有有限标注的未配对多模态医学图像分割的模态协作卷积与Transformer混合网络。
Med Phys. 2023 Sep;50(9):5460-5478. doi: 10.1002/mp.16338. Epub 2023 Mar 15.
9
PolypMixNet: Enhancing semi-supervised polyp segmentation with polyp-aware augmentation.PolypMixNet:利用息肉感知增强进行半监督息肉分割。
Comput Biol Med. 2024 Mar;170:108006. doi: 10.1016/j.compbiomed.2024.108006. Epub 2024 Jan 15.
10
DEPAS: De-novo Pathology Semantic Masks using a Generative Model.DEPAS:使用生成模型进行从头开始的病理学语义掩模。
Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-7. doi: 10.1109/EMBC40787.2023.10340437.

引用本文的文献

1
Semantic hyperspectral image synthesis for cross-modality knowledge transfer in surgical data science.用于手术数据科学中跨模态知识转移的语义高光谱图像合成
Int J Comput Assist Radiol Surg. 2025 Apr 24. doi: 10.1007/s11548-025-03364-7.
2
Deep learning-enabled transformation of anterior segment images to corneal fluorescein staining images for enhanced corneal disease screening.基于深度学习将眼前节图像转换为角膜荧光素染色图像以加强角膜疾病筛查
Comput Struct Biotechnol J. 2025 Mar 7;28:94-105. doi: 10.1016/j.csbj.2025.02.039. eCollection 2025.

本文引用的文献

1
Surgical data science - from concepts toward clinical translation.外科数据科学——从概念到临床转化。
Med Image Anal. 2022 Feb;76:102306. doi: 10.1016/j.media.2021.102306. Epub 2021 Nov 18.
2
Surgical data science for next-generation interventions.面向下一代干预措施的外科数据科学。
Nat Biomed Eng. 2017 Sep;1(9):691-696. doi: 10.1038/s41551-017-0132-7.