• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于手术场景分割的基于类别感知语义扩散模型的图像合成

Image synthesis with class-aware semantic diffusion models for surgical scene segmentation.

作者信息

Zhou Yihang, Towning Rebecca, Awad Zaid, Giannarou Stamatia

机构信息

Hamlyn Centre for Robotic Surgery, Department of Surgery and Cancer Imperial College London London UK.

Imperial College Healthcare NHS Trust London UK.

出版信息

Healthc Technol Lett. 2025 Jan 31;12(1):e70003. doi: 10.1049/htl2.70003. eCollection 2025 Jan-Dec.

DOI:10.1049/htl2.70003
PMID:39897096
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11783686/
Abstract

Surgical scene segmentation is essential for enhancing surgical precision, yet it is frequently compromised by the scarcity and imbalance of available data. To address these challenges, semantic image synthesis methods based on generative adversarial networks and diffusion models have been developed. However, these models often yield non-diverse images and fail to capture small, critical tissue classes, limiting their effectiveness. In response, a class-aware semantic diffusion model (CASDM), a novel approach which utilizes segmentation maps as conditions for image synthesis to tackle data scarcity and imbalance is proposed. Novel class-aware mean squared error and class-aware self-perceptual loss functions have been defined to prioritize critical, less visible classes, thereby enhancing image quality and relevance. Furthermore, to the authors' knowledge, they are the first to generate multi-class segmentation maps using text prompts in a novel fashion to specify their contents. These maps are then used by CASDM to generate surgical scene images, enhancing datasets for training and validating segmentation models. This evaluation assesses both image quality and downstream segmentation performance, demonstrates the strong effectiveness and generalisability of CASDM in producing realistic image-map pairs, significantly advancing surgical scene segmentation across diverse and challenging datasets.

摘要

手术场景分割对于提高手术精度至关重要,但它经常因可用数据的稀缺和不平衡而受到影响。为应对这些挑战,基于生成对抗网络和扩散模型的语义图像合成方法已被开发出来。然而,这些模型往往生成的图像缺乏多样性,并且无法捕捉到小的关键组织类别,从而限制了它们的有效性。作为回应,提出了一种类感知语义扩散模型(CASDM),这是一种新颖的方法,它利用分割图作为图像合成的条件来解决数据稀缺和不平衡问题。定义了新颖的类感知均方误差和类感知自感知损失函数,以优先处理关键的、不太明显的类别,从而提高图像质量和相关性。此外,据作者所知,他们首次以新颖的方式使用文本提示生成多类分割图来指定其内容。然后,CASDM使用这些分割图来生成手术场景图像,增强用于训练和验证分割模型的数据集。该评估同时评估了图像质量和下游分割性能,证明了CASDM在生成逼真的图像-分割图对方面的强大有效性和通用性,显著推进了跨不同且具有挑战性的数据集的手术场景分割。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae66/11783686/29763a3e7c6a/HTL2-12-e70003-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae66/11783686/18743ff184af/HTL2-12-e70003-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae66/11783686/88dbceadc7eb/HTL2-12-e70003-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae66/11783686/373ef350b9f5/HTL2-12-e70003-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae66/11783686/c7476c325519/HTL2-12-e70003-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae66/11783686/24ac88a50074/HTL2-12-e70003-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae66/11783686/29763a3e7c6a/HTL2-12-e70003-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae66/11783686/18743ff184af/HTL2-12-e70003-g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae66/11783686/88dbceadc7eb/HTL2-12-e70003-g007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae66/11783686/373ef350b9f5/HTL2-12-e70003-g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae66/11783686/c7476c325519/HTL2-12-e70003-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae66/11783686/24ac88a50074/HTL2-12-e70003-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ae66/11783686/29763a3e7c6a/HTL2-12-e70003-g001.jpg

相似文献

1
Image synthesis with class-aware semantic diffusion models for surgical scene segmentation.用于手术场景分割的基于类别感知语义扩散模型的图像合成
Healthc Technol Lett. 2025 Jan 31;12(1):e70003. doi: 10.1049/htl2.70003. eCollection 2025 Jan-Dec.
2
Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像(MRI)中进行脑肿瘤分割与检测
Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.
3
Generative multi-adversarial network for striking the right balance in abdominal image segmentation.生成式多对抗网络在腹部图像分割中实现良好平衡。
Int J Comput Assist Radiol Surg. 2020 Nov;15(11):1847-1858. doi: 10.1007/s11548-020-02254-4. Epub 2020 Sep 8.
4
Advancements in urban scene segmentation using deep learning and generative adversarial networks for accurate satellite image analysis.深度学习和生成对抗网络在城市场景分割中的应用进展,用于准确的卫星图像分析。
PLoS One. 2024 Jul 18;19(7):e0307187. doi: 10.1371/journal.pone.0307187. eCollection 2024.
5
Semi-Supervised Semantic Image Segmentation by Deep Diffusion Models and Generative Adversarial Networks.基于深度扩散模型和生成对抗网络的半监督语义图像分割。
Int J Neural Syst. 2024 Nov;34(11):2450057. doi: 10.1142/S0129065724500576. Epub 2024 Aug 15.
6
Label-informed cardiac magnetic resonance image synthesis through conditional generative adversarial networks.通过条件生成对抗网络实现标签引导的心脏磁共振图像合成。
Comput Med Imaging Graph. 2022 Oct;101:102123. doi: 10.1016/j.compmedimag.2022.102123. Epub 2022 Sep 11.
7
Guided image generation for improved surgical image segmentation.引导图像生成以改进手术图像分割。
Med Image Anal. 2024 Oct;97:103263. doi: 10.1016/j.media.2024.103263. Epub 2024 Jul 3.
8
Exploration of Semantic Label Decomposition and Dataset Size in Semantic Indoor Scenes Synthesis via Optimized Residual Generative Adversarial Networks.通过优化残差生成对抗网络探索语义室内场景合成中的语义标签分解和数据集大小
Sensors (Basel). 2022 Oct 29;22(21):8306. doi: 10.3390/s22218306.
9
Semantic hyperspectral image synthesis for cross-modality knowledge transfer in surgical data science.用于手术数据科学中跨模态知识转移的语义高光谱图像合成
Int J Comput Assist Radiol Surg. 2025 Apr 24. doi: 10.1007/s11548-025-03364-7.
10
Class-Aware Adversarial Transformers for Medical Image Segmentation.用于医学图像分割的类别感知对抗变压器
Adv Neural Inf Process Syst. 2022 Dec;35:29582-29596.

本文引用的文献

1
Anatomy segmentation in laparoscopic surgery: comparison of machine learning and human expertise - an experimental study.腹腔镜手术中的解剖分割:机器学习和人类专业知识的比较——一项实验研究。
Int J Surg. 2023 Oct 1;109(10):2962-2974. doi: 10.1097/JS9.0000000000000595.
2
Unsupervised Medical Image Translation With Adversarial Diffusion Models.基于对抗扩散模型的无监督医学图像翻译。
IEEE Trans Med Imaging. 2023 Dec;42(12):3524-3539. doi: 10.1109/TMI.2023.3290149. Epub 2023 Nov 30.
3
Diffusion models in medical imaging: A comprehensive survey.
扩散模型在医学成像中的应用:全面综述。
Med Image Anal. 2023 Aug;88:102846. doi: 10.1016/j.media.2023.102846. Epub 2023 May 23.
4
Deep learning for detection and segmentation of artefact and disease instances in gastrointestinal endoscopy.深度学习在胃肠内镜中用于检测和分割伪影和病变实例。
Med Image Anal. 2021 May;70:102002. doi: 10.1016/j.media.2021.102002. Epub 2021 Feb 17.
5
Artificial Intelligence for Intraoperative Guidance: Using Semantic Segmentation to Identify Surgical Anatomy During Laparoscopic Cholecystectomy.人工智能在术中指导中的应用:利用语义分割技术在腹腔镜胆囊切除术中识别手术解剖结构。
Ann Surg. 2022 Aug 1;276(2):363-369. doi: 10.1097/SLA.0000000000004594. Epub 2020 Nov 13.
6
EndoNet: A Deep Architecture for Recognition Tasks on Laparoscopic Videos.EndoNet:腹腔镜视频识别任务的深度架构。
IEEE Trans Med Imaging. 2017 Jan;36(1):86-97. doi: 10.1109/TMI.2016.2593957. Epub 2016 Jul 22.