用于医学图像分割的具有空间注意力和潜在嵌入的条件扩散模型

Conditional Diffusion Model with Spatial Attention and Latent Embedding for Medical Image Segmentation.

作者信息

Hejrati Behzad, Banerjee Soumyanil, Glide-Hurst Carri, Dong Ming

机构信息

Department of Computer Science, Wayne State University, Detroit, MI, USA.

Department of Human Oncology, University of Wisconsin-Madison, Madison, WI, USA.

出版信息

Med Image Comput Comput Assist Interv. 2024 Oct;15009:202-212. doi: 10.1007/978-3-031-72114-4_20. Epub 2024 Oct 3.

DOI:10.1007/978-3-031-72114-4_20

PMID:40196356

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11974562/

Abstract

Diffusion models have been used extensively for high quality image and video generation tasks. In this paper, we propose a novel conditional diffusion model with spatial attention and latent embedding (cDAL) for medical image segmentation. In cDAL, a convolutional neural network (CNN) based discriminator is used at every time-step of the diffusion process to distinguish between the generated labels and the real ones. A spatial attention map is computed based on the features learned by the discriminator to help cDAL generate more accurate segmentation of discriminative regions in an input image. Additionally, we incorporated a random latent embedding into each layer of our model to significantly reduce the number of training and sampling time-steps, thereby making it much faster than other diffusion models for image segmentation. We applied cDAL on 3 publicly available medical image segmentation datasets (MoNuSeg, Chest X-ray and Hippocampus) and observed significant qualitative and quantitative improvements with higher Dice scores and mIoU over the state-of-the-art algorithms. The source code is publicly available at https://github.com/Hejrati/cDAL/.

摘要

扩散模型已被广泛应用于高质量图像和视频生成任务。在本文中，我们提出了一种用于医学图像分割的新型条件扩散模型，即带空间注意力和潜在嵌入的模型（cDAL）。在cDAL中，基于卷积神经网络（CNN）的鉴别器在扩散过程的每个时间步用于区分生成的标签和真实标签。基于鉴别器学习到的特征计算空间注意力图，以帮助cDAL在输入图像中对判别区域生成更准确的分割。此外，我们在模型的每一层中加入了随机潜在嵌入，以显著减少训练和采样时间步的数量，从而使其在图像分割方面比其他扩散模型快得多。我们将cDAL应用于3个公开可用的医学图像分割数据集（MoNuSeg、胸部X光和海马体），并观察到与最先进算法相比，在定性和定量方面都有显著改进，具有更高的Dice分数和平均交并比（mIoU）。源代码可在https://github.com/Hejrati/cDAL/上公开获取。

相似文献

Conditional Diffusion Model with Spatial Attention and Latent Embedding for Medical Image Segmentation.用于医学图像分割的具有空间注意力和潜在嵌入的条件扩散模型

Med Image Comput Comput Assist Interv. 2024 Oct;15009:202-212. doi: 10.1007/978-3-031-72114-4_20. Epub 2024 Oct 3.

Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像（MRI）中进行脑肿瘤分割与检测

Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.

TAC-UNet: transformer-assisted convolutional neural network for medical image segmentation.TAC-UNet：用于医学图像分割的Transformer辅助卷积神经网络。

Quant Imaging Med Surg. 2024 Dec 5;14(12):8824-8839. doi: 10.21037/qims-24-1229. Epub 2024 Nov 5.

Collaborative networks of transformers and convolutional neural networks are powerful and versatile learners for accurate 3D medical image segmentation.基于转换器和卷积神经网络的协作网络是精确的 3D 医学图像分割的强大且多功能的学习者。

Comput Biol Med. 2023 Sep;164:107228. doi: 10.1016/j.compbiomed.2023.107228. Epub 2023 Jul 5.

Liver tumor segmentation method combining multi-axis attention and conditional generative adversarial networks.结合多轴注意力和条件生成对抗网络的肝脏肿瘤分割方法

PLoS One. 2024 Dec 3;19(12):e0312105. doi: 10.1371/journal.pone.0312105. eCollection 2024.

Domain-Generalized Discrete Diffusion Model for Cross-Domain Medical Image Segmentation.

IEEE Trans Med Imaging. 2025 Apr 25;PP. doi: 10.1109/TMI.2025.3564474.

TSCA-Net: Transformer based spatial-channel attention segmentation network for medical images.TSCA-Net：基于Transformer 的空间-通道注意力分割网络用于医学图像。

Comput Biol Med. 2024 Mar;170:107938. doi: 10.1016/j.compbiomed.2024.107938. Epub 2024 Jan 3.

Boosting medical image segmentation via conditional-synergistic convolution and lesion decoupling.通过条件协同卷积和病灶解耦来提升医学图像分割。

Comput Med Imaging Graph. 2022 Oct;101:102110. doi: 10.1016/j.compmedimag.2022.102110. Epub 2022 Aug 24.

EMCAH-Net: an effective multi-scale context aggregation hybrid network for medical image segmentation.EMCAH-Net：一种用于医学图像分割的高效多尺度上下文聚合混合网络。

Quant Imaging Med Surg. 2025 Apr 1;15(4):3064-3083. doi: 10.21037/qims-24-1983. Epub 2025 Mar 28.

BiU-net: A dual-branch structure based on two-stage fusion strategy for biomedical image segmentation.BiU-net：一种基于两阶段融合策略的双分支结构，用于生物医学图像分割。

Comput Methods Programs Biomed. 2024 Jul;252:108235. doi: 10.1016/j.cmpb.2024.108235. Epub 2024 May 18.

本文引用的文献

MSU-Net: Multi-Scale U-Net for 2D Medical Image Segmentation.MSU-Net：用于二维医学图像分割的多尺度U-Net

Front Genet. 2021 Feb 11;12:639930. doi: 10.3389/fgene.2021.639930. eCollection 2021.

UNet++: A Nested U-Net Architecture for Medical Image Segmentation.U-Net++：一种用于医学图像分割的嵌套U-Net架构。

Deep Learn Med Image Anal Multimodal Learn Clin Decis Support (2018). 2018 Sep;11045:3-11. doi: 10.1007/978-3-030-00889-5_1. Epub 2018 Sep 20.

A Multi-Organ Nucleus Segmentation Challenge.多器官细胞核分割挑战赛

IEEE Trans Med Imaging. 2020 May;39(5):1380-1391. doi: 10.1109/TMI.2019.2947628. Epub 2019 Oct 23.

A Dataset and a Technique for Generalized Nuclear Segmentation for Computational Pathology.用于计算病理学中通用核分割的数据集和技术。

IEEE Trans Med Imaging. 2017 Jul;36(7):1550-1560. doi: 10.1109/TMI.2017.2677499. Epub 2017 Mar 6.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验