级联多路径捷径扩散模型在医学图像翻译中的应用。

Cascaded Multi-path Shortcut Diffusion Model for Medical Image Translation.

机构信息

Department of Biomedical Engineering, Yale University, New Haven, CT, USA.

Department of Computer Science, University of California Irvine, Irvine, CA, USA.

出版信息

Med Image Anal. 2024 Dec;98:103300. doi: 10.1016/j.media.2024.103300. Epub 2024 Aug 13.

DOI:10.1016/j.media.2024.103300

PMID:39226710

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11979896/

Abstract

Image-to-image translation is a vital component in medical imaging processing, with many uses in a wide range of imaging modalities and clinical scenarios. Previous methods include Generative Adversarial Networks (GANs) and Diffusion Models (DMs), which offer realism but suffer from instability and lack uncertainty estimation. Even though both GAN and DM methods have individually exhibited their capability in medical image translation tasks, the potential of combining a GAN and DM to further improve translation performance and to enable uncertainty estimation remains largely unexplored. In this work, we address these challenges by proposing a Cascade Multi-path Shortcut Diffusion Model (CMDM) for high-quality medical image translation and uncertainty estimation. To reduce the required number of iterations and ensure robust performance, our method first obtains a conditional GAN-generated prior image that will be used for the efficient reverse translation with a DM in the subsequent step. Additionally, a multi-path shortcut diffusion strategy is employed to refine translation results and estimate uncertainty. A cascaded pipeline further enhances translation quality, incorporating residual averaging between cascades. We collected three different medical image datasets with two sub-tasks for each dataset to test the generalizability of our approach. Our experimental results found that CMDM can produce high-quality translations comparable to state-of-the-art methods while providing reasonable uncertainty estimations that correlate well with the translation error.

摘要

图像到图像的转换是医学成像处理中的一个重要组成部分，在多种成像方式和临床场景中都有广泛的应用。以前的方法包括生成对抗网络（GAN）和扩散模型（DM），它们提供了真实感，但存在不稳定性和缺乏不确定性估计的问题。尽管 GAN 和 DM 方法各自在医学图像翻译任务中表现出了能力，但将 GAN 和 DM 结合起来以进一步提高翻译性能和实现不确定性估计的潜力在很大程度上仍未得到探索。在这项工作中，我们通过提出一种用于高质量医学图像翻译和不确定性估计的级联多路径捷径扩散模型（CMDM）来解决这些挑战。为了减少所需的迭代次数并确保稳健的性能，我们的方法首先获得一个条件 GAN 生成的先验图像，该图像将用于后续步骤中使用 DM 进行高效的反向翻译。此外，采用多路径捷径扩散策略来改进翻译结果并估计不确定性。级联流水线进一步提高了翻译质量，在级联之间进行残差平均。我们收集了三个不同的医学图像数据集，每个数据集有两个子任务，以测试我们方法的泛化能力。我们的实验结果发现，CMDM 可以生成与最先进方法相当的高质量翻译，同时提供合理的不确定性估计，与翻译误差相关性良好。

相似文献

Cascaded Multi-path Shortcut Diffusion Model for Medical Image Translation.级联多路径捷径扩散模型在医学图像翻译中的应用。

Med Image Anal. 2024 Dec;98:103300. doi: 10.1016/j.media.2024.103300. Epub 2024 Aug 13.

Unsupervised Medical Image Translation With Adversarial Diffusion Models.基于对抗扩散模型的无监督医学图像翻译。

IEEE Trans Med Imaging. 2023 Dec;42(12):3524-3539. doi: 10.1109/TMI.2023.3290149. Epub 2023 Nov 30.

A medical image classification method based on self-regularized adversarial learning.基于自正则化对抗学习的医学图像分类方法。

Med Phys. 2024 Nov;51(11):8232-8246. doi: 10.1002/mp.17320. Epub 2024 Jul 30.

A systematic review of generative AI approaches for medical image enhancement: Comparing GANs, transformers, and diffusion models.医学图像增强生成式人工智能方法的系统综述：比较生成对抗网络、Transformer和扩散模型。

Int J Med Inform. 2025 Jul;199:105903. doi: 10.1016/j.ijmedinf.2025.105903. Epub 2025 Apr 1.

Generative artificial intelligence to produce high-fidelity blastocyst-stage embryo images.生成式人工智能生成高保真囊胚期胚胎图像。

Hum Reprod. 2024 Jun 3;39(6):1197-1207. doi: 10.1093/humrep/deae064.

Generative Adversarial Networks in Medical Image Processing.生成对抗网络在医学图像处理中的应用。

Curr Pharm Des. 2021;27(15):1856-1868. doi: 10.2174/1381612826666201125110710.

Multi-domain medical image translation generation for lung image classification based on generative adversarial networks.基于生成对抗网络的肺部图像分类的多领域医学图像翻译生成。

Comput Methods Programs Biomed. 2023 Feb;229:107200. doi: 10.1016/j.cmpb.2022.107200. Epub 2022 Nov 2.

Using histopathology latent diffusion models as privacy-preserving dataset augmenters improves downstream classification performance.使用组织病理学潜在扩散模型作为保护隐私的数据增强器可以提高下游分类性能。

Comput Biol Med. 2024 Jun;175:108410. doi: 10.1016/j.compbiomed.2024.108410. Epub 2024 Apr 4.

3D conditional generative adversarial networks for high-quality PET image estimation at low dose.基于三维条件生成对抗网络的低剂量 PET 图像高质量估计。

Neuroimage. 2018 Jul 1;174:550-562. doi: 10.1016/j.neuroimage.2018.03.045. Epub 2018 Mar 20.

Towards robust multimodal ultrasound classification for liver tumor diagnosis: A generative approach to modality missingness.迈向用于肝肿瘤诊断的稳健多模态超声分类：一种处理模态缺失的生成方法。

Comput Methods Programs Biomed. 2025 Jun;265:108759. doi: 10.1016/j.cmpb.2025.108759. Epub 2025 Mar 30.

引用本文的文献

Brain Latent Progression: Individual-based spatiotemporal disease progression on 3D Brain MRIs via latent diffusion.脑潜在进展：基于个体的三维脑磁共振成像上通过潜在扩散的时空疾病进展

Med Image Anal. 2025 Jul 31;106:103734. doi: 10.1016/j.media.2025.103734.

Feasibility of generating sagittal radiographs from coronal views using GAN-based deep learning framework in adolescent idiopathic scoliosis.使用基于生成对抗网络（GAN）的深度学习框架从冠状位视图生成青少年特发性脊柱侧凸矢状位X线片的可行性。

Eur Radiol Exp. 2025 Jan 29;9(1):11. doi: 10.1186/s41747-025-00553-6.

Staging of prostate Cancer with ultra-fast PSMA-PET scans enhanced by AI.通过人工智能增强的超快速前列腺特异性膜抗原正电子发射断层扫描对前列腺癌进行分期

Eur J Nucl Med Mol Imaging. 2025 Apr;52(5):1658-1670. doi: 10.1007/s00259-024-07060-7. Epub 2025 Jan 11.

本文引用的文献

POUR-Net: A Population-Prior-Aided Over-Under-Representation Network for Low-Count PET Attenuation Map Generation.POUR-Net：一种用于低计数PET衰减图生成的基于人群先验辅助的过/欠表征网络。

IEEE Trans Med Imaging. 2025 Apr;44(4):1699-1710. doi: 10.1109/TMI.2024.3514925. Epub 2025 Apr 3.

Synthetic CT generation from MRI using 3D transformer-based denoising diffusion model.基于 3D 变形器的去噪扩散模型从 MRI 生成合成 CT。

Med Phys. 2024 Apr;51(4):2538-2548. doi: 10.1002/mp.16847. Epub 2023 Nov 27.

PET image denoising based on denoising diffusion probabilistic model.基于去噪扩散概率模型的 PET 图像去噪。

Eur J Nucl Med Mol Imaging. 2024 Jan;51(2):358-368. doi: 10.1007/s00259-023-06417-8. Epub 2023 Oct 3.

CoreDiff: Contextual Error-Modulated Generalized Diffusion Model for Low-Dose CT Denoising and Generalization.CoreDiff：用于低剂量 CT 去噪和泛化的上下文错误调制广义扩散模型。

IEEE Trans Med Imaging. 2024 Feb;43(2):745-759. doi: 10.1109/TMI.2023.3320812. Epub 2024 Feb 2.

Unsupervised Medical Image Translation With Adversarial Diffusion Models.基于对抗扩散模型的无监督医学图像翻译。

IEEE Trans Med Imaging. 2023 Dec;42(12):3524-3539. doi: 10.1109/TMI.2023.3290149. Epub 2023 Nov 30.

Diffusion models in medical imaging: A comprehensive survey.扩散模型在医学成像中的应用：全面综述。

Med Image Anal. 2023 Aug;88:102846. doi: 10.1016/j.media.2023.102846. Epub 2023 May 23.

Bone suppression of lateral chest x-rays with imperfect and limited dual-energy subtraction images.侧胸 X 射线的骨抑制与不完善和有限的双能减影图像。

Comput Med Imaging Graph. 2023 Apr;105:102186. doi: 10.1016/j.compmedimag.2023.102186. Epub 2023 Jan 21.

DuDoUFNet: Dual-Domain Under-to-Fully-Complete Progressive Restoration Network for Simultaneous Metal Artifact Reduction and Low-Dose CT Reconstruction.DuDoUFNet：用于同时降低金属伪影和低剂量 CT 重建的双域下到全完成渐进式恢复网络。

IEEE Trans Med Imaging. 2022 Dec;41(12):3587-3599. doi: 10.1109/TMI.2022.3189759. Epub 2022 Dec 2.

Cross-vender, cross-tracer, and cross-protocol deep transfer learning for attenuation map generation of cardiac SPECT.跨供应商、跨示踪剂和跨协议的心脏 SPECT 衰减图生成的深度迁移学习。

J Nucl Cardiol. 2022 Dec;29(6):3379-3391. doi: 10.1007/s12350-022-02978-7. Epub 2022 Apr 26.

Parameter-Transferred Wasserstein Generative Adversarial Network (PT-WGAN) for Low-Dose PET Image Denoising.用于低剂量PET图像去噪的参数转移瓦瑟斯坦生成对抗网络（PT-WGAN）

IEEE Trans Radiat Plasma Med Sci. 2021 Mar;5(2):213-223. doi: 10.1109/trpms.2020.3025071. Epub 2020 Sep 21.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验