School of Electrical Engineering, Iran University of Science and Technology, Tehran, Iran.
Department of Electrical Engineering, Shahid Beheshti University, Tehran, Iran.
Med Image Anal. 2023 Aug;88:102846. doi: 10.1016/j.media.2023.102846. Epub 2023 May 23.
Denoising diffusion models, a class of generative models, have garnered immense interest lately in various deep-learning problems. A diffusion probabilistic model defines a forward diffusion stage where the input data is gradually perturbed over several steps by adding Gaussian noise and then learns to reverse the diffusion process to retrieve the desired noise-free data from noisy data samples. Diffusion models are widely appreciated for their strong mode coverage and quality of the generated samples in spite of their known computational burdens. Capitalizing on the advances in computer vision, the field of medical imaging has also observed a growing interest in diffusion models. With the aim of helping the researcher navigate this profusion, this survey intends to provide a comprehensive overview of diffusion models in the discipline of medical imaging. Specifically, we start with an introduction to the solid theoretical foundation and fundamental concepts behind diffusion models and the three generic diffusion modeling frameworks, namely, diffusion probabilistic models, noise-conditioned score networks, and stochastic differential equations. Then, we provide a systematic taxonomy of diffusion models in the medical domain and propose a multi-perspective categorization based on their application, imaging modality, organ of interest, and algorithms. To this end, we cover extensive applications of diffusion models in the medical domain, including image-to-image translation, reconstruction, registration, classification, segmentation, denoising, 2/3D generation, anomaly detection, and other medically-related challenges. Furthermore, we emphasize the practical use case of some selected approaches, and then we discuss the limitations of the diffusion models in the medical domain and propose several directions to fulfill the demands of this field. Finally, we gather the overviewed studies with their available open-source implementations at our GitHub. We aim to update the relevant latest papers within it regularly.
去噪扩散模型是一类生成模型,最近在各种深度学习问题中引起了极大的兴趣。扩散概率模型定义了一个前向扩散阶段,在该阶段中,输入数据通过添加高斯噪声逐步进行扰动,经过几个步骤,然后学习反向扩散过程,从噪声数据样本中恢复所需的无噪声数据。扩散模型因其强大的模式覆盖和生成样本的质量而受到广泛赞赏,尽管它们的计算负担是已知的。利用计算机视觉的进步,医学成像领域也对扩散模型产生了越来越大的兴趣。本综述旨在为研究人员提供医学成像领域扩散模型的全面概述,以帮助他们应对这一复杂局面。具体来说,我们首先介绍扩散模型的坚实理论基础和基本概念,以及三种通用的扩散建模框架,即扩散概率模型、噪声条件得分网络和随机微分方程。然后,我们提出了一种基于其应用、成像模态、感兴趣器官和算法的扩散模型在医学领域的系统分类法。为此,我们涵盖了扩散模型在医学领域的广泛应用,包括图像到图像的转换、重建、配准、分类、分割、去噪、2/3D 生成、异常检测和其他与医学相关的挑战。此外,我们强调了一些选定方法的实际用例,然后讨论了扩散模型在医学领域的局限性,并提出了几个方向,以满足该领域的需求。最后,我们在我们的 GitHub 上汇集了经过综述的研究及其可用的开源实现。我们的目标是定期在其中更新相关的最新论文。