IEEE J Biomed Health Inform. 2024 May;28(5):2904-2915. doi: 10.1109/JBHI.2024.3367375. Epub 2024 May 6.
Three-dimensional images are frequently used in medical imaging research for classification, segmentation, and detection. However, the limited availability of 3D images hinders research progress due to network training difficulties. Generative methods have been proposed to create medical images using AI techniques. Nevertheless, 2D approaches have difficulty dealing with 3D anatomical structures, which can result in discontinuities between slices. To mitigate these discontinuities, several 3D generative networks have been proposed. However, the scarcity of available 3D images makes training these networks with limited samples inadequate for producing high-fidelity 3D images. We propose a data-guided generative adversarial network to provide high fidelity in 3D image generation. The generator creates fake images with noise using reference code obtained by extracting features from real images. The generator also creates decoded images using reference code without noise. These decoded images are compared to the real images to evaluate fidelity in the reference code. This generation process can create high-fidelity 3D images from only a small amount of real training data. Additionally, our method employs three types of discriminator: volume (evaluates all the slices), slab (evaluates a set of consecutive slices), and slice (evaluates randomly selected slices). The proposed discriminator enhances fidelity by differentiating between real and fake images based on detailed characteristics. Results from our method are compared with existing methods by using quantitative analysis such as Fréchet inception distance and maximum mean discrepancy. The results demonstrate that our method produces more realistic 3D images than existing methods.
三维图像在医学成像研究中常用于分类、分割和检测。然而,由于网络训练困难,三维图像的可用性有限,阻碍了研究进展。已经提出了使用人工智能技术生成医学图像的生成方法。然而,二维方法难以处理三维解剖结构,这可能导致切片之间存在不连续。为了减轻这些不连续性,已经提出了几种三维生成网络。然而,可用的三维图像稀缺,使得使用有限的样本训练这些网络不足以生成高保真的三维图像。我们提出了一种数据引导的生成对抗网络,以提供三维图像生成的高保真度。生成器使用从真实图像中提取特征得到的参考代码,通过添加噪声来生成假图像。生成器还使用没有噪声的参考代码生成解码图像。将这些解码图像与真实图像进行比较,以评估参考代码中的保真度。这种生成过程可以仅使用少量的真实训练数据生成高保真度的三维图像。此外,我们的方法使用了三种类型的鉴别器:体(评估所有切片)、板(评估一组连续的切片)和片(评估随机选择的切片)。所提出的鉴别器通过基于详细特征区分真实图像和假图像来提高保真度。通过使用 Fréchet inception 距离和最大均值差异等定量分析方法,将我们的方法与现有方法的结果进行比较。结果表明,我们的方法生成的三维图像比现有方法更逼真。