Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:430-433. doi: 10.1109/EMBC48229.2022.9871217.
Synthetic medical images have an important role to play in developing data-driven medical image processing systems. Using a relatively small amount of patient data to train generative models that can produce an abundance of additional samples could bridge the gap towards big-data in niche medical domains. These generative models are evaluated in terms of the synthetic data they generate using the Visual Turing Test (VTT), Fréchet Inception Distance (FID), and other metrics. However, these are generally interpreted at the group level, and do not measure the artificiality of individual synthetic images. The present study attempts to address the challenge of automatically identifying artificial images that are obviously-artificial-looking, which may be necessary for filtering out poorly constructed synthetic images that might otherwise deteriorate the performance of assimilating systems. Synthetic computed tomography (CT) images from a progressively-grown generative adversarial network (PGGAN) were evaluated with a VTT and their image embeddings were analyzed for correlation with artificiality. Images categorized as obviously-artificial (≥0. 7 probability of being rated as fake) were classified using a battery of algorithms. The top-performing classifier, a support vector machine, exhibited accuracy of 75.5%, sensitivity of 0.743, and specificity of 0.769. This is an encouraging result that suggests a potential approach for validating synthetic medical image datasets. Clinical Relevance - Next-generation medical AI systems for image processing will utilize synthetic images produced by generative models. This paper presents an approach towards verifying artificial image legibility for quality-control before being deployed for these purposes.
合成医学图像在开发数据驱动的医学图像处理系统中具有重要作用。使用相对较少的患者数据来训练生成模型,这些模型可以生成大量额外的样本,可以弥合小众医学领域大数据的差距。这些生成模型是根据它们使用视觉图灵测试 (VTT)、Fréchet inception 距离 (FID) 和其他指标生成的合成数据进行评估的。然而,这些通常是在群体水平上进行解释的,并没有衡量单个合成图像的人工性。本研究试图解决自动识别明显人为的人工图像的挑战,这可能对于过滤掉构造不良的合成图像是必要的,否则这些图像可能会降低同化系统的性能。使用 VTT 评估了来自逐步生长的生成对抗网络 (PGGAN)的合成计算机断层扫描 (CT) 图像,并分析了它们的图像嵌入与人工性的相关性。将被归类为明显人为的图像(被评为假的概率≥0.7)使用一系列算法进行分类。表现最佳的分类器是支持向量机,其准确率为 75.5%,灵敏度为 0.743,特异性为 0.769。这是一个令人鼓舞的结果,表明了验证合成医学图像数据集的潜在方法。临床意义 - 用于图像处理的下一代医学人工智能系统将利用生成模型生成的合成图像。本文提出了一种在将其用于这些目的之前验证人工图像可读性的方法,以进行质量控制。