Ilanchezian Indu, Boreiko Valentyn, Kühlewein Laura, Huang Ziwei, Seçkin Ayhan Murat, Hein Matthias, Koch Lisa, Berens Philipp
Hertie Institute for AI in Brain Health, University of Tübingen, Tübingen, Germany.
Tübingen AI Center, Tübingen, Germany.
PLOS Digit Health. 2025 May 15;4(5):e0000853. doi: 10.1371/journal.pdig.0000853. eCollection 2025 May.
Counterfactual reasoning is often used by humans in clinical settings. For imaging based specialties such as ophthalmology, it would be beneficial to have an AI model that can create counterfactual images, illustrating answers to questions like "If the subject had had diabetic retinopathy, how would the fundus image have looked?". Such an AI model could aid in training of clinicians or in patient education through visuals that answer counterfactual queries. We used large-scale retinal image datasets containing color fundus photography (CFP) and optical coherence tomography (OCT) images to train ordinary and adversarially robust classifiers that classify healthy and disease categories. In addition, we trained an unconditional diffusion model to generate diverse retinal images including ones with disease lesions. During sampling, we then combined the diffusion model with classifier guidance to achieve realistic and meaningful counterfactual images maintaining the subject's retinal image structure. We found that our method generated counterfactuals by introducing or removing the necessary disease-related features. We conducted an expert study to validate that generated counterfactuals are realistic and clinically meaningful. Generated color fundus images were indistinguishable from real images and were shown to contain clinically meaningful lesions. Generated OCT images appeared realistic, but could be identified by experts with higher than chance probability. This shows that combining diffusion models with classifier guidance can achieve realistic and meaningful counterfactuals even for high-resolution medical images such as CFP images. Such images could be used for patient education or training of medical professionals.
反事实推理在临床环境中经常被人类使用。对于眼科等基于成像的专业领域而言,拥有一个能够创建反事实图像的人工智能模型会大有裨益,这些图像可以说明诸如“如果受试者患有糖尿病性视网膜病变,眼底图像会是什么样子?”这类问题的答案。这样的人工智能模型可以通过回答反事实查询的视觉效果,辅助临床医生的培训或患者教育。我们使用了包含彩色眼底摄影(CFP)和光学相干断层扫描(OCT)图像的大规模视网膜图像数据集,来训练对健康和疾病类别进行分类的普通分类器和对抗鲁棒分类器。此外,我们训练了一个无条件扩散模型,以生成包括带有疾病病变的各种视网膜图像。在采样过程中,我们将扩散模型与分类器引导相结合,以生成逼真且有意义的反事实图像,同时保持受试者视网膜图像的结构。我们发现,我们的方法通过引入或去除必要的疾病相关特征来生成反事实图像。我们进行了一项专家研究,以验证生成的反事实图像是逼真且具有临床意义的。生成的彩色眼底图像与真实图像难以区分,并且显示包含具有临床意义的病变。生成的OCT图像看起来很逼真,但专家识别的概率高于随机概率。这表明,即使对于CFP图像等高分辨率医学图像,将扩散模型与分类器引导相结合也可以实现逼真且有意义的反事实图像。这样的图像可用于患者教育或医学专业人员的培训。