Müller Sarah, Koch Lisa M, Lensch Hendrik P A, Berens Philipp
Hertie Institute for AI in Brain Health, Faculty of Medicine, University of Tübingen, Tübingen, Germany; Tübingen AI Center, Tübingen, Germany.
Hertie Institute for AI in Brain Health, Faculty of Medicine, University of Tübingen, Tübingen, Germany; Tübingen AI Center, Tübingen, Germany; Department of Diabetes, Endocrinology, Nutritional Medicine and Metabolism UDEM, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland.
Med Image Anal. 2025 Oct;105:103628. doi: 10.1016/j.media.2025.103628. Epub 2025 Jun 6.
Retinal fundus images play a crucial role in the early detection of eye diseases. However, the impact of technical factors on these images can pose challenges for reliable AI applications in ophthalmology. For example, large fundus cohorts are often confounded by factors such as camera type, bearing the risk of learning shortcuts rather than the causal relationships behind the image generation process. Here, we introduce a population model for retinal fundus images that effectively disentangles patient attributes from camera effects, enabling controllable and highly realistic image generation. To achieve this, we propose a disentanglement loss based on distance correlation. Through qualitative and quantitative analyses, we show that our models encode desired information in disentangled subspaces and enable controllable image generation based on the learned subspaces, demonstrating the effectiveness of our disentanglement loss. The project code is publicly available at https://github.com/berenslab/disentangling-retinal-images.
眼底图像在眼部疾病的早期检测中起着至关重要的作用。然而,技术因素对这些图像的影响可能给眼科领域可靠的人工智能应用带来挑战。例如,大型眼底图像队列常常受到相机类型等因素的干扰,存在学习捷径而非图像生成过程背后因果关系的风险。在此,我们引入一种用于眼底图像的总体模型,该模型能有效区分患者属性与相机效果,实现可控且高度逼真的图像生成。为实现这一目标,我们提出一种基于距离相关性的解缠损失。通过定性和定量分析,我们表明我们的模型在解缠子空间中编码所需信息,并基于所学子空间实现可控图像生成,证明了解缠损失的有效性。该项目代码可在https://github.com/berenslab/disentangling-retinal-images上公开获取。