Tong Nuo, Hui Ying, Gou Shui-Ping, Chen Ling-Xi, Wang Xiang-Hong, Chen Shuo-Hua, Li Jing, Li Xiao-Shuai, Wu Yun-Tao, Wu Shou-Ling, Wang Zhen-Chang, Sun Jing, Lv Han
Key Laboratory of Intelligent Perception and Image Understanding of the Ministry of Education, School of Artificial Intelligence, Xidian University, Xi'an, 710071, China.
Guangzhou Institute of Technology, Xidian University, Guangzhou, 510555, China.
Mil Med Res. 2025 Aug 6;12(1):47. doi: 10.1186/s40779-025-00630-2.
Brain volume measurement serves as a critical approach for assessing brain health status. Considering the close biological connection between the eyes and brain, this study aims to investigate the feasibility of estimating brain volume through retinal fundus imaging integrated with clinical metadata, and to offer a cost-effective approach for assessing brain health.
Based on clinical information, retinal fundus images, and neuroimaging data derived from a multicenter, population-based cohort study, the KaiLuan Study, we proposed a cross-modal correlation representation (CMCR) network to elucidate the intricate co-degenerative relationships between the eyes and brain for 755 subjects. Specifically, individual clinical information, which has been followed up for as long as 12 years, was encoded as a prompt to enhance the accuracy of brain volume estimation. Independent internal validation and external validation were performed to assess the robustness of the proposed model. Root mean square error (RMSE), peak signal-to-noise ratio (PSNR), and structural similarity index measure (SSIM) metrics were employed to quantitatively evaluate the quality of synthetic brain images derived from retinal imaging data.
The proposed framework yielded average RMSE, PSNR, and SSIM values of 98.23, 35.78 dB, and 0.64, respectively, which significantly outperformed 5 other methods: multi-channel Variational Autoencoder (mcVAE), Pixel-to-Pixel (Pixel2pixel), transformer-based U-Net (TransUNet), multi-scale transformer network (MT-Net), and residual vision transformer (ResViT). The two- (2D) and three-dimensional (3D) visualization results showed that the shape and texture of the synthetic brain images generated by the proposed method most closely resembled those of actual brain images. Thus, the CMCR framework accurately captured the latent structural correlations between the fundus and the brain. The average difference between predicted and actual brain volumes was 61.36 cm, with a relative error of 4.54%. When all of the clinical information (including age and sex, daily habits, cardiovascular factors, metabolic factors, and inflammatory factors) was encoded, the difference was decreased to 53.89 cm, with a relative error of 3.98%. Based on the synthesized brain MR images from retinal fundus images, the volumes of brain tissues could be estimated with high accuracy.
This study provides an innovative, accurate, and cost-effective approach to characterize brain health status through readily accessible retinal fundus images.
NCT05453877 ( https://clinicaltrials.gov/ ).
脑容量测量是评估脑健康状况的关键方法。鉴于眼睛与大脑之间密切的生物学联系,本研究旨在探讨通过结合临床元数据的视网膜眼底成像来估计脑容量的可行性,并提供一种经济有效的脑健康评估方法。
基于来自多中心、基于人群的队列研究——开滦研究的临床信息、视网膜眼底图像和神经影像数据,我们提出了一种跨模态相关表示(CMCR)网络,以阐明755名受试者眼睛与大脑之间复杂的共同退化关系。具体而言,长达12年随访的个体临床信息被编码为一个提示,以提高脑容量估计的准确性。进行了独立的内部验证和外部验证,以评估所提出模型的稳健性。采用均方根误差(RMSE)、峰值信噪比(PSNR)和结构相似性指数测量(SSIM)指标来定量评估从视网膜成像数据生成的合成脑图像的质量。
所提出的框架产生的平均RMSE、PSNR和SSIM值分别为98.23、35.78 dB和0.64,显著优于其他5种方法:多通道变分自编码器(mcVAE)、像素到像素(Pixel2pixel)、基于变压器的U-Net(TransUNet)、多尺度变压器网络(MT-Net)和残差视觉变压器(ResViT)。二维(2D)和三维(3D)可视化结果表明,所提出方法生成的合成脑图像的形状和纹理与实际脑图像最为相似。因此,CMCR框架准确地捕捉了眼底与大脑之间潜在的结构相关性。预测脑容量与实际脑容量的平均差异为61.36 cm,相对误差为4.54%。当所有临床信息(包括年龄和性别、日常习惯、心血管因素、代谢因素和炎症因素)都被编码时,差异降至53.89 cm,相对误差为3.98%。基于从视网膜眼底图像合成的脑磁共振图像,可以高精度地估计脑组织的体积。
本研究提供了一种创新、准确且经济有效的方法,通过易于获取的视网膜眼底图像来表征脑健康状况。
NCT05453877(https://clinicaltrials.gov/)