Kim Hong Kyu, Yoo Tae Keun
Department of Ophthalmology, Dankook University Hospital, Dankook University College of Medicine, Cheonan, South Korea.
Prof. Kim Eye Center, Cheonan, Chungcheongnam-do, South Korea.
Int Ophthalmol. 2025 Mar 18;45(1):111. doi: 10.1007/s10792-025-03500-x.
This meta-analysis evaluated the diagnostic performance of oculomics approaches, including deep learning, machine learning, and logistic regression models, in detecting major mental disorders using retinal imaging.
A systematic review identified 11 studies for inclusion. Study quality was assessed using the QUADAS-2 tool, revealing a high risk of bias, particularly in patient selection and index test design. Pooled sensitivity and specificity were estimated using random-effects models, and diagnostic performance was evaluated through a summary receiver operating characteristic curve.
The analysis included 13 diagnostic models across 11 studies, covering major depressive disorder, bipolar disorder, schizophrenia, obsessive-compulsive disorder, and autism spectrum disorder using color fundus photography, and optical coherence tomography (OCT), and OCT angiography. The pooled sensitivity was 0.89 (95% CI: 0.78-0.94), and specificity was 0.87 (95% CI: 0.74-0.95). The pooled area under the curve was 0.904, indicating high diagnostic accuracy. However, all studies exhibited a high risk of bias, primarily due to case-control study designs, lack of external validation, and selection bias in 77% of studies. Some models showed signs of overfitting, likely due to small sample sizes, insufficient validation, or dataset limitations. Additionally, no distinct retinal patterns specific to mental disorders were identified.
While oculomics demonstrates potential for detecting mental disorders through retinal imaging, significant methodological limitations, including high bias, overfitting risks, and the absence of disease-specific retinal biomarkers, limit its current clinical applicability. Future research should focus on large-scale, externally validated studies with prospective designs to establish reliable retinal markers for psychiatric diagnosis.
本荟萃分析评估了眼科学方法(包括深度学习、机器学习和逻辑回归模型)在使用视网膜成像检测主要精神障碍方面的诊断性能。
一项系统评价确定了11项纳入研究。使用QUADAS-2工具评估研究质量,结果显示存在高偏倚风险,尤其是在患者选择和指标测试设计方面。使用随机效应模型估计合并敏感性和特异性,并通过汇总的受试者工作特征曲线评估诊断性能。
该分析纳入了11项研究中的13种诊断模型,涵盖了使用彩色眼底摄影、光学相干断层扫描(OCT)和OCT血管造影术检测重度抑郁症、双相情感障碍、精神分裂症、强迫症和自闭症谱系障碍。合并敏感性为0.89(95%CI:0.78 - 0.94),特异性为0.87(95%CI:0.74 - 0.95)。曲线下合并面积为0.904,表明诊断准确性高。然而,所有研究均表现出高偏倚风险,主要原因是病例对照研究设计、缺乏外部验证以及77%的研究存在选择偏倚。一些模型显示出过度拟合的迹象,可能是由于样本量小、验证不足或数据集限制。此外,未发现特定于精神障碍的独特视网膜模式。
虽然眼科学通过视网膜成像检测精神障碍具有潜力,但显著的方法学局限性,包括高偏倚、过度拟合风险以及缺乏疾病特异性视网膜生物标志物,限制了其目前的临床适用性。未来的研究应侧重于大规模、具有前瞻性设计且经过外部验证的研究,以建立用于精神疾病诊断的可靠视网膜标志物。