Mariottoni Eduardo B, Jammal Alessandro A, Berchuck Samuel I, Shigueoka Leonardo S, Tavares Ivan M, Medeiros Felipe A
Vision, Imaging and Performance (VIP) Laboratory, Department of Ophthalmology, Duke Eye Center, Duke University, 2351 Erwin Rd, Durham, NC, 27701, USA.
Department of Ophthalmology, Federal University of São Paulo, São Paulo, Brazil.
Sci Rep. 2021 Jan 18;11(1):1752. doi: 10.1038/s41598-021-80993-3.
The current lack of consensus for diagnosing glaucoma makes it difficult to develop diagnostic tests derived from deep learning (DL) algorithms. In the present study, we propose an objective definition of glaucomatous optic neuropathy (GON) using clearly defined parameters from optical coherence tomography and standard automated perimetry. We then use the proposed objective definition as reference standard to develop a DL algorithm to detect GON on fundus photos. A DL algorithm was trained to detect GON on fundus photos, using the proposed objective definition as reference standard. The performance was evaluated on an independent test sample with sensitivity, specificity, area under the receiver operating characteristic curve (AUC), and likelihood ratios (LR). The test sample had 2118 fundus photos from 585 eyes of 405 individuals. The AUC to discriminate between GON and normal was 0.92 with sensitivity of 77% at 95% specificity. LRs indicated that the DL algorithm provided large changes in the post-test probability of disease for the majority of eyes. A DL algorithm to evaluate fundus photos had high performance to discriminate GON from normal. The newly proposed objective definition of GON used as reference standard may increase the comparability of diagnostic studies of glaucoma across devices and populations.
目前在青光眼诊断方面缺乏共识,这使得难以开发基于深度学习(DL)算法的诊断测试。在本研究中,我们使用光学相干断层扫描和标准自动视野检查中明确定义的参数,提出了青光眼性视神经病变(GON)的客观定义。然后,我们将所提出的客观定义用作参考标准,开发一种DL算法以在眼底照片上检测GON。使用所提出的客观定义作为参考标准,训练了一种DL算法以在眼底照片上检测GON。在一个独立测试样本上评估了该算法的性能,评估指标包括敏感性、特异性、受试者操作特征曲线下面积(AUC)和似然比(LR)。该测试样本包含来自405名个体的585只眼睛的2118张眼底照片。区分GON和正常情况的AUC为0.92,在特异性为95%时敏感性为77%。似然比表明,对于大多数眼睛,DL算法在疾病的测试后概率方面提供了很大变化。一种评估眼底照片的DL算法在区分GON和正常情况方面具有高性能。新提出的用作参考标准的GON客观定义可能会提高跨设备和人群的青光眼诊断研究的可比性。