Li Fei, Wang Zhe, Qu Guoxiang, Song Diping, Yuan Ye, Xu Yang, Gao Kai, Luo Guangwei, Xiao Zegu, Lam Dennis S C, Zhong Hua, Qiao Yu, Zhang Xiulan
Zhongshan Ophthalmic Center, State Key Laboratory of Ophthalmology, Sun Yat-sen University, Guangzhou, China.
SenseTime Group Limited, Hong Kong, China.
BMC Med Imaging. 2018 Oct 4;18(1):35. doi: 10.1186/s12880-018-0273-5.
To develop a deep neural network able to differentiate glaucoma from non-glaucoma visual fields based on visual filed (VF) test results, we collected VF tests from 3 different ophthalmic centers in mainland China.
Visual fields obtained by both Humphrey 30-2 and 24-2 tests were collected. Reliability criteria were established as fixation losses less than 2/13, false positive and false negative rates of less than 15%.
We split a total of 4012 PD images from 1352 patients into two sets, 3712 for training and another 300 for validation. There is no significant difference between left to right ratio (P = 0.6211), while age (P = 0.0022), VFI (P = 0.0001), MD (P = 0.0039) and PSD (P = 0.0001) exhibited obvious statistical differences. On the validation set of 300 VFs, CNN achieves the accuracy of 0.876, while the specificity and sensitivity are 0.826 and 0.932, respectively. For ophthalmologists, the average accuracies are 0.607, 0.585 and 0.626 for resident ophthalmologists, attending ophthalmologists and glaucoma experts, respectively. AGIS and GSS2 achieved accuracy of 0.459 and 0.523 respectively. Three traditional machine learning algorithms, namely support vector machine (SVM), random forest (RF), and k-nearest neighbor (k-NN) were also implemented and evaluated in the experiments, which achieved accuracy of 0.670, 0.644, and 0.591 respectively.
Our algorithm based on CNN has achieved higher accuracy compared to human ophthalmologists and traditional rules (AGIS and GSS2) in differentiation of glaucoma and non-glaucoma VFs.
为了开发一种能够根据视野(VF)测试结果区分青光眼和非青光眼视野的深度神经网络,我们收集了来自中国大陆3个不同眼科中心的VF测试数据。
收集了通过Humphrey 30-2和24-2测试获得的视野。可靠性标准设定为固视丢失少于2/13,假阳性率和假阴性率少于15%。
我们将来自1352名患者的总共4012张视野图像分为两组,3712张用于训练,另外300张用于验证。左右比例之间无显著差异(P = 0.6211),而年龄(P = 0.0022)、视野指数(VFI)(P = 0.0001)、平均缺损(MD)(P = 0.0039)和模式标准差(PSD)(P = 0.0001)表现出明显的统计学差异。在300个视野的验证集上,卷积神经网络(CNN)的准确率达到0.876,而特异性和敏感性分别为0.826和0.932。对于眼科医生,住院眼科医生、主治眼科医生和青光眼专家的平均准确率分别为0.607、0.585和0.626。AGIS和GSS2的准确率分别为0.459和0.523。实验中还实施并评估了三种传统机器学习算法,即支持向量机(SVM)、随机森林(RF)和k近邻(k-NN),它们的准确率分别为0.670、0.644和0.591。
与人类眼科医生和传统规则(AGIS和GSS2)相比,我们基于CNN的算法在区分青光眼和非青光眼视野方面取得了更高的准确率。