Oh Sejong, Park Yuli, Cho Kyong Jin, Kim Seong Jae
Software Science, College of Software Convergence, Jukjeon Campus, Dankook University, Yongin 16890, Korea.
Department of Ophthalmology, College of Medicine, Dankook University, 119, Dandae-ro, Dongnam-gu, Cheonan-si, Chungnam 31116, Korea.
Diagnostics (Basel). 2021 Mar 13;11(3):510. doi: 10.3390/diagnostics11030510.
The aim is to develop a machine learning prediction model for the diagnosis of glaucoma and an explanation system for a specific prediction. Clinical data of the patients based on a visual field test, a retinal nerve fiber layer optical coherence tomography (RNFL OCT) test, a general examination including an intraocular pressure (IOP) measurement, and fundus photography were provided for the feature selection process. Five selected features (variables) were used to develop a machine learning prediction model. The support vector machine, C5.0, random forest, and XGboost algorithms were tested for the prediction model. The performance of the prediction models was tested with 10-fold cross-validation. Statistical charts, such as gauge, radar, and Shapley Additive Explanations (SHAP), were used to explain the prediction case. All four models achieved similarly high diagnostic performance, with accuracy values ranging from 0.903 to 0.947. The XGboost model is the best model with an accuracy of 0.947, sensitivity of 0.941, specificity of 0.950, and AUC of 0.945. Three statistical charts were established to explain the prediction based on the characteristics of the XGboost model. Higher diagnostic performance was achieved with the XGboost model. These three statistical charts can help us understand why the machine learning model produces a specific prediction result. This may be the first attempt to apply "explainable artificial intelligence" to eye disease diagnosis.
目的是开发一种用于青光眼诊断的机器学习预测模型以及一个针对特定预测的解释系统。在特征选择过程中,提供了基于视野测试、视网膜神经纤维层光学相干断层扫描(RNFL OCT)测试、包括眼压(IOP)测量的全身检查以及眼底摄影的患者临床数据。使用五个选定的特征(变量)来开发机器学习预测模型。对支持向量机、C5.0、随机森林和XGboost算法进行了预测模型测试。预测模型的性能通过10折交叉验证进行测试。使用统计图表,如仪表盘图、雷达图和夏普利值加法解释(SHAP),来解释预测情况。所有四个模型都取得了相似的高诊断性能,准确率值在0.903至0.947之间。XGboost模型是最佳模型,准确率为0.947,灵敏度为0.941,特异性为0.950,曲线下面积(AUC)为0.945。基于XGboost模型的特征建立了三个统计图表来解释预测。XGboost模型实现了更高的诊断性能。这三个统计图表可以帮助我们理解为什么机器学习模型会产生特定的预测结果。这可能是将“可解释人工智能”应用于眼病诊断的首次尝试。