Fernandez Escamez Carlos Salvador, Martin Giral Elena, Perucho Martinez Susana, Toledano Fernandez Nicolas
Ophthalmology Department, Hospital de Fuenlabrada, Madrid 28942, Spain.
Doctorate Program in Health Sciences, Universidad Rey Juan Carlos, Alcorcon 28922, Madrid, Spain.
Int J Ophthalmol. 2021 Mar 18;14(3):393-398. doi: 10.18240/ijo.2021.03.10. eCollection 2021.
To develop a classifier for differentiating between healthy and early stage glaucoma eyes based on peripapillary retinal nerve fiber layer (RNFL) thicknesses measured with optical coherence tomography (OCT), using machine learning algorithms with a high interpretability.
Ninety patients with early glaucoma and 85 healthy eyes were included. Early glaucoma eyes showed a visual field (VF) defect with mean deviation >-6.00 dB and characteristic glaucomatous morphology. RNFL thickness in every quadrant, clock-hour and average thickness were used to feed machine learning algorithms. Cluster analysis was conducted to detect and exclude outliers. Tree gradient boosting algorithms were used to calculate the importance of parameters on the classifier and to check the relation between their values and its impact on the classifier. Parameters with the lowest importance were excluded and a weighted decision tree analysis was applied to obtain an interpretable classifier. Area under the ROC curve (AUC), accuracy and generalization ability of the model were estimated using cross validation techniques.
Average and 7 clock-hour RNFL thicknesses were the parameters with the highest importance. Correlation between parameter values and impact on classification displayed a stepped pattern for average thickness. Decision tree model revealed that average thickness lower than 82 µm was a high predictor for early glaucoma. Model scores had AUC of 0.953 (95%CI: 0.903-0998), with an accuracy of 89%.
Gradient boosting methods provide accurate and highly interpretable classifiers to discriminate between early glaucoma and healthy eyes. Average and 7-hour RNFL thicknesses have the best discriminant power.
基于光学相干断层扫描(OCT)测量的视乳头周围视网膜神经纤维层(RNFL)厚度,使用具有高可解释性的机器学习算法,开发一种用于区分健康眼睛和早期青光眼眼睛的分类器。
纳入90例早期青光眼患者和85只健康眼睛。早期青光眼眼睛表现出平均偏差>-6.00 dB的视野(VF)缺损和典型的青光眼形态。每个象限、钟点的RNFL厚度以及平均厚度用于输入机器学习算法。进行聚类分析以检测和排除异常值。使用树梯度提升算法计算参数对分类器的重要性,并检查其值与对分类器影响之间的关系。排除重要性最低的参数,并应用加权决策树分析以获得可解释的分类器。使用交叉验证技术估计模型的ROC曲线下面积(AUC)、准确性和泛化能力。
平均厚度和7个钟点的RNFL厚度是重要性最高的参数。参数值与对分类的影响之间的相关性显示平均厚度呈阶梯模式。决策树模型显示,平均厚度低于82 µm是早期青光眼的高预测指标。模型评分的AUC为0.953(95%CI:0.903-0.998),准确率为89%。
梯度提升方法提供了准确且高度可解释的分类器,用于区分早期青光眼和健康眼睛。平均厚度和7个钟点的RNFL厚度具有最佳判别能力。