Marouf Ahmed Al, Mottalib Md Mozaharul, Alhajj Reda, Rokne Jon, Jafarullah Omar
Department of Computer Science, University of Calgary, Calgary, AB T2N 1N4, Canada.
Department of Computer Science and Engineering, Daffodil International University, Dhaka 1341, Bangladesh.
Bioengineering (Basel). 2022 Dec 24;10(1):25. doi: 10.3390/bioengineering10010025.
The eye is generally considered to be the most important sensory organ of humans. Diseases and other degenerative conditions of the eye are therefore of great concern as they affect the function of this vital organ. With proper early diagnosis by experts and with optimal use of medicines and surgical techniques, these diseases or conditions can in many cases be either cured or greatly mitigated. Experts that perform the diagnosis are in high demand and their services are expensive, hence the appropriate identification of the cause of vision problems is either postponed or not done at all such that corrective measures are either not done or done too late. An efficient model to predict eye diseases using machine learning (ML) and ranker-based feature selection (r-FS) methods is therefore proposed which will aid in obtaining a correct diagnosis. The aim of this model is to automatically predict one or more of five common eye diseases namely, Cataracts (CT), Acute Angle-Closure Glaucoma (AACG), Primary Congenital Glaucoma (PCG), Exophthalmos or Bulging Eyes (BE) and Ocular Hypertension (OH). We have used efficient data collection methods, data annotations by professional ophthalmologists, applied five different feature selection methods, two types of data splitting techniques (train-test and stratified k-fold cross validation), and applied nine ML methods for the overall prediction approach. While applying ML methods, we have chosen suitable classic ML methods, such as Decision Tree (DT), Random Forest (RF), Naive Bayes (NB), AdaBoost (AB), Logistic Regression (LR), k-Nearest Neighbour (k-NN), Bagging (Bg), Boosting (BS) and Support Vector Machine (SVM). We have performed a symptomatic analysis of the prominent symptoms of each of the five eye diseases. The results of the analysis and comparison between methods are shown separately. While comparing the methods, we have adopted traditional performance indices, such as accuracy, precision, sensitivity, F1-Score, etc. Finally, SVM outperformed other models obtaining the highest accuracy of 99.11% for 10-fold cross-validation and LR obtained 98.58% for the split ratio of 80:20.
眼睛通常被认为是人类最重要的感官器官。因此,眼部疾病和其他退行性病变备受关注,因为它们会影响这个重要器官的功能。通过专家进行适当的早期诊断,并优化使用药物和手术技术,这些疾病或状况在许多情况下可以治愈或得到极大缓解。进行诊断的专家需求量很大,他们的服务费用高昂,因此对视力问题原因的恰当识别要么被推迟,要么根本不进行,以至于矫正措施要么没有实施,要么实施得太晚。因此,提出了一种使用机器学习(ML)和基于排序器的特征选择(r-FS)方法来预测眼部疾病的有效模型,这将有助于获得正确的诊断。该模型的目的是自动预测五种常见眼部疾病中的一种或多种,即白内障(CT)、急性闭角型青光眼(AACG)、原发性先天性青光眼(PCG)、眼球突出(BE)和高眼压症(OH)。我们使用了高效的数据收集方法,由专业眼科医生进行数据标注,应用了五种不同的特征选择方法、两种数据分割技术(训练-测试和分层k折交叉验证),并应用了九种ML方法进行整体预测。在应用ML方法时,我们选择了合适的经典ML方法,如决策树(DT)、随机森林(RF)、朴素贝叶斯(NB)、AdaBoost(AB)、逻辑回归(LR)、k近邻(k-NN)、装袋法(Bg)、提升法(BS)和支持向量机(SVM)。我们对五种眼部疾病各自的突出症状进行了症状分析。方法之间的分析和比较结果分别列出。在比较这些方法时,我们采用了传统的性能指标,如准确率、精确率、灵敏度、F1分数等。最后,支持向量机的表现优于其他模型,在10折交叉验证中获得了99.11%的最高准确率,逻辑回归在80:20的分割比例下获得了98.58%的准确率。