Jin Xiao-Lu, Li Xue-Mei, Liu Tie-Juan, Zhou Ling-Yun
Ocular Motility Disorder Treatment Centre, the First Affiliated Hospital of Harbin Medical University, Harbin 150001, Heilongjiang Province, China.
Int J Ophthalmol. 2025 May 18;18(5):757-764. doi: 10.18240/ijo.2025.05.01. eCollection 2025.
To develop different machine learning models to train and test diplopia images and data generated by the computerized diplopia test.
Diplopia images and data generated by computerized diplopia tests, along with patient medical records, were retrospectively collected from 3244 cases. Diagnostic models were constructed using logistic regression (LR), decision tree (DT), support vector machine (SVM), extreme gradient boosting (XGBoost), and deep learning (DL) algorithms. A total of 2757 diplopia images were randomly selected as training data, while the test dataset contained 487 diplopia images. The optimal diagnostic model was evaluated using test set accuracy, confusion matrix, and precision-recall curve (P-R curve).
The test set accuracy of the LR, SVM, DT, XGBoost, DL (64 categories), and DL (6 binary classifications) algorithms was 0.762, 0.811, 0.818, 0.812, 0.858 and 0.858, respectively. The accuracy in the training set was 0.785, 0.815, 0.998, 0.965, 0.968, and 0.967, respectively. The weighted precision of LR, SVM, DT, XGBoost, DL (64 categories), and DL (6 binary classifications) algorithms was 0.74, 0.77, 0.83, 0.80, 0.85, and 0.85, respectively; weighted recall was 0.76, 0.81, 0.82, 0.81, 0.86, and 0.86, respectively; weighted F1 score was 0.74, 0.79, 0.82, 0.80, 0.85, and 0.85, respectively.
In this study, the 7 machine learning algorithms all achieve automatic diagnosis of extraocular muscle palsy. The DL (64 categories) and DL (6 binary classifications) algorithms have a significant advantage over other machine learning algorithms regarding diagnostic accuracy on the test set, with a high level of consistency with clinical diagnoses made by physicians. Therefore, it can be used as a reference for diagnosis.
开发不同的机器学习模型,用于训练和测试复视图像以及计算机化复视测试生成的数据。
回顾性收集3244例患者的计算机化复视测试生成的复视图像和数据以及患者病历。使用逻辑回归(LR)、决策树(DT)、支持向量机(SVM)、极端梯度提升(XGBoost)和深度学习(DL)算法构建诊断模型。总共随机选择2757幅复视图像作为训练数据,而测试数据集包含487幅复视图像。使用测试集准确率、混淆矩阵和精确率-召回率曲线(P-R曲线)评估最佳诊断模型。
LR、SVM、DT、XGBoost、DL(64类别)和DL(6二元分类)算法的测试集准确率分别为0.762、0.811、0.818、0.812、0.858和0.858。训练集准确率分别为0.785、0.815、0.998、0.965、0.968和0.967。LR、SVM、DT、XGBoost、DL(64类别)和DL(6二元分类)算法的加权精确率分别为0.74、0.77、0.83、0.80、0.85和0.85;加权召回率分别为0.76、0.81、0.82、0.81、0.86和0.86;加权F1分数分别为0.74、0.79、0.82、0.80、0.85和0.85。
在本研究中,7种机器学习算法均实现了眼外肌麻痹的自动诊断。DL(64类别)和DL(6二元分类)算法在测试集诊断准确率方面比其他机器学习算法具有显著优势,与医生做出的临床诊断具有高度一致性。因此,可作为诊断参考。