Chuntranapaporn Suthicha, Choontanom Raveewan, Srimanan Worapot
Ophthalmology, Phramongkutklao Hospital, Bangkok, THA.
Cureus. 2024 Nov 19;16(11):e73985. doi: 10.7759/cureus.73985. eCollection 2024 Nov.
This study primarily aimed to compare the accuracy of three convolutional neural network (CNN) models in measuring the four positions of ocular duction. Further, it secondarily aimed to compare the accuracy of each CNN model in the training dataset versus the ophthalmologist measurements.
This study included 526 subjects aged over 18 who visited the ophthalmology outpatient department. Ocular images were captured using mobile phones in various gaze positions and stored anonymously as JPEG files. Ocular duction was measured by assessing corneal light reflex deviation from the central cornea. Ductions were classified into 30, 60, and 90 prism diopters (PD) and full ductions from the primary position. Three CNN models, MobileNet, ResNet, and EfficientNet, were used to classify ocular duction. Their predictive ability was evaluated using the area under the receiver operating characteristic (AUROC) curve. The dataset was divided into the training (2,001 images), evaluation (213 images), and testing (190 images) groups, which were reconstructed using the routine follow-up data of volunteers at the Ophthalmology Department of Phramongkutklao Hospital between February 2023 and June 2023.
To evaluate the data, the MobileNet_V3_Large, ResNet101, and EfficientNet_B5 models were utilized to measure duction angles with the receiver operating characteristic (ROC) curves. The training times for MobileNet, ResNet, and EfficientNet were 5.54, 9.56, and 26.39 minutes, respectively. In the testing phase, MobileNet, ResNet, and EfficientNet were used to measure each duction position: 30 PD with corresponding ROC curve values of 0.77, 0.5, and 0.58; 60 PD with ROC curve values of 0.71, 0.83, and 0.81; 90 PD with ROC curve values of 0.7, 0.73, and 0.81; and full duction with ROC curve values of 0.91, 0.93, and 0.94, respectively. Analysis of variance revealed no significant difference in the mean AUROC curves among the models, yielding a p-value of 0.936. MobileNet has the narrowest confidence intervals for average prediction accuracy across three CNN models.
The three CNN models did not significantly differ in terms of efficacy in detecting various duction positions. However, MobileNet stands out, with a narrower confidence interval and shorter training time, which indicates its potential application.
本研究主要旨在比较三种卷积神经网络(CNN)模型在测量眼球运动四个位置时的准确性。此外,其次要目的是比较每个CNN模型在训练数据集中与眼科医生测量结果的准确性。
本研究纳入了526名年龄超过18岁的眼科门诊患者。使用手机在不同注视位置拍摄眼部图像,并以JPEG文件形式匿名存储。通过评估角膜光反射偏离角膜中心来测量眼球运动。眼球运动分为30、60和90棱镜度(PD)以及从初始位置的全眼球运动。使用三种CNN模型,即MobileNet、ResNet和EfficientNet,对眼球运动进行分类。使用受试者工作特征(AUROC)曲线下面积评估它们的预测能力。数据集分为训练组(2001张图像)、评估组(213张图像)和测试组(190张图像),这些数据是利用2023年2月至2023年6月期间诗里蒙坤考医院眼科志愿者的常规随访数据重建的。
为了评估数据,使用MobileNet_V3_Large、ResNet101和EfficientNet_B5模型通过受试者工作特征(ROC)曲线测量眼球运动角度。MobileNet、ResNet和EfficientNet的训练时间分别为5.54、9.56和26.39分钟。在测试阶段,使用MobileNet、ResNet和EfficientNet测量每个眼球运动位置:30 PD时对应的ROC曲线值分别为0.77、0.5和0.58;60 PD时ROC曲线值分别为0.71、0.83和0.81;90 PD时ROC曲线值分别为0.7、0.73和0.81;全眼球运动时ROC曲线值分别为0.91、0.93和0.94。方差分析显示各模型之间平均AUROC曲线无显著差异,p值为0.936。在三个CNN模型中,MobileNet的平均预测准确性的置信区间最窄。
三种CNN模型在检测不同眼球运动位置的效能方面没有显著差异。然而,MobileNet表现突出,其置信区间更窄且训练时间更短,这表明了它的潜在应用价值。