Park Sanghyeon, Go Seokmin, Kim Seonhyo, Shim Jaeho
Helix Animal Medical Center, Seoul 06546, Republic of Korea.
Nowon N Animal Medical Center, Seoul 01704, Republic of Korea.
Animals (Basel). 2025 May 4;15(9):1327. doi: 10.3390/ani15091327.
Cataracts are a prevalent cause of vision loss in dogs, and timely diagnosis is essential for effective treatment. This study aimed to develop and evaluate deep learning models to automatically classify canine cataracts from ocular ultrasound images. A dataset of 3155 ultrasound images (comprising 1329 No cataract, 614 Cortical, 1033 Mature, and 179 Hypermature cases) was used to train and validate four widely used deep learning models (AlexNet, EfficientNetB3, ResNet50, and DenseNet161). Data augmentation and normalization techniques were applied to address category imbalance. DenseNet161 demonstrated the best performance, achieving a test accuracy of 92.03% and an F1-score of 0.8744. The confusion matrix revealed that the model attained the highest accuracy for the No cataract category (99.0%), followed by Cortical (90.3%) and Mature (86.5%) cataracts, while Hypermature cataracts were classified with lower accuracy (78.6%). Receiver Operating Characteristic (ROC) curve analysis confirmed strong discriminative ability, with an area under the curve (AUC) of 0.99. Visual interpretation using Gradient-weighted Class Activation Mapping indicated that the model effectively focused on clinically relevant regions. This deep learning-based classification framework shows significant potential for assisting veterinarians in diagnosing cataracts, thereby improving clinical decision-making in veterinary ophthalmology.
白内障是犬类视力丧失的常见原因,及时诊断对于有效治疗至关重要。本研究旨在开发和评估深度学习模型,以便从眼部超声图像中自动对犬类白内障进行分类。使用一个包含3155张超声图像的数据集(包括1329例无白内障、614例皮质性、1033例成熟性和179例过熟性病例)来训练和验证四种广泛使用的深度学习模型(AlexNet、EfficientNetB3、ResNet50和DenseNet161)。应用数据增强和归一化技术来解决类别不平衡问题。DenseNet161表现出最佳性能,测试准确率达到92.03%,F1分数为0.8744。混淆矩阵显示,该模型对无白内障类别的准确率最高(99.0%),其次是皮质性白内障(90.3%)和成熟性白内障(86.5%),而过熟性白内障的分类准确率较低(78.6%)。受试者工作特征(ROC)曲线分析证实了其强大的判别能力,曲线下面积(AUC)为0.99。使用梯度加权类激活映射的视觉解释表明,该模型有效地聚焦于临床相关区域。这种基于深度学习的分类框架在协助兽医诊断白内障方面显示出巨大潜力,从而改善兽医眼科的临床决策。