Zhao Jiani, Wan Cheng, Li Jiajun, Zhang Zhe, Yang Weihua, Li Keran
College of Electronic and Information Engineering /College of Integrated Circuits, Nanjing University of Aeronautics and Astronautics, Nanjing, Jiangsu, 211106, China.
Eye Hospital, Nanjing Medical University, Nanjing, Jiangsu, 210029, China.
Heliyon. 2024 Jul 17;10(14):e34726. doi: 10.1016/j.heliyon.2024.e34726. eCollection 2024 Jul 30.
Cataracts are a leading cause of blindness worldwide, making accurate diagnosis and effective surgical planning critical. However, grading the severity of the lens nucleus is challenging because deep learning (DL) models pretrained using ImageNet perform poorly when applied directly to medical data due to the limited availability of labeled medical images and high interclass similarity. Self-supervised pretraining offers a solution by circumventing the need for cost-intensive data annotations and bridging domain disparities. In this study, to address the challenges of intelligent grading, we proposed a hybrid model called nuclear cataract mask encoder network (NCME-Net), which utilizes self-supervised pretraining for the four-class analysis of nuclear cataract severity. A total of 792 images of nuclear cataracts were categorized into the training set (533 images), the validation set (139 images), and the test set (100 images). NCME-Net achieved a diagnostic accuracy of 91.0 % on the test set, a 5.0 % improvement over the best-performing DL model (ResNet50). Experimental results demonstrate NCME-Net's ability to distinguish between cataract severities, particularly in scenarios with limited samples, making it a valuable tool for intelligently diagnosing cataracts. In addition, the effect of different self-supervised tasks on the model's ability to capture the intrinsic structure of the data was studied. Findings indicate that image restoration tasks significantly enhance semantic information extraction.
白内障是全球失明的主要原因之一,因此准确诊断和有效的手术规划至关重要。然而,对晶状体核的严重程度进行分级具有挑战性,因为直接应用于医学数据时,使用ImageNet预训练的深度学习(DL)模型表现不佳,这是由于标记医学图像的可用性有限以及类间相似度高。自监督预训练通过避免对成本高昂的数据标注的需求并弥合领域差异提供了一种解决方案。在本研究中,为应对智能分级的挑战,我们提出了一种名为核性白内障掩码编码器网络(NCME-Net)的混合模型,该模型利用自监督预训练对核性白内障严重程度进行四类分析。总共792张核性白内障图像被分类为训练集(533张图像)、验证集(139张图像)和测试集(100张图像)。NCME-Net在测试集上实现了91.0%的诊断准确率,比表现最佳的DL模型(ResNet50)提高了5.0%。实验结果证明了NCME-Net区分白内障严重程度的能力,特别是在样本有限的情况下,使其成为智能诊断白内障的有价值工具。此外,还研究了不同自监督任务对模型捕获数据内在结构能力的影响。研究结果表明,图像恢复任务显著增强了语义信息提取。