Gao Yuan, Ma Chenbin, Guo Lishuang, Zhang Xuxiang, Ji Xunming
Department of Biomedical Engineering, School of Biological Science and Medical Engineering, Beihang University, Beijing 100191, China.
Shen Yuan Honors College, Beihang University, Beijing 100191, China.
Bioengineering (Basel). 2023 Aug 18;10(8):978. doi: 10.3390/bioengineering10080978.
Retinopathy, a prevalent disease causing visual impairment and sometimes blindness, affects many individuals in the population. Early detection and treatment of the disease can be facilitated by monitoring the retina using fundus imaging. Nonetheless, the limited availability of fundus images and the imbalanced datasets warrant the development of more precise and efficient algorithms to enhance diagnostic performance. This study presents a novel online knowledge distillation framework, called CLRD, which employs a collaborative learning approach for detecting retinopathy. By combining student models with varying scales and architectures, the CLRD framework extracts crucial pathological information from fundus images. The transfer of knowledge is accomplished by developing distortion information particular to fundus images, thereby enhancing model invariance. Our selection of student models includes the Transformer-based BEiT and the CNN-based ConvNeXt, which achieve accuracies of 98.77% and 96.88%, respectively. Furthermore, the proposed method has 5.69-23.13%, 5.37-23.73%, 5.74-23.17%, 11.24-45.21%, and 5.87-24.96% higher accuracy, precision, recall, specificity, and F1 score, respectively, compared to the advanced visual model. The results of our study indicate that the CLRD framework can effectively minimize generalization errors without compromising independent predictions made by student models, offering novel directions for further investigations into detecting retinopathy.
视网膜病变是一种常见的导致视力损害甚至失明的疾病,影响着许多人群。通过眼底成像监测视网膜有助于疾病的早期检测和治疗。然而,眼底图像的有限可用性和不平衡数据集促使人们开发更精确、高效的算法来提高诊断性能。本研究提出了一种名为CLRD的新型在线知识蒸馏框架,该框架采用协作学习方法来检测视网膜病变。通过将不同尺度和架构的学生模型相结合,CLRD框架从眼底图像中提取关键的病理信息。知识转移是通过开发特定于眼底图像的失真信息来实现的,从而增强模型的不变性。我们选择的学生模型包括基于Transformer的BEiT和基于CNN的ConvNeXt,它们的准确率分别达到了98.77%和96.88%。此外,与先进的视觉模型相比,该方法的准确率、精确率、召回率、特异性和F1分数分别提高了5.69-23.13%、5.37-23.73%、5.74-23.17%、11.24-45.21%和5.87-24.96%。我们的研究结果表明,CLRD框架可以在不影响学生模型独立预测的情况下有效减少泛化误差,为进一步研究视网膜病变检测提供了新的方向。