Department of Hepatobiliary Surgery, Jintan Affiliated Hospital of Jiangsu University, Changzhou, Jiangsu, 213200, China.
Department of Gastroenterology, The First Affiliated Hospital of Soochow University, # 899 Pinghai St., Suzhou, Jiangsu, 215006, China.
J Imaging Inform Med. 2024 Oct;37(5):2342-2353. doi: 10.1007/s10278-024-01123-9. Epub 2024 Apr 23.
Labelling medical images is an arduous and costly task that necessitates clinical expertise and large numbers of qualified images. Insufficient samples can lead to underfitting during training and poor performance of supervised learning models. In this study, we aim to develop a SimCLR-based semi-supervised learning framework to classify colorectal neoplasia based on the NICE classification. First, the proposed framework was trained under self-supervised learning using a large unlabelled dataset; subsequently, it was fine-tuned on a limited labelled dataset based on the NICE classification. The model was evaluated on an independent dataset and compared with models based on supervised transfer learning and endoscopists using accuracy, Matthew's correlation coefficient (MCC), and Cohen's kappa. Finally, Grad-CAM and t-SNE were applied to visualize the models' interpretations. A ResNet-backboned SimCLR model (accuracy of 0.908, MCC of 0.862, and Cohen's kappa of 0.896) outperformed supervised transfer learning-based models (means: 0.803, 0.698, and 0.742) and junior endoscopists (0.816, 0.724, and 0.863), while performing only slightly worse than senior endoscopists (0.916, 0.875, and 0.944). Moreover, t-SNE showed a better clustering of ternary samples through self-supervised learning in SimCLR than through supervised transfer learning. Compared with traditional supervised learning, semi-supervised learning enables deep learning models to achieve improved performance with limited labelled endoscopic images.
对医学图像进行标注是一项艰巨且昂贵的任务,需要临床专业知识和大量合格的图像。样本不足会导致训练时欠拟合和监督学习模型性能不佳。在本研究中,我们旨在开发一种基于 SimCLR 的半监督学习框架,根据 NICE 分类对结直肠肿瘤进行分类。首先,使用大型未标记数据集在自监督学习下训练提出的框架;然后,根据 NICE 分类在有限的标记数据集上进行微调。该模型在独立数据集上进行评估,并与基于监督迁移学习和内镜医生的模型进行比较,使用准确性、马修相关系数 (MCC) 和 Cohen's kappa。最后,应用 Grad-CAM 和 t-SNE 可视化模型的解释。基于 ResNet 骨干的 SimCLR 模型(准确性为 0.908,MCC 为 0.862,Cohen's kappa 为 0.896)优于基于监督迁移学习的模型(平均值:0.803、0.698 和 0.742)和初级内镜医生(0.816、0.724 和 0.863),而表现仅略逊于高级内镜医生(0.916、0.875 和 0.944)。此外,与监督迁移学习相比,通过自监督学习,t-SNE 在 SimCLR 中对三元样本的聚类效果更好。与传统监督学习相比,半监督学习使深度学习模型能够在有限的标记内镜图像上实现更好的性能。