IEEE J Biomed Health Inform. 2024 Jul;28(7):4132-4144. doi: 10.1109/JBHI.2024.3386197. Epub 2024 Jul 2.
In the field of pathology, the scarcity of certain diseases and the difficulty of annotating images hinder the development of large, high-quality datasets, which in turn affects the advancement of deep learning-assisted diagnostics. Few-shot learning has demonstrated unique advantages in modeling tasks with limited data, yet explorations of this method in the field of pathology remain in the early stages. To address this issue, we present a dual-channel prototype network (DCPN), a novel few-shot learning approach for efficiently classifying pathology images with limited data. The DCPN leverages self-supervised learning to extend the pyramid vision transformer (PVT) to few-shot classification tasks and combines it with a convolutional neural network to construct a dual-channel network for extracting multi-scale, high-precision pathological features, thereby substantially enhancing the generalizability of prototype representations. Additionally, we design a soft voting classifier based on multi-scale features to further augment the discriminative power of the model in complex pathology image classification tasks. We constructed three few-shot classification tasks with varying degrees of domain shift using three publicly available pathological datasets-CRCTP, NCTCRC, and LC25000-to emulate real-world clinical scenarios. The results demonstrated that the DCPN outperformed the prototypical network across all metrics, achieving the highest accuracies in same-domain tasks-70.86% for 1-shot, 82.57% for 5-shot, and 85.2% for 10-shot setups-corresponding to improvements of 5.51%, 5.72%, and 6.81%, respectively, over the prototypical network. Notably, in the same-domain 10-shot setting, the accuracy of the DCPN (85.2%) surpassed that of the PVT-based supervised learning model (85.15%), confirming its potential to diagnose rare diseases within few-shot learning frameworks.
在病理学领域,某些疾病的稀缺性和图像注释的困难阻碍了大型、高质量数据集的发展,进而影响了深度学习辅助诊断的进展。Few-shot learning 在数据有限的建模任务中表现出了独特的优势,但这种方法在病理学领域的探索仍处于早期阶段。为了解决这个问题,我们提出了一种双通道原型网络(DCPN),这是一种用于有效分类有限数据的病理学图像的新型 Few-shot learning 方法。DCPN 利用自监督学习将金字塔视觉转换器(PVT)扩展到 Few-shot 分类任务,并将其与卷积神经网络相结合,构建一个双通道网络,用于提取多尺度、高精度的病理特征,从而大大增强原型表示的泛化能力。此外,我们设计了一种基于多尺度特征的软投票分类器,以进一步提高模型在复杂病理学图像分类任务中的判别能力。我们使用三个公开的病理学数据集-CRCTP、NCTCRC 和 LC25000-构建了三个具有不同程度领域转移的 Few-shot 分类任务,以模拟真实的临床场景。结果表明,DCPN 在所有指标上都优于原型网络,在同领域任务中的表现最佳-1 -shot 准确率为 70.86%,5-shot 准确率为 82.57%,10-shot 准确率为 85.2%,分别比原型网络提高了 5.51%、5.72%和 6.81%。值得注意的是,在同领域的 10-shot 环境中,DCPN(85.2%)的准确率超过了基于 PVT 的监督学习模型(85.15%),证实了其在 Few-shot learning 框架内诊断罕见疾病的潜力。