Fu Wen, Chen Jie, Zhou Li
Institute of Microelectronics of Chinese Academy of Sciences, Beijing, 100029 China.
University of Chinese Academy of Sciences, Beijing, 100049 China.
Biomed Eng Lett. 2024 May 20;14(4):877-889. doi: 10.1007/s13534-024-00383-2. eCollection 2024 Jul.
Due to the difficulty in obtaining clinical samples and the high cost of labeling, rare skin diseases are characterized by data scarcity, making training deep neural networks for classification challenging. In recent years, few-shot learning has emerged as a promising solution, enabling models to recognize unseen disease classes by limited labeled samples. However, most existing methods ignored the fine-grained nature of rare skin diseases, resulting in poor performance when generalizing to highly similar classes. Moreover, the distributions learned from limited labeled data are biased, severely impairing the model's generalizability. This paper proposes a self-supervision distribution calibration network (SS-DCN) to address the above issues. Specifically, SS-DCN adopts a multi-task learning framework during pre-training. By introducing self-supervised tasks to aid in supervised learning, the model can learn more discriminative and transferable visual representations. Furthermore, SS-DCN applied an enhanced distribution calibration (EDC) strategy, which utilizes the statistics of base classes with sufficient samples to calibrate the bias distribution of novel classes with few-shot samples. By generating more samples from the calibrated distribution, EDC can provide sufficient supervision for subsequent classifier training. The proposed method is evaluated on three public skin disease datasets(i.e., ISIC2018, Derm7pt, and SD198), achieving significant performance improvements over state-of-the-art methods.
由于获取临床样本困难且标记成本高昂,罕见皮肤病的特点是数据稀缺,这使得训练用于分类的深度神经网络具有挑战性。近年来,少样本学习作为一种有前景的解决方案出现,使模型能够通过有限的标记样本识别未见的疾病类别。然而,大多数现有方法忽略了罕见皮肤病的细粒度性质,导致在推广到高度相似的类别时性能不佳。此外,从有限标记数据中学到的分布存在偏差,严重损害了模型的泛化能力。本文提出了一种自监督分布校准网络(SS-DCN)来解决上述问题。具体来说,SS-DCN在预训练期间采用多任务学习框架。通过引入自监督任务来辅助监督学习,模型可以学习到更具判别力和可转移的视觉表示。此外,SS-DCN应用了一种增强分布校准(EDC)策略,该策略利用具有足够样本的基础类别的统计信息来校准具有少样本的新类别偏差分布。通过从校准分布中生成更多样本,EDC可以为后续分类器训练提供足够的监督。所提出的方法在三个公共皮肤病数据集(即ISIC2018、Derm7pt和SD198)上进行了评估,与现有方法相比取得了显著的性能提升。