LG AI Research, 07796, Seoul, Republic of Korea.
KAIST, Kim Jaechul Graduate School of AI, 34141, Daejeon, Republic of Korea.
Sci Rep. 2024 Oct 5;14(1):23199. doi: 10.1038/s41598-024-73695-z.
Deep neural networks are increasingly used in medical imaging for tasks such as pathological classification, but they face challenges due to the scarcity of high-quality, expert-labeled training data. Recent efforts have utilized pre-trained contrastive image-text models like CLIP, adapting them for medical use by fine-tuning the model with chest X-ray images and corresponding reports for zero-shot pathology classification, thus eliminating the need for pathology-specific annotations. However, most studies continue to use the same contrastive learning objectives as in the general domain, overlooking the multi-labeled nature of medical image-report pairs. In this paper, we propose a new fine-tuning strategy that includes positive-pair loss relaxation and random sentence sampling. We aim to improve the performance of zero-shot pathology classification without relying on external knowledge. Our method can be applied to any pre-trained contrastive image-text encoder and easily transferred to out-of-domain datasets without further training, as it does not use external data. Our approach consistently improves overall zero-shot pathology classification across four chest X-ray datasets and three pre-trained models, with an average macro AUROC increase of 4.3%. Additionally, our method outperforms the state-of-the-art and marginally surpasses board-certified radiologists in zero-shot classification for the five competition pathologies in the CheXpert dataset.
深度神经网络在医学影像领域的应用越来越广泛,可用于病理分类等任务,但由于高质量、专家标注训练数据的稀缺,它们面临着挑战。最近的研究利用了预训练的对比图像-文本模型,如 CLIP,通过用胸部 X 光图像和相应的报告对模型进行微调,实现零样本病理学分类,从而无需进行特定于病理学的注释,将其应用于医学领域。然而,大多数研究仍然使用与通用领域相同的对比学习目标,忽略了医学图像-报告对的多标签性质。在本文中,我们提出了一种新的微调策略,包括正样本对损失松弛和随机句子采样。我们旨在在不依赖外部知识的情况下提高零样本病理学分类的性能。我们的方法可以应用于任何预训练的对比图像-文本编码器,并可以轻松地转移到无领域数据集,而无需进一步训练,因为它不使用外部数据。我们的方法在四个胸部 X 光数据集和三个预训练模型上均提高了整体零样本病理学分类的性能,平均宏观 AUROC 提高了 4.3%。此外,我们的方法在 CheXpert 数据集的五个竞赛病理学的零样本分类中优于最新技术水平,略优于董事会认证的放射科医生。