Jiang Xiaoben, Yang Dawei, Feng Li, Zhu Yu, Wang Mingliang, Feng Yinzhou, Bai Chunxue, Fang Hao
School of Information Science and Technology, East China University of Science and Technology, Shanghai, China.
Department of Pulmonary and Critical Care Medicine, Zhongshan Hospital, Fudan University, Shanghai, China.
Front Med (Lausanne). 2024 May 2;11:1360143. doi: 10.3389/fmed.2024.1360143. eCollection 2024.
Deep learning-based methods can promote and save critical time for the diagnosis of pneumonia from computed tomography (CT) images of the chest, where the methods usually rely on large amounts of labeled data to learn good visual representations. However, medical images are difficult to obtain and need to be labeled by professional radiologists.
To address this issue, a novel contrastive learning model with token projection, namely CoTP, is proposed for improving the diagnostic quality of few-shot chest CT images. Specifically, (1) we utilize solely unlabeled data for fitting CoTP, along with a small number of labeled samples for fine-tuning, (2) we present a new Omicron dataset and modify the data augmentation strategy, i.e., random Poisson noise perturbation for the CT interpretation task, and (3) token projection is utilized to further improve the quality of the global visual representations.
The ResNet50 pre-trained by CoTP attained accuracy (ACC) of 92.35%, sensitivity (SEN) of 92.96%, precision (PRE) of 91.54%, and the area under the receiver-operating characteristics curve (AUC) of 98.90% on the presented Omicron dataset. On the contrary, the ResNet50 without pre-training achieved ACC, SEN, PRE, and AUC of 77.61, 77.90, 76.69, and 85.66%, respectively.
Extensive experiments reveal that a model pre-trained by CoTP greatly outperforms that without pre-training. The CoTP can improve the efficacy of diagnosis and reduce the heavy workload of radiologists for screening of Omicron pneumonia.
基于深度学习的方法可以促进胸部计算机断层扫描(CT)图像中肺炎的诊断并节省关键时间,这些方法通常依赖大量标记数据来学习良好的视觉表示。然而,医学图像难以获取且需要由专业放射科医生进行标记。
为了解决这个问题,提出了一种具有令牌投影的新型对比学习模型,即CoTP,以提高少样本胸部CT图像的诊断质量。具体而言,(1)我们仅使用未标记数据来拟合CoTP,并使用少量标记样本进行微调;(2)我们提出了一个新的奥密克戎数据集并修改了数据增强策略,即用于CT解释任务的随机泊松噪声扰动;(3)利用令牌投影进一步提高全局视觉表示的质量。
在提出的奥密克戎数据集上,由CoTP预训练的ResNet50的准确率(ACC)达到92.35%,灵敏度(SEN)达到9