研究视觉Transformer在医学图像分类中对标签噪声的鲁棒性。

Investigating the Robustness of Vision Transformers against Label Noise in Medical Image Classification.

作者信息

Khanal Bidur, Shrestha Prashant, Amgain Sanskar, Khanal Bishesh, Bhattarai Binod, Linte Cristian A

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2024 Jul;2024:1-6. doi: 10.1109/EMBC53108.2024.10782929.

DOI:10.1109/EMBC53108.2024.10782929

PMID:40039337

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11953727/

Abstract

Label noise in medical image classification datasets significantly hampers the training of supervised deep learning methods, undermining their generalizability. The test performance of a model tends to decrease as the label noise rate increases. Over recent years, several methods have been proposed to mitigate the impact of label noise in medical image classification and enhance the robustness of the model. Predominantly, these works have employed CNN-based architectures as the backbone of their classifiers for feature extraction. However, in recent years, Vision Transformer (ViT)-based backbones have replaced CNNs, demonstrating improved performance and a greater ability to learn more generalizable features, especially when the dataset is large. Nevertheless, no prior work has rigorously investigated how transformer-based backbones handle the impact of label noise in medical image classification. In this paper, we investigate the architectural robustness of ViT against label noise and compare it to that of CNNs. We use two medical image classification datasets-COVID-DU-Ex, and NCT-CRC-HE-100K-both corrupted by injecting label noise at various rates. Additionally, we show that pretraining is crucial for ensuring ViT's improved robustness against label noise in supervised training.

摘要

医学图像分类数据集中的标签噪声严重阻碍了监督深度学习方法的训练，削弱了它们的通用性。随着标签噪声率的增加，模型的测试性能往往会下降。近年来，已经提出了几种方法来减轻标签噪声在医学图像分类中的影响，并提高模型的鲁棒性。主要地，这些工作采用基于卷积神经网络（CNN）的架构作为其分类器的主干进行特征提取。然而，近年来，基于视觉Transformer（ViT）的主干已经取代了CNN，表现出了更好的性能和更强的学习更通用特征的能力，特别是当数据集很大时。尽管如此，之前没有工作严格研究基于Transformer的主干如何处理医学图像分类中标签噪声的影响。在本文中，我们研究了ViT对标签噪声的架构鲁棒性，并将其与CNN的鲁棒性进行比较。我们使用了两个医学图像分类数据集——COVID-DU-Ex和NCT-CRC-HE-100K——这两个数据集都通过以不同速率注入标签噪声而被破坏。此外，我们表明预训练对于确保ViT在监督训练中对标签噪声具有更高的鲁棒性至关重要。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4f3b/11953727/e22acc845394/nihms-2064453-f0001.jpg

相似文献

Investigating the Robustness of Vision Transformers against Label Noise in Medical Image Classification.研究视觉Transformer在医学图像分类中对标签噪声的鲁棒性。

Annu Int Conf IEEE Eng Med Biol Soc. 2024 Jul;2024:1-6. doi: 10.1109/EMBC53108.2024.10782929.

Improving Medical Image Classification in Noisy Labels Using only Self-supervised Pretraining.仅使用自监督预训练改善含噪声标签的医学图像分类

Data Eng Med Imaging (2023). 2023 Oct;14314:78-90. doi: 10.1007/978-3-031-44992-5_8. Epub 2023 Oct 1.

Self-supervised learning improves robustness of deep learning lung tumor segmentation models to CT imaging differences.自监督学习提高了深度学习肺肿瘤分割模型对CT成像差异的鲁棒性。

Med Phys. 2025 Mar;52(3):1573-1588. doi: 10.1002/mp.17541. Epub 2024 Dec 5.

Pure Vision Transformer (CT-ViT) with Noise2Neighbors Interpolation for Low-Dose CT Image Denoising.基于 Noise2Neighbors 插值的纯 Vision Transformer（CT-ViT）用于低剂量 CT 图像降噪。

J Imaging Inform Med. 2024 Oct;37(5):2669-2687. doi: 10.1007/s10278-024-01108-8. Epub 2024 Apr 15.

Suppressing label noise in medical image classification using mixup attention and self-supervised learning.利用混叠注意力和自监督学习抑制医学图像分类中的标签噪声。

Phys Med Biol. 2024 May 8;69(10). doi: 10.1088/1361-6560/ad4083.

Magnetic resonance image denoising for Rician noise using a novel hybrid transformer-CNN network (HTC-net) and self-supervised pretraining.使用新型混合变压器-卷积神经网络（HTC-net）和自监督预训练对莱斯噪声进行磁共振图像去噪

Med Phys. 2025 Mar;52(3):1643-1660. doi: 10.1002/mp.17562. Epub 2024 Dec 6.

Exploring vision transformers and XGBoost as deep learning ensembles for transforming carcinoma recognition.探索将视觉Transformer和XGBoost作为深度学习集成方法用于转化型癌的识别。

Sci Rep. 2024 Dec 3;14(1):30052. doi: 10.1038/s41598-024-81456-1.

HTC-retina: A hybrid retinal diseases classification model using transformer-Convolutional Neural Network from optical coherence tomography images.HTC-retina：一种使用来自光学相干断层扫描图像的变压器-卷积神经网络的混合视网膜疾病分类模型。

Comput Biol Med. 2024 Aug;178:108726. doi: 10.1016/j.compbiomed.2024.108726. Epub 2024 Jun 9.

Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像（MRI）中进行脑肿瘤分割与检测

Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.

Combating Medical Label Noise through more precise partition-correction and progressive hard-enhanced learning.通过更精确的分区校正和渐进式硬增强学习来对抗医学标签噪声。

Comput Methods Programs Biomed. 2025 Jun;265:108734. doi: 10.1016/j.cmpb.2025.108734. Epub 2025 Mar 29.

本文引用的文献

Generalized Cross Entropy Loss for Training Deep Neural Networks with Noisy Labels.用于训练带有噪声标签的深度神经网络的广义交叉熵损失

Adv Neural Inf Process Syst. 2018 Dec;32:8792-8802. Epub 2018 Dec 3.

Improving Medical Image Classification in Noisy Labels Using only Self-supervised Pretraining.仅使用自监督预训练改善含噪声标签的医学图像分类

Data Eng Med Imaging (2023). 2023 Oct;14314:78-90. doi: 10.1007/978-3-031-44992-5_8. Epub 2023 Oct 1.

Transformers in medical imaging: A survey.医学成像中的变压器：综述。

Med Image Anal. 2023 Aug;88:102802. doi: 10.1016/j.media.2023.102802. Epub 2023 Apr 5.

Investigating the impact of class-dependent label noise in medical image classification.研究医学图像分类中类别相关标签噪声的影响。

Proc SPIE Int Soc Opt Eng. 2023 Feb;12464. doi: 10.1117/12.2654420. Epub 2023 Apr 3.

A Survey on Vision Transformer.视觉Transformer综述

IEEE Trans Pattern Anal Mach Intell. 2023 Jan;45(1):87-110. doi: 10.1109/TPAMI.2022.3152247. Epub 2022 Dec 5.

Robust Medical Image Classification From Noisy Labeled Data With Global and Local Representation Guided Co-Training.基于全局和局部表示引导协同训练的含噪标记数据的稳健医学图像分类。

IEEE Trans Med Imaging. 2022 Jun;41(6):1371-1382. doi: 10.1109/TMI.2021.3140140. Epub 2022 Jun 1.

COVID-19 infection localization and severity grading from chest X-ray images.通过胸部X光图像进行COVID-19感染定位及严重程度分级

Comput Biol Med. 2021 Dec;139:105002. doi: 10.1016/j.compbiomed.2021.105002. Epub 2021 Oct 30.

Predicting survival from colorectal cancer histology slides using deep learning: A retrospective multicenter study.利用深度学习预测结直肠癌组织学切片的生存情况：一项回顾性多中心研究。

PLoS Med. 2019 Jan 24;16(1):e1002730. doi: 10.1371/journal.pmed.1002730. eCollection 2019 Jan.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验