Lerch Luc, Huber Lukas S, Kamath Amith, Pöllinger Alexander, Pahud de Mortanges Aurélie, Obmann Verena C, Dammann Florian, Senn Walter, Reyes Mauricio
Medical Image Analysis Group, ARTORG Centre for Biomedical Research, University of Bern, Bern, Switzerland.
Computational Neuroscience Group, Department of Physiology, University of Bern, Bern, Switzerland.
Front Radiol. 2024 Dec 19;4:1420545. doi: 10.3389/fradi.2024.1420545. eCollection 2024.
Successful performance of deep learning models for medical image analysis is highly dependent on the quality of the images being analysed. Factors like differences in imaging equipment and calibration, as well as patient-specific factors such as movements or biological variability (e.g., tissue density), lead to a large variability in the quality of obtained medical images. Consequently, robustness against the presence of noise is a crucial factor for the application of deep learning models in clinical contexts.
We evaluate the effect of various data augmentation strategies on the robustness of a ResNet-18 trained to classify breast ultrasound images and benchmark the performance against trained human radiologists. Additionally, we introduce , a novel, biologically inspired data augmentation strategy for medical image analysis. DreamOn is based on a conditional generative adversarial network (GAN) to generate REM-dream-inspired interpolations of training images.
We find that while available data augmentation approaches substantially improve robustness compared to models trained without any data augmentation, radiologists outperform models on noisy images. Using DreamOn data augmentation, we obtain a substantial improvement in robustness in the high noise regime.
We show that REM-dream-inspired conditional GAN-based data augmentation is a promising approach to improving deep learning model robustness against noise perturbations in medical imaging. Additionally, we highlight a gap in robustness between deep learning models and human experts, emphasizing the imperative for ongoing developments in AI to match human diagnostic expertise.
深度学习模型在医学图像分析中的成功应用高度依赖于所分析图像的质量。成像设备和校准的差异等因素,以及患者特定因素,如运动或生物变异性(如组织密度),导致所获取医学图像的质量存在很大差异。因此,针对噪声的鲁棒性是深度学习模型在临床环境中应用的关键因素。
我们评估了各种数据增强策略对训练用于对乳腺超声图像进行分类的ResNet-18模型鲁棒性的影响,并将其性能与训练有素的人类放射科医生进行基准测试。此外,我们引入了一种新颖的、受生物学启发的医学图像分析数据增强策略。DreamOn基于条件生成对抗网络(GAN)来生成受快速眼动睡眠梦境启发的训练图像插值。
我们发现,虽然与未进行任何数据增强训练的模型相比,现有的数据增强方法显著提高了鲁棒性,但在有噪声的图像上,放射科医生的表现优于模型。使用DreamOn数据增强,我们在高噪声环境下的鲁棒性有了显著提高。
我们表明,受快速眼动睡眠梦境启发的基于条件GAN的数据增强是一种有前途的方法,可提高深度学习模型在医学成像中对噪声干扰的鲁棒性。此外,我们强调了深度学习模型与人类专家在鲁棒性方面的差距,强调了人工智能持续发展以匹配人类诊断专业知识的必要性。