Hassan Mohamed, Vakanski Aleksandar, Zhang Boyu, Xian Min
Department of Computer Science, University of Idaho, Idaho Falls, ID 83402, USA.
IEEE Access. 2025;13:82972-82985. doi: 10.1109/ACCESS.2025.3568641. Epub 2025 May 9.
Effective clinical deployment of deep learning models in healthcare demands high generalization performance to ensure accurate diagnosis and treatment planning. In recent years, significant research has focused on improving the generalization of deep learning models by regularizing the sharpness of the loss landscape. Among the optimization approaches that explicitly minimize sharpness, Sharpness-Aware Minimization (SAM) has shown potential in enhancing generalization performance on general domain image datasets. This success has led to the development of several advanced sharpness-based algorithms aimed at addressing the limitations of SAM, such as Adaptive SAM, Surrogate-Gap SAM, Weighted SAM, and Curvature Regularized SAM. These sharpness-based optimizers have shown improvements in model generalization compared to conventional stochastic gradient descent optimizers and their variants on general domain image datasets, but they have not been thoroughly evaluated on medical images. This work provides a review of recent sharpness-based methods for improving the generalization of deep learning networks and evaluates the methods' performance on three medical image datasets, including breast ultrasound, chest X-ray, and colon histopathology images. Our findings indicate that the initial SAM method successfully enhances the generalization of various deep learning models. While Adaptive SAM improves generalization of convolutional neural networks, it fails to do so for vision transformers. Other sharpness-based optimizers, however, do not demonstrate consistent results. The results reveal that contrary to findings in the non-medical domain, SAM is the only recommended sharpness-based optimizer that consistently improves generalization in medical image analysis, and further research is necessary to refine the variants of SAM to enhance generalization performance in this field.
深度学习模型在医疗保健领域的有效临床应用需要高泛化性能,以确保准确的诊断和治疗规划。近年来,大量研究聚焦于通过正则化损失曲面的锐度来提高深度学习模型的泛化能力。在明确最小化锐度的优化方法中,锐度感知最小化(SAM)在提升通用领域图像数据集的泛化性能方面显示出潜力。这一成功促使了几种先进的基于锐度的算法的发展,旨在解决SAM的局限性,如自适应SAM、替代间隙SAM、加权SAM和曲率正则化SAM。与传统的随机梯度下降优化器及其在通用领域图像数据集上的变体相比,这些基于锐度的优化器在模型泛化方面有所改进,但它们尚未在医学图像上得到充分评估。这项工作对最近基于锐度的提高深度学习网络泛化能力的方法进行了综述,并在包括乳腺超声、胸部X光和结肠组织病理学图像在内的三个医学图像数据集上评估了这些方法的性能。我们的研究结果表明,最初的SAM方法成功地增强了各种深度学习模型的泛化能力。虽然自适应SAM提高了卷积神经网络的泛化能力,但对视觉Transformer却未能如此。然而,其他基于锐度的优化器并未显示出一致的结果。结果表明,与非医学领域的研究结果相反,SAM是唯一被推荐的基于锐度的优化器,它能在医学图像分析中持续提高泛化能力,并且有必要进一步研究以改进SAM的变体,以增强该领域的泛化性能。