Toure Alousseyni, Haman Ismael Adji, Benbakreti Samir, Roumane Ahmed, Benbakreti Soumia, Benouis Mohamed
Department of Specialty, National High School of Telecommunication and ICT, Oran, Algeria.
University of Djillali Liabes, Laboratory of Mathematic, Sidi Bel Abbes, Algeria.
J Clin Ultrasound. 2025 May 1. doi: 10.1002/jcu.24002.
Computer-aided diagnosis using deep neural networks allows for the analysis and processing of images or videos of different pathologies, providing valuable reference data to physicians for the diagnosis or screening of conditions such as skin cancer.
In this work, we highlight the contribution of Convolutional Neural Networks, pre-trained models, and Vision Transformer architectures in the classification of skin melanoma. The experimental aspect will therefore involve the contribution of the classical CNN, as well as models inspired by this CNN, namely, Inception V3, ResNet 50, AlexNet, and EfficientNet in addition to the hybrid architecture.
The conducted experiments entailed the adjustment of multiple hyperparameters, leading to the development of an architecture that achieved optimal results. Additionally, employing a hybrid architecture not only facilitated the amalgamation of the strengths from two models (the top performing pretrained ResNet50 model with the Vision Transformer) but also led to enhanced accuracy. After training the dataset, the proposed models have contributed to progressively improving the results, eventually achieving a classification rate of 95.53% for the hybrid ResNet50-ViT model.
The aim of this research is to equip clinicians with a robust tool for melanoma diagnosis by leveraging the strengths of two models within the ResNet50-ViT hybrid framework.
使用深度神经网络的计算机辅助诊断能够对不同病理的图像或视频进行分析和处理,为医生诊断或筛查皮肤癌等病症提供有价值的参考数据。
在这项工作中,我们重点介绍卷积神经网络、预训练模型和视觉Transformer架构在皮肤黑色素瘤分类中的作用。因此,实验方面将涉及经典卷积神经网络的作用,以及受此卷积神经网络启发的模型,即除混合架构外的Inception V3、ResNet 50、AlexNet和EfficientNet。
所进行的实验需要调整多个超参数,从而开发出一种取得最佳结果的架构。此外,采用混合架构不仅便于融合两种模型(表现最佳的预训练ResNet50模型与视觉Transformer)的优势,还提高了准确率。在对数据集进行训练后,所提出的模型逐步改进了结果,最终混合ResNet50-ViT模型的分类率达到了95.53%。
本研究的目的是通过利用ResNet50-ViT混合框架内两种模型的优势,为临床医生提供一种强大的黑色素瘤诊断工具。