增强皮肤癌分类：一种基于视觉Transformer的多尺度注意力与集成方法

Boosting Skin Cancer Classification: A Multi-Scale Attention and Ensemble Approach with Vision Transformers.

作者信息

Yang Guang, Luo Suhuai, Greer Peter

机构信息

School of Information and Physical Sciences, The University of Newcastle, Callaghan, NSW 2308, Australia.

School of Information and Physical Sciences, College of Engineering, Science and Environment, The University of Newcastle, Callaghan NSW 2308, Australia.

出版信息

Sensors (Basel). 2025 Apr 15;25(8):2479. doi: 10.3390/s25082479.

DOI:10.3390/s25082479

PMID:40285168

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12030980/

Abstract

Skin cancer is a significant global health concern, with melanoma being the most dangerous form, responsible for the majority of skin cancer-related deaths. Early detection of skin cancer is critical, as it can drastically improve survival rates. While deep learning models have achieved impressive results in skin cancer classification, there remain challenges in accurately distinguishing between benign and malignant lesions. In this study, we introduce a novel multi-scale attention-based performance booster inspired by the Vision Transformer (ViT) architecture, which enhances the accuracy of both ViT and convolutional neural network (CNN) models. By leveraging attention maps to identify discriminative regions within skin lesion images, our method improves the models' focus on diagnostically relevant areas. Additionally, we employ ensemble learning techniques to combine the outputs of several deep learning models using majority voting. Our skin cancer classifier, consisting of ViT and EfficientNet models, achieved a classification accuracy of 95.05% on the ISIC2018 dataset, outperforming individual models. The results demonstrate the effectiveness of integrating attention-based multi-scale learning and ensemble methods in skin cancer classification.

摘要

皮肤癌是一个重大的全球健康问题，黑色素瘤是最危险的类型，导致了大多数与皮肤癌相关的死亡。皮肤癌的早期检测至关重要，因为它可以显著提高生存率。虽然深度学习模型在皮肤癌分类方面取得了令人瞩目的成果，但在准确区分良性和恶性病变方面仍然存在挑战。在本研究中，我们引入了一种受视觉Transformer（ViT）架构启发的新型基于多尺度注意力的性能增强器，它提高了ViT和卷积神经网络（CNN）模型的准确性。通过利用注意力图来识别皮肤病变图像中的判别区域，我们的方法提高了模型对诊断相关区域的关注。此外，我们采用集成学习技术，通过多数投票来组合多个深度学习模型的输出。我们的皮肤癌分类器由ViT和EfficientNet模型组成，在ISIC2018数据集上实现了95.05%的分类准确率，优于单个模型。结果证明了在皮肤癌分类中集成基于注意力的多尺度学习和集成方法的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1a7b/12030980/d39cb0fd3077/sensors-25-02479-g001.jpg

相似文献

Boosting Skin Cancer Classification: A Multi-Scale Attention and Ensemble Approach with Vision Transformers.

Sensors (Basel). 2025 Apr 15;25(8):2479. doi: 10.3390/s25082479.

SkinEHDLF a hybrid deep learning approach for accurate skin cancer classification in complex systems.

Sci Rep. 2025 Apr 28;15(1):14913. doi: 10.1038/s41598-025-98205-7.

A robust deep learning framework for multiclass skin cancer classification.

Sci Rep. 2025 Feb 10;15(1):4938. doi: 10.1038/s41598-025-89230-7.

A novel Skin lesion prediction and classification technique: ViT-GradCAM.

Skin Res Technol. 2024 Sep;30(9):e70040. doi: 10.1111/srt.70040.

A comprehensive analysis of deep learning and transfer learning techniques for skin cancer classification.

Sci Rep. 2025 Feb 7;15(1):4633. doi: 10.1038/s41598-024-82241-w.

Optimizing time prediction and error classification in early melanoma detection using a hybrid RCNN-LSTM model.

Microsc Res Tech. 2024 Aug;87(8):1789-1809. doi: 10.1002/jemt.24559. Epub 2024 Mar 22.

Classification of melanoma skin Cancer based on Image Data Set using different neural networks.

Sci Rep. 2024 Nov 29;14(1):29704. doi: 10.1038/s41598-024-75143-4.

Enhancing skin lesion classification with advanced deep learning ensemble models: a path towards accurate medical diagnostics.

Curr Probl Cancer. 2024 Apr;49:101077. doi: 10.1016/j.currproblcancer.2024.101077. Epub 2024 Mar 13.

Enhancing Skin Cancer Classification using Efficient Net B0-B7 through Convolutional Neural Networks and Transfer Learning with Patient-Specific Data.

Asian Pac J Cancer Prev. 2024 May 1;25(5):1795-1802. doi: 10.31557/APJCP.2024.25.5.1795.

A multi-stage fusion deep learning framework merging local patterns with attention-driven contextual dependencies for cancer detection.

Comput Biol Med. 2025 May;189:109916. doi: 10.1016/j.compbiomed.2025.109916. Epub 2025 Mar 6.

本文引用的文献

Assessment of Diagnostic Performance of Dermatologists Cooperating With a Convolutional Neural Network in a Prospective Clinical Study: Human With Machine.

JAMA Dermatol. 2023 Jun 1;159(6):621-627. doi: 10.1001/jamadermatol.2023.0905.

Incorporating a Novel Dual Transfer Learning Approach for Medical Images.

Sensors (Basel). 2023 Jan 4;23(2):570. doi: 10.3390/s23020570.

An improved transformer network for skin cancer classification.

Comput Biol Med. 2022 Oct;149:105939. doi: 10.1016/j.compbiomed.2022.105939. Epub 2022 Aug 10.

Fully transformer network for skin lesion analysis.

Med Image Anal. 2022 Apr;77:102357. doi: 10.1016/j.media.2022.102357. Epub 2022 Jan 18.

Cancer statistics, 2019.

CA Cancer J Clin. 2019 Jan;69(1):7-34. doi: 10.3322/caac.21551. Epub 2019 Jan 8.

Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries.

CA Cancer J Clin. 2018 Nov;68(6):394-424. doi: 10.3322/caac.21492. Epub 2018 Sep 12.

The HAM10000 dataset, a large collection of multi-source dermatoscopic images of common pigmented skin lesions.

Sci Data. 2018 Aug 14;5:180161. doi: 10.1038/sdata.2018.161.

Dermatologist-level classification of skin cancer with deep neural networks.

Nature. 2017 Feb 2;542(7639):115-118. doi: 10.1038/nature21056. Epub 2017 Jan 25.

Automated Detection and Segmentation of Vascular Structures of Skin Lesions Seen in Dermoscopy, With an Application to Basal Cell Carcinoma Classification.

IEEE J Biomed Health Inform. 2017 Nov;21(6):1675-1684. doi: 10.1109/JBHI.2016.2637342. Epub 2016 Dec 8.

Cancer statistics, 2012.

CA Cancer J Clin. 2012 Jan-Feb;62(1):10-29. doi: 10.3322/caac.20138. Epub 2012 Jan 4.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

增强皮肤癌分类：一种基于视觉Transformer的多尺度注意力与集成方法

Boosting Skin Cancer Classification: A Multi-Scale Attention and Ensemble Approach with Vision Transformers.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

本文引用的文献