• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用基于混合移位窗口多头自注意力和基于SwiGLU的多层感知器的Swin Transformer增强皮肤癌诊断

Enhancing Skin Cancer Diagnosis Using Swin Transformer with Hybrid Shifted Window-Based Multi-head Self-attention and SwiGLU-Based MLP.

作者信息

Pacal Ishak, Alaftekin Melek, Zengul Ferhat Devrim

机构信息

Department of Computer Engineering, Igdir University, 76000, Igdir, Turkey.

Department of Health Services Administration, The University of Alabama at Birmingham, Birmingham, AL, USA.

出版信息

J Imaging Inform Med. 2024 Dec;37(6):3174-3192. doi: 10.1007/s10278-024-01140-8. Epub 2024 Jun 5.

DOI:10.1007/s10278-024-01140-8
PMID:38839675
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11612041/
Abstract

Skin cancer is one of the most frequently occurring cancers worldwide, and early detection is crucial for effective treatment. Dermatologists often face challenges such as heavy data demands, potential human errors, and strict time limits, which can negatively affect diagnostic outcomes. Deep learning-based diagnostic systems offer quick, accurate testing and enhanced research capabilities, providing significant support to dermatologists. In this study, we enhanced the Swin Transformer architecture by implementing the hybrid shifted window-based multi-head self-attention (HSW-MSA) in place of the conventional shifted window-based multi-head self-attention (SW-MSA). This adjustment enables the model to more efficiently process areas of skin cancer overlap, capture finer details, and manage long-range dependencies, while maintaining memory usage and computational efficiency during training. Additionally, the study replaces the standard multi-layer perceptron (MLP) in the Swin Transformer with a SwiGLU-based MLP, an upgraded version of the gated linear unit (GLU) module, to achieve higher accuracy, faster training speeds, and better parameter efficiency. The modified Swin model-base was evaluated using the publicly accessible ISIC 2019 skin dataset with eight classes and was compared against popular convolutional neural networks (CNNs) and cutting-edge vision transformer (ViT) models. In an exhaustive assessment on the unseen test dataset, the proposed Swin-Base model demonstrated exceptional performance, achieving an accuracy of 89.36%, a recall of 85.13%, a precision of 88.22%, and an F1-score of 86.65%, surpassing all previously reported research and deep learning models documented in the literature.

摘要

皮肤癌是全球最常见的癌症之一,早期检测对于有效治疗至关重要。皮肤科医生经常面临数据需求大、可能出现人为错误以及严格的时间限制等挑战,这些都会对诊断结果产生负面影响。基于深度学习的诊断系统提供快速、准确的检测以及增强的研究能力,为皮肤科医生提供了重要支持。在本研究中,我们通过实现基于混合移位窗口的多头自注意力机制(HSW-MSA)来替代传统的基于移位窗口的多头自注意力机制(SW-MSA),对Swin Transformer架构进行了改进。这种调整使模型能够更有效地处理皮肤癌重叠区域,捕捉更精细的细节,并管理长程依赖关系,同时在训练过程中保持内存使用和计算效率。此外,该研究用基于SwiGLU的多层感知器(MLP)取代了Swin Transformer中的标准多层感知器,SwiGLU是门控线性单元(GLU)模块的升级版,以实现更高的准确率、更快的训练速度和更好的参数效率。使用具有八个类别的公开可用的ISIC 2019皮肤数据集对改进后的Swin模型基础进行了评估,并与流行的卷积神经网络(CNN)和前沿的视觉Transformer(ViT)模型进行了比较。在对未见测试数据集的详尽评估中,所提出的Swin-Base模型表现出色,准确率达到89.36%,召回率为85.13%,精确率为88.22%,F1分数为86.65%,超过了文献中所有先前报道的研究和深度学习模型。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c40/11612041/26c034bdfe74/10278_2024_1140_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c40/11612041/cc211c529022/10278_2024_1140_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c40/11612041/a89b5aad7897/10278_2024_1140_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c40/11612041/c047e3235f06/10278_2024_1140_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c40/11612041/178efe9bad6e/10278_2024_1140_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c40/11612041/c37505f0efa0/10278_2024_1140_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c40/11612041/f921ace24736/10278_2024_1140_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c40/11612041/26c034bdfe74/10278_2024_1140_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c40/11612041/cc211c529022/10278_2024_1140_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c40/11612041/a89b5aad7897/10278_2024_1140_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c40/11612041/c047e3235f06/10278_2024_1140_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c40/11612041/178efe9bad6e/10278_2024_1140_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c40/11612041/c37505f0efa0/10278_2024_1140_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c40/11612041/f921ace24736/10278_2024_1140_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4c40/11612041/26c034bdfe74/10278_2024_1140_Fig7_HTML.jpg

相似文献

1
Enhancing Skin Cancer Diagnosis Using Swin Transformer with Hybrid Shifted Window-Based Multi-head Self-attention and SwiGLU-Based MLP.使用基于混合移位窗口多头自注意力和基于SwiGLU的多层感知器的Swin Transformer增强皮肤癌诊断
J Imaging Inform Med. 2024 Dec;37(6):3174-3192. doi: 10.1007/s10278-024-01140-8. Epub 2024 Jun 5.
2
Enhanced Pneumonia Detection in Chest X-Rays Using Hybrid Convolutional and Vision Transformer Networks.使用混合卷积和视觉Transformer网络增强胸部X光片中的肺炎检测
Curr Med Imaging. 2025;21:e15734056326685. doi: 10.2174/0115734056326685250101113959.
3
SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.SwinCross:用于 PET/CT 图像中头颈部肿瘤分割的跨模态 Swin 变换器。
Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.
4
Swin-GA-RF: genetic algorithm-based Swin Transformer and random forest for enhancing cervical cancer classification.Swin-GA-RF:基于遗传算法的Swin Transformer和随机森林用于增强宫颈癌分类
Front Oncol. 2024 Jul 19;14:1392301. doi: 10.3389/fonc.2024.1392301. eCollection 2024.
5
An explainable deep learning model for diabetic foot ulcer classification using swin transformer and efficient multi-scale attention-driven network.一种基于Swin Transformer和高效多尺度注意力驱动网络的用于糖尿病足溃疡分类的可解释深度学习模型。
Sci Rep. 2025 Feb 3;15(1):4057. doi: 10.1038/s41598-025-87519-1.
6
A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.深度学习方法在自身免疫性大疱性疾病中的直接免疫荧光模式识别。
Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.
7
Enhancing Melanoma Diagnosis with Advanced Deep Learning Models Focusing on Vision Transformer, Swin Transformer, and ConvNeXt.利用聚焦于视觉Transformer、Swin Transformer和ConvNeXt的先进深度学习模型增强黑色素瘤诊断
Dermatopathology (Basel). 2024 Aug 15;11(3):239-252. doi: 10.3390/dermatopathology11030026.
8
Optimizing vitiligo diagnosis with ResNet and Swin transformer deep learning models: a study on performance and interpretability.使用ResNet和Swin变压器深度学习模型优化白癜风诊断:性能与可解释性研究
Sci Rep. 2024 Apr 21;14(1):9127. doi: 10.1038/s41598-024-59436-2.
9
SkinEHDLF a hybrid deep learning approach for accurate skin cancer classification in complex systems.SkinEHDLF:一种用于复杂系统中皮肤癌精确分类的混合深度学习方法。
Sci Rep. 2025 Apr 28;15(1):14913. doi: 10.1038/s41598-025-98205-7.
10
Classification of Mobile-Based Oral Cancer Images Using the Vision Transformer and the Swin Transformer.使用视觉Transformer和Swin Transformer对基于移动设备的口腔癌图像进行分类
Cancers (Basel). 2024 Feb 29;16(5):987. doi: 10.3390/cancers16050987.

引用本文的文献

1
Semi-supervised GAN with hybrid regularization and evolutionary hyperparameter tuning for accurate melanoma detection.具有混合正则化和进化超参数调整的半监督生成对抗网络用于准确的黑色素瘤检测。
Sci Rep. 2025 Aug 30;15(1):31977. doi: 10.1038/s41598-025-17756-x.
2
Harnessing infrared thermography and multi-convolutional neural networks for early breast cancer detection.利用红外热成像和多卷积神经网络进行早期乳腺癌检测。
Sci Rep. 2025 Jul 28;15(1):27464. doi: 10.1038/s41598-025-09330-2.
3
Hierarchical Swin Transformer Ensemble with Explainable AI for Robust and Decentralized Breast Cancer Diagnosis.

本文引用的文献

1
A Comparative Analysis of Deep Learning-Based Approaches for Classifying Dental Implants Decision Support System.基于深度学习的方法在分类牙种植体决策支持系统中的比较分析。
J Imaging Inform Med. 2024 Oct;37(5):2559-2580. doi: 10.1007/s10278-024-01086-x. Epub 2024 Apr 2.
2
DVFNet: A deep feature fusion-based model for the multiclassification of skin cancer utilizing dermoscopy images.DVFNet:一种基于深度特征融合的利用皮肤镜图像进行皮肤癌多分类的模型。
PLoS One. 2024 Mar 20;19(3):e0297667. doi: 10.1371/journal.pone.0297667. eCollection 2024.
3
A robust deep learning model for the classification of dental implant brands.
用于稳健且分散式乳腺癌诊断的具有可解释人工智能的分层Swin变压器集成模型
Bioengineering (Basel). 2025 Jun 13;12(6):651. doi: 10.3390/bioengineering12060651.
4
SkinEHDLF a hybrid deep learning approach for accurate skin cancer classification in complex systems.SkinEHDLF:一种用于复杂系统中皮肤癌精确分类的混合深度学习方法。
Sci Rep. 2025 Apr 28;15(1):14913. doi: 10.1038/s41598-025-98205-7.
5
An intelligent framework for skin cancer detection and classification using fusion of Squeeze-Excitation-DenseNet with Metaheuristic-driven ensemble deep learning models.一种使用挤压激励密集网络(Squeeze-Excitation-DenseNet)与元启发式驱动的集成深度学习模型融合进行皮肤癌检测和分类的智能框架。
Sci Rep. 2025 Mar 3;15(1):7425. doi: 10.1038/s41598-025-92293-1.
6
A robust deep learning framework for multiclass skin cancer classification.一种用于多类皮肤癌分类的强大深度学习框架。
Sci Rep. 2025 Feb 10;15(1):4938. doi: 10.1038/s41598-025-89230-7.
7
Comparison of deep transfer learning models for classification of cervical cancer from pap smear images.用于从巴氏涂片图像中对宫颈癌进行分类的深度迁移学习模型比较
Sci Rep. 2025 Jan 31;15(1):3945. doi: 10.1038/s41598-024-74531-0.
8
Addressing Challenges in Skin Cancer Diagnosis: A Convolutional Swin Transformer Approach.应对皮肤癌诊断中的挑战:一种卷积Swin Transformer方法。
J Imaging Inform Med. 2025 Jun;38(3):1755-1775. doi: 10.1007/s10278-024-01290-9. Epub 2024 Oct 22.
9
Skin cancer classification leveraging multi-directional compact convolutional neural network ensembles and gabor wavelets.利用多方向紧致卷积神经网络集成和 Gabor 小波进行皮肤癌分类。
Sci Rep. 2024 Sep 4;14(1):20637. doi: 10.1038/s41598-024-69954-8.
10
Gray-Scale Extraction of Bone Features from Chest Radiographs Based on Deep Learning Technique for Personal Identification and Classification in Forensic Medicine.基于深度学习技术从胸部X光片中提取骨骼特征用于法医学中的个人识别和分类的灰度提取
Diagnostics (Basel). 2024 Aug 15;14(16):1778. doi: 10.3390/diagnostics14161778.
一种用于牙科种植体品牌分类的强大深度学习模型。
J Stomatol Oral Maxillofac Surg. 2024 Sep;125(5S1):101818. doi: 10.1016/j.jormas.2024.101818. Epub 2024 Mar 8.
4
Biosensors for melanoma skin cancer diagnostics.用于黑色素瘤皮肤癌诊断的生物传感器。
Biosens Bioelectron. 2024 Apr 15;250:116045. doi: 10.1016/j.bios.2024.116045. Epub 2024 Jan 20.
5
Artificial intelligence for skin cancer detection and classification for clinical environment: a systematic review.用于临床环境中皮肤癌检测和分类的人工智能:一项系统综述。
Front Med (Lausanne). 2024 Jan 8;10:1305954. doi: 10.3389/fmed.2023.1305954. eCollection 2023.
6
LeaNet: Lightweight U-shaped architecture for high-performance skin cancer image segmentation.LeaNet:用于高性能皮肤癌图像分割的轻量级 U 形架构。
Comput Biol Med. 2024 Feb;169:107919. doi: 10.1016/j.compbiomed.2024.107919. Epub 2024 Jan 1.
7
Detection and Classification of Melanoma Skin Cancer Using Image Processing Technique.基于图像处理技术的黑色素瘤皮肤癌检测与分类
Diagnostics (Basel). 2023 Oct 26;13(21):3313. doi: 10.3390/diagnostics13213313.
8
MSRNet: Multiclass Skin Lesion Recognition Using Additional Residual Block Based Fine-Tuned Deep Models Information Fusion and Best Feature Selection.MSRNet:基于额外残差块的微调深度模型、信息融合和最佳特征选择的多类皮肤病变识别
Diagnostics (Basel). 2023 Sep 26;13(19):3063. doi: 10.3390/diagnostics13193063.
9
SkinNet-INIO: Multiclass Skin Lesion Localization and Classification Using Fusion-Assisted Deep Neural Networks and Improved Nature-Inspired Optimization Algorithm.SkinNet-INIO:使用融合辅助深度神经网络和改进的自然启发优化算法的多类皮肤病变定位与分类
Diagnostics (Basel). 2023 Sep 6;13(18):2869. doi: 10.3390/diagnostics13182869.
10
A novel framework of multiclass skin lesion recognition from dermoscopic images using deep learning and explainable AI.一种使用深度学习和可解释人工智能从皮肤镜图像中进行多类皮肤病变识别的新型框架。
Front Oncol. 2023 Jun 6;13:1151257. doi: 10.3389/fonc.2023.1151257. eCollection 2023.