基于自适应掩码的多模态掩码自动编码器用于白癜风分期分类

Multimodal Masked Autoencoder Based on Adaptive Masking for Vitiligo Stage Classification.

作者信息

Xiang Fan, Li Zhiming, Jiang Shuying, Li Chunying, Li Shuli, Gao Tianwen, He Kaiqiao, Chen Jianru, Zhang Junpeng, Zhang Junran

机构信息

Department of Automation, College of Electrical Engineering, Sichuan University, Chengdu, 610065, China.

Department of Dermatology, Xijing Hospital, Fourth Military Medical University, Xi'an, 710032, China.

出版信息

J Imaging Inform Med. 2025 Apr 29. doi: 10.1007/s10278-025-01521-7.

DOI:10.1007/s10278-025-01521-7

PMID:40301294

Abstract

Vitiligo, a prevalent skin condition characterized by depigmentation, presents challenges in staging due to its inherent complexity. Multimodal skin images can provide complementary information, and in this study, the integration of clinical images of vitiligo and those obtained under Wood's lamp is conducive to the classification of vitiligo stages. However, difficulties in annotating multimodal data and the scarcity of multimodal data limit the performance of deep learning models in related classification tasks. To address these issues, a Multimodal Masked Autoencoder (Multi-MAE) based on adaptive masking is proposed in annotating multimodal data and the problem of multimodal data scarcity, and enhances the model's ability to extract characteristics from multimodal data. Specifically, an image reconstruction task is constructed to diminish reliance on annotated multimodal data, and a pre-training strategy is employed to alleviate the scarcity of multimodal data. Experimental results demonstrate that the proposed model achieves a vitiligo stage classification accuracy of 95.48% on a dataset of unlabeled dermatological images, an improvement of 5.16%, 4.51%, 3.87%, 2.58%, 4.51%, 4.51%, 3.87%, and 2.58% over that of MobileNet, DenseNet, VGG, ResNet-50, BEIT, MaskFeat, SimMIM, and MAE, respectively. These results verify the effectiveness of the proposed Multi-MAE model in assessing the stable and active vitiligo stages, making it a suitable clinical aid for evaluating the severity of vitiligo lesions.

摘要

白癜风是一种以色素脱失为特征的常见皮肤病，因其内在复杂性在分期方面存在挑战。多模态皮肤图像可以提供补充信息，在本研究中，白癜风临床图像与伍德灯下获得的图像相结合有助于白癜风分期的分类。然而，多模态数据标注困难以及多模态数据稀缺限制了深度学习模型在相关分类任务中的性能。为解决这些问题，提出了一种基于自适应掩码的多模态掩码自动编码器（Multi-MAE），用于标注多模态数据以及解决多模态数据稀缺问题，并增强模型从多模态数据中提取特征的能力。具体而言，构建图像重建任务以减少对标注多模态数据的依赖，并采用预训练策略来缓解多模态数据的稀缺性。实验结果表明，所提出的模型在未标记皮肤病图像数据集上实现了95.48%的白癜风分期分类准确率，分别比MobileNet、DenseNet、VGG、ResNet-50、BEIT、MaskFeat、SimMIM和MAE提高了5.16%、4.51%、3.87%、2.58%、4.51%、4.51%、3.87%和2.58%。这些结果验证了所提出的Multi-MAE模型在评估白癜风稳定期和进展期方面的有效性，使其成为评估白癜风皮损严重程度的合适临床辅助工具。

相似文献

Multimodal Masked Autoencoder Based on Adaptive Masking for Vitiligo Stage Classification.基于自适应掩码的多模态掩码自动编码器用于白癜风分期分类

J Imaging Inform Med. 2025 Apr 29. doi: 10.1007/s10278-025-01521-7.

In-depth study of Wood's lamp examination combined with reflective confocal laser scanning microscopy for the guidance of vitiligo staging and treatment.深入研究伍氏灯检查结合反射共聚焦激光扫描显微镜对白癜风分期和治疗的指导作用。

J Cosmet Dermatol. 2024 Apr;23(4):1472-1479. doi: 10.1111/jocd.16145. Epub 2023 Dec 29.

BUS-M2AE: Multi-scale Masked Autoencoder for Breast Ultrasound Image Analysis.BUS-M2AE：用于乳腺超声图像分析的多尺度掩码自动编码器

Comput Biol Med. 2025 Jun;191:110159. doi: 10.1016/j.compbiomed.2025.110159. Epub 2025 Apr 18.

MMAgentRec, a personalized multi-modal recommendation agent with large language model.MMAgentRec，一个带有大语言模型的个性化多模态推荐代理。

Sci Rep. 2025 Apr 8;15(1):12062. doi: 10.1038/s41598-025-96458-w.

Towards robust multimodal ultrasound classification for liver tumor diagnosis: A generative approach to modality missingness.迈向用于肝肿瘤诊断的稳健多模态超声分类：一种处理模态缺失的生成方法。

Comput Methods Programs Biomed. 2025 Jun;265:108759. doi: 10.1016/j.cmpb.2025.108759. Epub 2025 Mar 30.

Salmon colored fluorescence on Wood's lamp in a pediatric patient with vitiligo: A case report.伍德灯下白癜风患儿出现鲑鱼色荧光：一例报告。

J Family Med Prim Care. 2024 Nov;13(11):5381-5384. doi: 10.4103/jfmpc.jfmpc_508_24. Epub 2024 Nov 18.

Three-dimensional semi-supervised lumbar vertebrae region of interest segmentation based on MAE pre-training.基于MAE预训练的三维半监督腰椎感兴趣区域分割

J Xray Sci Technol. 2025 Jan;33(1):270-282. doi: 10.1177/08953996241301685. Epub 2025 Jan 15.

Optimizing vitiligo diagnosis with ResNet and Swin transformer deep learning models: a study on performance and interpretability.使用ResNet和Swin变压器深度学习模型优化白癜风诊断：性能与可解释性研究

Sci Rep. 2024 Apr 21;14(1):9127. doi: 10.1038/s41598-024-59436-2.

Learning the heterogeneous representation of brain's structure from serial SEM images using a masked autoencoder.使用掩码自动编码器从连续扫描电子显微镜图像中学习大脑结构的异质表示。

Front Neuroinform. 2023 Jun 8;17:1118419. doi: 10.3389/fninf.2023.1118419. eCollection 2023.

[A multimodal medical image contrastive learning algorithm with domain adaptive denormalization].一种具有域自适应去归一化的多模态医学图像对比学习算法

Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2023 Jun 25;40(3):482-491. doi: 10.7507/1001-5515.202302050.

引用本文的文献

Brain tumor classification using GAN-augmented data with autoencoders and Swin Transformers.使用带有自动编码器和Swin Transformer的GAN增强数据进行脑肿瘤分类

Front Med (Lausanne). 2025 Aug 22;12:1635796. doi: 10.3389/fmed.2025.1635796. eCollection 2025.

本文引用的文献

Vitiligo: From Pathogenesis to Treatment.白癜风：从发病机制到治疗

J Clin Med. 2024 Sep 3;13(17):5225. doi: 10.3390/jcm13175225.

Learning with limited annotations: A survey on deep semi-supervised learning for medical image segmentation.利用有限标注进行学习：医学图像分割的深度半监督学习综述。

Comput Biol Med. 2024 Feb;169:107840. doi: 10.1016/j.compbiomed.2023.107840. Epub 2023 Dec 16.

Non-invasive skin measurement methods and diagnostics for vitiligo: a systematic review.白癜风的非侵入性皮肤测量方法与诊断：一项系统综述

Front Med (Lausanne). 2023 Jul 27;10:1200963. doi: 10.3389/fmed.2023.1200963. eCollection 2023.

Vitiligo, from Pathogenesis to Therapeutic Advances: State of the Art.白癜风：从发病机制到治疗进展的最新研究进展。

Int J Mol Sci. 2023 Mar 3;24(5):4910. doi: 10.3390/ijms24054910.

A deep learning based multimodal fusion model for skin lesion diagnosis using smartphone collected clinical images and metadata.一种基于深度学习的多模态融合模型，用于利用智能手机采集的临床图像和元数据进行皮肤病变诊断。

Front Surg. 2022 Oct 4;9:1029991. doi: 10.3389/fsurg.2022.1029991. eCollection 2022.

MDFNet: application of multimodal fusion method based on skin image and clinical data to skin cancer classification.MDFNet：基于皮肤图像和临床数据的多模态融合方法在皮肤癌分类中的应用。

J Cancer Res Clin Oncol. 2023 Jul;149(7):3287-3299. doi: 10.1007/s00432-022-04180-1. Epub 2022 Aug 3.

Adversarial multimodal fusion with attention mechanism for skin lesion classification using clinical and dermoscopic images.基于注意力机制的对抗式多模态融合在临床和皮肤镜图像的皮肤损伤分类中的应用。

Med Image Anal. 2022 Oct;81:102535. doi: 10.1016/j.media.2022.102535. Epub 2022 Jul 13.

Revealing The Unseen: A Review of Wood's Lamp in Dermatology.揭示不可见之物：伍德灯在皮肤病学中的应用综述

J Clin Aesthet Dermatol. 2022 Jun;15(6):25-30.

A review on multimodal medical image fusion: Compendious analysis of medical modalities, multimodal databases, fusion techniques and quality metrics.多模态医学图像融合综述：对医学模态、多模态数据库、融合技术和质量指标的简明分析。

Comput Biol Med. 2022 May;144:105253. doi: 10.1016/j.compbiomed.2022.105253. Epub 2022 Feb 3.

Melanocytes and keratinocytes morphological changes in vitiligo patients. A histological, immunohistochemical and ultrastructural analysis.白癜风患者黑素细胞和角质形成细胞的形态学变化。组织学、免疫组织化学及超微结构分析。

Ultrastruct Pathol. 2022 Mar 4;46(2):217-235. doi: 10.1080/01913123.2022.2044946.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于自适应掩码的多模态掩码自动编码器用于白癜风分期分类

Multimodal Masked Autoencoder Based on Adaptive Masking for Vitiligo Stage Classification.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献