Suppr超能文献

基于自适应掩码的多模态掩码自动编码器用于白癜风分期分类

Multimodal Masked Autoencoder Based on Adaptive Masking for Vitiligo Stage Classification.

作者信息

Xiang Fan, Li Zhiming, Jiang Shuying, Li Chunying, Li Shuli, Gao Tianwen, He Kaiqiao, Chen Jianru, Zhang Junpeng, Zhang Junran

机构信息

Department of Automation, College of Electrical Engineering, Sichuan University, Chengdu, 610065, China.

Department of Dermatology, Xijing Hospital, Fourth Military Medical University, Xi'an, 710032, China.

出版信息

J Imaging Inform Med. 2025 Apr 29. doi: 10.1007/s10278-025-01521-7.

Abstract

Vitiligo, a prevalent skin condition characterized by depigmentation, presents challenges in staging due to its inherent complexity. Multimodal skin images can provide complementary information, and in this study, the integration of clinical images of vitiligo and those obtained under Wood's lamp is conducive to the classification of vitiligo stages. However, difficulties in annotating multimodal data and the scarcity of multimodal data limit the performance of deep learning models in related classification tasks. To address these issues, a Multimodal Masked Autoencoder (Multi-MAE) based on adaptive masking is proposed in annotating multimodal data and the problem of multimodal data scarcity, and enhances the model's ability to extract characteristics from multimodal data. Specifically, an image reconstruction task is constructed to diminish reliance on annotated multimodal data, and a pre-training strategy is employed to alleviate the scarcity of multimodal data. Experimental results demonstrate that the proposed model achieves a vitiligo stage classification accuracy of 95.48% on a dataset of unlabeled dermatological images, an improvement of 5.16%, 4.51%, 3.87%, 2.58%, 4.51%, 4.51%, 3.87%, and 2.58% over that of MobileNet, DenseNet, VGG, ResNet-50, BEIT, MaskFeat, SimMIM, and MAE, respectively. These results verify the effectiveness of the proposed Multi-MAE model in assessing the stable and active vitiligo stages, making it a suitable clinical aid for evaluating the severity of vitiligo lesions.

摘要

白癜风是一种以色素脱失为特征的常见皮肤病,因其内在复杂性在分期方面存在挑战。多模态皮肤图像可以提供补充信息,在本研究中,白癜风临床图像与伍德灯下获得的图像相结合有助于白癜风分期的分类。然而,多模态数据标注困难以及多模态数据稀缺限制了深度学习模型在相关分类任务中的性能。为解决这些问题,提出了一种基于自适应掩码的多模态掩码自动编码器(Multi-MAE),用于标注多模态数据以及解决多模态数据稀缺问题,并增强模型从多模态数据中提取特征的能力。具体而言,构建图像重建任务以减少对标注多模态数据的依赖,并采用预训练策略来缓解多模态数据的稀缺性。实验结果表明,所提出的模型在未标记皮肤病图像数据集上实现了95.48%的白癜风分期分类准确率,分别比MobileNet、DenseNet、VGG、ResNet-50、BEIT、MaskFeat、SimMIM和MAE提高了5.16%、4.51%、3.87%、2.58%、4.51%、4.51%、3.87%和2.58%。这些结果验证了所提出的Multi-MAE模型在评估白癜风稳定期和进展期方面的有效性,使其成为评估白癜风皮损严重程度的合适临床辅助工具。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验