一种具有自适应多模态融合的多阶段多模态学习算法，用于改进多标签皮肤病变分类。

Zuo Lihan, Wang Zizhou, Wang Yan

School of Computer and Artificial Intelligence, Southwest Jiaotong University, Chengdu 610000, PR China.

Institute of High Performance Computing, Agency for Science, Technology and Research (A*STAR), Singapore 138632, Singapore.

Artif Intell Med. 2025 Apr;162:103091. doi: 10.1016/j.artmed.2025.103091. Epub 2025 Feb 19.

Skin cancer is frequently occurring and has become a major contributor to both cancer incidence and mortality. Accurate and timely diagnosis of skin cancer holds the potential to save lives. Deep learning-based methods have demonstrated significant advancements in the screening of skin cancers. However, most current approaches rely on a single modality input for diagnosis, thereby missing out on valuable complementary information that could enhance accuracy. Although some multimodal-based methods exist, they often lack adaptability and fail to fully leverage multimodal information. In this paper, we introduce a novel uncertainty-based hybrid fusion strategy for a multi-modal learning algorithm aimed at skin cancer diagnosis. Our approach specifically combines three different modalities: clinical images, dermoscopy images, and metadata, to make the final classification. For the fusion of two image modalities, we employ an intermediate fusion strategy that considers the similarity between clinical and dermoscopy images to extract features containing both complementary and correlated information. To capture the correlated information, we utilize cosine similarity, and we employ concatenation as the means for integrating complementary information. In the fusion of image and metadata modalities, we leverage uncertainty to obtain confident late fusion results, allowing our method to adaptively combine the information from different modalities. We conducted comprehensive experiments using a popular publicly available skin disease diagnosis dataset, and the results of these experiments demonstrate the effectiveness of our proposed method. Our proposed fusion algorithm could enhance the clinical applicability of automated skin lesion classification, offering a more robust and adaptive way to make automatic diagnoses with the help of uncertainty mechanism. Code is available at https://github.com/Zuo-Lihan/CosCatNet-Adaptive_Fusion_Algorithm.

皮肤癌发病率很高，已成为癌症发病率和死亡率的主要促成因素。准确及时地诊断皮肤癌有可能挽救生命。基于深度学习的方法在皮肤癌筛查方面已取得显著进展。然而，当前大多数方法依靠单一模态输入进行诊断，从而遗漏了可能提高准确性的宝贵补充信息。尽管存在一些基于多模态的方法，但它们往往缺乏适应性，无法充分利用多模态信息。在本文中，我们针对皮肤癌诊断的多模态学习算法引入了一种基于不确定性的新型混合融合策略。我们的方法具体结合了三种不同的模态：临床图像、皮肤镜图像和元数据，以进行最终分类。对于两种图像模态的融合，我们采用一种中间融合策略，该策略考虑临床图像和皮肤镜图像之间的相似性，以提取包含补充信息和相关信息的特征。为了捕获相关信息，我们利用余弦相似度，并采用拼接作为整合补充信息的方式。在图像和元数据模态的融合中，我们利用不确定性来获得可靠的后期融合结果，使我们的方法能够自适应地组合来自不同模态的信息。我们使用一个流行的公开可用皮肤病诊断数据集进行了全面实验，这些实验结果证明了我们提出的方法的有效性。我们提出的融合算法可以提高自动皮肤病变分类的临床适用性，借助不确定性机制提供一种更强大、更具适应性的自动诊断方法。代码可在https://github.com/Zuo-Lihan/CosCatNet-Adaptive_Fusion_Algorithm获取。

相似文献

A multi-stage multi-modal learning algorithm with adaptive multimodal fusion for improving multi-label skin lesion classification.

Artif Intell Med. 2025 Apr;162:103091. doi: 10.1016/j.artmed.2025.103091. Epub 2025 Feb 19.

Automatic melanoma detection using an optimized five-stream convolutional neural network.

Sci Rep. 2025 Jul 1;15(1):22404. doi: 10.1038/s41598-025-05675-w.

A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.

Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.

Multiclass skin lesion classification and localziation from dermoscopic images using a novel network-level fused deep architecture and explainable artificial intelligence.

BMC Med Inform Decis Mak. 2025 Jul 1;25(1):215. doi: 10.1186/s12911-025-03051-2.

Attention-Guided Learning With Feature Reconstruction for Skin Lesion Diagnosis Using Clinical and Ultrasound Images.

IEEE Trans Med Imaging. 2025 Jan;44(1):543-555. doi: 10.1109/TMI.2024.3450682. Epub 2025 Jan 2.

Influence of early through late fusion on pancreas segmentation from imperfectly registered multimodal magnetic resonance imaging.

J Med Imaging (Bellingham). 2025 Mar;12(2):024008. doi: 10.1117/1.JMI.12.2.024008. Epub 2025 Apr 26.

RemixFormer++: A Multi-Modal Transformer Model for Precision Skin Tumor Differential Diagnosis With Memory-Efficient Attention.

IEEE Trans Med Imaging. 2025 Jan;44(1):320-337. doi: 10.1109/TMI.2024.3441012. Epub 2025 Jan 2.

SG-Fusion: A swin-transformer and graph convolution-based multi-modal deep neural network for glioma prognosis.

Artif Intell Med. 2024 Nov;157:102972. doi: 10.1016/j.artmed.2024.102972. Epub 2024 Aug 31.

Short-Term Memory Impairment

Artificial intelligence for diagnosing exudative age-related macular degeneration.

Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.

Suppr 超能文献

核心技术专利：CN118964589B侵权必究

相似文献

A multi-stage multi-modal learning algorithm with adaptive multimodal fusion for improving multi-label skin lesion classification.

Artif Intell Med. 2025 Apr;162:103091. doi: 10.1016/j.artmed.2025.103091. Epub 2025 Feb 19.

Automatic melanoma detection using an optimized five-stream convolutional neural network.

Sci Rep. 2025 Jul 1;15(1):22404. doi: 10.1038/s41598-025-05675-w.

A deep learning approach to direct immunofluorescence pattern recognition in autoimmune bullous diseases.

Br J Dermatol. 2024 Jul 16;191(2):261-266. doi: 10.1093/bjd/ljae142.

Multiclass skin lesion classification and localziation from dermoscopic images using a novel network-level fused deep architecture and explainable artificial intelligence.

BMC Med Inform Decis Mak. 2025 Jul 1;25(1):215. doi: 10.1186/s12911-025-03051-2.

Attention-Guided Learning With Feature Reconstruction for Skin Lesion Diagnosis Using Clinical and Ultrasound Images.

IEEE Trans Med Imaging. 2025 Jan;44(1):543-555. doi: 10.1109/TMI.2024.3450682. Epub 2025 Jan 2.

Influence of early through late fusion on pancreas segmentation from imperfectly registered multimodal magnetic resonance imaging.

J Med Imaging (Bellingham). 2025 Mar;12(2):024008. doi: 10.1117/1.JMI.12.2.024008. Epub 2025 Apr 26.

RemixFormer++: A Multi-Modal Transformer Model for Precision Skin Tumor Differential Diagnosis With Memory-Efficient Attention.

IEEE Trans Med Imaging. 2025 Jan;44(1):320-337. doi: 10.1109/TMI.2024.3441012. Epub 2025 Jan 2.

SG-Fusion: A swin-transformer and graph convolution-based multi-modal deep neural network for glioma prognosis.

Artif Intell Med. 2024 Nov;157:102972. doi: 10.1016/j.artmed.2024.102972. Epub 2024 Aug 31.

Short-Term Memory Impairment

Artificial intelligence for diagnosing exudative age-related macular degeneration.

Cochrane Database Syst Rev. 2024 Oct 17;10(10):CD015522. doi: 10.1002/14651858.CD015522.pub2.

Suppr
超能文献

A multi-stage multi-modal learning algorithm with adaptive multimodal fusion for improving multi-label skin lesion classification.

作者信息

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

Suppr超能文献

一种具有自适应多模态融合的多阶段多模态学习算法，用于改进多标签皮肤病变分类。

A multi-stage multi-modal learning algorithm with adaptive multimodal fusion for improving multi-label skin lesion classification.

作者信息

机构信息

出版信息

相似文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

Suppr
超能文献