• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于深度学习的多模态融合模型,用于利用智能手机采集的临床图像和元数据进行皮肤病变诊断。

A deep learning based multimodal fusion model for skin lesion diagnosis using smartphone collected clinical images and metadata.

作者信息

Ou Chubin, Zhou Sitong, Yang Ronghua, Jiang Weili, He Haoyang, Gan Wenjun, Chen Wentao, Qin Xinchi, Luo Wei, Pi Xiaobing, Li Jiehua

机构信息

Clinical Research Institute, The First People's Hospital of Foshan, Foshan, China.

R/D Center, Visionwise Medical Technology, Foshan, China.

出版信息

Front Surg. 2022 Oct 4;9:1029991. doi: 10.3389/fsurg.2022.1029991. eCollection 2022.

DOI:10.3389/fsurg.2022.1029991
PMID:36268206
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9577400/
Abstract

INTRODUCTION

Skin cancer is one of the most common types of cancer. An accessible tool to the public can help screening for malign lesion. We aimed to develop a deep learning model to classify skin lesion using clinical images and meta information collected from smartphones.

METHODS

A deep neural network was developed with two encoders for extracting information from image data and metadata. A multimodal fusion module with intra-modality self-attention and inter-modality cross-attention was proposed to effectively combine image features and meta features. The model was trained on tested on a public dataset and compared with other state-of-the-art methods using five-fold cross-validation.

RESULTS

Including metadata is shown to significantly improve a model's performance. Our model outperformed other metadata fusion methods in terms of accuracy, balanced accuracy and area under the receiver-operating characteristic curve, with an averaged value of 0.768±0.022, 0.775±0.022 and 0.947±0.007.

CONCLUSION

A deep learning model using smartphone collected images and metadata for skin lesion diagnosis was successfully developed. The proposed model showed promising performance and could be a potential tool for skin cancer screening.

摘要

引言

皮肤癌是最常见的癌症类型之一。一种公众可使用的工具有助于筛查恶性病变。我们旨在开发一种深度学习模型,利用从智能手机收集的临床图像和元信息对皮肤病变进行分类。

方法

开发了一种深度神经网络,带有两个编码器,用于从图像数据和元数据中提取信息。提出了一种具有模态内自注意力和模态间交叉注意力的多模态融合模块,以有效结合图像特征和元特征。该模型在一个公共数据集上进行训练和测试,并使用五折交叉验证与其他先进方法进行比较。

结果

结果表明,纳入元数据可显著提高模型性能。我们的模型在准确率、平衡准确率和受试者工作特征曲线下面积方面优于其他元数据融合方法,其平均值分别为0.768±0.022、0.775±0.022和0.947±0.007。

结论

成功开发了一种利用智能手机收集的图像和元数据进行皮肤病变诊断的深度学习模型。所提出的模型表现出良好的性能,可能成为皮肤癌筛查的潜在工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/832a/9577400/758a2ad25359/fsurg-09-1029991-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/832a/9577400/9f2eb42e288e/fsurg-09-1029991-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/832a/9577400/af6ae2ab8f72/fsurg-09-1029991-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/832a/9577400/71aca4d52ef9/fsurg-09-1029991-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/832a/9577400/758a2ad25359/fsurg-09-1029991-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/832a/9577400/9f2eb42e288e/fsurg-09-1029991-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/832a/9577400/af6ae2ab8f72/fsurg-09-1029991-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/832a/9577400/71aca4d52ef9/fsurg-09-1029991-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/832a/9577400/758a2ad25359/fsurg-09-1029991-g004.jpg

相似文献

1
A deep learning based multimodal fusion model for skin lesion diagnosis using smartphone collected clinical images and metadata.一种基于深度学习的多模态融合模型,用于利用智能手机采集的临床图像和元数据进行皮肤病变诊断。
Front Surg. 2022 Oct 4;9:1029991. doi: 10.3389/fsurg.2022.1029991. eCollection 2022.
2
A multimodal transformer to fuse images and metadata for skin disease classification.一种用于融合图像和元数据以进行皮肤病分类的多模态变压器。
Vis Comput. 2022 May 5:1-13. doi: 10.1007/s00371-022-02492-4.
3
Interpretability-Based Multimodal Convolutional Neural Networks for Skin Lesion Diagnosis.基于可解释性的多模态卷积神经网络用于皮肤损伤诊断。
IEEE Trans Cybern. 2022 Dec;52(12):12623-12637. doi: 10.1109/TCYB.2021.3069920. Epub 2022 Nov 18.
4
An Attention-Based Mechanism to Combine Images and Metadata in Deep Learning Models Applied to Skin Cancer Classification.基于注意力机制的深度学习模型融合图像与元数据在皮肤癌分类中的应用
IEEE J Biomed Health Inform. 2021 Sep;25(9):3554-3563. doi: 10.1109/JBHI.2021.3062002. Epub 2021 Sep 3.
5
Impact of metadata in multimodal classification of bone tumours.多模态骨肿瘤分类中元数据的影响。
BMC Musculoskelet Disord. 2024 Oct 19;25(1):822. doi: 10.1186/s12891-024-07934-9.
6
Computer-Aided Diagnosis of Spinal Tuberculosis From CT Images Based on Deep Learning With Multimodal Feature Fusion.基于多模态特征融合深度学习的CT图像脊柱结核计算机辅助诊断
Front Microbiol. 2022 Feb 23;13:823324. doi: 10.3389/fmicb.2022.823324. eCollection 2022.
7
Adversarial multimodal fusion with attention mechanism for skin lesion classification using clinical and dermoscopic images.基于注意力机制的对抗式多模态融合在临床和皮肤镜图像的皮肤损伤分类中的应用。
Med Image Anal. 2022 Oct;81:102535. doi: 10.1016/j.media.2022.102535. Epub 2022 Jul 13.
8
Multimodal skin lesion classification using deep learning.基于深度学习的多模态皮肤损伤分类。
Exp Dermatol. 2018 Nov;27(11):1261-1267. doi: 10.1111/exd.13777. Epub 2018 Sep 27.
9
Smartphone-based machine learning model for real-time assessment of medical kidney biopsy.基于智能手机的医学肾活检实时评估机器学习模型。
J Pathol Inform. 2024 May 31;15:100385. doi: 10.1016/j.jpi.2024.100385. eCollection 2024 Dec.
10
A Deep Learning Approach to Skin Cancer Detection in Dermoscopy Images.一种用于皮肤镜图像中皮肤癌检测的深度学习方法。
J Biomed Phys Eng. 2020 Dec 1;10(6):801-806. doi: 10.31661/jbpe.v0i0.2004-1107. eCollection 2020 Dec.

引用本文的文献

1
Advancements and challenges of artificial intelligence in dermatology: a review of applications and perspectives in China.人工智能在皮肤病学中的进展与挑战:中国应用与前景综述
Front Digit Health. 2025 Aug 13;7:1544520. doi: 10.3389/fdgth.2025.1544520. eCollection 2025.
2
Multimodal Masked Autoencoder Based on Adaptive Masking for Vitiligo Stage Classification.基于自适应掩码的多模态掩码自动编码器用于白癜风分期分类
J Imaging Inform Med. 2025 Apr 29. doi: 10.1007/s10278-025-01521-7.
3
Multimodality Fusion Aspects of Medical Diagnosis: A Comprehensive Review.

本文引用的文献

1
MDFNet: application of multimodal fusion method based on skin image and clinical data to skin cancer classification.MDFNet:基于皮肤图像和临床数据的多模态融合方法在皮肤癌分类中的应用。
J Cancer Res Clin Oncol. 2023 Jul;149(7):3287-3299. doi: 10.1007/s00432-022-04180-1. Epub 2022 Aug 3.
2
A multimodal transformer to fuse images and metadata for skin disease classification.一种用于融合图像和元数据以进行皮肤病分类的多模态变压器。
Vis Comput. 2022 May 5:1-13. doi: 10.1007/s00371-022-02492-4.
3
Morphology-aware multi-source fusion-based intracranial aneurysms rupture prediction.
医学诊断的多模态融合方面:全面综述
Bioengineering (Basel). 2024 Dec 5;11(12):1233. doi: 10.3390/bioengineering11121233.
4
Decoding skin cancer classification: perspectives, insights, and advances through researchers' lens.解读皮肤癌分类:透过研究者视角的观点、见解与进展
Sci Rep. 2024 Dec 18;14(1):30542. doi: 10.1038/s41598-024-81961-3.
5
Integrating artificial intelligence with smartphone-based imaging for cancer detection in vivo.将人工智能与基于智能手机的成像技术相结合用于体内癌症检测。
Biosens Bioelectron. 2025 Mar 1;271:116982. doi: 10.1016/j.bios.2024.116982. Epub 2024 Nov 21.
6
AI in Aesthetic/Cosmetic Dermatology: Current and Future.人工智能在美容皮肤科的现状与未来
J Cosmet Dermatol. 2025 Jan;24(1):e16640. doi: 10.1111/jocd.16640. Epub 2024 Nov 7.
7
Clinical Interpretability of Deep Learning for Predicting Microvascular Invasion in Hepatocellular Carcinoma by Using Attention Mechanism.利用注意力机制的深度学习对肝细胞癌微血管侵犯的临床可解释性预测
Bioengineering (Basel). 2023 Aug 9;10(8):948. doi: 10.3390/bioengineering10080948.
基于形态感知的多源融合的颅内动脉瘤破裂预测。
Eur Radiol. 2022 Aug;32(8):5633-5641. doi: 10.1007/s00330-022-08608-7. Epub 2022 Feb 18.
4
Skin Cancer Detection: A Review Using Deep Learning Techniques.皮肤癌检测:深度学习技术的综述。
Int J Environ Res Public Health. 2021 May 20;18(10):5479. doi: 10.3390/ijerph18105479.
5
Cancer statistics for the year 2020: An overview.2020年癌症统计数据概述。
Int J Cancer. 2021 Apr 5. doi: 10.1002/ijc.33588.
6
An Attention-Based Mechanism to Combine Images and Metadata in Deep Learning Models Applied to Skin Cancer Classification.基于注意力机制的深度学习模型融合图像与元数据在皮肤癌分类中的应用
IEEE J Biomed Health Inform. 2021 Sep;25(9):3554-3563. doi: 10.1109/JBHI.2021.3062002. Epub 2021 Sep 3.
7
PAD-UFES-20: A skin lesion dataset composed of patient data and clinical images collected from smartphones.PAD-UFES-20:一个由从智能手机收集的患者数据和临床图像组成的皮肤病变数据集。
Data Brief. 2020 Aug 25;32:106221. doi: 10.1016/j.dib.2020.106221. eCollection 2020 Oct.
8
Risk factors for development of new skin neoplasms in patients with past history of skin cancer: A survival analysis.皮肤癌病史患者中新皮肤肿瘤发展的风险因素:生存分析。
Sci Rep. 2018 Oct 24;8(1):15744. doi: 10.1038/s41598-018-33763-7.
9
Multimodal skin lesion classification using deep learning.基于深度学习的多模态皮肤损伤分类。
Exp Dermatol. 2018 Nov;27(11):1261-1267. doi: 10.1111/exd.13777. Epub 2018 Sep 27.
10
Features Causing Confusion between Basal Cell Carcinoma and Squamous Cell Carcinoma in Clinical Diagnosis.临床诊断中导致基底细胞癌和鳞状细胞癌混淆的特征。
Ann Dermatol. 2018 Feb;30(1):64-70. doi: 10.5021/ad.2018.30.1.64. Epub 2017 Dec 26.