Dominguez-Morales Juan P, Hernandez-Rodriguez Juan-Carlos, Duran-Lopez Lourdes, Conejo-Mir Julian, Pereyra-Rodriguez Jose-Juan
IEEE J Biomed Health Inform. 2025 Jan;29(1):443-455. doi: 10.1109/JBHI.2024.3465929. Epub 2025 Jan 7.
Melanoma is considered a global public health challenge and is responsible for more than 90% deaths related to skin cancer. Although the diagnosis of early melanoma is the main goal of dermoscopy, the discrimination between dermoscopic images of in situ and invasive melanomas can be a difficult task even for experienced dermatologists. Recent advances in artificial intelligence in the field of medical image analysis show that its application to dermoscopy with the aim of supporting and providing a second opinion to the medical expert could be of great interest. In this work, four datasets from different sources were used to train and evaluate deep learning models on in situ versus invasive melanoma classification and on Breslow thickness prediction. Supervised learning and semi-supervised learning using a multi-teacher ensemble knowledge distillation approach were considered and evaluated using a stratified 5-fold cross-validation scheme. The best models achieved AUCs of 0.80850.0242 and of 0.82320.0666 on the former and latter classification tasks, respectively. The best results were obtained using semi-supervised learning, with the best model achieving 0.8547 and 0.8768 AUC, respectively. An external test set was also evaluated, where semi-supervision achieved higher performance in all the classification tasks. The results obtained show that semi-supervised learning could improve the performance of trained models in different melanoma classification tasks compared to supervised learning. Automatic deep learning-based diagnosis systems could support medical professionals in their decision, serving as a second opinion or as a triage tool for medical centers.
黑色素瘤被视为一项全球公共卫生挑战,且导致了超过90%的皮肤癌相关死亡。尽管早期黑色素瘤的诊断是皮肤镜检查的主要目标,但即使对于经验丰富的皮肤科医生而言,鉴别原位黑色素瘤和侵袭性黑色素瘤的皮肤镜图像也可能是一项艰巨任务。医学图像分析领域人工智能的最新进展表明,将其应用于皮肤镜检查以支持医学专家并提供第二种观点可能会非常有意义。在这项工作中,使用了来自不同来源的四个数据集来训练和评估深度学习模型在原位与侵袭性黑色素瘤分类以及Breslow厚度预测方面的表现。考虑了使用多教师集成知识蒸馏方法的监督学习和半监督学习,并使用分层5折交叉验证方案进行评估。最佳模型在前一个和后一个分类任务上分别实现了0.8085±0.0242和0.8232±0.0666的曲线下面积(AUC)。使用半监督学习获得了最佳结果,最佳模型的AUC分别达到0.8547和0.8768。还评估了一个外部测试集,其中半监督在所有分类任务中都取得了更高的性能。所获得的结果表明,与监督学习相比,半监督学习可以提高训练模型在不同黑色素瘤分类任务中的性能。基于深度学习的自动诊断系统可以在医疗专业人员的决策中提供支持,作为医学中心的第二种观点或分诊工具。