Alfi Iftiaz A, Rahman Md Mahfuzur, Shorfuzzaman Mohammad, Nazir Amril
Department of Electrical and Computer Engineering, North South University, Dhaka 1229, Bangladesh.
Department of Information and Computer Science, College of Computing and Mathematics, King Fahd University of Petroleum and Minerals, Dhahran 31261, Saudi Arabia.
Diagnostics (Basel). 2022 Mar 17;12(3):726. doi: 10.3390/diagnostics12030726.
A skin lesion is a portion of skin that observes abnormal growth compared to other areas of the skin. The ISIC 2018 lesion dataset has seven classes. A miniature dataset version of it is also available with only two classes: malignant and benign. Malignant tumors are tumors that are cancerous, and benign tumors are non-cancerous. Malignant tumors have the ability to multiply and spread throughout the body at a much faster rate. The early detection of the cancerous skin lesion is crucial for the survival of the patient. Deep learning models and machine learning models play an essential role in the detection of skin lesions. Still, due to image occlusions and imbalanced datasets, the accuracies have been compromised so far. In this paper, we introduce an interpretable method for the non-invasive diagnosis of melanoma skin cancer using deep learning and ensemble stacking of machine learning models. The dataset used to train the classifier models contains balanced images of benign and malignant skin moles. Hand-crafted features are used to train the base models (logistic regression, SVM, random forest, KNN, and gradient boosting machine) of machine learning. The prediction of these base models was used to train level one model stacking using cross-validation on the training set. Deep learning models (MobileNet, Xception, ResNet50, ResNet50V2, and DenseNet121) were used for transfer learning, and were already pre-trained on ImageNet data. The classifier was evaluated for each model. The deep learning models were then ensembled with different combinations of models and assessed. Furthermore, shapely adaptive explanations are used to construct an interpretability approach that generates heatmaps to identify the parts of an image that are most suggestive of the illness. This allows dermatologists to understand the results of our model in a way that makes sense to them. For evaluation, we calculated the accuracy, F1-score, Cohen's kappa, confusion matrix, and ROC curves and identified the best model for classifying skin lesions.
皮肤病变是指与皮肤其他区域相比出现异常生长的皮肤部分。ISIC 2018病变数据集有七个类别。它还有一个微型数据集版本,只有两个类别:恶性和良性。恶性肿瘤是癌性肿瘤,良性肿瘤是非癌性的。恶性肿瘤有能力以更快的速度在全身扩散和增殖。癌性皮肤病变的早期检测对患者的生存至关重要。深度学习模型和机器学习模型在皮肤病变检测中起着至关重要的作用。然而,由于图像遮挡和数据集不平衡,到目前为止准确率受到了影响。在本文中,我们介绍了一种使用深度学习和机器学习模型的集成堆叠对黑色素瘤皮肤癌进行无创诊断的可解释方法。用于训练分类器模型的数据集包含良性和恶性皮肤痣的平衡图像。手工特征用于训练机器学习的基础模型(逻辑回归、支持向量机、随机森林、K近邻和梯度提升机)。这些基础模型的预测用于在训练集上使用交叉验证训练一级模型堆叠。深度学习模型(MobileNet、Xception、ResNet50、ResNet50V2和DenseNet)用于迁移学习,并且已经在ImageNet数据上进行了预训练。对每个模型的分类器进行了评估。然后将深度学习模型与不同的模型组合进行集成并评估。此外,使用形状自适应解释来构建一种可解释性方法,该方法生成热图以识别图像中最能提示疾病的部分。这使皮肤科医生能够以对他们有意义的方式理解我们模型的结果。为了进行评估,我们计算了准确率、F1分数、科恩卡帕系数、混淆矩阵和ROC曲线,并确定了用于分类皮肤病变的最佳模型。