Kumar Ayushi, Vatsa Avimanyou
Monroe Township High School, Monroe Township, NJ, United States.
Department of Computer Science, Fairleigh Dickinson University, Teaneck, NJ, United States.
Front Big Data. 2022 Mar 29;5:848614. doi: 10.3389/fdata.2022.848614. eCollection 2022.
Skin cancer is the most common cancer in the USA, and it is a leading cause of death worldwide. Every year, more than five million patients are newly diagnosed in the USA. The deadliest and most serious form of skin cancer is called melanoma. Skin cancer can affect anyone, regardless of skin color, race, gender, and age. The diagnosis of melanoma has been done by visual examination and manual techniques by skilled doctors. It is a time-consuming process and highly prone to error. The skin images captured by dermoscopy eliminate the surface reflection of skin and give a better visualization of deeper levels of the skin. However, the existence of many artifacts and noise such as hair, veins, and water residue make the lesion images very complex. Due to the complexity of images, the border detection, feature extraction, and classification process are challenging. Without a proper mechanism, it is hard to identify and predict melanoma at an early stage. Therefore, there is a need to provide precise details, identify early skin cancer, and classify skin cancer with appropriate sensitivity and precision. This article aims to review and analyze two deep neural network-based classification algorithms (convolutional neural network, CNN; recurrent neural network, RNN) and a decision tree-based algorithm (XG-Boost) on skin lesion images (ISIC dataset) and find which of these provides the best classification performance metric. Also, the performance of algorithms is compared using six different metrics-loss, accuracy, precision, recall, F1 score, and ROC.
皮肤癌是美国最常见的癌症,也是全球主要的死亡原因之一。在美国,每年有超过500万患者被新诊断出患有皮肤癌。最致命、最严重的皮肤癌形式是黑色素瘤。皮肤癌可影响任何人,无论肤色、种族、性别和年龄。黑色素瘤的诊断一直由经验丰富的医生通过视觉检查和手工技术进行。这是一个耗时的过程,而且极易出错。皮肤镜拍摄的皮肤图像消除了皮肤的表面反射,能更好地显示皮肤深层情况。然而,毛发、血管和水渍等许多伪影和噪声的存在,使得病变图像非常复杂。由于图像的复杂性,边界检测、特征提取和分类过程具有挑战性。如果没有适当的机制,很难在早期识别和预测黑色素瘤。因此,需要提供精确的细节,识别早期皮肤癌,并以适当的灵敏度和精度对皮肤癌进行分类。本文旨在回顾和分析基于深度神经网络的两种分类算法(卷积神经网络,CNN;循环神经网络,RNN)以及基于决策树的算法(XG-Boost)对皮肤病变图像(ISIC数据集)的分类情况,并找出哪种算法能提供最佳的分类性能指标。此外,还使用六种不同的指标——损失、准确率、精确率、召回率、F1分数和ROC来比较算法的性能。