Department of Electrical and Computer Engineering, Texas A&M University, College Station, TX, United States.
College of Science and Engineering, Hamad Bin Khalifa University, Doha, Qatar.
J Med Internet Res. 2021 Nov 24;23(11):e22934. doi: 10.2196/22934.
BACKGROUND: Skin cancer is the most common cancer type affecting humans. Traditional skin cancer diagnosis methods are costly, require a professional physician, and take time. Hence, to aid in diagnosing skin cancer, artificial intelligence (AI) tools are being used, including shallow and deep machine learning-based methodologies that are trained to detect and classify skin cancer using computer algorithms and deep neural networks. OBJECTIVE: The aim of this study was to identify and group the different types of AI-based technologies used to detect and classify skin cancer. The study also examined the reliability of the selected papers by studying the correlation between the data set size and the number of diagnostic classes with the performance metrics used to evaluate the models. METHODS: We conducted a systematic search for papers using Institute of Electrical and Electronics Engineers (IEEE) Xplore, Association for Computing Machinery Digital Library (ACM DL), and Ovid MEDLINE databases following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses Extension for Scoping Reviews (PRISMA-ScR) guidelines. The studies included in this scoping review had to fulfill several selection criteria: being specifically about skin cancer, detecting or classifying skin cancer, and using AI technologies. Study selection and data extraction were independently conducted by two reviewers. Extracted data were narratively synthesized, where studies were grouped based on the diagnostic AI techniques and their evaluation metrics. RESULTS: We retrieved 906 papers from the 3 databases, of which 53 were eligible for this review. Shallow AI-based techniques were used in 14 studies, and deep AI-based techniques were used in 39 studies. The studies used up to 11 evaluation metrics to assess the proposed models, where 39 studies used accuracy as the primary evaluation metric. Overall, studies that used smaller data sets reported higher accuracy. CONCLUSIONS: This paper examined multiple AI-based skin cancer detection models. However, a direct comparison between methods was hindered by the varied use of different evaluation metrics and image types. Performance scores were affected by factors such as data set size, number of diagnostic classes, and techniques. Hence, the reliability of shallow and deep models with higher accuracy scores was questionable since they were trained and tested on relatively small data sets of a few diagnostic classes.
背景:皮肤癌是影响人类的最常见癌症类型。传统的皮肤癌诊断方法成本高、需要专业医生且耗时。因此,为了帮助诊断皮肤癌,人工智能 (AI) 工具被用于包括基于浅层和深层机器学习的方法,这些方法使用计算机算法和深度神经网络来训练以检测和分类皮肤癌。
目的:本研究旨在识别和分组用于检测和分类皮肤癌的不同类型的基于 AI 的技术。该研究还通过研究数据集大小与诊断类别数量与用于评估模型的性能指标之间的相关性,检查了所选论文的可靠性。
方法:我们按照系统评价和荟萃分析扩展的首选报告项目 (PRISMA-ScR) 指南,使用 IEEE Xplore、ACM 数字图书馆 (ACM DL) 和 Ovid MEDLINE 数据库对论文进行了系统搜索。本范围综述中纳入的研究必须满足以下几个选择标准:专门针对皮肤癌、检测或分类皮肤癌、以及使用 AI 技术。两名评审员独立进行研究选择和数据提取。提取的数据进行了叙述性综合,其中根据诊断 AI 技术及其评估指标对研究进行了分组。
结果:我们从 3 个数据库中检索到 906 篇论文,其中 53 篇符合本综述要求。浅层 AI 技术在 14 项研究中使用,深层 AI 技术在 39 项研究中使用。研究使用了多达 11 种评估指标来评估提出的模型,其中 39 项研究使用准确率作为主要评估指标。总体而言,使用较小数据集的研究报告的准确率更高。
结论:本文检查了多种基于 AI 的皮肤癌检测模型。然而,由于不同评估指标和图像类型的使用不同,直接比较方法受到阻碍。性能得分受到数据集中的大小、诊断类别数量和技术等因素的影响。因此,具有较高准确率得分的浅层和深层模型的可靠性是有问题的,因为它们是在少数诊断类别相对较小的数据上进行训练和测试的。
J Med Internet Res. 2021-11-24
J Med Internet Res. 2023-3-31
J Med Internet Res. 2025-3-7
J Med Internet Res. 2024-11-28
Cochrane Database Syst Rev. 2024-10-17
J Med Internet Res. 2025-5-6
J Med Internet Res. 2025-5-5
Diagnostics (Basel). 2025-4-7
An Acad Bras Cienc. 2020
BMC Bioinformatics. 2020-3-11
Annu Int Conf IEEE Eng Med Biol Soc. 2019-7
IEEE Trans Med Imaging. 2019-1-21
Comput Med Imaging Graph. 2018-11-3
Annu Int Conf IEEE Eng Med Biol Soc. 2018-7
Annu Int Conf IEEE Eng Med Biol Soc. 2018-7
J Med Internet Res. 2018-10-17