Servicio de Dermatología, Hospital Clínic de Barcelona, Barcelona, España.
Servicio de Dermatología, Hospital Clínic de Barcelona, Barcelona, España; Institut d'Investigacions Biomediques August Pi I Sunyer (IDIBAPS), Barcelona, España.
Actas Dermosifiliogr (Engl Ed). 2020 May;111(4):313-316. doi: 10.1016/j.ad.2019.09.002. Epub 2020 Apr 2.
Automated image classification is a promising branch of machine learning (ML) useful for skin cancer diagnosis, but little has been determined about its limitations for general usability in current clinical practice.
To determine limitations in the selection of skin cancer images for ML analysis, particularly in melanoma.
Retrospective cohort study design, including 2,849 consecutive high-quality dermoscopy images of skin tumors from 2010 to 2014, for evaluation by a ML system. Each dermoscopy image was assorted according to its eligibility for ML analysis.
Of the 2,849 images chosen from our database, 968 (34%) met the inclusion criteria for analysis by the ML system. Only 64.7% of nevi and 36.6% of melanoma met the inclusion criteria. Of the 528 melanomas, 335 (63.4%) were excluded. An absence of normal surrounding skin (40.5% of all melanomas from our database) and absence of pigmentation (14.2%) were the most common reasons for exclusion from ML analysis.
Only 36.6% of our melanomas were admissible for analysis by state-of-the-art ML systems. We conclude that future ML systems should be trained on larger datasets which include relevant non-ideal images from lesions evaluated in real clinical practice. Fortunately, many of these limitations are being overcome by the scientific community as recent works show.
自动化图像分类是机器学习(ML)的一个有前途的分支,可用于皮肤癌诊断,但对于其在当前临床实践中的普遍可用性的局限性知之甚少。
确定用于 ML 分析的皮肤癌图像选择的局限性,特别是在黑色素瘤方面。
回顾性队列研究设计,包括 2010 年至 2014 年的 2849 例连续高质量皮肤肿瘤皮肤镜图像,由 ML 系统评估。每个皮肤镜图像都根据其是否适合 ML 分析进行分类。
从我们的数据库中选择的 2849 张图像中,有 968 张(34%)符合 ML 系统分析的纳入标准。只有 64.7%的痣和 36.6%的黑色素瘤符合纳入标准。在 528 个黑色素瘤中,有 335 个(63.4%)被排除在外。缺乏正常周围皮肤(我们数据库中所有黑色素瘤的 40.5%)和缺乏色素沉着(14.2%)是最常见的排除 ML 分析的原因。
我们的黑色素瘤中只有 36.6%符合最先进的 ML 系统的分析要求。我们得出结论,未来的 ML 系统应该在更大的数据集上进行训练,这些数据集包括在实际临床实践中评估的病变的相关非理想图像。幸运的是,正如最近的研究工作所示,科学界正在克服许多这些局限性。