Aggarwal Pushkar
College of Medicine, University of Cincinnati, Cincinnati, OH, United States.
JMIR Dermatol. 2021 Oct 12;4(2):e31697. doi: 10.2196/31697.
The performance of deep-learning image recognition models is below par when applied to images with Fitzpatrick classification skin types 4 and 5.
The objective of this research was to assess whether image recognition models perform differently when differentiating between dermatological diseases in individuals with darker skin color (Fitzpatrick skin types 4 and 5) than when differentiating between the same dermatological diseases in Caucasians (Fitzpatrick skin types 1, 2, and 3) when both models are trained on the same number of images.
Two image recognition models were trained, validated, and tested. The goal of each model was to differentiate between melanoma and basal cell carcinoma. Open-source images of melanoma and basal cell carcinoma were acquired from the Hellenic Dermatological Atlas, the Dermatology Atlas, the Interactive Dermatology Atlas, and DermNet NZ.
The image recognition models trained and validated on images with light skin color had higher sensitivity, specificity, positive predictive value, negative predictive value, and F1 score than the image recognition models trained and validated on images of skin of color for differentiation between melanoma and basal cell carcinoma.
A higher number of images of dermatological diseases in individuals with darker skin color than images of dermatological diseases in individuals with light skin color would need to be gathered for artificial intelligence models to perform equally well.
深度学习图像识别模型应用于菲茨帕特里克分类皮肤类型4和5的图像时,其性能低于标准水平。
本研究的目的是评估当两个图像识别模型在相同数量的图像上进行训练时,在区分深色皮肤个体(菲茨帕特里克皮肤类型4和5)的皮肤病与区分白种人(菲茨帕特里克皮肤类型1、2和3)的相同皮肤病时,图像识别模型的表现是否不同。
训练、验证和测试了两个图像识别模型。每个模型的目标是区分黑色素瘤和基底细胞癌。黑色素瘤和基底细胞癌的开源图像取自希腊皮肤病图谱、皮肤病图谱、交互式皮肤病图谱和新西兰皮肤病网。
在浅色皮肤图像上训练和验证的图像识别模型,在区分黑色素瘤和基底细胞癌方面,比在有色皮肤图像上训练和验证的图像识别模型具有更高的灵敏度、特异性、阳性预测值、阴性预测值和F1分数。
为了使人工智能模型表现得同样出色,需要收集比浅色皮肤个体的皮肤病图像更多的深色皮肤个体的皮肤病图像。