L'Oréal Research and Innovation, Clichy, France.
ModiFace - A L'Oréal Group Company, Toronto, Ontario, Canada.
J Eur Acad Dermatol Venereol. 2023 Jan;37(1):176-183. doi: 10.1111/jdv.18541. Epub 2022 Sep 9.
Real-life validation is necessary to ensure our artificial intelligence (AI) skin diagnostic tool is inclusive across a diverse and representative US population of various ages, ancestries and skin phototypes.
To explore the relevance and accuracy of an automated, algorithm-based analysis of facial signs in representative women of different ancestries, ages and phototypes, living in the same country.
In a cross-sectional study of selfie images of 1041 US women, algorithm-based analyses of seven facial signs were automatically graded by an AI-based algorithm and by 50 US dermatologists of various profiles (age, gender, ancestry, geographical location). For automated analysis and dermatologist assessment, the same referential skin atlas was used to standardize the grading scales. The average values and their variability were compared with respect to age, ancestry and phototype.
For five signs, the grading obtained by the automated system were strongly correlated with dermatologists' assessments (r ≥ 0.75); cheek skin pores were moderately correlated (r = 0.63) and pigmentation signs, especially for the darkest skin tones, were weakly correlated (r = 0.40) to the dermatologist assessments. Age and ancestry had no effect on the correlations. In many cases, the automated system performed better than the dermatologist-assessed clinical grading due to 0.3-0.5 grading unit differences among the dermatologist panel that were not related to any individual characteristic (e.g. gender, age, ancestry, location). The use of phototypes, as discontinuous categorical variables, is likely a limiting factor in the assessments of grading, whether obtained by automated analysis or clinical assessment of the images.
The AI-based automatic procedure is accurate and clinically relevant for analysing facial signs in a diverse and inclusive population of US women, as confirmed by a diverse panel of dermatologists, although skin tone requires further improvement.
为确保我们的人工智能(AI)皮肤诊断工具能够涵盖美国不同年龄、种族和皮肤光型的多样化和代表性人群,进行真实世界的验证是必要的。
探索一种基于算法的面部标志自动分析方法在不同种族、年龄和光型的代表性美国女性中的相关性和准确性,这些女性生活在同一个国家。
在一项对 1041 名美国女性自拍图像的横断面研究中,一种基于 AI 的算法自动对七种面部标志进行了基于算法的分析,由 50 名具有不同特征(年龄、性别、种族、地理位置)的美国皮肤科医生进行了评估。对于自动分析和皮肤科医生评估,使用相同的参考皮肤图谱来标准化评分尺度。比较了年龄、种族和光型与自动分析和皮肤科医生评估的平均值及其变异性。
对于五种标志,自动系统的评分与皮肤科医生的评估高度相关(r≥0.75);脸颊皮肤毛孔中度相关(r=0.63),而色素沉着标志,尤其是最暗肤色的色素沉着标志,与皮肤科医生的评估弱相关(r=0.40)。年龄和种族对相关性没有影响。在许多情况下,由于皮肤科医生小组中存在 0.3-0.5 个评分单位的差异,这些差异与任何个体特征(如性别、年龄、种族、位置)无关,自动系统的性能优于皮肤科医生评估的临床分级。使用光型作为不连续的分类变量可能是评估分级的一个限制因素,无论是通过自动分析还是对图像的临床评估。
基于 AI 的自动程序对于分析美国女性多样化和包容性人群中的面部标志是准确且具有临床相关性的,这得到了皮肤科医生小组的证实,尽管肤色仍需进一步改进。