Department of Ophthalmology, Hallym University College of Medicine, Hallym University Sacred Heart Hospital, 22, Gwanpyeong-Ro 170Beon-Gil, Dongan-Gu, Anyang-Si, Gyeonggi-Do, 14068, Republic of Korea.
Department of Ophthalmology, Asan Medical Center, Seoul, Korea.
Sci Rep. 2023 Mar 13;13(1):4103. doi: 10.1038/s41598-023-30699-5.
Artificial intelligence as a screening tool for eyelid lesions will be helpful for early diagnosis of eyelid malignancies and proper decision-making. This study aimed to evaluate the performance of a deep learning model in differentiating eyelid lesions using clinical eyelid photographs in comparison with human ophthalmologists. We included 4954 photographs from 928 patients in this retrospective cross-sectional study. Images were classified into three categories: malignant lesion, benign lesion, and no lesion. Two pre-trained convolutional neural network (CNN) models, DenseNet-161 and EfficientNetV2-M architectures, were fine-tuned to classify images into three or two (malignant versus benign) categories. For a ternary classification, the mean diagnostic accuracies of the CNNs were 82.1% and 83.0% using DenseNet-161 and EfficientNetV2-M, respectively, which were inferior to those of the nine clinicians (87.0-89.5%). For the binary classification, the mean accuracies were 87.5% and 92.5% using DenseNet-161 and EfficientNetV2-M models, which was similar to that of the clinicians (85.8-90.0%). The mean AUC of the two CNN models was 0.908 and 0.950, respectively. Gradient-weighted class activation map successfully highlighted the eyelid tumors on clinical photographs. Deep learning models showed a promising performance in discriminating malignant versus benign eyelid lesions on clinical photographs, reaching the level of human observers.
人工智能作为眼睑病变的筛查工具,将有助于早期诊断眼睑恶性肿瘤并做出适当的决策。本研究旨在评估深度学习模型在使用临床眼睑照片区分眼睑病变方面的性能,与人类眼科医生进行比较。我们在这项回顾性横断面研究中纳入了 928 名患者的 4954 张照片。图像分为三类:恶性病变、良性病变和无病变。我们对两种预先训练好的卷积神经网络(CNN)模型,DenseNet-161 和 EfficientNetV2-M 架构进行了微调,以将图像分为三类或两类(恶性与良性)。对于三分法,CNN 的平均诊断准确率分别为 DenseNet-161 和 EfficientNetV2-M 的 82.1%和 83.0%,低于 9 位临床医生(87.0-89.5%)。对于二分法,DenseNet-161 和 EfficientNetV2-M 模型的平均准确率分别为 87.5%和 92.5%,与临床医生的准确率(85.8-90.0%)相似。两种 CNN 模型的平均 AUC 分别为 0.908 和 0.950。梯度加权类激活图成功突出了临床照片上的眼睑肿瘤。深度学习模型在区分临床照片上的恶性与良性眼睑病变方面表现出有前途的性能,达到了人类观察者的水平。