Chang Melinda Y, Heidary Gena, Beres Shannon, Pineles Stacy L, Gaier Eric D, Gise Ryan, Reid Mark, Avramidis Kleanthis, Rostami Mohammad, Narayanan Shrikanth
Division of Ophthalmology, Children's Hospital Los Angeles, Los Angeles, California.
Roski Eye Institute, Keck School of Medicine, University of Southern California, Los Angeles, California.
Ophthalmol Sci. 2024 Feb 20;4(4):100496. doi: 10.1016/j.xops.2024.100496. eCollection 2024 Jul-Aug.
To develop and test an artificial intelligence (AI) model to aid in differentiating pediatric pseudopapilledema from true papilledema on fundus photographs.
Multicenter retrospective study.
A total of 851 fundus photographs from 235 children (age < 18 years) with pseudopapilledema and true papilledema.
Four pediatric neuro-ophthalmologists at 4 different institutions contributed fundus photographs of children with confirmed diagnoses of papilledema or pseudopapilledema. An AI model to classify fundus photographs as papilledema or pseudopapilledema was developed using a DenseNet backbone and a tribranch convolutional neural network. We performed 10-fold cross-validation and separately analyzed an external test set. The AI model's performance was compared with 2 masked human expert pediatric neuro-ophthalmologists, who performed the same classification task.
Accuracy, sensitivity, and specificity of the AI model compared with human experts.
The area under receiver operating curve of the AI model was 0.77 for the cross-validation set and 0.81 for the external test set. The accuracy of the AI model was 70.0% for the cross-validation set and 73.9% for the external test set. The sensitivity of the AI model was 73.4% for the cross-validation set and 90.4% for the external test set. The AI model's accuracy was significantly higher than human experts on the cross validation set ( < 0.002), and the model's sensitivity was significantly higher on the external test set ( = 0.0002). The specificity of the AI model and human experts was similar (56.4%-67.3%). Moreover, the AI model was significantly more sensitive at detecting mild papilledema than human experts, whereas AI and humans performed similarly on photographs of moderate-to-severe papilledema. On review of the external test set, only 1 child (with nearly resolved pseudotumor cerebri) had both eyes with papilledema incorrectly classified as pseudopapilledema.
When classifying fundus photographs of pediatric papilledema and pseudopapilledema, our AI model achieved > 90% sensitivity at detecting papilledema, superior to human experts. Due to the high sensitivity and low false negative rate, AI may be useful to triage children with suspected papilledema requiring work-up to evaluate for serious underlying neurologic conditions.
Proprietary or commercial disclosure may be found in the Footnotes and Disclosures at the end of this article.
开发并测试一种人工智能(AI)模型,以辅助在眼底照片上鉴别儿童假性视乳头水肿和真性视乳头水肿。
多中心回顾性研究。
来自235名年龄小于18岁的患有假性视乳头水肿和真性视乳头水肿儿童的851张眼底照片。
4个不同机构的4名儿科神经眼科医生提供了确诊为视乳头水肿或假性视乳头水肿儿童的眼底照片。使用DenseNet骨干网络和三叉分支卷积神经网络开发了一种将眼底照片分类为视乳头水肿或假性视乳头水肿的AI模型。我们进行了10倍交叉验证,并单独分析了一个外部测试集。将AI模型的性能与2名进行相同分类任务的蒙面人类专家儿科神经眼科医生进行了比较。
与人类专家相比,AI模型的准确性、敏感性和特异性。
AI模型在交叉验证集上的受试者操作特征曲线下面积为0.77,在外部测试集上为0.81。AI模型在交叉验证集上的准确率为70.0%,在外部测试集上为73.9%。AI模型在交叉验证集上的敏感性为73.4%,在外部测试集上为90.4%。在交叉验证集上,AI模型的准确性显著高于人类专家(P<0.002),在外部测试集上模型的敏感性显著更高(P = 0.0002)。AI模型和人类专家的特异性相似(56.4%-67.3%)。此外,AI模型在检测轻度视乳头水肿方面比人类专家更敏感,而在中度至重度视乳头水肿照片上AI和人类的表现相似。在对外部测试集的回顾中,只有1名儿童(假性脑瘤几乎消退)双眼的视乳头水肿被错误分类为假性视乳头水肿。
在对儿童视乳头水肿和假性视乳头水肿的眼底照片进行分类时,我们的AI模型在检测视乳头水肿方面的敏感性超过90%,优于人类专家。由于高敏感性和低假阴性率,AI可能有助于对疑似视乳头水肿的儿童进行分诊,以便评估是否存在严重的潜在神经系统疾病。
在本文末尾的脚注和披露中可能会找到专有或商业披露信息。