Gupta Ayushi, Al-Kazwini Hussein
Ophthalmology, Royal Free Hospital, London, GBR.
Ophthalmology, East Kent Hospitals University National Health Service (NHS) Foundation Trust, Canterbury, GBR.
Cureus. 2024 Nov 14;16(11):e73660. doi: 10.7759/cureus.73660. eCollection 2024 Nov.
Introduction Artificial intelligence is rapidly advancing in healthcare. Ophthalmology, with its reliance on imaging for diagnosis and management, has the potential to benefit from this technology. Deep learning models are currently used in image analysis in ophthalmology. ChatGPT (OpenAI, San Francisco, CA), a large language model, has recently expanded to include image analysis, creating new opportunities for diagnostic applications. While prior research shows potential in text-based diagnostics for ophthalmology, there is limited literature on AI's diagnostic accuracy in interpreting retinal images. Methods We selected 12 fundus images from key diseases identified by the Royal College of Ophthalmology curricula for medical students, foundation doctors, and trainees. Each image was presented to ChatGPT 4.0 using a standardised prompt to identify the most likely diagnosis. Responses were recorded, and the model's accuracy was assessed by comparing its diagnoses to the confirmed conditions. Results ChatGPT accurately diagnosed four out of 12 diseases (papilloedema, dry age-related macular degeneration (ARMD), glaucoma and vitreous haemorrhage) and provided one partially correct diagnosis (diabetic retinopathy). However, the model struggled with seven cases, including central retinal artery occlusion, central retinal vein occlusion, dry ARMD, rhegmatogenous retinal detachment, tractional retinal detachment, epiretinal membrane and macular hole. Conclusion ChatGPT demonstrates the potential for diagnosis of retinal conditions from fundus photography. However, it currently lacks the accuracy required for clinical application; the model often hallucinates when unsure, which has diagnostic implications. Further work is required to refine these models and expand their diagnostic potential.
引言 人工智能在医疗保健领域正迅速发展。眼科因其诊断和管理依赖成像技术,有潜力从这项技术中受益。深度学习模型目前已用于眼科图像分析。大型语言模型ChatGPT(OpenAI,加利福尼亚州旧金山)最近已扩展到包括图像分析,为诊断应用创造了新机会。虽然先前的研究显示了人工智能在眼科基于文本的诊断中的潜力,但关于人工智能解读视网膜图像的诊断准确性的文献有限。
方法 我们从皇家眼科医学院为医学生、住院医生和实习生制定的课程中确定的关键疾病中选择了12张眼底图像。使用标准化提示将每张图像呈现给ChatGPT 4.0,以确定最可能的诊断。记录回复,并通过将其诊断结果与确诊情况进行比较来评估模型的准确性。
结果 ChatGPT准确诊断出12种疾病中的4种(视乳头水肿、干性年龄相关性黄斑变性(ARMD)、青光眼和玻璃体出血),并提供了1个部分正确的诊断(糖尿病性视网膜病变)。然而,该模型在7个病例中表现不佳,包括视网膜中央动脉阻塞、视网膜中央静脉阻塞、干性ARMD、孔源性视网膜脱离、牵拉性视网膜脱离、视网膜前膜和黄斑裂孔。
结论 ChatGPT展示了通过眼底摄影诊断视网膜疾病的潜力。然而,它目前缺乏临床应用所需的准确性;该模型在不确定时经常会产生幻觉,这具有诊断意义。需要进一步开展工作来优化这些模型并扩大其诊断潜力。