Wu Gloria, Zhao Weichen, Wong Adrial, Lee David A
Department of Ophthalmology, University of California San Francisco School of Medicine, San Francisco, California, USA.
University of California, Davis, Davis, California, USA.
Digit Health. 2024 Feb 14;10:20552076241229933. doi: 10.1177/20552076241229933. eCollection 2024 Jan-Dec.
"Floaters," a common complaint among patients of all ages, was used as a query term because it affects 30% of all people searching for eye care. The American Academy of Ophthalmology website's "floaters" section was used as a source for questions and answers (www.aao.org). Floaters is a visual obstruction that moves with the movement of the eye. They can be associated with retinal detachment, which can lead to vision loss. With the advent of large language model (LLM) chatbots ChatGPT, Bard versus virtual assistants (VA), Google Assistant, and Alexa, we analyzed their responses to "floaters."
Using AAO.org, "Public & Patients," and its related subsection, "EyeHealth A-Z": Floaters and Flashes link, we asked four questions: (1) What are floaters? (2) What are flashes? (3) Flashes and Migraines? (4) Floaters and Flashes Treatment? to ChatGPT, Bard, Google Assistant, and Alexa. The American Academy of Ophthalmology (AAO) keywords were identified if they were highlighted. The "Flesch-Kincaid Grade Level" formula approved by the U.S. Department of Education, was used to evaluate the reading comprehension level for the responses.
Of the chatbots and virtual assistants, Google Assistant is the only one that uses the term "ophthalmologist." There is no mention of the urgency or emergency nature of floaters. AAO.org shows a lower reading level vs the LLMs and VA ( = .11). The reading comprehension levels of ChatGPT, Bard, Google Assistant, and Alexa are higher (12.3, 9.7, 13.1, 8.1 grade) vs the AAO.org (7.3 grade). There is a higher word count for LLMs vs VA ( < .0286).
Currently, ChatGPT, Bard, Google Assistant, and Alexa are similar. Factual information is present but all miss the urgency of the diagnosis of a retinal detachment. Translational relevance: Both the LLM and virtual assistants are free and our patients will use them to obtain "floaters" information. There may be errors of omission with ChatGPT and a lack of urgency to seek a physician's care.
“飞蚊症”是各年龄段患者的常见诉求,将其用作查询词是因为它影响了30%寻求眼科护理的人群。美国眼科学会网站的“飞蚊症”板块被用作问答来源(www.aao.org)。飞蚊症是一种随眼球运动而移动的视觉障碍。它们可能与视网膜脱离有关,而视网膜脱离会导致视力丧失。随着大语言模型(LLM)聊天机器人ChatGPT、Bard与虚拟助手(VA)、谷歌助手和Alexa的出现,我们分析了它们对“飞蚊症”的回答。
利用AAO.org的“公众与患者”及其相关子板块“眼部健康A - Z:飞蚊症与闪光链接”,我们向ChatGPT、Bard、谷歌助手和Alexa提出了四个问题:(1)什么是飞蚊症?(2)什么是闪光感?(3)闪光感与偏头痛?(4)飞蚊症与闪光感的治疗?如果美国眼科学会(AAO)的关键词被突出显示,则予以识别。采用美国教育部批准的“弗莱什 - 金凯德年级水平”公式来评估回答的阅读理解水平。
在聊天机器人和虚拟助手中,谷歌助手是唯一使用“眼科医生”一词的。没有提及飞蚊症的紧迫性或紧急性质。与大语言模型和虚拟助手相比,AAO.org的阅读水平较低(=0.11)。ChatGPT、Bard、谷歌助手和Alexa的阅读理解水平高于AAO.org(分别为12.3、9.7、13.1、8.1年级,而AAO.org为7.3年级)。大语言模型的单词数比虚拟助手更多(<0.0286)。
目前,ChatGPT、Bard、谷歌助手和Alexa类似。存在事实性信息,但都忽略了视网膜脱离诊断的紧迫性。翻译相关性:大语言模型和虚拟助手都是免费的,我们的患者会用它们来获取“飞蚊症”信息。ChatGPT可能存在遗漏错误,且缺乏寻求医生治疗的紧迫性。