患有飞蚊症的患者：来自虚拟助手和大语言模型的回答。

Patients with floaters: Answers from virtual assistants and large language models.

作者信息

Wu Gloria, Zhao Weichen, Wong Adrial, Lee David A

机构信息

Department of Ophthalmology, University of California San Francisco School of Medicine, San Francisco, California, USA.

University of California, Davis, Davis, California, USA.

出版信息

Digit Health. 2024 Feb 14;10:20552076241229933. doi: 10.1177/20552076241229933. eCollection 2024 Jan-Dec.

DOI:10.1177/20552076241229933

PMID:38362238

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10868475/

Abstract

OBJECTIVES

"Floaters," a common complaint among patients of all ages, was used as a query term because it affects 30% of all people searching for eye care. The American Academy of Ophthalmology website's "floaters" section was used as a source for questions and answers (www.aao.org). Floaters is a visual obstruction that moves with the movement of the eye. They can be associated with retinal detachment, which can lead to vision loss. With the advent of large language model (LLM) chatbots ChatGPT, Bard versus virtual assistants (VA), Google Assistant, and Alexa, we analyzed their responses to "floaters."

METHODS

Using AAO.org, "Public & Patients," and its related subsection, "EyeHealth A-Z": Floaters and Flashes link, we asked four questions: (1) What are floaters? (2) What are flashes? (3) Flashes and Migraines? (4) Floaters and Flashes Treatment? to ChatGPT, Bard, Google Assistant, and Alexa. The American Academy of Ophthalmology (AAO) keywords were identified if they were highlighted. The "Flesch-Kincaid Grade Level" formula approved by the U.S. Department of Education, was used to evaluate the reading comprehension level for the responses.

RESULTS

Of the chatbots and virtual assistants, Google Assistant is the only one that uses the term "ophthalmologist." There is no mention of the urgency or emergency nature of floaters. AAO.org shows a lower reading level vs the LLMs and VA ( = .11). The reading comprehension levels of ChatGPT, Bard, Google Assistant, and Alexa are higher (12.3, 9.7, 13.1, 8.1 grade) vs the AAO.org (7.3 grade). There is a higher word count for LLMs vs VA ( < .0286).

CONCLUSION

Currently, ChatGPT, Bard, Google Assistant, and Alexa are similar. Factual information is present but all miss the urgency of the diagnosis of a retinal detachment. Translational relevance: Both the LLM and virtual assistants are free and our patients will use them to obtain "floaters" information. There may be errors of omission with ChatGPT and a lack of urgency to seek a physician's care.

摘要

目的

“飞蚊症”是各年龄段患者的常见诉求，将其用作查询词是因为它影响了30%寻求眼科护理的人群。美国眼科学会网站的“飞蚊症”板块被用作问答来源（www.aao.org）。飞蚊症是一种随眼球运动而移动的视觉障碍。它们可能与视网膜脱离有关，而视网膜脱离会导致视力丧失。随着大语言模型（LLM）聊天机器人ChatGPT、Bard与虚拟助手（VA）、谷歌助手和Alexa的出现，我们分析了它们对“飞蚊症”的回答。

方法

利用AAO.org的“公众与患者”及其相关子板块“眼部健康A - Z：飞蚊症与闪光链接”，我们向ChatGPT、Bard、谷歌助手和Alexa提出了四个问题：（1）什么是飞蚊症？（2）什么是闪光感？（3）闪光感与偏头痛？（4）飞蚊症与闪光感的治疗？如果美国眼科学会（AAO）的关键词被突出显示，则予以识别。采用美国教育部批准的“弗莱什 - 金凯德年级水平”公式来评估回答的阅读理解水平。

结果

在聊天机器人和虚拟助手中，谷歌助手是唯一使用“眼科医生”一词的。没有提及飞蚊症的紧迫性或紧急性质。与大语言模型和虚拟助手相比，AAO.org的阅读水平较低（=0.11）。ChatGPT、Bard、谷歌助手和Alexa的阅读理解水平高于AAO.org（分别为12.3、9.7、13.1、8.1年级，而AAO.org为7.3年级）。大语言模型的单词数比虚拟助手更多（<0.0286）。

结论

目前，ChatGPT、Bard、谷歌助手和Alexa类似。存在事实性信息，但都忽略了视网膜脱离诊断的紧迫性。翻译相关性：大语言模型和虚拟助手都是免费的，我们的患者会用它们来获取“飞蚊症”信息。ChatGPT可能存在遗漏错误，且缺乏寻求医生治疗的紧迫性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1bec/10868475/2f8dcd13e844/10.1177_20552076241229933-fig1.jpg

相似文献

Patients with floaters: Answers from virtual assistants and large language models.

Digit Health. 2024 Feb 14;10:20552076241229933. doi: 10.1177/20552076241229933. eCollection 2024 Jan-Dec.

ChatGPT and Google Assistant as a Source of Patient Education for Patients With Amblyopia: Content Analysis.

J Med Internet Res. 2024 Aug 15;26:e52401. doi: 10.2196/52401.

Performance of Artificial Intelligence Chatbots on Glaucoma Questions Adapted From Patient Brochures.

Cureus. 2024 Mar 23;16(3):e56766. doi: 10.7759/cureus.56766. eCollection 2024 Mar.

Benchmarking large language models' performances for myopia care: a comparative analysis of ChatGPT-3.5, ChatGPT-4.0, and Google Bard.

EBioMedicine. 2023 Sep;95:104770. doi: 10.1016/j.ebiom.2023.104770. Epub 2023 Aug 23.

Dr. Google vs. Dr. ChatGPT: Exploring the Use of Artificial Intelligence in Ophthalmology by Comparing the Accuracy, Safety, and Readability of Responses to Frequently Asked Patient Questions Regarding Cataracts and Cataract Surgery.

Semin Ophthalmol. 2024 Aug;39(6):472-479. doi: 10.1080/08820538.2024.2326058. Epub 2024 Mar 22.

Assessing the Accuracy of Information on Medication Abortion: A Comparative Analysis of ChatGPT and Google Bard AI.

Cureus. 2024 Jan 2;16(1):e51544. doi: 10.7759/cureus.51544. eCollection 2024 Jan.

Artificial intelligence chatbots as sources of patient education material for obstructive sleep apnoea: ChatGPT versus Google Bard.

Eur Arch Otorhinolaryngol. 2024 Feb;281(2):985-993. doi: 10.1007/s00405-023-08319-9. Epub 2023 Nov 2.

Talking technology: exploring chatbots as a tool for cataract patient education.

Clin Exp Optom. 2025 Jan;108(1):56-64. doi: 10.1080/08164622.2023.2298812. Epub 2024 Jan 9.

Evidence-based potential of generative artificial intelligence large language models in orthodontics: a comparative study of ChatGPT, Google Bard, and Microsoft Bing.

Eur J Orthod. 2024 Apr 13. doi: 10.1093/ejo/cjae017.

The Use of Large Language Models to Generate Education Materials about Uveitis.

Ophthalmol Retina. 2024 Feb;8(2):195-201. doi: 10.1016/j.oret.2023.09.008. Epub 2023 Sep 15.

引用本文的文献

Comparison of the readability of ChatGPT and Bard in medical communication: a meta-analysis.

BMC Med Inform Decis Mak. 2025 Sep 1;25(1):325. doi: 10.1186/s12911-025-03035-2.

Large language models in ophthalmology: a scoping review on their utility for clinicians, researchers, patients, and educators.

Eye (Lond). 2025 Aug 25. doi: 10.1038/s41433-025-03935-7.

Comparative Analysis of LLMs in Dry Eye Syndrome Healthcare Information.

Diagnostics (Basel). 2025 Jul 30;15(15):1913. doi: 10.3390/diagnostics15151913.

Large language models in patient education: a scoping review of applications in medicine.

Front Med (Lausanne). 2024 Oct 29;11:1477898. doi: 10.3389/fmed.2024.1477898. eCollection 2024.

A Performance Evaluation of Large Language Models in Keratoconus: A Comparative Study of ChatGPT-3.5, ChatGPT-4.0, Gemini, Copilot, Chatsonic, and Perplexity.

J Clin Med. 2024 Oct 30;13(21):6512. doi: 10.3390/jcm13216512.

Assessment of Large Language Models in Cataract Care Information Provision: A Quantitative Comparison.

Ophthalmol Ther. 2025 Jan;14(1):103-116. doi: 10.1007/s40123-024-01066-y. Epub 2024 Nov 8.

Applications of ChatGPT in the diagnosis, management, education, and research of retinal diseases: a scoping review.

Int J Retina Vitreous. 2024 Oct 17;10(1):79. doi: 10.1186/s40942-024-00595-9.

本文引用的文献

New era after ChatGPT in ophthalmology: advances from data-based decision support to patient-centered generative artificial intelligence.

Ann Transl Med. 2023 Aug 30;11(10):337. doi: 10.21037/atm-23-1598. Epub 2023 Jun 30.

What can GPT-4 do for Diagnosing Rare Eye Diseases? A Pilot Study.

Ophthalmol Ther. 2023 Dec;12(6):3395-3402. doi: 10.1007/s40123-023-00789-8. Epub 2023 Sep 1.

Benchmarking large language models' performances for myopia care: a comparative analysis of ChatGPT-3.5, ChatGPT-4.0, and Google Bard.

EBioMedicine. 2023 Sep;95:104770. doi: 10.1016/j.ebiom.2023.104770. Epub 2023 Aug 23.

Artificial intelligence-based ChatGPT chatbot responses for patient and parent questions on vernal keratoconjunctivitis.

Graefes Arch Clin Exp Ophthalmol. 2023 Oct;261(10):3041-3043. doi: 10.1007/s00417-023-06078-1. Epub 2023 May 2.

Medicine in the Era of Artificial Intelligence: Hey Chatbot, Write Me an H&P.

JAMA Intern Med. 2023 Jun 1;183(6):507-508. doi: 10.1001/jamainternmed.2023.1832.

How Chatbots and Large Language Model Artificial Intelligence Systems Will Reshape Modern Medicine: Fountain of Creativity or Pandora's Box?

JAMA Intern Med. 2023 Jun 1;183(6):596-597. doi: 10.1001/jamainternmed.2023.1835.

Comparing Physician and Artificial Intelligence Chatbot Responses to Patient Questions Posted to a Public Social Media Forum.

JAMA Intern Med. 2023 Jun 1;183(6):589-596. doi: 10.1001/jamainternmed.2023.1838.

The Role of ChatGPT, Generative Language Models, and Artificial Intelligence in Medical Education: A Conversation With ChatGPT and a Call for Papers.

JMIR Med Educ. 2023 Mar 6;9:e46885. doi: 10.2196/46885.

Diagnostic Accuracy of Differential-Diagnosis Lists Generated by Generative Pretrained Transformer 3 Chatbot for Clinical Vignettes with Common Chief Complaints: A Pilot Study.

Int J Environ Res Public Health. 2023 Feb 15;20(4):3378. doi: 10.3390/ijerph20043378.

High Satisfaction With a Virtual Assistant for Plastic Surgery Frequently Asked Questions.

Aesthet Surg J. 2023 Mar 15;43(4):494-503. doi: 10.1093/asj/sjac290.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

患有飞蚊症的患者：来自虚拟助手和大语言模型的回答。

Patients with floaters: Answers from virtual assistants and large language models.

作者信息

机构信息

出版信息

OBJECTIVES

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献