Suppr
超能文献

谷歌医生与ChatGPT医生：通过比较关于白内障及白内障手术的常见患者问题的回答的准确性、安全性和可读性，探索人工智能在眼科领域的应用。

Dr. Google vs. Dr. ChatGPT: Exploring the Use of Artificial Intelligence in Ophthalmology by Comparing the Accuracy, Safety, and Readability of Responses to Frequently Asked Patient Questions Regarding Cataracts and Cataract Surgery.

作者信息

Cohen Samuel A, Brant Arthur, Fisher Ann Caroline, Pershing Suzann, Do Diana, Pan Carolyn

机构信息

Byers Eye Institute, Stanford University School of Medicine, Stanford, CA, USA.

出版信息

Semin Ophthalmol. 2024 Aug;39(6):472-479. doi: 10.1080/08820538.2024.2326058. Epub 2024 Mar 22.

DOI:10.1080/08820538.2024.2326058

PMID:38516983

Abstract

PURPOSE

Patients are using online search modalities to learn about their eye health. While Google remains the most popular search engine, the use of large language models (LLMs) like ChatGPT has increased. Cataract surgery is the most common surgical procedure in the US, and there is limited data on the quality of online information that populates after searches related to cataract surgery on search engines such as Google and LLM platforms such as ChatGPT. We identified the most common patient frequently asked questions (FAQs) about cataracts and cataract surgery and evaluated the accuracy, safety, and readability of the answers to these questions provided by both Google and ChatGPT. We demonstrated the utility of ChatGPT in writing notes and creating patient education materials.

METHODS

The top 20 FAQs related to cataracts and cataract surgery were recorded from Google. Responses to the questions provided by Google and ChatGPT were evaluated by a panel of ophthalmologists for accuracy and safety. Evaluators were also asked to distinguish between Google and LLM chatbot answers. Five validated readability indices were used to assess the readability of responses. ChatGPT was instructed to generate operative notes, post-operative instructions, and customizable patient education materials according to specific readability criteria.

RESULTS

Responses to 20 patient FAQs generated by ChatGPT were significantly longer and written at a higher reading level than responses provided by Google ( < .001), with an average grade level of 14.8 (college level). Expert reviewers were correctly able to distinguish between a human-reviewed and chatbot generated response an average of 31% of the time. Google answers contained incorrect or inappropriate material 27% of the time, compared with 6% of LLM generated answers ( < .001). When expert reviewers were asked to compare the responses directly, chatbot responses were favored (66%).

CONCLUSIONS

When comparing the responses to patients' cataract FAQs provided by ChatGPT and Google, practicing ophthalmologists overwhelming preferred ChatGPT responses. LLM chatbot responses were less likely to contain inaccurate information. ChatGPT represents a viable information source for eye health for patients with higher health literacy. ChatGPT may also be used by ophthalmologists to create customizable patient education materials for patients with varying health literacy.

摘要

目的

患者正在使用在线搜索方式来了解自己的眼部健康状况。虽然谷歌仍然是最受欢迎的搜索引擎，但像ChatGPT这样的大语言模型（LLM）的使用有所增加。白内障手术是美国最常见的外科手术，而关于在谷歌等搜索引擎以及ChatGPT等LLM平台上搜索与白内障手术相关内容后出现的在线信息质量的数据有限。我们确定了患者关于白内障和白内障手术最常见的常见问题（FAQ），并评估了谷歌和ChatGPT对这些问题的回答的准确性、安全性和可读性。我们展示了ChatGPT在撰写病历和创建患者教育材料方面的效用。

方法

从谷歌记录了与白内障和白内障手术相关的前20个常见问题。由一组眼科医生对谷歌和ChatGPT提供的问题回答进行准确性和安全性评估。评估人员还被要求区分谷歌和LLM聊天机器人的答案。使用五个经过验证的可读性指标来评估回答的可读性。根据特定的可读性标准，指示ChatGPT生成手术记录、术后指导和可定制的患者教育材料。

结果

ChatGPT生成的对20个患者常见问题的回答明显比谷歌提供的回答更长，且写作水平更高（<0.001），平均年级水平为14.8（大学水平）。专家评审员平均有31%的时间能够正确区分人工审核的回答和聊天机器人生成的回答。谷歌的回答有27%的时间包含不正确或不适当的内容，而LLM生成的回答为6%（<0.001）。当要求专家评审员直接比较回答时，聊天机器人的回答更受青睐（66%）。

结论

在比较ChatGPT和谷歌提供的针对患者白内障常见问题的回答时，执业眼科医生绝大多数更喜欢ChatGPT的回答。LLM聊天机器人的回答不太可能包含不准确的信息。ChatGPT对于健康素养较高的患者来说是一个可行的眼部健康信息来源。眼科医生也可以使用ChatGPT为健康素养不同的患者创建可定制的患者教育材料。

相似文献

Dr. Google vs. Dr. ChatGPT: Exploring the Use of Artificial Intelligence in Ophthalmology by Comparing the Accuracy, Safety, and Readability of Responses to Frequently Asked Patient Questions Regarding Cataracts and Cataract Surgery.

Semin Ophthalmol. 2024 Aug;39(6):472-479. doi: 10.1080/08820538.2024.2326058. Epub 2024 Mar 22.

Comparing the Ability of Google and ChatGPT to Accurately Respond to Oculoplastics-Related Patient Questions and Generate Customized Oculoplastics Patient Education Materials.

Clin Ophthalmol. 2024 Sep 21;18:2647-2655. doi: 10.2147/OPTH.S480222. eCollection 2024.

Comparison of Ophthalmologist and Large Language Model Chatbot Responses to Online Patient Eye Care Questions.

JAMA Netw Open. 2023 Aug 1;6(8):e2330320. doi: 10.1001/jamanetworkopen.2023.30320.

Large language models and bariatric surgery patient education: a comparative readability analysis of GPT-3.5, GPT-4, Bard, and online institutional resources.

Surg Endosc. 2024 May;38(5):2522-2532. doi: 10.1007/s00464-024-10720-2. Epub 2024 Mar 12.

Do ChatGPT and Google differ in answers to commonly asked patient questions regarding total shoulder and total elbow arthroplasty?

J Shoulder Elbow Surg. 2024 Aug;33(8):e429-e437. doi: 10.1016/j.jse.2023.11.014. Epub 2024 Jan 3.

Artificial intelligence chatbots as sources of patient education material for cataract surgery: ChatGPT-4 versus Google Bard.

BMJ Open Ophthalmol. 2024 Oct 17;9(1):e001824. doi: 10.1136/bmjophth-2024-001824.

Large language models: a new frontier in paediatric cataract patient education.

Br J Ophthalmol. 2024 Sep 20;108(10):1470-1476. doi: 10.1136/bjo-2024-325252.

Is Information About Musculoskeletal Malignancies From Large Language Models or Web Resources at a Suitable Reading Level for Patients?

Clin Orthop Relat Res. 2025 Feb 1;483(2):306-315. doi: 10.1097/CORR.0000000000003263. Epub 2024 Sep 25.

BPPV Information on Google Versus AI (ChatGPT).

Otolaryngol Head Neck Surg. 2024 Jun;170(6):1504-1511. doi: 10.1002/ohn.506. Epub 2023 Aug 25.

Evaluating the accuracy and readability of ChatGPT in providing parental guidance for adenoidectomy, tonsillectomy, and ventilation tube insertion surgery.

Int J Pediatr Otorhinolaryngol. 2024 Jun;181:111998. doi: 10.1016/j.ijporl.2024.111998. Epub 2024 May 31.

引用本文的文献

Clinical decision-making for uveal melanoma radiotherapy: comparative performance of experienced radiation oncologists and leading generative AI models.

Front Oncol. 2025 Aug 14;15:1605916. doi: 10.3389/fonc.2025.1605916. eCollection 2025.

Evaluating ChatGPT's Utility in Biologic Therapy for Systemic Lupus Erythematosus: Comparative Study of ChatGPT and Google Web Search.

JMIR Form Res. 2025 Aug 28;9:e76458. doi: 10.2196/76458.

Large language models in ophthalmology: a scoping review on their utility for clinicians, researchers, patients, and educators.

Eye (Lond). 2025 Aug 25. doi: 10.1038/s41433-025-03935-7.

Comparative analysis of large language models in providing patient information about keratoconus and contact lenses.

Int Ophthalmol. 2025 Aug 18;45(1):340. doi: 10.1007/s10792-025-03711-2.

Large language models in the management of chronic ocular diseases: a scoping review.

Front Cell Dev Biol. 2025 Jun 18;13:1608988. doi: 10.3389/fcell.2025.1608988. eCollection 2025.

Large language models provide discordant information compared to ophthalmology guidelines.

Sci Rep. 2025 Jul 1;15(1):20556. doi: 10.1038/s41598-025-06404-z.

Online platform vs. doctors: a comparative exploration of congenital cataract patient education from virtual to reality.

Front Artif Intell. 2025 Jun 3;8:1548385. doi: 10.3389/frai.2025.1548385. eCollection 2025.

Chinese generative AI models (DeepSeek and Qwen) rival ChatGPT-4 in ophthalmology queries with excellent performance in Arabic and English.

Narra J. 2025 Apr;5(1):e2371. doi: 10.52225/narra.v5i1.2371. Epub 2025 Apr 8.

Evaluating the Accuracy of Gemini 2.0 Advanced and ChatGPT 4o in Cataract Knowledge: A Performance Analysis Using Brazilian Council of Ophthalmology Board Exam Questions.

Cureus. 2025 Feb 24;17(2):e79565. doi: 10.7759/cureus.79565. eCollection 2025 Feb.

A Future of Self-Directed Patient Internet Research: Large Language Model-Based Tools Versus Standard Search Engines.

Ann Biomed Eng. 2025 May;53(5):1199-1208. doi: 10.1007/s10439-025-03701-6. Epub 2025 Mar 3.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

谷歌医生与ChatGPT医生：通过比较关于白内障及白内障手术的常见患者问题的回答的准确性、安全性和可读性，探索人工智能在眼科领域的应用。

Dr. Google vs. Dr. ChatGPT: Exploring the Use of Artificial Intelligence in Ophthalmology by Comparing the Accuracy, Safety, and Readability of Responses to Frequently Asked Patient Questions Regarding Cataracts and Cataract Surgery.

作者信息

机构信息

出版信息

PURPOSE

METHODS

RESULTS

CONCLUSIONS

目的

方法

结果

结论

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译