使用生成式大语言模型对常见外科疾病患者进行教育：ChatGPT与谷歌Gemini的比较分析

Use of generative large language models for patient education on common surgical conditions: a comparative analysis between ChatGPT and Google Gemini.

作者信息

ELSenbawy Omar Mahmoud, Patel Keval Bhavesh, Wannakuwatte Randev Ayodhya, Thota Akhila N

机构信息

Alexandria University, Alexandria, Egypt.

Narendra Modi Medical College, Ahemdabad, Gujarat, India.

出版信息

Updates Surg. 2025 Jan 15. doi: 10.1007/s13304-025-02074-8.

DOI:10.1007/s13304-025-02074-8

PMID:39815048

Abstract

There is a growing importance for patients to easily access information regarding their medical conditions to improve their understanding and participation in health care decisions. Artificial Intelligence (AI) has proven as a fast, efficient, and effective tool in educating patients regarding their health care conditions. The aim of the study is to compare the responses provided by AI tools, ChatGPT and Google Gemini, to assess for conciseness and understandability of information provided for the medical conditions Deep vein thrombosis, decubitus ulcers, and hemorrhoids. A cross-sectional original research design was conducted regarding the responses generated by ChatGPT and Google Gemini for the post-surgical complications of Deep vein thrombosis, decubitus ulcers, and hemorrhoids. Each response was evaluated by the Flesch-Kincaid calculator for total number of words, sentences, average words per sentence, average syllables per word, grade level, and ease score. Additionally, the similarity score was evaluated using QuillBot and reliability using a modified discern score. These results were then analyzed by the unpaired or two sample t-test to compare the averages between the two AI tools to conclude which one was superior. Chat GPT required a higher education level to understand as suggested by the higher grade levels and lower ease scores. The easiest brochure was for deep vein thrombosis which had the lowest ease score and highest grade level. ChatGPT displayed more similarity with information provided on the internet as calculated by the plagiarism calculator-Quill bot. The reliability score via the Modified Discern score showing both AI tools were similar. Although there is a difference in the various scores for each AI tool, based on the P values obtained there is not enough evidence to conclude the superiority of one AI tool over the other.

摘要

患者能够轻松获取有关其医疗状况的信息，对于提高他们对医疗保健决策的理解和参与度变得越来越重要。人工智能（AI）已被证明是一种快速、高效且有效的工具，可用于教育患者了解其医疗保健状况。本研究的目的是比较人工智能工具ChatGPT和谷歌Gemini提供的回答，以评估为深静脉血栓形成、褥疮和痔疮等医疗状况提供的信息的简洁性和易懂性。针对ChatGPT和谷歌Gemini生成的关于深静脉血栓形成、褥疮和痔疮术后并发症的回答，进行了一项横断面原创研究设计。每个回答都通过弗莱什-金凯德计算器评估单词总数、句子数、平均每句单词数、平均每词音节数、年级水平和易读分数。此外，使用QuillBot评估相似度分数，使用修改后的辨别分数评估可靠性。然后通过非配对或双样本t检验分析这些结果，以比较两个人工智能工具的平均值，从而得出哪个工具更优。正如较高的年级水平和较低的易读分数所表明的那样，ChatGPT需要更高的教育水平才能理解。最容易理解的手册是关于深静脉血栓形成的，其易读分数最低，年级水平最高。根据抄袭计算器Quill bot的计算，ChatGPT与互联网上提供的信息显示出更高的相似度。通过修改后的辨别分数得出的可靠性分数表明，两个人工智能工具相似。尽管每个人工智能工具的各种分数存在差异，但根据获得的P值，没有足够的证据得出一个人工智能工具优于另一个的结论。

相似文献

Use of generative large language models for patient education on common surgical conditions: a comparative analysis between ChatGPT and Google Gemini.使用生成式大语言模型对常见外科疾病患者进行教育：ChatGPT与谷歌Gemini的比较分析

Updates Surg. 2025 Jan 15. doi: 10.1007/s13304-025-02074-8.

Analyzing the Effectiveness of AI-Generated Patient Education Materials: A Comparative Study of ChatGPT and Google Gemini.分析人工智能生成的患者教育材料的有效性：ChatGPT与谷歌Gemini的比较研究

Cureus. 2024 Nov 25;16(11):e74398. doi: 10.7759/cureus.74398. eCollection 2024 Nov.

A Cross-Sectional Study Comparing Patient Education Guides Created by ChatGPT and Google Gemini for Common Cardiovascular-Related Conditions.一项比较ChatGPT和谷歌Gemini针对常见心血管相关疾病创建的患者教育指南的横断面研究。

Cureus. 2025 Jan 14;17(1):e77442. doi: 10.7759/cureus.77442. eCollection 2025 Jan.

Comparative Analysis of ChatGPT and Google Gemini in Generating Patient Educational Resources on Cardiac Health: A Focus on Exercise-Induced Arrhythmia, Sleep Habits, and Dietary Habits.ChatGPT与谷歌Gemini在生成心脏健康患者教育资源方面的比较分析：聚焦运动诱发心律失常、睡眠习惯和饮食习惯

Cureus. 2025 Mar 18;17(3):e80771. doi: 10.7759/cureus.80771. eCollection 2025 Mar.

Evaluating Artificial Intelligence (AI)-Generated Patient Education Guides on Epilepsy: A Cross-Sectional Study of ChatGPT and Google Gemini.评估人工智能（AI）生成的癫痫患者教育指南：ChatGPT和谷歌Gemini的横断面研究

Cureus. 2024 Nov 7;16(11):e73212. doi: 10.7759/cureus.73212. eCollection 2024 Nov.

A Cross-Sectional Study Comparing Patient Information Guides Generated by ChatGPT and Google Gemini for Common Radiological Procedures.一项比较ChatGPT和谷歌Gemini生成的常见放射学检查患者信息指南的横断面研究。

Cureus. 2024 Nov 30;16(11):e74876. doi: 10.7759/cureus.74876. eCollection 2024 Nov.

A Cross-Sectional Study Comparing Patient Information Guides for Amyotrophic Lateral Sclerosis, Myasthenia Gravis, and Guillain-Barré Syndrome Produced by ChatGPT-4 and Google Gemini 1.5.一项比较ChatGPT-4和谷歌Gemini 1.5生成的肌萎缩侧索硬化症、重症肌无力和吉兰-巴雷综合征患者信息指南的横断面研究。

Cureus. 2025 Feb 25;17(2):e79646. doi: 10.7759/cureus.79646. eCollection 2025 Feb.

An Observational Study to Evaluate Readability and Reliability of AI-Generated Brochures for Emergency Medical Conditions.一项评估人工智能生成的急诊医疗状况手册可读性和可靠性的观察性研究。

Cureus. 2024 Aug 31;16(8):e68307. doi: 10.7759/cureus.68307. eCollection 2024 Aug.

Performance of Artificial Intelligence Chatbots in Responding to Patient Queries Related to Traumatic Dental Injuries: A Comparative Study.人工智能聊天机器人在回应与创伤性牙损伤相关的患者咨询中的表现：一项比较研究。

Dent Traumatol. 2025 Jun;41(3):338-347. doi: 10.1111/edt.13020. Epub 2024 Nov 22.

Assessing the quality and readability of patient education materials on chemotherapy cardiotoxicity from artificial intelligence chatbots: An observational cross-sectional study.评估人工智能聊天机器人提供的关于化疗心脏毒性的患者教育材料的质量和可读性：一项观察性横断面研究。

Medicine (Baltimore). 2025 Apr 11;104(15):e42135. doi: 10.1097/MD.0000000000042135.

引用本文的文献

Utilisation of AI-driven chatbots for perioperative health information seeking: a descriptive qualitative study of orthopaedic patients and family members.利用人工智能驱动的聊天机器人获取围手术期健康信息：一项针对骨科患者及其家属的描述性定性研究

BMJ Open. 2025 Sep 4;15(9):e099824. doi: 10.1136/bmjopen-2025-099824.

Comparison of the readability of ChatGPT and Bard in medical communication: a meta-analysis.ChatGPT与Bard在医学交流中的可读性比较：一项荟萃分析。

BMC Med Inform Decis Mak. 2025 Sep 1;25(1):325. doi: 10.1186/s12911-025-03035-2.

本文引用的文献

Will Artificial Intelligence Be "Better" Than Humans in the Management of Syncope?在晕厥管理方面，人工智能会比人类“更出色”吗？

JACC Adv. 2024 Jul 31;3(9):101072. doi: 10.1016/j.jacadv.2024.101072. eCollection 2024 Sep.

Evaluation of the Quality and Reliability of YouTube Videos Created by Orthodontists as an Information Source for Clear Aligners.正畸医生制作的YouTube视频作为隐形矫治器信息来源的质量和可靠性评估

Turk J Orthod. 2024 Mar 28;37(1):44-49. doi: 10.4274/TurkJOrthod.2023.2022.127.

Comparison of large language models in management advice for melanoma: Google's AI BARD, BingAI and ChatGPT.大语言模型在黑色素瘤管理建议方面的比较：谷歌的人工智能BARD、必应人工智能和ChatGPT。

Skin Health Dis. 2023 Nov 28;4(1):e313. doi: 10.1002/ski2.313. eCollection 2024 Feb.

Artificial Intelligence Revolutionizing the Field of Medical Education.人工智能变革医学教育领域。

Cureus. 2023 Nov 28;15(11):e49604. doi: 10.7759/cureus.49604. eCollection 2023 Nov.

Investigating the impact of innovative AI chatbot on post-pandemic medical education and clinical assistance: a comprehensive analysis.探讨创新型人工智能聊天机器人对后疫情时代医学教育和临床辅助的影响：全面分析。

ANZ J Surg. 2024 Feb;94(1-2):68-77. doi: 10.1111/ans.18666. Epub 2023 Aug 21.

Application of Artificial Intelligence in Medical Education: Current Scenario and Future Perspectives.人工智能在医学教育中的应用：现状与未来展望

J Adv Med Educ Prof. 2023 Jul;11(3):133-140. doi: 10.30476/JAMP.2023.98655.1803.

Artificial intelligence in healthcare: Complementing, not replacing, doctors and healthcare providers.医疗保健领域的人工智能：辅助医生和医疗服务提供者，而非取而代之。

Digit Health. 2023 Jul 2;9:20552076231186520. doi: 10.1177/20552076231186520. eCollection 2023 Jan-Dec.

Artificial intelligence in healthcare and education.人工智能在医疗和教育领域的应用。

Br Dent J. 2023 May;234(10):761-764. doi: 10.1038/s41415-023-5845-2. Epub 2023 May 26.

Can Artificial Intelligence Improve the Readability of Patient Education Materials?人工智能能否提高患者教育材料的可读性？

Clin Orthop Relat Res. 2023 Nov 1;481(11):2260-2267. doi: 10.1097/CORR.0000000000002668. Epub 2023 Apr 28.

AI-generated research paper fabrication and plagiarism in the scientific community.科学界中人工智能生成的研究论文造假与抄袭现象。

Patterns (N Y). 2023 Mar 10;4(3):100706. doi: 10.1016/j.patter.2023.100706.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

使用生成式大语言模型对常见外科疾病患者进行教育：ChatGPT与谷歌Gemini的比较分析

Use of generative large language models for patient education on common surgical conditions: a comparative analysis between ChatGPT and Google Gemini.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献