一项评估人工智能生成的急诊医疗状况手册可读性和可靠性的观察性研究。

An Observational Study to Evaluate Readability and Reliability of AI-Generated Brochures for Emergency Medical Conditions.

作者信息

S Adithya, Aggarwal Shreyas, Sridhar Janani, Vs Kavya, John Victoria P, Singh Chaihthanya

机构信息

Medical School, Ramaiah Medical College, Bangalore, IND.

Geriatrics, Prince Charles Hospital, Cwm Taf Morgannwg University Health Board, Merthyr Tydfil, GBR.

出版信息

Cureus. 2024 Aug 31;16(8):e68307. doi: 10.7759/cureus.68307. eCollection 2024 Aug.

DOI:10.7759/cureus.68307

PMID:39350844

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11441454/

Abstract

Introduction The study assesses the readability of AI-generated brochures for common emergency medical conditions like heart attack, anaphylaxis, and syncope. Thus, the study aims to compare the AI-generated responses for patient information guides of common emergency medical conditions using ChatGPT and Google Gemini. Methodology Brochures for each condition were created by both AI tools. Readability was assessed using the Flesch-Kincaid Calculator, evaluating word count, sentence count and ease of understanding. Reliability was measured using the Modified DISCERN Score. The similarity between AI outputs was determined using Quillbot. Statistical analysis was performed with R (v4.3.2). Results ChatGPT and Gemini produced brochures with no statistically significant differences in word count (p= 0.2119), sentence count (p=0.1276), readability (p=0.3796), or reliability (p=0.7407). However, ChatGPT provided more detailed content with 32.4% more words (582.80 vs. 440.20) and 51.6% more sentences (67.00 vs. 44.20). In addition, Gemini's brochures were slightly easier to read with a higher ease score (50.62 vs. 41.88). Reliability varied by topic with ChatGPT scoring higher for Heart Attack (4 vs. 3) and Choking (3 vs. 2), while Google Gemini scored higher for Anaphylaxis (4 vs. 3) and Drowning (4 vs. 3), highlighting the need for topic-specific evaluation. Conclusions Although AI-generated brochures from ChatGPT and Gemini are comparable in readability and reliability for patient information on emergency medical conditions, this study highlights that there is no statistically significant difference in the responses generated by the two AI tools.

摘要

引言本研究评估了人工智能生成的针对心脏病发作、过敏反应和晕厥等常见紧急医疗状况的宣传册的可读性。因此，该研究旨在比较使用ChatGPT和谷歌Gemini生成的针对常见紧急医疗状况的患者信息指南的回复。方法两种人工智能工具都针对每种状况创建了宣传册。使用弗莱什-金凯德计算器评估可读性，评估单词数、句子数和理解难易程度。使用改良的DISCERN评分来衡量可靠性。使用Quillbot确定人工智能输出之间的相似度。使用R（v4.3.2）进行统计分析。结果 ChatGPT和Gemini生成的宣传册在单词数（p = 0.2119）、句子数（p = 0.1276）、可读性（p = 0.3796）或可靠性（p = 0.7407）方面没有统计学上的显著差异。然而，ChatGPT提供了更详细的内容，单词数多32.4%（582.80对440.20），句子数多51.6%（67.00对44.20）。此外，Gemini生成的宣传册阅读起来稍容易一些，易读性得分更高（50.62对41.88）。可靠性因主题而异，ChatGPT在心脏病发作（4分对3分）和窒息（3分对2分）方面得分更高，而谷歌Gemini在过敏反应（4分对3分）和溺水（4分对3分）方面得分更高，这突出了进行特定主题评估的必要性。结论尽管ChatGPT和Gemini生成的针对紧急医疗状况患者信息的宣传册在可读性和可靠性方面具有可比性，但本研究强调，这两种人工智能工具生成的回复在统计学上没有显著差异。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/0314/11441454/59d29b9f5f26/cureus-0016-00000068307-i01.jpg

相似文献

An Observational Study to Evaluate Readability and Reliability of AI-Generated Brochures for Emergency Medical Conditions.一项评估人工智能生成的急诊医疗状况手册可读性和可靠性的观察性研究。

Cureus. 2024 Aug 31;16(8):e68307. doi: 10.7759/cureus.68307. eCollection 2024 Aug.

Optimizing Ophthalmology Patient Education via ChatBot-Generated Materials: Readability Analysis of AI-Generated Patient Education Materials and The American Society of Ophthalmic Plastic and Reconstructive Surgery Patient Brochures.通过聊天机器人生成的材料优化眼科患者教育：人工智能生成的患者教育材料和美国眼科整形重建外科学会患者手册的可读性分析。

Ophthalmic Plast Reconstr Surg. 2024;40(2):212-216. doi: 10.1097/IOP.0000000000002549. Epub 2023 Nov 16.

Can artificial intelligence models serve as patient information consultants in orthodontics?人工智能模型能否在正畸学中充当患者信息顾问？

BMC Med Inform Decis Mak. 2024 Jul 29;24(1):211. doi: 10.1186/s12911-024-02619-8.

Evaluación de la fiabilidad y legibilidad de las respuestas de los chatbots como recurso de información al paciente para las exploraciones PET-TC más communes.评估聊天机器人回复作为常见PET-CT检查患者信息资源的可靠性和可读性。

Rev Esp Med Nucl Imagen Mol (Engl Ed). 2025 Jan-Feb;44(1):500065. doi: 10.1016/j.remnie.2024.500065. Epub 2024 Sep 28.

Performance of Artificial Intelligence Chatbots on Glaucoma Questions Adapted From Patient Brochures.人工智能聊天机器人对改编自患者手册的青光眼问题的回答情况。

Cureus. 2024 Mar 23;16(3):e56766. doi: 10.7759/cureus.56766. eCollection 2024 Mar.

Assessment of readability, reliability, and quality of ChatGPT®, BARD®, Gemini®, Copilot®, Perplexity® responses on palliative care.评估 ChatGPT®、BARD®、 Gemini®、Copilot®、Perplexity® 在姑息治疗方面的可读性、可靠性和质量。

Medicine (Baltimore). 2024 Aug 16;103(33):e39305. doi: 10.1097/MD.0000000000039305.

Evaluating the Efficacy of ChatGPT as a Patient Education Tool in Prostate Cancer: Multimetric Assessment.评估 ChatGPT 在前列腺癌患者教育中的疗效：多指标评估。

J Med Internet Res. 2024 Aug 14;26:e55939. doi: 10.2196/55939.

Assessing the Readability of Patient Education Materials on Cardiac Catheterization From Artificial Intelligence Chatbots: An Observational Cross-Sectional Study.评估人工智能聊天机器人提供的心脏导管插入术患者教育材料的可读性：一项观察性横断面研究。

Cureus. 2024 Jul 4;16(7):e63865. doi: 10.7759/cureus.63865. eCollection 2024 Jul.

Dr. Google to Dr. ChatGPT: assessing the content and quality of artificial intelligence-generated medical information on appendicitis.谷歌博士对 ChatGPT 博士：评估人工智能生成的关于阑尾炎的医学信息的内容和质量。

Surg Endosc. 2024 May;38(5):2887-2893. doi: 10.1007/s00464-024-10739-5. Epub 2024 Mar 5.

Evaluation of the accuracy and readability of ChatGPT-4 and Google Gemini in providing information on retinal detachment: a multicenter expert comparative study.ChatGPT-4和谷歌Gemini在提供视网膜脱离信息方面的准确性和可读性评估：一项多中心专家对比研究。

Int J Retina Vitreous. 2024 Sep 2;10(1):61. doi: 10.1186/s40942-024-00579-9.

引用本文的文献

Evaluating the Accuracy, Completeness, and Readability of Chatbot Responses to Refractive Surgery-Related Patient Questions: A Comparative Analysis of ChatGPT and Google Gemini.评估聊天机器人对屈光手术相关患者问题回答的准确性、完整性和可读性：ChatGPT与谷歌Gemini的比较分析

Cureus. 2025 Jul 29;17(7):e88980. doi: 10.7759/cureus.88980. eCollection 2025 Jul.

Comparison of the readability of ChatGPT and Bard in medical communication: a meta-analysis.ChatGPT与Bard在医学交流中的可读性比较：一项荟萃分析。

BMC Med Inform Decis Mak. 2025 Sep 1;25(1):325. doi: 10.1186/s12911-025-03035-2.

Analysis of Patient Education Guides Generated by ChatGPT and Gemini on Common Anti-diabetic Drugs: A Cross-Sectional Study.ChatGPT和Gemini生成的关于常见抗糖尿病药物的患者教育指南分析：一项横断面研究。

Cureus. 2025 Mar 25;17(3):e81156. doi: 10.7759/cureus.81156. eCollection 2025 Mar.

本文引用的文献

Exploring the Role of ChatGPT-4, BingAI, and Gemini as Virtual Consultants to Educate Families about Retinopathy of Prematurity.探索ChatGPT-4、必应人工智能和Gemini作为虚拟顾问在向家庭普及早产儿视网膜病变知识方面的作用。

Children (Basel). 2024 Jun 20;11(6):750. doi: 10.3390/children11060750.

Enhancing Patient Communication With Chat-GPT in Radiology: Evaluating the Efficacy and Readability of Answers to Common Imaging-Related Questions.利用Chat-GPT加强放射科与患者的沟通：评估常见影像相关问题答案的有效性和可读性

J Am Coll Radiol. 2024 Feb;21(2):353-359. doi: 10.1016/j.jacr.2023.09.011. Epub 2023 Oct 18.

Artificial intelligence in healthcare and education.人工智能在医疗和教育领域的应用。

Br Dent J. 2023 May;234(10):761-764. doi: 10.1038/s41415-023-5845-2. Epub 2023 May 26.

ChatGPT Utility in Healthcare Education, Research, and Practice: Systematic Review on the Promising Perspectives and Valid Concerns.ChatGPT在医学教育、研究与实践中的应用：对其前景与合理担忧的系统评价

Healthcare (Basel). 2023 Mar 19;11(6):887. doi: 10.3390/healthcare11060887.

Long-term prognosis and clinical course of choking-induced cardiac arrest in patients without the return of spontaneous circulation at hospital arrival: a population-based community study from the Shizuoka Kokuho Database.到达医院时无自主循环恢复的窒息性心搏骤停患者的长期预后和临床过程：来自静冈国保数据库的基于人群的社区研究。

BMC Emerg Med. 2022 Jul 6;22(1):120. doi: 10.1186/s12873-022-00676-8.

Management for the Drowning Patient.溺水患者的救治管理。

Chest. 2021 Apr;159(4):1473-1483. doi: 10.1016/j.chest.2020.10.007. Epub 2020 Oct 14.

Anaphylaxis in Children and Adolescents.儿童和青少年的过敏反应。

Pediatr Clin North Am. 2019 Oct;66(5):995-1005. doi: 10.1016/j.pcl.2019.06.005. Epub 2019 Aug 5.

Does A Therapy for Reflex Vasovagal Syncope Really Exist?是否真的存在针对反射性血管迷走性晕厥的治疗方法？

High Blood Press Cardiovasc Prev. 2019 Aug;26(4):273-281. doi: 10.1007/s40292-019-00327-3. Epub 2019 Jul 11.

Positive Psychological Well-Being and Cardiovascular Disease: JACC Health Promotion Series.积极心理福祉与心血管疾病：美国心脏病学会健康促进系列。

J Am Coll Cardiol. 2018 Sep 18;72(12):1382-1396. doi: 10.1016/j.jacc.2018.07.042.

Patient education in the management of coronary heart disease.冠心病管理中的患者教育

Cochrane Database Syst Rev. 2017 Jun 28;6(6):CD008895. doi: 10.1002/14651858.CD008895.pub3.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一项评估人工智能生成的急诊医疗状况手册可读性和可靠性的观察性研究。

An Observational Study to Evaluate Readability and Reliability of AI-Generated Brochures for Emergency Medical Conditions.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献