用于“分娩硬膜外麻醉”患者教育的人工智能聊天机器人与传统医学资源的比较：准确性、情感基调及可读性评估

Artificial intelligence chatbots versus traditional medical resources for patient education on "Labor Epidurals": an evaluation of accuracy, emotional tone, and readability.

作者信息

Gondode Prakash Gyandev, Singh Ram, Mehta Swati, Singh Sneha, Kumar Subodh, Nayak Sudhansu Sekhar

机构信息

Department of Anaesthesiology Pain Medicine and Critical Care, All India Institute of Medical Sciences, New Delhi, India.

出版信息

Int J Obstet Anesth. 2025 Feb;61:104302. doi: 10.1016/j.ijoa.2024.104302. Epub 2024 Nov 26.

DOI:10.1016/j.ijoa.2024.104302

PMID:39657284

Abstract

BACKGROUND

Labor epidural analgesia is a widely used method for pain relief in childbirth, yet information accessibility for expectant mothers remains a challenge. Artificial intelligence (AI) chatbots like Chat Generative Pre-Trained Transformer (ChatGPT) and Google Gemini offer potential solutions for improving patient education. This study evaluates the accuracy, readability, and emotional tone of AI chatbot responses compared to the American Society of Anesthesiologists (ASA) online materials on labor epidurals.

METHODS

Eight common questions about labor epidurals were posed to ChatGPT and Gemini. Seven obstetric anaesthesiologists evaluated the generated responses for accuracy and completeness on a 1-10 Likert scale, comparing them with ASA-sourced content. Statistical analysis (one-way ANOVA, Tukey HSD), sentiment analysis and readability metrics (Flesch Reading ease) were used to assess differences.

RESULTS

ASA materials scored highest for accuracy (8.80 ± 0.40) and readability, followed by Gemini and ChatGPT. Completeness scores showed ASA and Gemini performing significantly better than ChatGPT (P <0.001). ASA materials were the most accessible, while Gemini content was more complex. Sentiment analysis indicated a neutral tone for ASA and Gemini, with ChatGPT displaying a less consistent tone.

CONCLUSION

AI chatbots exhibit promise in patient education for labor epidurals but require improvements in readability and tone consistency to enhance engagement. Further refinement of AI chatbots may support more accessible, patient-centred healthcare information.

摘要

背景

分娩硬膜外镇痛是一种广泛应用于分娩疼痛缓解的方法，但对孕妇而言，信息的可获取性仍是一项挑战。像聊天生成预训练变换器（ChatGPT）和谷歌双子星（Google Gemini）这样的人工智能（AI）聊天机器人为改善患者教育提供了潜在的解决方案。本研究将AI聊天机器人的回复与美国麻醉医师协会（ASA）关于分娩硬膜外麻醉的在线资料进行比较，评估其准确性、可读性和情感基调。

方法

向ChatGPT和双子星提出八个关于分娩硬膜外麻醉的常见问题。七名产科麻醉医师在1-10李克特量表上评估生成的回复的准确性和完整性，并将其与ASA来源的内容进行比较。采用统计分析（单因素方差分析、Tukey HSD检验）、情感分析和可读性指标（弗莱什阅读简易度）来评估差异。

结果

ASA资料在准确性（8.80±0.40）和可读性方面得分最高，其次是双子星和ChatGPT。完整性得分显示，ASA和双子星的表现明显优于ChatGPT（P<0.001）。ASA资料最易理解，而双子星的内容则更复杂。情感分析表明，ASA和双子星的语气中性，ChatGPT的语气则不太一致。

结论

AI聊天机器人在分娩硬膜外麻醉的患者教育方面显示出前景，但需要提高可读性和语气一致性以增强参与度。进一步优化AI聊天机器人可能有助于提供更易获取、以患者为中心的医疗保健信息。

相似文献

Artificial intelligence chatbots versus traditional medical resources for patient education on "Labor Epidurals": an evaluation of accuracy, emotional tone, and readability.用于“分娩硬膜外麻醉”患者教育的人工智能聊天机器人与传统医学资源的比较：准确性、情感基调及可读性评估

Int J Obstet Anesth. 2025 Feb;61:104302. doi: 10.1016/j.ijoa.2024.104302. Epub 2024 Nov 26.

Readability, quality and accuracy of generative artificial intelligence chatbots for commonly asked questions about labor epidurals: a comparison of ChatGPT and Bard.生成式人工智能聊天机器人针对分娩硬膜外麻醉常见问题的可读性、质量和准确性：ChatGPT与Bard的比较

Int J Obstet Anesth. 2025 Feb;61:104317. doi: 10.1016/j.ijoa.2024.104317. Epub 2024 Dec 20.

Comparative Analysis of Accuracy, Readability, Sentiment, and Actionability: Artificial Intelligence Chatbots (ChatGPT and Google Gemini) versus Traditional Patient Information Leaflets for Local Anesthesia in Eye Surgery.准确性、可读性、情感倾向和可操作性的比较分析：人工智能聊天机器人（ChatGPT和谷歌Gemini）与眼科手术局部麻醉传统患者信息手册的对比

Br Ir Orthopt J. 2024 Aug 19;20(1):183-192. doi: 10.22599/bioj.377. eCollection 2024.

Assessing the quality and readability of patient education materials on chemotherapy cardiotoxicity from artificial intelligence chatbots: An observational cross-sectional study.评估人工智能聊天机器人提供的关于化疗心脏毒性的患者教育材料的质量和可读性：一项观察性横断面研究。

Medicine (Baltimore). 2025 Apr 11;104(15):e42135. doi: 10.1097/MD.0000000000042135.

Performance of Artificial Intelligence Chatbots in Responding to Patient Queries Related to Traumatic Dental Injuries: A Comparative Study.人工智能聊天机器人在回应与创伤性牙损伤相关的患者咨询中的表现：一项比较研究。

Dent Traumatol. 2025 Jun;41(3):338-347. doi: 10.1111/edt.13020. Epub 2024 Nov 22.

End-of-life Care Patient Information Leaflets-A Comparative Evaluation of Artificial Intelligence-generated Content for Readability, Sentiment, Accuracy, Completeness, and Suitability: ChatGPT vs Google Gemini.临终关怀患者信息手册——人工智能生成内容在可读性、情感倾向、准确性、完整性和适用性方面的比较评估：ChatGPT与谷歌Gemini对比

Indian J Crit Care Med. 2024 Jun;28(6):561-568. doi: 10.5005/jp-journals-10071-24725.

Assessment of readability, reliability, and quality of ChatGPT®, BARD®, Gemini®, Copilot®, Perplexity® responses on palliative care.评估 ChatGPT®、BARD®、 Gemini®、Copilot®、Perplexity® 在姑息治疗方面的可读性、可靠性和质量。

Medicine (Baltimore). 2024 Aug 16;103(33):e39305. doi: 10.1097/MD.0000000000039305.

Evaluación de la fiabilidad y legibilidad de las respuestas de los chatbots como recurso de información al paciente para las exploraciones PET-TC más communes.评估聊天机器人回复作为常见PET-CT检查患者信息资源的可靠性和可读性。

Rev Esp Med Nucl Imagen Mol (Engl Ed). 2025 Jan-Feb;44(1):500065. doi: 10.1016/j.remnie.2024.500065. Epub 2024 Sep 28.

Comparing patient education tools for chronic pain medications: Artificial intelligence chatbot versus traditional patient information leaflets.比较慢性疼痛药物的患者教育工具：人工智能聊天机器人与传统患者信息手册。

Indian J Anaesth. 2024 Jul;68(7):631-636. doi: 10.4103/ija.ija_204_24. Epub 2024 Jun 7.

Generative artificial intelligence chatbots may provide appropriate informational responses to common vascular surgery questions by patients.生成式人工智能聊天机器人可能会为患者关于常见血管外科问题提供恰当的信息性回复。

Vascular. 2025 Feb;33(1):229-237. doi: 10.1177/17085381241240550. Epub 2024 Mar 18.

引用本文的文献

Harnessing Generative Artificial Intelligence in Pediatric Anesthesia: Enhancing Learning, Patient Care, and Family Communication.在儿科麻醉中利用生成式人工智能：加强学习、患者护理和医患沟通。

Paediatr Anaesth. 2025 Sep;35(9):691-694. doi: 10.1111/pan.70005. Epub 2025 Jun 24.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

用于“分娩硬膜外麻醉”患者教育的人工智能聊天机器人与传统医学资源的比较：准确性、情感基调及可读性评估

Artificial intelligence chatbots versus traditional medical resources for patient education on "Labor Epidurals": an evaluation of accuracy, emotional tone, and readability.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献