Suppr
超能文献

聊天机器人作为美容面部整形手术的患者教育资源：对 ChatGPT 和 Google Bard 回复的评估。

Chatbots as Patient Education Resources for Aesthetic Facial Plastic Surgery: Evaluation of ChatGPT and Google Bard Responses.

机构信息

Department of Otolaryngology - Head and Neck Surgery, Thomas Jefferson University Hospitals, Philadelphia, Pennsylvania, USA.

Sidney Kimmel Medical College, Philadelphia, Pennsylvania, USA.

出版信息

Facial Plast Surg Aesthet Med. 2024 Nov-Dec;26(6):665-673. doi: 10.1089/fpsam.2023.0368. Epub 2024 Jul 1.

DOI:10.1089/fpsam.2023.0368

PMID:38946595

Abstract

ChatGPT and Google Bard™ are popular artificial intelligence chatbots with utility for patients, including those undergoing aesthetic facial plastic surgery. To compare the accuracy and readability of chatbot-generated responses to patient education questions regarding aesthetic facial plastic surgery using a response accuracy scale and readability testing. ChatGPT and Google Bard™ were asked 28 identical questions using four prompts: none, patient friendly, eighth-grade level, and references. Accuracy was assessed using Global Quality Scale (range: 1-5). Flesch-Kincaid grade level was calculated, and chatbot-provided references were analyzed for veracity. Although 59.8% of responses were good quality (Global Quality Scale ≥4), ChatGPT generated more accurate responses than Google Bard™ on patient-friendly prompting ( < 0.001). Google Bard™ responses were of a significantly lower grade level than ChatGPT for all prompts ( < 0.05). Despite eighth-grade prompting, response grade level for both chatbots was high: ChatGPT (10.5 ± 1.8) and Google Bard™ (9.6 ± 1.3). Prompting for references yielded 108/108 of chatbot-generated references. Forty-one (38.0%) citations were legitimate. Twenty (18.5%) provided accurately reported information from the reference. Although ChatGPT produced more accurate responses and at a higher education level than Google Bard™, both chatbots provided responses above recommended grade levels for patients and failed to provide accurate references.

摘要

ChatGPT 和 Google Bard™ 是广受欢迎的人工智能聊天机器人，对患者具有实用价值，包括那些正在接受美容面部整形手术的患者。为了比较聊天机器人对美容面部整形手术患者教育问题生成的回复的准确性和可读性，使用回复准确性量表和可读性测试。使用四个提示词（无提示、患者友好型、八年级水平和参考文献）向 ChatGPT 和 Google Bard™ 询问了 28 个相同的问题。准确性使用全球质量量表（范围：1-5）进行评估。计算弗莱什-金凯德年级水平，并分析聊天机器人提供的参考文献的真实性。尽管 59.8%的回复质量良好（全球质量量表≥4），但在患者友好型提示下，ChatGPT 生成的回复比 Google Bard™更准确（<0.001）。对于所有提示词，Google Bard™的回复等级都明显低于 ChatGPT（<0.05）。尽管提示词为八年级水平，但两个聊天机器人的回复等级都很高：ChatGPT（10.5±1.8）和 Google Bard™（9.6±1.3）。提示参考文献生成了 108/108 个聊天机器人生成的参考文献。41 个（38.0%）引述是合法的。20 个（18.5%）提供了准确报告的参考文献信息。尽管 ChatGPT 生成的回复比 Google Bard™更准确，且教育水平更高，但两个聊天机器人提供的回复都高于患者推荐的等级水平，且未能提供准确的参考文献。

相似文献

Chatbots as Patient Education Resources for Aesthetic Facial Plastic Surgery: Evaluation of ChatGPT and Google Bard Responses.

Facial Plast Surg Aesthet Med. 2024 Nov-Dec;26(6):665-673. doi: 10.1089/fpsam.2023.0368. Epub 2024 Jul 1.

Generative artificial intelligence chatbots may provide appropriate informational responses to common vascular surgery questions by patients.

Vascular. 2025 Feb;33(1):229-237. doi: 10.1177/17085381241240550. Epub 2024 Mar 18.

Artificial intelligence chatbots as sources of patient education material for cataract surgery: ChatGPT-4 versus Google Bard.

BMJ Open Ophthalmol. 2024 Oct 17;9(1):e001824. doi: 10.1136/bmjophth-2024-001824.

Reliability and readability analysis of ChatGPT-4 and Google Bard as a patient information source for the most commonly applied radionuclide treatments in cancer patients.

Rev Esp Med Nucl Imagen Mol (Engl Ed). 2024 Jul-Aug;43(4):500021. doi: 10.1016/j.remnie.2024.500021. Epub 2024 May 29.

Performance of Artificial Intelligence Chatbots on Glaucoma Questions Adapted From Patient Brochures.

Cureus. 2024 Mar 23;16(3):e56766. doi: 10.7759/cureus.56766. eCollection 2024 Mar.

Artificial intelligence chatbots as sources of patient education material for obstructive sleep apnoea: ChatGPT versus Google Bard.

Eur Arch Otorhinolaryngol. 2024 Feb;281(2):985-993. doi: 10.1007/s00405-023-08319-9. Epub 2023 Nov 2.

Appropriateness and readability of Google Bard and ChatGPT-3.5 generated responses for surgical treatment of glaucoma.

Rom J Ophthalmol. 2024 Jul-Sep;68(3):243-248. doi: 10.22336/rjo.2024.45.

The promising role of chatbots in keratorefractive surgery patient education.

J Fr Ophtalmol. 2025 Feb;48(2):104381. doi: 10.1016/j.jfo.2024.104381. Epub 2024 Dec 13.

Accuracy and Readability of Artificial Intelligence Chatbot Responses to Vasectomy-Related Questions: Public Beware.

Cureus. 2024 Aug 28;16(8):e67996. doi: 10.7759/cureus.67996. eCollection 2024 Aug.

Readability, quality and accuracy of generative artificial intelligence chatbots for commonly asked questions about labor epidurals: a comparison of ChatGPT and Bard.

Int J Obstet Anesth. 2025 Feb;61:104317. doi: 10.1016/j.ijoa.2024.104317. Epub 2024 Dec 20.

引用本文的文献

Comparison of the readability of ChatGPT and Bard in medical communication: a meta-analysis.

BMC Med Inform Decis Mak. 2025 Sep 1;25(1):325. doi: 10.1186/s12911-025-03035-2.

Evaluating large language models in patient education on facial plastic surgery: a standardized protocol.

Int J Surg Protoc. 2025 Jun 11;29(3):108-112. doi: 10.1097/SP9.0000000000000052. eCollection 2025 Sep.

Application of ChatGPT-assisted problem-based learning teaching method in clinical medical education.

BMC Med Educ. 2025 Jan 11;25(1):50. doi: 10.1186/s12909-024-06321-1.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

Suppr超能文献

聊天机器人作为美容面部整形手术的患者教育资源：对 ChatGPT 和 Google Bard 回复的评估。

Chatbots as Patient Education Resources for Aesthetic Facial Plastic Surgery: Evaluation of ChatGPT and Google Bard Responses.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译