人工智能能回答我的问题吗？腹部整形手术患者围手术期评估中人工智能的应用。

Can AI Answer My Questions? Utilizing Artificial Intelligence in the Perioperative Assessment for Abdominoplasty Patients.

作者信息

Lim Bryan, Seth Ishith, Cuomo Roberto, Kenney Peter Sinkjær, Ross Richard J, Sofiadellis Foti, Pentangelo Paola, Ceccaroni Alessandra, Alfano Carmine, Rozen Warren Matthew

机构信息

Department of Plastic Surgery, Peninsula Health, Melbourne, Victoria, 3199, Australia.

Plastic Surgery Unit, Department of Medicine, Surgery and Neuroscience, University of Siena, Siena, Italy.

出版信息

Aesthetic Plast Surg. 2024 Nov;48(22):4712-4724. doi: 10.1007/s00266-024-04157-0. Epub 2024 Jun 19.

DOI:10.1007/s00266-024-04157-0

PMID:38898239

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11645314/

Abstract

BACKGROUND

Abdominoplasty is a common operation, used for a range of cosmetic and functional issues, often in the context of divarication of recti, significant weight loss, and after pregnancy. Despite this, patient-surgeon communication gaps can hinder informed decision-making. The integration of large language models (LLMs) in healthcare offers potential for enhancing patient information. This study evaluated the feasibility of using LLMs for answering perioperative queries.

METHODS

This study assessed the efficacy of four leading LLMs-OpenAI's ChatGPT-3.5, Anthropic's Claude, Google's Gemini, and Bing's CoPilot-using fifteen unique prompts. All outputs were evaluated using the Flesch-Kincaid, Flesch Reading Ease score, and Coleman-Liau index for readability assessment. The DISCERN score and a Likert scale were utilized to evaluate quality. Scores were assigned by two plastic surgical residents and then reviewed and discussed until a consensus was reached by five plastic surgeon specialists.

RESULTS

ChatGPT-3.5 required the highest level for comprehension, followed by Gemini, Claude, then CoPilot. Claude provided the most appropriate and actionable advice. In terms of patient-friendliness, CoPilot outperformed the rest, enhancing engagement and information comprehensiveness. ChatGPT-3.5 and Gemini offered adequate, though unremarkable, advice, employing more professional language. CoPilot uniquely included visual aids and was the only model to use hyperlinks, although they were not very helpful and acceptable, and it faced limitations in responding to certain queries.

CONCLUSION

ChatGPT-3.5, Gemini, Claude, and Bing's CoPilot showcased differences in readability and reliability. LLMs offer unique advantages for patient care but require careful selection. Future research should integrate LLM strengths and address weaknesses for optimal patient education.

LEVEL OF EVIDENCE V

This journal requires that authors assign a level of evidence to each article. For a full description of these Evidence-Based Medicine ratings, please refer to the Table of Contents or the online Instructions to Authors www.springer.com/00266 .

摘要

背景

腹壁成形术是一种常见手术，用于解决一系列美容和功能问题，常见于腹直肌分离、显著体重减轻以及产后的情况。尽管如此，患者与外科医生之间的沟通差距可能会阻碍明智的决策。大语言模型（LLMs）在医疗保健中的整合为增强患者信息提供了潜力。本研究评估了使用大语言模型回答围手术期问题的可行性。

方法

本研究使用15个独特的提示评估了四种领先的大语言模型——OpenAI的ChatGPT-3.5、Anthropic的Claude、谷歌的Gemini和必应的CoPilot的效果。所有输出均使用弗莱什-金凯德、弗莱什易读性分数和科尔曼-廖指数进行可读性评估。使用DISCERN分数和李克特量表评估质量。分数由两名整形外科住院医师给出，然后由五名整形外科专家进行审核和讨论，直至达成共识。

结果

ChatGPT-3.5需要最高的理解水平，其次是Gemini、Claude，然后是CoPilot。Claude提供了最恰当且可行的建议。在对患者的友好度方面，CoPilot优于其他模型，增强了参与度和信息全面性。ChatGPT-3.5和Gemini提供了足够但并不突出的建议，使用的语言更专业。CoPilot独特地包含视觉辅助工具，是唯一使用超链接的模型，尽管这些超链接不太有用且不太被接受，并且它在回答某些问题时面临局限性。

结论

ChatGPT-3.5、Gemini、Claude和必应的CoPilot在可读性和可靠性方面存在差异。大语言模型为患者护理提供了独特优势，但需要谨慎选择。未来的研究应整合大语言模型的优势并解决其弱点，以实现最佳的患者教育。

证据水平V：本期刊要求作者为每篇文章指定证据水平。有关这些循证医学评级的完整描述，请参考目录或作者在线指南www.springer.com/00266 。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/f4de/11645314/ec712b2db7ff/266_2024_4157_Fig1_HTML.jpg

相似文献

Can AI Answer My Questions? Utilizing Artificial Intelligence in the Perioperative Assessment for Abdominoplasty Patients.人工智能能回答我的问题吗？腹部整形手术患者围手术期评估中人工智能的应用。

Aesthetic Plast Surg. 2024 Nov;48(22):4712-4724. doi: 10.1007/s00266-024-04157-0. Epub 2024 Jun 19.

Assessing the efficacy of artificial intelligence to provide peri-operative information for patients with a stoma.评估人工智能为造口患者提供围手术期信息的效果。

ANZ J Surg. 2025 Mar;95(3):464-496. doi: 10.1111/ans.19337. Epub 2024 Dec 2.

Evaluating the Efficacy of Large Language Models in Generating Medical Documentation: A Comparative Study of ChatGPT-4, ChatGPT-4o, and Claude.评估大语言模型在生成医学文档方面的功效：ChatGPT-4、ChatGPT-4o和Claude的比较研究

Aesthetic Plast Surg. 2025 Apr 14. doi: 10.1007/s00266-025-04842-8.

Assessing the Responses of Large Language Models (ChatGPT-4, Claude 3, Gemini, and Microsoft Copilot) to Frequently Asked Questions in Retinopathy of Prematurity: A Study on Readability and Appropriateness.评估大型语言模型（ChatGPT-4、Claude 3、Gemini和Microsoft Copilot）对早产儿视网膜病变常见问题的回答：一项关于可读性和适宜性的研究

J Pediatr Ophthalmol Strabismus. 2025 Mar-Apr;62(2):84-95. doi: 10.3928/01913913-20240911-05. Epub 2024 Oct 28.

Exploring the Potential of ChatGPT-4 in Responding to Common Questions About Abdominoplasty: An AI-Based Case Study of a Plastic Surgery Consultation.探讨 ChatGPT-4 在回答腹部整形常见问题方面的潜力：基于人工智能的整形外科咨询案例研究。

Aesthetic Plast Surg. 2024 Apr;48(8):1571-1583. doi: 10.1007/s00266-023-03660-0. Epub 2023 Sep 28.

Assessing the quality and readability of patient education materials on chemotherapy cardiotoxicity from artificial intelligence chatbots: An observational cross-sectional study.评估人工智能聊天机器人提供的关于化疗心脏毒性的患者教育材料的质量和可读性：一项观察性横断面研究。

Medicine (Baltimore). 2025 Apr 11;104(15):e42135. doi: 10.1097/MD.0000000000042135.

Investigating the impact of innovative AI chatbot on post-pandemic medical education and clinical assistance: a comprehensive analysis.探讨创新型人工智能聊天机器人对后疫情时代医学教育和临床辅助的影响：全面分析。

ANZ J Surg. 2024 Feb;94(1-2):68-77. doi: 10.1111/ans.18666. Epub 2023 Aug 21.

Proficiency, Clarity, and Objectivity of Large Language Models Versus Specialists' Knowledge on COVID-19's Impacts in Pregnancy: Cross-Sectional Pilot Study.大型语言模型在新冠肺炎对妊娠影响方面的熟练度、清晰度和客观性与专家知识对比：横断面试点研究

JMIR Form Res. 2025 Feb 5;9:e56126. doi: 10.2196/56126.

Evaluating the reliability of the responses of large language models to keratoconus-related questions.评估大语言模型对圆锥角膜相关问题回答的可靠性。

Clin Exp Optom. 2024 Oct 24:1-8. doi: 10.1080/08164622.2024.2419524.

Assessing the Responses of Large Language Models (ChatGPT-4, Gemini, and Microsoft Copilot) to Frequently Asked Questions in Breast Imaging: A Study on Readability and Accuracy.评估大语言模型（ChatGPT-4、Gemini和Microsoft Copilot）对乳腺成像常见问题的回答：可读性和准确性研究

Cureus. 2024 May 9;16(5):e59960. doi: 10.7759/cureus.59960. eCollection 2024 May.

引用本文的文献

Comparison of the readability of ChatGPT and Bard in medical communication: a meta-analysis.ChatGPT与Bard在医学交流中的可读性比较：一项荟萃分析。

BMC Med Inform Decis Mak. 2025 Sep 1;25(1):325. doi: 10.1186/s12911-025-03035-2.

A bibliometric analysis of large language model-based AI chatbots in surgery.基于大语言模型的人工智能聊天机器人在外科手术中的文献计量分析

Ann Med Surg (Lond). 2025 May 12;87(7):4127-4138. doi: 10.1097/MS9.0000000000003234. eCollection 2025 Jul.

Generative AI/LLMs for Plain Language Medical Information for Patients, Caregivers and General Public: Opportunities, Risks and Ethics.用于为患者、护理人员和普通公众提供通俗易懂的医学信息的生成式人工智能/大型语言模型：机遇、风险与伦理

Patient Prefer Adherence. 2025 Jul 31;19:2227-2249. doi: 10.2147/PPA.S527922. eCollection 2025.

[Is the application of digital technologies the game changer for surgical training of the future? A Germany-wide analysis].[数字技术的应用会成为未来外科培训的变革者吗？一项全德范围的分析]

Chirurgie (Heidelb). 2025 May 22. doi: 10.1007/s00104-025-02306-y.

Accuracy of LLMs in medical education: evidence from a concordance test with medical teacher.大语言模型在医学教育中的准确性：来自与医学教师一致性测试的证据。

BMC Med Educ. 2025 Mar 26;25(1):443. doi: 10.1186/s12909-025-07009-w.

A Performance Evaluation of Large Language Models in Keratoconus: A Comparative Study of ChatGPT-3.5, ChatGPT-4.0, Gemini, Copilot, Chatsonic, and Perplexity.大语言模型在圆锥角膜中的性能评估：ChatGPT-3.5、ChatGPT-4.0、Gemini、Copilot、Chatsonic和Perplexity的比较研究

J Clin Med. 2024 Oct 30;13(21):6512. doi: 10.3390/jcm13216512.

本文引用的文献

Impact of nutrition on skin wound healing and aesthetic outcomes: A comprehensive narrative review.营养对皮肤伤口愈合及美学效果的影响：一项全面的叙述性综述

JPRAS Open. 2024 Jan 23;39:291-302. doi: 10.1016/j.jpra.2024.01.006. eCollection 2024 Mar.

Artificial Intelligence Language Model Performance for Rapid Intraoperative Queries in Plastic Surgery: ChatGPT and the Deep Inferior Epigastric Perforator Flap.人工智能语言模型在整形手术中快速术中查询的性能：ChatGPT与腹壁下动脉穿支皮瓣

J Clin Med. 2024 Feb 4;13(3):900. doi: 10.3390/jcm13030900.

Comparison of large language models in management advice for melanoma: Google's AI BARD, BingAI and ChatGPT.大语言模型在黑色素瘤管理建议方面的比较：谷歌的人工智能BARD、必应人工智能和ChatGPT。

Skin Health Dis. 2023 Nov 28;4(1):e313. doi: 10.1002/ski2.313. eCollection 2024 Feb.

Navigating the Ethical Landmines of ChatGPT: Implications of Intelligent Chatbots in Plastic Surgery Clinical Practice.应对ChatGPT的伦理雷区：智能聊天机器人在整形手术临床实践中的影响

Plast Reconstr Surg Glob Open. 2023 Sep 15;11(9):e5290. doi: 10.1097/GOX.0000000000005290. eCollection 2023 Sep.

A Study on the Acquisition and Identification of Beige Adipocytes and Exosomes as Well as Their Inflammatory Regulation by Promoting Macrophage Polarization.棕色脂肪细胞与外泌体的获取与鉴定及其通过促进巨噬细胞极化的炎症调控作用研究

Aesthetic Plast Surg. 2024 Feb;48(3):519-529. doi: 10.1007/s00266-023-03782-5. Epub 2023 Dec 26.

Utilization of ChatGPT-4 in Plastic and Reconstructive Surgery: A Narrative Review.ChatGPT-4在整形与重建外科中的应用：一篇叙述性综述。

Plast Reconstr Surg Glob Open. 2023 Oct 26;11(10):e5305. doi: 10.1097/GOX.0000000000005305. eCollection 2023 Oct.

Using Generative Artificial Intelligence Tools in Cosmetic Surgery: A Study on Rhinoplasty, Facelifts, and Blepharoplasty Procedures.在整容手术中使用生成式人工智能工具：一项关于隆鼻术、面部提升术和眼睑成形术的研究。

J Clin Med. 2023 Oct 14;12(20):6524. doi: 10.3390/jcm12206524.

The future landscape of large language models in medicine.医学领域大语言模型的未来前景。

Commun Med (Lond). 2023 Oct 10;3(1):141. doi: 10.1038/s43856-023-00370-1.

Comparing the Efficacy of Large Language Models ChatGPT, BARD, and Bing AI in Providing Information on Rhinoplasty: An Observational Study.比较大型语言模型ChatGPT、BARD和必应人工智能在提供隆鼻信息方面的功效：一项观察性研究。

Aesthet Surg J Open Forum. 2023 Sep 14;5:ojad084. doi: 10.1093/asjof/ojad084. eCollection 2023.

Aesthetic Plast Surg. 2024 Apr;48(8):1571-1583. doi: 10.1007/s00266-023-03660-0. Epub 2023 Sep 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

人工智能能回答我的问题吗？腹部整形手术患者围手术期评估中人工智能的应用。

Can AI Answer My Questions? Utilizing Artificial Intelligence in the Perioperative Assessment for Abdominoplasty Patients.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

LEVEL OF EVIDENCE V

背景

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献