面颈部除皱术后并发症：在 16 个模拟患者就诊场景下，对大型语言模型和人工智能（ChatGPT）性能的实施和评估。

Complications Following Facelift and Neck Lift: Implementation and Assessment of Large Language Model and Artificial Intelligence (ChatGPT) Performance Across 16 Simulated Patient Presentations.

机构信息

Division of Plastic, Reconstructive, and Aesthetic Surgery, McGill University Health Centre, Montreal, QC, Canada.

Manhattan Eye, Ear and Throat Hospital, New York, NY, USA.

出版信息

Aesthetic Plast Surg. 2023 Dec;47(6):2407-2414. doi: 10.1007/s00266-023-03538-1. Epub 2023 Aug 17.

DOI:10.1007/s00266-023-03538-1

PMID:37589944

Abstract

INTRODUCTION

ChatGPT represents a potential resource for patient guidance and education, with the possibility for quality improvement in healthcare delivery. The present study evaluates the role of ChatGPT as an interactive patient resource, and assesses its performance in identifying, triaging, and guiding patients with concerns of postoperative complications following facelift and neck lift surgery.

METHODS

Sixteen patient profiles were generated to simulate postoperative patient presentations, with complications of varying acuity and severity. ChatGPT was assessed for its accuracy in generating a differential diagnosis, soliciting a history, providing the most-likely diagnosis, the appropriate disposition, treatments/interventions to begin from home, and red-flag symptoms necessitating an urgent presentation to the emergency department.

RESULTS

Overall accuracy in providing a complete differential diagnosis in response to simulated presentations was 85%, with an accuracy of 88% in identifying the most-likely diagnosis after history-taking. However, appropriate patient dispositions were suggested in only 56% of cases. Relevant home treatments/interventions were suggested with an 82% accuracy, and red-flag symptoms with a 73% accuracy. A detailed analysis, stratified according to latency of postoperative presentation (<48 h, 48 h-1 week, or >1 week), and according to acuity of complications, is presented herein.

CONCLUSIONS

ChatGPT overestimated the urgency of indicated patient dispositions in 44% of cases, concerning for potential unnecessary increase in healthcare resource utilization. Imperfect performance, and the tool's tendency for overinclusion in its responses, risk increasing patient anxiety and straining physician-patient relationships. While artificial intelligence has great potential in triaging postoperative patient concerns, and improving efficiency and resource utilization, ChatGPT's performance, in its current form, demonstrates a need for further refinement before its safe and effective implementation in facial aesthetic surgical practice.

LEVEL OF EVIDENCE IV

This journal requires that authors assign a level of evidence to each article. For a full description of these Evidence-Based Medicine ratings, please refer to the Table of Contents or the online Instructions to Authors www.springer.com/00266 .

摘要

简介

ChatGPT 代表了一种潜在的患者指导和教育资源，有可能改善医疗保健服务的质量。本研究评估了 ChatGPT 作为交互式患者资源的作用，并评估了其在识别、分诊和指导接受面部提升和颈部提升手术后出现并发症的患者方面的表现。

方法

生成了 16 个患者档案，以模拟术后患者的表现，模拟了不同严重程度和严重程度的并发症。评估了 ChatGPT 在生成鉴别诊断、询问病史、提供最可能诊断、适当处置、建议在家开始的治疗/干预措施以及需要紧急到急诊就诊的红色症状方面的准确性。

结果

总体而言，在模拟演示中提供完整鉴别诊断的准确率为 85%，在询问病史后识别最可能诊断的准确率为 88%。然而，只有 56%的情况下建议了适当的患者处置。建议的相关家庭治疗/干预的准确率为 82%，红色症状的准确率为 73%。根据术后表现的潜伏期（<48 小时、48 小时-1 周或>1 周）和并发症的严重程度进行了详细分析，并在此处呈现。

结论

ChatGPT 在 44%的情况下高估了指示性患者处置的紧迫性，这可能导致不必要地增加医疗资源的利用。不完善的表现和工具在响应中过度包容的倾向，可能会增加患者的焦虑并影响医患关系。虽然人工智能在分诊术后患者的担忧和提高效率和资源利用方面具有巨大潜力，但 ChatGPT 的表现表明，在其安全有效地应用于面部美容手术实践之前，需要进一步改进。

证据等级 IV：本杂志要求作者为每篇文章分配一个证据等级。有关这些循证医学评级的完整描述，请参阅目录或在线作者指南 www.springer.com/00266 。

相似文献

Complications Following Facelift and Neck Lift: Implementation and Assessment of Large Language Model and Artificial Intelligence (ChatGPT) Performance Across 16 Simulated Patient Presentations.面颈部除皱术后并发症：在 16 个模拟患者就诊场景下，对大型语言模型和人工智能（ChatGPT）性能的实施和评估。

Aesthetic Plast Surg. 2023 Dec;47(6):2407-2414. doi: 10.1007/s00266-023-03538-1. Epub 2023 Aug 17.

Complications Following Body Contouring: Performance Validation of Bard, a Novel AI Large Language Model, in Triaging and Managing Postoperative Patient Concerns.身体塑形术后并发症：新型 AI 大语言模型 Bard 在分诊和处理术后患者问题方面的性能验证。

Aesthetic Plast Surg. 2024 Mar;48(5):953-976. doi: 10.1007/s00266-023-03819-9. Epub 2024 Jan 25.

Utility and Comparative Performance of Current Artificial Intelligence Large Language Models as Postoperative Medical Support Chatbots in Aesthetic Surgery.当前人工智能大语言模型作为美容外科术后医疗支持聊天机器人的效用和比较性能。

Aesthet Surg J. 2024 Jul 15;44(8):889-896. doi: 10.1093/asj/sjae025.

Exploring the Unknown: Evaluating ChatGPT's Performance in Uncovering Novel Aspects of Plastic Surgery and Identifying Areas for Future Innovation.探索未知：评估 ChatGPT 在揭示整形外科新方面的表现，并确定未来创新的领域。

Aesthetic Plast Surg. 2024 Jul;48(13):2580-2589. doi: 10.1007/s00266-024-03952-z. Epub 2024 Mar 25.

Aesthetic Surgery Advice and Counseling from Artificial Intelligence: A Rhinoplasty Consultation with ChatGPT.人工智能提供的美容外科建议和咨询：ChatGPT 参与的隆鼻咨询。

Aesthetic Plast Surg. 2023 Oct;47(5):1985-1993. doi: 10.1007/s00266-023-03338-7. Epub 2023 Apr 24.

Can ChatGPT be the Plastic Surgeon's New Digital Assistant? A Bibliometric Analysis and Scoping Review of ChatGPT in Plastic Surgery Literature.ChatGPT能否成为整形外科医生的新型数字助手？整形外科文献中ChatGPT的文献计量分析与范围综述

Aesthetic Plast Surg. 2024 Apr;48(8):1644-1652. doi: 10.1007/s00266-023-03709-0. Epub 2023 Oct 18.

Assessing Improvement of Patient Satisfaction Following Facelift Surgery Using the FACE-Q Scales: A Prospective and Multicenter Study.采用 FACE-Q 量表评估面部提升术后患者满意度的改善：一项前瞻性、多中心研究。

Aesthetic Plast Surg. 2019 Apr;43(2):370-375. doi: 10.1007/s00266-018-1277-9. Epub 2018 Nov 28.

Evaluation of the Artificial Intelligence Chatbot on Breast Reconstruction and Its Efficacy in Surgical Research: A Case Study.评估人工智能聊天机器人在乳房重建中的应用及其在外科研究中的疗效：案例研究。

Aesthetic Plast Surg. 2023 Dec;47(6):2360-2369. doi: 10.1007/s00266-023-03443-7. Epub 2023 Jun 14.

Comparative Performance of Current Patient-Accessible Artificial Intelligence Large Language Models in the Preoperative Education of Patients in Facial Aesthetic Surgery.当前患者可使用的人工智能大语言模型在面部美容手术患者术前教育中的比较性能

Aesthet Surg J Open Forum. 2024 Aug 13;6:ojae058. doi: 10.1093/asjof/ojae058. eCollection 2024.

Exploring the Potential of ChatGPT-4 in Responding to Common Questions About Abdominoplasty: An AI-Based Case Study of a Plastic Surgery Consultation.探讨 ChatGPT-4 在回答腹部整形常见问题方面的潜力：基于人工智能的整形外科咨询案例研究。

Aesthetic Plast Surg. 2024 Apr;48(8):1571-1583. doi: 10.1007/s00266-023-03660-0. Epub 2023 Sep 28.

引用本文的文献

A systematic review and meta-analysis of diagnostic performance comparison between generative AI and physicians.生成式人工智能与医生诊断性能比较的系统评价与荟萃分析

NPJ Digit Med. 2025 Mar 22;8(1):175. doi: 10.1038/s41746-025-01543-z.

Analyzing evaluation methods for large language models in the medical field: a scoping review.分析医学领域大语言模型的评价方法：范围综述。

BMC Med Inform Decis Mak. 2024 Nov 29;24(1):366. doi: 10.1186/s12911-024-02709-7.

Preoperative Patient Guidance and Education in Aesthetic Breast Plastic Surgery: A Novel Proposed Application of Artificial Intelligence Large Language Models.美容乳房整形手术中的术前患者指导与教育：人工智能大语言模型的一种新型拟用应用

Aesthet Surg J Open Forum. 2024 Aug 13;6:ojae062. doi: 10.1093/asjof/ojae062. eCollection 2024.

Large Language Models for Intraoperative Decision Support in Plastic Surgery: A Comparison between ChatGPT-4 and Gemini.大型语言模型在整形手术中的术中决策支持：ChatGPT-4 和 Gemini 的比较。

Medicina (Kaunas). 2024 Jun 8;60(6):957. doi: 10.3390/medicina60060957.

Artificial Intelligence as a Triage Tool during the Perioperative Period: Pilot Study of Accuracy and Accessibility for Clinical Application.人工智能作为围手术期的分诊工具：临床应用准确性和可及性的初步研究

Plast Reconstr Surg Glob Open. 2024 Feb 2;12(2):e5580. doi: 10.1097/GOX.0000000000005580. eCollection 2024 Feb.

本文引用的文献

Harvesting the Power of Artificial Intelligence for Surgery: Uses, Implications, and Ethical Considerations.挖掘人工智能在手术中的力量：用途、影响及伦理考量。

Am Surg. 2023 Dec;89(12):5102-5104. doi: 10.1177/00031348231175454. Epub 2023 May 6.

Exploring the Potential of Artificial Intelligence in Surgery: Insights from a Conversation with ChatGPT.探索人工智能在手术中的潜力：与ChatGPT对话的见解

Ann Surg Oncol. 2023 Jul;30(7):3875-3878. doi: 10.1245/s10434-023-13347-0. Epub 2023 Apr 5.

ChatGPT: Is this version good for healthcare and research?ChatGPT：这个版本对医疗保健和研究有帮助吗？

Diabetes Metab Syndr. 2023 Apr;17(4):102744. doi: 10.1016/j.dsx.2023.102744. Epub 2023 Mar 15.

Application of ChatGPT in Cosmetic Plastic Surgery: Ally or Antagonist?ChatGPT在美容整形手术中的应用：盟友还是对手？

Aesthet Surg J. 2023 Jun 14;43(7):NP587-NP590. doi: 10.1093/asj/sjad042.

How Does ChatGPT Perform on the United States Medical Licensing Examination (USMLE)? The Implications of Large Language Models for Medical Education and Knowledge Assessment.ChatGPT在美国医师执照考试（USMLE）中的表现如何？大语言模型对医学教育和知识评估的影响。

JMIR Med Educ. 2023 Feb 8;9:e45312. doi: 10.2196/45312.

Machine Learning in Medicine.医学中的机器学习

N Engl J Med. 2019 Apr 4;380(14):1347-1358. doi: 10.1056/NEJMra1814259.

Introduction to artificial intelligence in medicine.医学人工智能导论。

Minim Invasive Ther Allied Technol. 2019 Apr;28(2):73-81. doi: 10.1080/13645706.2019.1575882. Epub 2019 Feb 27.

Machine learning for medical diagnosis: history, state of the art and perspective.用于医学诊断的机器学习：历史、现状与展望。

Artif Intell Med. 2001 Aug;23(1):89-109. doi: 10.1016/s0933-3657(01)00077-x.

Artificial intelligence applications in the intensive care unit.人工智能在重症监护病房的应用。

Crit Care Med. 2001 Feb;29(2):427-35. doi: 10.1097/00003246-200102000-00038.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

面颈部除皱术后并发症：在 16 个模拟患者就诊场景下，对大型语言模型和人工智能（ChatGPT）性能的实施和评估。

Complications Following Facelift and Neck Lift: Implementation and Assessment of Large Language Model and Artificial Intelligence (ChatGPT) Performance Across 16 Simulated Patient Presentations.

机构信息

出版信息

INTRODUCTION

METHODS

RESULTS

CONCLUSIONS

LEVEL OF EVIDENCE IV

简介

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献