Suppr超能文献

面颈部除皱术后并发症:在 16 个模拟患者就诊场景下,对大型语言模型和人工智能(ChatGPT)性能的实施和评估。

Complications Following Facelift and Neck Lift: Implementation and Assessment of Large Language Model and Artificial Intelligence (ChatGPT) Performance Across 16 Simulated Patient Presentations.

机构信息

Division of Plastic, Reconstructive, and Aesthetic Surgery, McGill University Health Centre, Montreal, QC, Canada.

Manhattan Eye, Ear and Throat Hospital, New York, NY, USA.

出版信息

Aesthetic Plast Surg. 2023 Dec;47(6):2407-2414. doi: 10.1007/s00266-023-03538-1. Epub 2023 Aug 17.

Abstract

INTRODUCTION

ChatGPT represents a potential resource for patient guidance and education, with the possibility for quality improvement in healthcare delivery. The present study evaluates the role of ChatGPT as an interactive patient resource, and assesses its performance in identifying, triaging, and guiding patients with concerns of postoperative complications following facelift and neck lift surgery.

METHODS

Sixteen patient profiles were generated to simulate postoperative patient presentations, with complications of varying acuity and severity. ChatGPT was assessed for its accuracy in generating a differential diagnosis, soliciting a history, providing the most-likely diagnosis, the appropriate disposition, treatments/interventions to begin from home, and red-flag symptoms necessitating an urgent presentation to the emergency department.

RESULTS

Overall accuracy in providing a complete differential diagnosis in response to simulated presentations was 85%, with an accuracy of 88% in identifying the most-likely diagnosis after history-taking. However, appropriate patient dispositions were suggested in only 56% of cases. Relevant home treatments/interventions were suggested with an 82% accuracy, and red-flag symptoms with a 73% accuracy. A detailed analysis, stratified according to latency of postoperative presentation (<48 h, 48 h-1 week, or >1 week), and according to acuity of complications, is presented herein.

CONCLUSIONS

ChatGPT overestimated the urgency of indicated patient dispositions in 44% of cases, concerning for potential unnecessary increase in healthcare resource utilization. Imperfect performance, and the tool's tendency for overinclusion in its responses, risk increasing patient anxiety and straining physician-patient relationships. While artificial intelligence has great potential in triaging postoperative patient concerns, and improving efficiency and resource utilization, ChatGPT's performance, in its current form, demonstrates a need for further refinement before its safe and effective implementation in facial aesthetic surgical practice.

LEVEL OF EVIDENCE IV

This journal requires that authors assign a level of evidence to each article. For a full description of these Evidence-Based Medicine ratings, please refer to the Table of Contents or the online Instructions to Authors www.springer.com/00266 .

摘要

简介

ChatGPT 代表了一种潜在的患者指导和教育资源,有可能改善医疗保健服务的质量。本研究评估了 ChatGPT 作为交互式患者资源的作用,并评估了其在识别、分诊和指导接受面部提升和颈部提升手术后出现并发症的患者方面的表现。

方法

生成了 16 个患者档案,以模拟术后患者的表现,模拟了不同严重程度和严重程度的并发症。评估了 ChatGPT 在生成鉴别诊断、询问病史、提供最可能诊断、适当处置、建议在家开始的治疗/干预措施以及需要紧急到急诊就诊的红色症状方面的准确性。

结果

总体而言,在模拟演示中提供完整鉴别诊断的准确率为 85%,在询问病史后识别最可能诊断的准确率为 88%。然而,只有 56%的情况下建议了适当的患者处置。建议的相关家庭治疗/干预的准确率为 82%,红色症状的准确率为 73%。根据术后表现的潜伏期(<48 小时、48 小时-1 周或>1 周)和并发症的严重程度进行了详细分析,并在此处呈现。

结论

ChatGPT 在 44%的情况下高估了指示性患者处置的紧迫性,这可能导致不必要地增加医疗资源的利用。不完善的表现和工具在响应中过度包容的倾向,可能会增加患者的焦虑并影响医患关系。虽然人工智能在分诊术后患者的担忧和提高效率和资源利用方面具有巨大潜力,但 ChatGPT 的表现表明,在其安全有效地应用于面部美容手术实践之前,需要进一步改进。

证据等级 IV:本杂志要求作者为每篇文章分配一个证据等级。有关这些循证医学评级的完整描述,请参阅目录或在线作者指南 www.springer.com/00266

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验