• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

急诊科中基于语音指令的ChatGPT支持的患者分诊:一项前瞻性多中心研究。

ChatGPT-supported patient triage with voice commands in the emergency department: A prospective multicenter study.

作者信息

Pasli Sinan, Yadigaroğlu Metin, Kirimli Esma Nilay, Beşer Muhammet Fatih, Unutmaz İhsan, Ayhan Asu Özden, Karakurt Büşra, Şahin Abdul Samet, Hiçyilmaz Halil İbrahim, Imamoğlu Melih

机构信息

Karadeniz Technical University, School of Medicine, Department of Emergency Medicine, Trabzon, Turkey.

Samsun University, School of Medicine, Department of Emergency Medicine, Samsun, Turkey.

出版信息

Am J Emerg Med. 2025 Apr 17;94:63-70. doi: 10.1016/j.ajem.2025.04.040.

DOI:10.1016/j.ajem.2025.04.040
PMID:40273640
Abstract

BACKGROUND

Triage aims to prioritize patients according to their medical urgency by accurately evaluating their clinical conditions, managing waiting times efficiently, and improving the overall effectiveness of emergency care. This study aims to assess ChatGPT's performance in patient triage across four emergency departments with varying dynamics and to provide a detailed analysis of its strengths and weaknesses.

METHODS

In this multicenter, prospective study, we compared the triage decisions made by ChatGPT-4o and the triage personnel with the gold standard decisions determined by an emergency medicine (EM) specialist. In the hospitals where we conducted the study, triage teams routinely direct patients to the appropriate ED areas based on the Emergency Severity Index (ESI) system and the hospital's local triage protocols. During the study period, the triage team collected patient data, including chief complaints, comorbidities, and vital signs, and used this information to make the initial triage decisions. An independent physician simultaneously entered the same data into ChatGPT using voice commands. At the same time, an EM specialist, present in the triage room throughout the study period, reviewed the same patient data and determined the gold standard triage decisions, strictly adhering to both the hospital's local protocols and the ESI system. Before initiating the study, we customized ChatGPT for each hospital by designing prompts that incorporated both the general principles of the ESI triage system and the specific triage rules of each hospital. The model's overall, hospital-based, and area-based performance was evaluated, with Cohen's Kappa, F1 score, and performance analyses conducted.

RESULTS

This study included 6657 patients. The overall agreement between triage personnel and GPT-4o with the gold standard was nearly perfect (Cohen's kappa = 0.782 and 0.833, respectively). The overall F1 score was 0.863 for the triage team, while GPT-4 achieved an F1 score of 0.897, demonstrating superior performance. ROC curve analysis showed the lowest performance in the yellow zone of a tertiary hospital (AUC = 0.75) and in the red zone of another tertiary hospital (AUC = 0.78). However, overall, AUC values greater than 0.90 were observed, indicating high accuracy.

CONCLUSION

ChatGPT generally outperformed triage personnel in patient triage across emergency departments with varying conditions, demonstrating high agreement with the gold standard decision. However, in tertiary hospitals, its performance was relatively lower in triaging patients with more complex symptoms, particularly those requiring triage to the yellow and red zones.

摘要

背景

分诊旨在通过准确评估患者的临床状况、有效管理等待时间以及提高急诊护理的整体效率,根据患者的医疗紧急程度对其进行优先排序。本研究旨在评估ChatGPT在四个动态各异的急诊科进行患者分诊的表现,并对其优势和劣势进行详细分析。

方法

在这项多中心前瞻性研究中,我们将ChatGPT-4o做出的分诊决策与分诊人员的决策,同由急诊医学(EM)专家确定的金标准决策进行了比较。在我们开展研究的医院中,分诊团队通常根据急诊严重程度指数(ESI)系统和医院的本地分诊协议,将患者引导至适当的急诊科区域。在研究期间,分诊团队收集患者数据,包括主要症状、合并症和生命体征,并利用这些信息做出初步分诊决策。一名独立医生同时使用语音指令将相同数据输入ChatGPT。与此同时,在整个研究期间都在分诊室的一名EM专家审查相同的患者数据,并严格遵循医院的本地协议和ESI系统,确定金标准分诊决策。在启动研究之前,我们通过设计包含ESI分诊系统的一般原则和每家医院具体分诊规则的提示,为每家医院定制了ChatGPT。对该模型的整体、基于医院和基于区域的表现进行了评估,并进行了 Cohen's Kappa分析、F1分数分析和性能分析。

结果

本研究纳入了6657名患者。分诊人员与GPT-4o与金标准之间的总体一致性近乎完美(Cohen's kappa分别为0.782和0.833)。分诊团队的总体F1分数为0.863,而GPT-4的F1分数为0.897,表现更优。ROC曲线分析显示,在一家三级医院的黄色区域(AUC = 0.75)和另一家三级医院的红色区域(AUC = 0.78)表现最差。然而,总体而言,观察到AUC值大于0.90,表明准确性较高。

结论

在不同条件的急诊科进行患者分诊时,ChatGPT的表现总体上优于分诊人员,与金标准决策高度一致。然而,在三级医院,对于症状较为复杂的患者,尤其是那些需要分诊到黄色和红色区域的患者,其表现相对较低。

相似文献

1
ChatGPT-supported patient triage with voice commands in the emergency department: A prospective multicenter study.急诊科中基于语音指令的ChatGPT支持的患者分诊:一项前瞻性多中心研究。
Am J Emerg Med. 2025 Apr 17;94:63-70. doi: 10.1016/j.ajem.2025.04.040.
2
Are Current Survival Prediction Tools Useful When Treating Subsequent Skeletal-related Events From Bone Metastases?当前的生存预测工具在治疗骨转移后的骨骼相关事件时有用吗?
Clin Orthop Relat Res. 2024 Sep 1;482(9):1710-1721. doi: 10.1097/CORR.0000000000003030. Epub 2024 Mar 22.
3
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
4
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
5
Systematic review and validation of prediction rules for identifying children with serious infections in emergency departments and urgent-access primary care.系统评价和验证预测规则,以识别急诊科和紧急初级保健中严重感染的儿童。
Health Technol Assess. 2012;16(15):1-100. doi: 10.3310/hta16150.
6
Comparison of ChatGPT and Internet Research for Clinical Research and Decision-Making in Occupational Medicine: Randomized Controlled Trial.ChatGPT与互联网搜索用于职业医学临床研究和决策的比较:随机对照试验
JMIR Form Res. 2025 May 20;9:e63857. doi: 10.2196/63857.
7
Diagnostic Performance of ChatGPT-4o in Detecting Hip Fractures on Pelvic X-rays.ChatGPT-4o在骨盆X光片检测髋部骨折中的诊断性能
Cureus. 2025 Jun 24;17(6):e86654. doi: 10.7759/cureus.86654. eCollection 2025 Jun.
8
Performance of the artificial intelligence-based Swiss medical assessment system versus Manchester triage system in the emergency department: A retrospective analysis.基于人工智能的瑞士医疗评估系统与曼彻斯特分诊系统在急诊科的性能比较:一项回顾性分析。
Am J Emerg Med. 2025 Aug;94:46-49. doi: 10.1016/j.ajem.2025.04.023. Epub 2025 Apr 19.
9
Home treatment for mental health problems: a systematic review.心理健康问题的居家治疗:一项系统综述
Health Technol Assess. 2001;5(15):1-139. doi: 10.3310/hta5150.
10
Impact of a Symptom Checker App on Patient-Physician Interaction Among Self-Referred Walk-In Patients in the Emergency Department: Multicenter, Parallel-Group, Randomized, Controlled Trial.症状检查应用程序对急诊科自行前来就诊患者中患者与医生互动的影响:多中心、平行组、随机对照试验
J Med Internet Res. 2025 Apr 2;27:e64028. doi: 10.2196/64028.

引用本文的文献

1
Performance of ChatGPT, Gemini and DeepSeek for non-critical triage support using real-world conversations in emergency department.ChatGPT、Gemini和DeepSeek在急诊科使用真实对话进行非关键分诊支持方面的表现。
BMC Emerg Med. 2025 Sep 1;25(1):176. doi: 10.1186/s12873-025-01337-2.
2
Potential of ChatGPT in youth mental health emergency triage: Comparative analysis with clinicians.ChatGPT在青少年心理健康紧急分诊中的潜力:与临床医生的比较分析
PCN Rep. 2025 Jul 15;4(3):e70159. doi: 10.1002/pcn5.70159. eCollection 2025 Sep.