• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

大语言模型在颌面创伤病例分诊中的作用:一项初步研究。

The Role of Large Language Models (LLMs) in Providing Triage for Maxillofacial Trauma Cases: A Preliminary Study.

作者信息

Frosolini Andrea, Catarzi Lisa, Benedetti Simone, Latini Linda, Chisci Glauco, Franz Leonardo, Gennaro Paolo, Gabriele Guido

机构信息

Maxillofacial Surgery Unit, Department of Medical Biotechnologies, University of Siena, 53100 Siena, Italy.

Phoniatris and Audiology Unit, Department of Neuroscience DNS, University of Padova, 35122 Treviso, Italy.

出版信息

Diagnostics (Basel). 2024 Apr 18;14(8):839. doi: 10.3390/diagnostics14080839.

DOI:10.3390/diagnostics14080839
PMID:38667484
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11048758/
Abstract

BACKGROUND

In the evolving field of maxillofacial surgery, integrating advanced technologies like Large Language Models (LLMs) into medical practices, especially for trauma triage, presents a promising yet largely unexplored potential. This study aimed to evaluate the feasibility of using LLMs for triaging complex maxillofacial trauma cases by comparing their performance against the expertise of a tertiary referral center.

METHODS

Utilizing a comprehensive review of patient records in a tertiary referral center over a year-long period, standardized prompts detailing patient demographics, injury characteristics, and medical histories were created. These prompts were used to assess the triage suggestions of ChatGPT 4.0 and Google GEMINI against the center's recommendations, supplemented by evaluating the AI's performance using the QAMAI and AIPI questionnaires.

RESULTS

The results in 10 cases of major maxillofacial trauma indicated moderate agreement rates between LLM recommendations and the referral center, with some variances in the suggestion of appropriate examinations (70% ChatGPT and 50% GEMINI) and treatment plans (60% ChatGPT and 45% GEMINI). Notably, the study found no statistically significant differences in several areas of the questionnaires, except in the diagnosis accuracy (GEMINI: 3.30, ChatGPT: 2.30; = 0.032) and relevance of the recommendations (GEMINI: 2.90, ChatGPT: 3.50; = 0.021). A Spearman correlation analysis highlighted significant correlations within the two questionnaires, specifically between the QAMAI total score and AIPI treatment scores (rho = 0.767, = 0.010).

CONCLUSIONS

This exploratory investigation underscores the potential of LLMs in enhancing clinical decision making for maxillofacial trauma cases, indicating a need for further research to refine their application in healthcare settings.

摘要

背景

在不断发展的颌面外科领域,将大语言模型(LLMs)等先进技术融入医疗实践,尤其是用于创伤分诊,具有很大的潜力,但在很大程度上尚未得到充分探索。本研究旨在通过将大语言模型的表现与三级转诊中心的专业知识进行比较,评估其用于分诊复杂颌面创伤病例的可行性。

方法

通过对一家三级转诊中心长达一年的患者记录进行全面回顾,创建了详细描述患者人口统计学、损伤特征和病史的标准化提示。这些提示用于评估ChatGPT 4.0和谷歌GEMINI的分诊建议与该中心的建议,并通过使用QAMAI和AIPI问卷评估人工智能的表现进行补充。

结果

10例主要颌面创伤病例的结果表明,大语言模型的建议与转诊中心之间的一致率中等,在适当检查建议(ChatGPT为70%,GEMINI为50%)和治疗计划(ChatGPT为60%,GEMINI为45%)方面存在一些差异。值得注意的是,该研究发现,除了诊断准确性(GEMINI:3.30,ChatGPT:2.30;P = 0.032)和建议的相关性(GEMINI:2.90,ChatGPT:3.50;P = 0.021)外,问卷的几个方面没有统计学上的显著差异。Spearman相关性分析突出了两份问卷之间的显著相关性,特别是QAMAI总分与AIPI治疗分数之间(rho = 0.767,P = 0.010)。

结论

这项探索性研究强调了大语言模型在增强颌面创伤病例临床决策方面的潜力,表明需要进一步研究以完善其在医疗环境中的应用。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4380/11048758/29ff240b6e03/diagnostics-14-00839-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4380/11048758/29ff240b6e03/diagnostics-14-00839-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/4380/11048758/29ff240b6e03/diagnostics-14-00839-g001.jpg

相似文献

1
The Role of Large Language Models (LLMs) in Providing Triage for Maxillofacial Trauma Cases: A Preliminary Study.大语言模型在颌面创伤病例分诊中的作用:一项初步研究。
Diagnostics (Basel). 2024 Apr 18;14(8):839. doi: 10.3390/diagnostics14080839.
2
Triage Performance Across Large Language Models, ChatGPT, and Untrained Doctors in Emergency Medicine: Comparative Study.分诊表现比较:大型语言模型、ChatGPT 和未经训练的急诊医生:一项对比研究。
J Med Internet Res. 2024 Jun 14;26:e53297. doi: 10.2196/53297.
3
Reliability of large language models for advanced head and neck malignancies management: a comparison between ChatGPT 4 and Gemini Advanced.大型语言模型在高级头颈部恶性肿瘤管理中的可靠性:ChatGPT 4 与 Gemini Advanced 之间的比较。
Eur Arch Otorhinolaryngol. 2024 Sep;281(9):5001-5006. doi: 10.1007/s00405-024-08746-2. Epub 2024 May 25.
4
Exploring the performance of large language models on hepatitis B infection-related questions: A comparative study.探索大语言模型在乙型肝炎感染相关问题上的表现:一项比较研究。
World J Gastroenterol. 2025 Jan 21;31(3):101092. doi: 10.3748/wjg.v31.i3.101092.
5
Evaluating text and visual diagnostic capabilities of large language models on questions related to the Breast Imaging Reporting and Data System Atlas 5 edition.评估大语言模型在与《乳腺影像报告和数据系统》第5版相关问题上的文本和视觉诊断能力。
Diagn Interv Radiol. 2025 Mar 3;31(2):111-129. doi: 10.4274/dir.2024.242876. Epub 2024 Sep 9.
6
Large Language Models for Intraoperative Decision Support in Plastic Surgery: A Comparison between ChatGPT-4 and Gemini.大型语言模型在整形手术中的术中决策支持:ChatGPT-4 和 Gemini 的比较。
Medicina (Kaunas). 2024 Jun 8;60(6):957. doi: 10.3390/medicina60060957.
7
Performance of Large Language Models ChatGPT and Gemini on Workplace Management Questions in Radiology.大型语言模型ChatGPT和Gemini在放射学工作场所管理问题上的表现
Diagnostics (Basel). 2025 Feb 19;15(4):497. doi: 10.3390/diagnostics15040497.
8
ChatGPT vs. Gemini: Comparative accuracy and efficiency in Lung-RADS score assignment from radiology reports.ChatGPT与Gemini:在根据放射学报告进行Lung-RADS评分分配中的准确性和效率比较
Clin Imaging. 2025 May;121:110455. doi: 10.1016/j.clinimag.2025.110455. Epub 2025 Mar 13.
9
ChatGPT vs Gemini: Comparative Accuracy and Efficiency in CAD-RADS Score Assignment from Radiology Reports.ChatGPT与Gemini:放射学报告中CAD-RADS评分分配的比较准确性和效率
J Imaging Inform Med. 2024 Nov 11. doi: 10.1007/s10278-024-01328-y.
10
Comparative performance analysis of large language models: ChatGPT-3.5, ChatGPT-4 and Google Gemini in glucocorticoid-induced osteoporosis.大型语言模型的性能比较分析:ChatGPT-3.5、ChatGPT-4 和 Google Gemini 在糖皮质激素诱导性骨质疏松症中的表现。
J Orthop Surg Res. 2024 Sep 18;19(1):574. doi: 10.1186/s13018-024-04996-2.

引用本文的文献

1
Artificial intelligence in maxillofacial trauma: expert ally or unreliable assistant?人工智能在颌面创伤中的应用:专家助手还是不可靠的帮手?
Med Oral Patol Oral Cir Bucal. 2025 Sep 1;30(5):e751-e757. doi: 10.4317/medoral.27229.
2
Evaluating GPT-4o in infectious disease diagnostics and management: A comparative study with residents and specialists on accuracy, completeness, and clinical support potential.评估GPT-4o在传染病诊断和管理中的应用:与住院医师和专科医生就准确性、完整性和临床支持潜力进行的比较研究。
Digit Health. 2025 Jul 7;11:20552076251355797. doi: 10.1177/20552076251355797. eCollection 2025 Jan-Dec.
3
Can Large Language Models Serve as Reliable Tools for Information in Dentistry? A Systematic Review.

本文引用的文献

1
Fine-tuning large language models for rare disease concept normalization.微调大型语言模型以实现罕见病概念规范化。
J Am Med Inform Assoc. 2024 Sep 1;31(9):2076-2083. doi: 10.1093/jamia/ocae133.
2
ChatGPT's Response Consistency: A Study on Repeated Queries of Medical Examination Questions.ChatGPT的回答一致性:关于医学考试问题重复查询的研究
Eur J Investig Health Psychol Educ. 2024 Mar 8;14(3):657-668. doi: 10.3390/ejihpe14030043.
3
The quality and readability of patient information provided by ChatGPT: can AI reliably explain common ENT operations?
大语言模型能否作为牙科领域可靠的信息工具?一项系统综述。
Int Dent J. 2025 May 16;75(4):100835. doi: 10.1016/j.identj.2025.04.015.
4
Translation and Validation of the Gothenburg Trismus Questionnaire-2 into Italian Language.《哥德堡牙关紧闭问卷-2》翻译成意大利语并进行效度验证。
J Clin Med. 2025 Apr 24;14(9):2949. doi: 10.3390/jcm14092949.
5
Decoding wisdom: Evaluating ChatGPT's accuracy and reproducibility in analyzing orthopantomographic images for third molar assessment.解读智慧:评估ChatGPT在分析全景图像以进行第三磨牙评估时的准确性和可重复性。
Comput Struct Biotechnol J. 2025 Apr 11;28:141-147. doi: 10.1016/j.csbj.2025.04.010. eCollection 2025.
6
Evaluating the Efficacy of Large Language Models in Generating Medical Documentation: A Comparative Study of ChatGPT-4, ChatGPT-4o, and Claude.评估大语言模型在生成医学文档方面的功效:ChatGPT-4、ChatGPT-4o和Claude的比较研究
Aesthetic Plast Surg. 2025 Apr 14. doi: 10.1007/s00266-025-04842-8.
7
Management of Dupuytren's Disease: A Multi-Centric Comparative Analysis Between Experienced Hand Surgeons Versus Artificial Intelligence.掌腱膜挛缩症的治疗:经验丰富的手外科医生与人工智能之间的多中心比较分析
Diagnostics (Basel). 2025 Feb 28;15(5):587. doi: 10.3390/diagnostics15050587.
8
Chat Generative Pre-Trained Transformer (ChatGPT) in Oral and Maxillofacial Surgery: A Narrative Review on Its Research Applications and Limitations.口腔颌面外科中的聊天生成预训练变换器(ChatGPT):关于其研究应用和局限性的叙述性综述
J Clin Med. 2025 Feb 18;14(4):1363. doi: 10.3390/jcm14041363.
9
Evolutionary patterns and research frontiers of artificial intelligence in age-related macular degeneration: a bibliometric analysis.年龄相关性黄斑变性中人工智能的进化模式与研究前沿:一项文献计量分析
Quant Imaging Med Surg. 2025 Jan 2;15(1):813-830. doi: 10.21037/qims-24-1406. Epub 2024 Dec 30.
10
Authorship transparency and equity in otolaryngology and maxillofacial surgery: current practices and the potential impact of large language models.耳鼻咽喉头颈外科学与颌面外科学中的作者身份透明度与公平性:当前实践及大语言模型的潜在影响
Eur Arch Otorhinolaryngol. 2025 Mar;282(3):1641-1647. doi: 10.1007/s00405-024-09151-5. Epub 2024 Dec 19.
ChatGPT 提供的患者信息的质量和可读性:人工智能能可靠地解释常见的耳鼻喉科手术吗?
Eur Arch Otorhinolaryngol. 2024 Nov;281(11):6147-6153. doi: 10.1007/s00405-024-08598-w. Epub 2024 Mar 26.
4
Utilizing large language models in breast cancer management: systematic review.利用大型语言模型进行乳腺癌管理:系统评价。
J Cancer Res Clin Oncol. 2024 Mar 19;150(3):140. doi: 10.1007/s00432-024-05678-6.
5
Accuracy of Treatment Recommendations by Pragmatic Evidence Search and Artificial Intelligence: An Exploratory Study.实用证据检索与人工智能辅助治疗建议的准确性:一项探索性研究
Diagnostics (Basel). 2024 Mar 1;14(5):527. doi: 10.3390/diagnostics14050527.
6
Exploring AI-chatbots' capability to suggest surgical planning in ophthalmology: ChatGPT versus Google Gemini analysis of retinal detachment cases.探索 AI 聊天机器人在眼科手术规划方面的建议能力:ChatGPT 与 Google Gemini 对视网膜脱离病例的分析比较。
Br J Ophthalmol. 2024 Sep 20;108(10):1457-1469. doi: 10.1136/bjo-2023-325143.
7
ChatGPT as an information tool in rhinology. Can we trust each other today?ChatGPT作为鼻科学中的一种信息工具。如今我们能相互信任吗?
Eur Arch Otorhinolaryngol. 2024 Jun;281(6):3253-3259. doi: 10.1007/s00405-024-08581-5. Epub 2024 Mar 4.
8
Google DeepMind's gemini AI versus ChatGPT: a comparative analysis in ophthalmology.谷歌深度思维公司的Gemini人工智能与ChatGPT:眼科领域的比较分析
Eye (Lond). 2024 Jun;38(8):1412-1417. doi: 10.1038/s41433-024-02958-w. Epub 2024 Feb 14.
9
Accuracy of ChatGPT-3.5 and -4 in providing scientific references in otolaryngology-head and neck surgery.ChatGPT-3.5和-4在提供耳鼻咽喉头颈外科学术参考文献方面的准确性。
Eur Arch Otorhinolaryngol. 2024 Apr;281(4):2159-2165. doi: 10.1007/s00405-023-08441-8. Epub 2024 Jan 11.
10
Beyond the Scalpel: Assessing ChatGPT's potential as an auxiliary intelligent virtual assistant in oral surgery.超越手术刀:评估ChatGPT作为口腔外科辅助智能虚拟助手的潜力。
Comput Struct Biotechnol J. 2023 Dec 6;24:46-52. doi: 10.1016/j.csbj.2023.11.058. eCollection 2024 Dec.