Suppr超能文献

突破骨骼,突破障碍:ChatGPT、DeepSeek和Gemini在手部骨折管理中的应用

Breaking Bones, Breaking Barriers: ChatGPT, DeepSeek, and Gemini in Hand Fracture Management.

作者信息

Marcaccini Gianluca, Seth Ishith, Xie Yi, Susini Pietro, Pozzi Mirco, Cuomo Roberto, Rozen Warren M

机构信息

Plastic Surgery Unit, Department of Medicine, Surgery and Neuroscience, University of Siena, 53100 Siena, Italy.

Department of Plastic and Reconstructive Surgery, Peninsula Health, Frankston, VIC 3199, Australia.

出版信息

J Clin Med. 2025 Mar 14;14(6):1983. doi: 10.3390/jcm14061983.

Abstract

: Hand fracture management requires precise diagnostic accuracy and complex decision-making. Advances in artificial intelligence (AI) suggest that large language models (LLMs) may assist or even rival traditional clinical approaches. This study evaluates the effectiveness of ChatGPT-4o, DeepSeek-V3, and Gemini 1.5 in diagnosing and recommending treatment strategies for hand fractures compared to experienced surgeons. : A retrospective analysis of 58 anonymized hand fracture cases was conducted. Clinical details, including fracture site, displacement, and soft-tissue involvement, were provided to the AI models, which generated management plans. Their recommendations were compared to actual surgeon decisions, assessing accuracy, precision, recall, and F1 score. : ChatGPT-4o demonstrated the highest accuracy (98.28%) and recall (91.74%), effectively identifying most correct interventions but occasionally proposing extraneous options (precision 58.48%). DeepSeek-V3 showed moderate accuracy (63.79%), with balanced precision (61.17%) and recall (57.89%), sometimes omitting correct treatments. Gemini 1.5 performed poorly (accuracy 18.97%), with low precision and recall, indicating substantial limitations in clinical decision support. : AI models can enhance clinical workflows, particularly in radiographic interpretation and triage, but their limitations highlight the irreplaceable role of human expertise in complex hand trauma management. ChatGPT-4o demonstrated promising accuracy but requires refinement. Ethical concerns regarding AI-driven medical decisions, including bias and transparency, must be addressed before widespread clinical implementation.

摘要

手部骨折的处理需要精确的诊断准确性和复杂的决策。人工智能(AI)的进展表明,大语言模型(LLMs)可能辅助甚至媲美传统临床方法。本研究评估了ChatGPT-4o、DeepSeek-V3和Gemini 1.5在诊断手部骨折并推荐治疗策略方面与经验丰富的外科医生相比的有效性。

对58例匿名手部骨折病例进行了回顾性分析。将包括骨折部位、移位和软组织受累情况在内的临床细节提供给人工智能模型,这些模型生成了处理方案。将它们的建议与外科医生的实际决策进行比较,评估准确性、精确性、召回率和F1分数。

ChatGPT-4o表现出最高的准确性(98.28%)和召回率(91.74%),能有效识别出大多数正确的干预措施,但偶尔会提出无关选项(精确性58.48%)。DeepSeek-V3表现出中等准确性(63.79%),精确性(61.17%)和召回率(57.89%)较为平衡,有时会遗漏正确的治疗方法。Gemini 1.5表现较差(准确性18.97%),精确性和召回率较低,表明在临床决策支持方面存在重大局限性。

人工智能模型可以改善临床工作流程,特别是在影像学解读和分诊方面,但其局限性凸显了人类专业知识在复杂手部创伤处理中不可替代的作用。ChatGPT-4o表现出了有前景的准确性,但需要改进。在广泛临床应用之前,必须解决与人工智能驱动的医疗决策相关的伦理问题,包括偏差和透明度问题。

相似文献

本文引用的文献

8
Artificial Intelligence in Facial Plastics and Reconstructive Surgery.人工智能在面部整形与重建外科中的应用
Otolaryngol Clin North Am. 2024 Oct;57(5):843-852. doi: 10.1016/j.otc.2024.05.002. Epub 2024 Jul 8.
10

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验