• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

弥合差距:大语言模型在撰写神经外科手术记录方面能否与人类专家相媲美?

Bridging the Gap: Can Large Language Models Match Human Expertise in Writing Neurosurgical Operative Notes?

作者信息

Ali Abdullah, Kumar Rohit Prem, Polavarapu Hanish, Lavadi Raj Swaroop, Mahavadi Anil, Legarreta Andrew D, Hudson Joseph S, Shah Manan, Paul David, Mooney James, Dietz Nicholas, Fields Daryl P, Hamilton D Kojo, Agarwal Nitin

机构信息

Department of Neurological Surgery, University of Pittsburgh School of Medicine, Pittsburgh, Pennsylvania, USA.

Department of Neurosurgery, SUNY Upstate Medical University, Syracuse, New York, USA.

出版信息

World Neurosurg. 2024 Dec;192:e34-e41. doi: 10.1016/j.wneu.2024.08.062. Epub 2024 Aug 15.

DOI:10.1016/j.wneu.2024.08.062
PMID:39153569
Abstract

BACKGROUND

Proper documentation is essential for patient care. The popularity of artificial intelligence (AI) offers the potential for improvements in neurosurgical note-writing. This study aimed to assess how AI can optimize documentation in neurosurgical procedures.

METHODS

Thirty-six operative notes were included. All identifiable data were removed. Essential information, such as perioperative data and diagnosis, was sourced from these notes. ChatGPT 4.0 was trained to draft notes from surgical vignettes using each surgeon's note template. One hundred forty-four surveys with a surgeon or AI note were shared with 3 surgeons to evaluate accuracy, content, and organization using a 5-point scale. Accuracy was defined as the factual correctness; content, as the comprehensiveness; and organization, as the arrangement of the note. Flesch-Kincaid Grade Level (FKGL) and Flesch Reading Ease (FRE) scores quantified each note's readability.

RESULTS

The mean AI accuracy was not different from the mean surgeon accuracy (4.44 vs. 4.33; P = 0.512), the mean AI content was lower than the mean surgeon content (3.73 vs. 4.42; P < 0.001). The mean AI note FKGL was greater than the mean surgeon FKGL (13.13 vs. 9.99; P < 0.001) and the mean AI FRE was lower than the mean surgeon FRE (21.42 vs. 41.70; P < 0.001).

CONCLUSIONS

AI notes were on par with surgeon notes in terms of accuracy and organization but lacked in content. Additionally, AI notes used language at an advanced reading level. These findings support the potential for ChatGPT to enhance the efficiency of neurosurgery documentation.

摘要

背景

妥善记录对于患者护理至关重要。人工智能(AI)的普及为改善神经外科手术记录提供了潜力。本研究旨在评估人工智能如何优化神经外科手术中的记录。

方法

纳入36份手术记录。去除所有可识别的数据。围手术期数据和诊断等基本信息来源于这些记录。使用每位外科医生的记录模板,训练ChatGPT 4.0根据手术案例起草记录。将144份包含外科医生记录或人工智能记录的调查问卷分发给3位外科医生,使用5分制评估准确性、内容和组织情况。准确性定义为事实正确性;内容定义为全面性;组织定义为记录的编排。弗莱什-金凯德年级水平(FKGL)和弗莱什阅读简易度(FRE)分数量化每份记录的可读性。

结果

人工智能记录的平均准确性与外科医生记录的平均准确性无差异(4.44对4.33;P = 0.512),人工智能记录的平均内容低于外科医生记录的平均内容(3.73对4.42;P < 0.001)。人工智能记录的平均FKGL大于外科医生记录的平均FKGL(13.13对9.99;P < 0.001),人工智能记录的平均FRE低于外科医生记录的平均FRE(21.42对41.70;P < 0.001)。

结论

人工智能记录在准确性和组织方面与外科医生记录相当,但在内容方面有所欠缺。此外,人工智能记录使用的语言阅读水平较高。这些发现支持了ChatGPT提高神经外科手术记录效率的潜力。

相似文献

1
Bridging the Gap: Can Large Language Models Match Human Expertise in Writing Neurosurgical Operative Notes?弥合差距:大语言模型在撰写神经外科手术记录方面能否与人类专家相媲美?
World Neurosurg. 2024 Dec;192:e34-e41. doi: 10.1016/j.wneu.2024.08.062. Epub 2024 Aug 15.
2
Can Artificial Intelligence Improve the Readability of Patient Education Materials?人工智能能否提高患者教育材料的可读性?
Clin Orthop Relat Res. 2023 Nov 1;481(11):2260-2267. doi: 10.1097/CORR.0000000000002668. Epub 2023 Apr 28.
3
Artificial Intelligence Shows Limited Success in Improving Readability Levels of Spanish-language Orthopaedic Patient Education Materials.人工智能在提高西班牙语骨科患者教育材料的可读性方面成效有限。
Clin Orthop Relat Res. 2025 Feb 11. doi: 10.1097/CORR.0000000000003413.
4
Development and Validation of a Large Language Model-Powered Chatbot for Neurosurgery: Mixed Methods Study on Enhancing Perioperative Patient Education.用于神经外科手术的基于大语言模型的聊天机器人的开发与验证:关于加强围手术期患者教育的混合方法研究
J Med Internet Res. 2025 Jul 15;27:e74299. doi: 10.2196/74299.
5
A structured evaluation of LLM-generated step-by-step instructions in cadaveric brachial plexus dissection.对大语言模型生成的尸体臂丛神经解剖分步指导的结构化评估。
BMC Med Educ. 2025 Jul 1;25(1):903. doi: 10.1186/s12909-025-07493-0.
6
Enhancing Magnetic Resonance Imaging (MRI) Report Comprehension in Spinal Trauma: Readability Analysis of AI-Generated Explanations for Thoracolumbar Fractures.提高脊柱创伤磁共振成像(MRI)报告的理解:胸腰椎骨折人工智能生成解释的可读性分析
JMIR AI. 2025 Jul 1;4:e69654. doi: 10.2196/69654.
7
"Dr. AI Will See You Now": How Do ChatGPT-4 Treatment Recommendations Align With Orthopaedic Clinical Practice Guidelines?“AI 医生为您服务”:ChatGPT-4 的治疗建议与骨科临床实践指南如何契合?
Clin Orthop Relat Res. 2024 Dec 1;482(12):2098-2106. doi: 10.1097/CORR.0000000000003234. Epub 2024 Sep 6.
8
Artificial Intelligence to Improve Clinical Coding Practice in Scandinavia: Crossover Randomized Controlled Trial.人工智能改善斯堪的纳维亚地区临床编码实践:交叉随机对照试验。
J Med Internet Res. 2025 Jul 3;27:e71904. doi: 10.2196/71904.
9
Using Artificial Intelligence ChatGPT to Access Medical Information about Chemical Eye Injuries: A Comparative Study.使用人工智能ChatGPT获取有关化学性眼外伤的医学信息:一项比较研究。
JMIR Form Res. 2025 Jun 30. doi: 10.2196/73642.
10
Improving Patient Communication by Simplifying AI-Generated Dental Radiology Reports With ChatGPT: Comparative Study.通过使用ChatGPT简化人工智能生成的牙科放射学报告来改善患者沟通:比较研究
J Med Internet Res. 2025 Jun 9;27:e73337. doi: 10.2196/73337.

引用本文的文献

1
Employing large language models safely and effectively as a practicing neurosurgeon.作为一名执业神经外科医生,安全有效地使用大语言模型。
Acta Neurochir (Wien). 2025 Apr 9;167(1):101. doi: 10.1007/s00701-025-06515-6.