• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

泌尿外科住院医师的非技术技能:一项针对ChatGPT4人工智能与顾问互动进行基准测试的双盲研究。

Non-technical Skills for Urology Trainees: A Double-Blinded Study of ChatGPT4 AI Benchmarking Against Consultant Interaction.

作者信息

Pears Matthew, Wadhwa Karan, Payne Stephen R, Hanchanale Vishwanath, Elmamoun Mamoun Hamid, Jain Sunjay, Konstantinidis Stathis Th, Rochester Mark, Doherty Ruth, Spearpoint Kenneth, Ng Oliver, Dick Lachlan, Yule Steven, Biyani Chandra Shekhar

机构信息

School of Health Sciences, University of Nottingham, Nottingham, UK.

Department of Urology, Broomfield Hospital, Chelmsford, UK.

出版信息

J Healthc Inform Res. 2024 Nov 14;9(1):103-118. doi: 10.1007/s41666-024-00180-7. eCollection 2025 Mar.

DOI:10.1007/s41666-024-00180-7
PMID:39897101
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11782744/
Abstract

Non-technical skills (NTS) are crucial in healthcare, encompassing cognitive and social skills that support technical ability. Traditional NTS training is evolving with the emergence of artificial intelligence (AI) models that can intelligently converse with their users, known as large language models (LLMs). This study investigated the capabilities and limitations of a popular model named generative pre-trained transformer 4 (GPT-4) in NTS training, comparing its performance to that of human evaluators. Urology trainees identified NTS events in simulated scenarios and discussed them in blinded feedback sessions with AI and human consultants. Experts assessed the blinded interaction data, providing quantitative ratings and qualitative evaluations using annotated transcripts. Wilcoxon signed-rank tests compared pre- and post-intervention ratings, whilst Mann-Whitney tests compared post-intervention ratings between AI and human feedback. Thematic analysis identified strengths, limitations, and differences between AI and human feedback approaches. The AI model demonstrated significant strengths in reinforcing knowledge gathering ( = 0.04), providing accurate and evidence-based feedback ( = 0.013), conveying empathy ( = 0.021), and tailoring explanations to complexity ( = 0.002). However, human feedback excelled in language terminology ( = 0.003), complexity ( = 0.020), and fact-based feedback ( = 0.025). The study highlights the potential for AI to augment assessment of NTS training in healthcare. A blended approach utilising AI and human expertise may boost training efficacy.

摘要

非技术技能(NTS)在医疗保健领域至关重要,它涵盖了支持技术能力的认知和社交技能。随着能够与用户进行智能对话的人工智能(AI)模型(即大语言模型,LLMs)的出现,传统的NTS培训正在不断发展。本研究调查了一种名为生成式预训练变换器4(GPT-4)的流行模型在NTS培训中的能力和局限性,并将其性能与人类评估者的性能进行了比较。泌尿外科实习生在模拟场景中识别NTS事件,并在与AI和人类顾问的盲态反馈会议中进行讨论。专家们评估了盲态交互数据,使用注释转录本提供定量评分和定性评估。Wilcoxon符号秩检验比较了干预前后的评分,而Mann-Whitney检验比较了AI和人类反馈之间的干预后评分。主题分析确定了AI和人类反馈方法的优势、局限性和差异。AI模型在加强知识收集(=0.04)、提供准确且基于证据的反馈(=0.013)、表达同理心(=0.021)以及根据复杂性调整解释(=0.002)方面表现出显著优势。然而,人类反馈在语言术语(=0.003)、复杂性(=0.020)和基于事实的反馈(=0.025)方面表现更出色。该研究强调了AI在增强医疗保健领域NTS培训评估方面的潜力。采用AI和人类专业知识的混合方法可能会提高培训效果。

相似文献

1
Non-technical Skills for Urology Trainees: A Double-Blinded Study of ChatGPT4 AI Benchmarking Against Consultant Interaction.泌尿外科住院医师的非技术技能:一项针对ChatGPT4人工智能与顾问互动进行基准测试的双盲研究。
J Healthc Inform Res. 2024 Nov 14;9(1):103-118. doi: 10.1007/s41666-024-00180-7. eCollection 2025 Mar.
2
Using Natural Language Processing to Explore Patient Perspectives on AI Avatars in Support Materials for Patients With Breast Cancer: Survey Study.使用自然语言处理技术探索乳腺癌患者在支持材料中对人工智能化身的看法:调查研究
J Med Internet Res. 2025 Jun 20;27:e70971. doi: 10.2196/70971.
3
Redefining Mentorship in Medical Education with Artificial Intelligence: A Delphi Study on the Feasibility and Implications.利用人工智能重新定义医学教育中的导师指导:关于可行性和影响的德尔菲研究
Teach Learn Med. 2025 Jun 18:1-11. doi: 10.1080/10401334.2025.2521001.
4
Clinical Management of Wasp Stings Using Large Language Models: Cross-Sectional Evaluation Study.使用大语言模型对黄蜂蜇伤进行临床管理:横断面评估研究
J Med Internet Res. 2025 Jun 4;27:e67489. doi: 10.2196/67489.
5
Beyond Traditional Simulation: An Exploratory Study on the Effectiveness and Acceptability of ChatGPT‑4o Advanced Voice Mode for Communication Skills Practice Among Medical Students.超越传统模拟:关于ChatGPT-4o高级语音模式对医学生沟通技能练习的有效性和可接受性的探索性研究。
Cureus. 2025 May 19;17(5):e84381. doi: 10.7759/cureus.84381. eCollection 2025 May.
6
Safety and User Experience of a Generative Artificial Intelligence Digital Mental Health Intervention: Exploratory Randomized Controlled Trial.生成式人工智能数字心理健康干预的安全性与用户体验:探索性随机对照试验
J Med Internet Res. 2025 May 23;27:e67365. doi: 10.2196/67365.
7
Enhancing professional communication training in higher education through artificial intelligence(AI)-integrated exercises: study protocol for a randomised controlled trial.通过人工智能(AI)集成练习加强高等教育中的专业沟通培训:一项随机对照试验的研究方案
BMC Med Educ. 2025 May 30;25(1):804. doi: 10.1186/s12909-025-07307-3.
8
Performance of 3 Conversational Generative Artificial Intelligence Models for Computing Maximum Safe Doses of Local Anesthetics: Comparative Analysis.用于计算局部麻醉药最大安全剂量的3种对话式生成人工智能模型的性能:比较分析
JMIR AI. 2025 May 13;4:e66796. doi: 10.2196/66796.
9
Using Generative Artificial Intelligence in Health Economics and Outcomes Research: A Primer on Techniques and Breakthroughs.在卫生经济学与结果研究中使用生成式人工智能:技术与突破入门
Pharmacoecon Open. 2025 Apr 29. doi: 10.1007/s41669-025-00580-4.
10
A dataset and benchmark for hospital course summarization with adapted large language models.一个用于医院病程总结的数据集和基准测试,采用了适配的大语言模型。
J Am Med Inform Assoc. 2025 Mar 1;32(3):470-479. doi: 10.1093/jamia/ocae312.

本文引用的文献

1
The effects of COVID-19 on training within urology: Lessons learned in virtual learning, human factors, non-technical skills and reflective practice.新型冠状病毒肺炎对泌尿外科培训的影响:虚拟学习、人为因素、非技术技能及反思性实践方面的经验教训
J Clin Urol. 2021 Jan;14(1):29-35. doi: 10.1177/2051415820950109.
2
Revolutionizing healthcare: the role of artificial intelligence in clinical practice.人工智能在临床实践中的应用:医疗保健的革命。
BMC Med Educ. 2023 Sep 22;23(1):689. doi: 10.1186/s12909-023-04698-z.
3
A Pilot Study Evaluating a Virtual Reality-Based Nontechnical Skills Training Application for Urology Trainees: Usability, Acceptability, and Impact.一项评估虚拟现实非技术技能培训应用于泌尿科学员的初步研究:可用性、可接受性和影响。
J Surg Educ. 2023 Dec;80(12):1836-1842. doi: 10.1016/j.jsurg.2023.08.012. Epub 2023 Sep 17.
4
'Bingo'-style cue identification techniques: enhancing non-technical skills in urology trainees.
Br J Surg. 2023 Oct 10;110(11):1549-1550. doi: 10.1093/bjs/znad277.
5
Evaluation of the reboot coaching workshops among urology trainees: A mixed method approach.泌尿外科住院医师重启辅导工作坊的评估:一种混合方法研究。
BJUI Compass. 2023 May 2;4(5):533-542. doi: 10.1002/bco2.249. eCollection 2023 Sep.
6
The potential role of ChatGPT and artificial intelligence in anatomy education: a conversation with ChatGPT.ChatGPT与人工智能在解剖学教育中的潜在作用:与ChatGPT的对话
Surg Radiol Anat. 2023 Oct;45(10):1321-1329. doi: 10.1007/s00276-023-03229-1. Epub 2023 Aug 16.
7
The First Months of Life of ChatGPT and Its Impact in Healthcare: A Bibliometric Analysis of the Current Literature.ChatGPT 的头几个月及其在医疗保健领域的影响:对当前文献的计量分析。
Ann Biomed Eng. 2024 May;52(5):1107-1110. doi: 10.1007/s10439-023-03325-8. Epub 2023 Jul 23.
8
Artificial Intelligence Supporting the Training of Communication Skills in the Education of Health Care Professions: Scoping Review.人工智能支持医疗保健专业教育中的沟通技巧培训:范围综述。
J Med Internet Res. 2023 Jun 19;25:e43311. doi: 10.2196/43311.
9
ChatGPT goes to the operating room: evaluating GPT-4 performance and its potential in surgical education and training in the era of large language models.ChatGPT走进手术室:在大语言模型时代评估GPT-4在外科教育与培训中的表现及其潜力。
Ann Surg Treat Res. 2023 May;104(5):269-273. doi: 10.4174/astr.2023.104.5.269. Epub 2023 Apr 28.
10
Comparing Physician and Artificial Intelligence Chatbot Responses to Patient Questions Posted to a Public Social Media Forum.比较医生和人工智能聊天机器人对发布在公共社交媒体论坛上的患者问题的回复。
JAMA Intern Med. 2023 Jun 1;183(6):589-596. doi: 10.1001/jamainternmed.2023.1838.