研究生医学教育中的现代人工智能与大语言模型:态度、应用及实践的范围综述
Modern artificial intelligence and large language models in graduate medical education: a scoping review of attitudes, applications & practice.
作者信息
Verghese Basil George, Iyer Charoo, Borse Tanvi, Cooper Shiamak, White Jacob, Sheehy Ryan
机构信息
Education for Health Professions Program, School of Education, Johns Hopkins University, 2800 N Charles St, Baltimore, MD, 21218, USA.
Internal Medicine Residency Program, Rochester, NY, USA.
出版信息
BMC Med Educ. 2025 May 20;25(1):730. doi: 10.1186/s12909-025-07321-5.
BACKGROUND
Artificial intelligence (AI) holds transformative potential for graduate medical education (GME), yet, a comprehensive exploration of AI's applications, perceptions, and limitations in GME is lacking.
OBJECTIVE
To map the current literature on AI in GME, identifying prevailing perceptions, applications, and research gaps to inform future research, policy discussions, and educational practices through a scoping review.
METHODS
Following the Joanna Briggs Institute guidelines and the PRISMA-ScR checklist a comprehensive search of multiple databases up to February 2024 was performed to include studies addressing AI interventions in GME.
RESULTS
Out of 1734 citations, 102 studies met the inclusion criteria, conducted across 16 countries, predominantly from North America (72), Asia (14), and Europe (6). Radiology had the highest number of publications (21), followed by general surgery (11) and emergency medicine (8). The majority of studies were published in 2023. Several key thematic areas emerged from the literature. Initially, perceptions of AI in graduate medical education (GME) were mixed, but have increasingly shifted toward a more favorable outlook, particularly as the benefits of AI integration in education become more apparent. In assessments, AI demonstrated the ability to differentiate between skill levels and offer meaningful feedback. It has also been effective in evaluating narrative comments to assess resident performance. In the domain of recruitment, AI tools have been applied to analyze letters of recommendation, applications, and personal statements, helping identify potential biases and improve equity in candidate selection. Furthermore, large language models consistently outperformed average candidates on board certification and in-training examinations, indicating their potential utility in standardized assessments. Finally, AI tools showed promise in enhancing clinical decision-making by supporting trainees with improved diagnostic accuracy and efficiency.
CONCLUSIONS
This scoping review provides a comprehensive overview of applications and limitations of AI in GME but is limited with potential biases, study heterogeneity, and evolving nature of AI.
背景
人工智能(AI)对研究生医学教育(GME)具有变革潜力,但目前缺乏对AI在GME中的应用、认知及局限性的全面探索。
目的
通过范围综述梳理当前关于GME中AI的文献,识别普遍的认知、应用及研究差距,为未来研究、政策讨论和教育实践提供参考。
方法
遵循乔安娜·布里格斯研究所指南和PRISMA-ScR清单,对截至2024年2月的多个数据库进行全面检索,纳入涉及GME中AI干预措施的研究。
结果
在1734条引用文献中,102项研究符合纳入标准,研究在16个国家开展,主要来自北美(72项)、亚洲(14项)和欧洲(6项)。放射学领域的出版物数量最多(21篇),其次是普通外科(11篇)和急诊医学(8篇)。大多数研究发表于2023年。文献中出现了几个关键主题领域。最初,人们对AI在研究生医学教育(GME)中的看法不一,但随着AI融入教育的益处日益明显,这种看法越来越倾向于积极。在评估方面,AI显示出区分技能水平并提供有意义反馈的能力。它在评估叙述性评论以评估住院医师表现方面也很有效。在招聘领域,AI工具已被用于分析推荐信、申请材料和个人陈述,有助于识别潜在偏见并提高候选人选拔的公平性。此外,大语言模型在委员会认证和在职考试中始终优于普通考生,表明它们在标准化评估中具有潜在效用。最后,AI工具在支持受训人员提高诊断准确性和效率以增强临床决策能力方面显示出前景。
结论
本范围综述全面概述了AI在GME中的应用和局限性,但受潜在偏见、研究异质性和AI不断发展的性质所限。
相似文献
JBI Database System Rev Implement Rep. 2016-4
J Med Internet Res. 2025-1-23
本文引用的文献
J Neurol Surg B Skull Base. 2024-1-23
J Am Med Inform Assoc. 2024-2-16
Am J Pharm Educ. 2024-1
IEEE Trans Vis Comput Graph. 2024-1
J Med Internet Res. 2023-10-19