• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人工智能在外科教育中日益重要的作用:ChatGPT参加澳大利亚普通外科科学考试。

The Growing Role of Artificial Intelligence in Surgical Education: ChatGPT Undertakes the Australian Generic Surgical Sciences Examination.

作者信息

Guo Allen Ao, Canagasingham Ashan, Rasiah Krishan, Chalasani Venu, Mundy Julie, Chung Amanda

机构信息

Department of Urology, Royal North Shore Hospital, Sydney, Australia.

North Shore Urology Research Group, Sydney, Australia.

出版信息

ANZ J Surg. 2025 Jul-Aug;95(7-8):1350-1355. doi: 10.1111/ans.70186. Epub 2025 May 30.

DOI:10.1111/ans.70186
PMID:40444677
Abstract

BACKGROUND

Large language models have undergone vast development in recent years. The advent of large language models such as ChatGPT may play an important role in enhancing future medical education.

METHODS

To evaluate the accuracy and performance of ChatGPT in the Generic Surgical Sciences Examination, we constructed a sample examination used to assess ChatGPT. Questions were sourced from a past questions bank and formatted to mirror the structure and layout of the examination. The performance of ChatGPT was assessed based on a predefined answer key recorded earlier.

RESULTS

ChatGPT scored a total of 468 marks out of a maximum total of 644 marks, scoring a final percentage of 72.7% across all sections tested. ChatGPT performed best in the physiology section, scoring 77.9%, followed by pathology, scoring 75.0%, and scored lowest in the anatomy section with 66.3%. When scoring was analyzed by question type, it was identified that ChatGPT performed best in the type "A" questions (multiple choice), scoring a total of 75%, which was followed closely by its performance in type "X" questions (true or false), where ChatGPT scored 73.2%. However, ChatGPT only scored 43.8% when answering type "B" questions (establishing a relationship between two statements).

CONCLUSION

Our results demonstrate that ChatGPT completed the Generic Surgical Sciences Examination with accuracy exceeding the required threshold for a pass in this examination. However, the large language model struggled with certain question types and sections. Overall, further research regarding the utility of ChatGPT in surgical education is required, and caution should be exercised with its use, as it remains in its infancy stages.

摘要

背景

近年来,大语言模型经历了巨大的发展。ChatGPT等大语言模型的出现可能在提升未来医学教育方面发挥重要作用。

方法

为了评估ChatGPT在普通外科学考试中的准确性和表现,我们构建了一个用于评估ChatGPT的样本考试。问题来自过去的题库,并按照考试的结构和布局进行格式化。根据之前记录的预定义答案键评估ChatGPT的表现。

结果

ChatGPT在满分644分的考试中总共获得了468分,在所有测试部分的最终得分率为72.7%。ChatGPT在生理学部分表现最佳,得分77.9%,其次是病理学,得分75.0%,在解剖学部分得分最低,为66.3%。按问题类型分析得分时,发现ChatGPT在“A”型问题(多项选择题)中表现最佳,总得分为75%,紧随其后的是在“X”型问题(判断题)中的表现,ChatGPT得分为73.2%。然而,ChatGPT在回答“B”型问题(建立两个陈述之间的关系)时仅得43.8%。

结论

我们的结果表明,ChatGPT完成了普通外科学考试,其准确性超过了该考试及格所需的阈值。然而,这个大语言模型在某些问题类型和部分存在困难。总体而言,需要进一步研究ChatGPT在外科教育中的实用性,并且在使用时应谨慎,因为它仍处于起步阶段。

相似文献

1
The Growing Role of Artificial Intelligence in Surgical Education: ChatGPT Undertakes the Australian Generic Surgical Sciences Examination.人工智能在外科教育中日益重要的作用:ChatGPT参加澳大利亚普通外科科学考试。
ANZ J Surg. 2025 Jul-Aug;95(7-8):1350-1355. doi: 10.1111/ans.70186. Epub 2025 May 30.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Artificial Intelligence in Orthopaedics: Performance of ChatGPT on Text and Image Questions on a Complete AAOS Orthopaedic In-Training Examination (OITE).人工智能在骨科领域的应用:ChatGPT 在 AAOS 骨科住院医师培训考试(OITE)全题文本和图像问题上的表现。
J Surg Educ. 2024 Nov;81(11):1645-1649. doi: 10.1016/j.jsurg.2024.08.002. Epub 2024 Sep 14.
4
Comparative performance of ChatGPT, Gemini, and final-year emergency medicine clerkship students in answering multiple-choice questions: implications for the use of AI in medical education.ChatGPT、Gemini与急诊医学实习最后一年学生在回答多项选择题方面的表现比较:人工智能在医学教育中的应用启示
Int J Emerg Med. 2025 Aug 7;18(1):146. doi: 10.1186/s12245-025-00949-6.
5
[Preliminary exploration of the applications of five large language models in the field of oral auxiliary diagnosis, treatment and health consultation].五种大语言模型在口腔辅助诊断、治疗及健康咨询领域的应用初探
Zhonghua Kou Qiang Yi Xue Za Zhi. 2025 Jul 30;60(8):871-878. doi: 10.3760/cma.j.cn112144-20241107-00418.
6
The performance of ChatGPT on medical image-based assessments and implications for medical education.ChatGPT在基于医学图像的评估中的表现及其对医学教育的影响。
BMC Med Educ. 2025 Aug 23;25(1):1192. doi: 10.1186/s12909-025-07752-0.
7
Unveiling GPT-4V's hidden challenges behind high accuracy on USMLE questions: Observational Study.揭示GPT-4V在美国医师执照考试(USMLE)问题上高精度背后的隐藏挑战:观察性研究。
J Med Internet Res. 2025 Feb 7;27:e65146. doi: 10.2196/65146.
8
Evaluation of ChatGPT-4 as an Online Outpatient Assistant in Puerperal Mastitis Management: Content Analysis of an Observational Study.评估ChatGPT-4作为产褥期乳腺炎管理在线门诊助手的效果:一项观察性研究的内容分析
JMIR Med Inform. 2025 Jul 24;13:e68980. doi: 10.2196/68980.
9
Navigating the future of pediatric cardiovascular surgery: Insights and innovation powered by Chat Generative Pre-Trained Transformer (ChatGPT).探索小儿心血管外科的未来:由聊天生成预训练变换器(ChatGPT)推动的见解与创新。
J Thorac Cardiovasc Surg. 2025 Feb 1. doi: 10.1016/j.jtcvs.2025.01.022.
10
Sexual Harassment and Prevention Training性骚扰与预防培训