Suppr超能文献

对三个人工智能平台回答美国医师执照考试第一步解剖学问题或识别X光片上解剖结构的比较评估。

Comparative assessment of three AI platforms in answering USMLE Step 1 anatomy questions or identifying anatomical structures on radiographs.

作者信息

Al-Khater Khulood Mohammed Khalid

机构信息

Department of Anatomy, College of Medicine, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia.

出版信息

Clin Anat. 2025 Mar;38(2):186-199. doi: 10.1002/ca.24243. Epub 2024 Nov 18.

Abstract

The application of artificial intelligence (AI) in education has gained great attention recently. Integration of AI tools in anatomy teaching is currently engaging researchers and academics worldwide. Several AI chatbots have been generated, the most popular being ChatGPT (OpenAI: San Francisco, California, USA). Since its first public release in November 2022, several research papers have pointed to its potential role in anatomy education. However, it is not yet known whether it will prove superior to other available AI tools in this role. This article sheds some light on the current status of research concerning AI applications in anatomy education and compares the performances of three well-known chatbots (ChatGPT, Gemini, and Claude) in answering anatomy questions. A total of 23 questions were used as prompts for each chatbot. These questions comprised 10 knowledge-based, 10 analysis-based USMLE Step 1-type, and three radiographs. ChatGPT was the most accurate of the three, scoring 100% accuracy. However, in terms of comprehensiveness, Claude was the best; it gave very organized anatomical responses. Gemini performed less well than the other two, with a scored accuracy of 60% and less scientific explanations. On the basis of these findings, this study recommends the incorporation of Claude and ChatGPT in anatomy education, but not Gemini, at least in its current state.

摘要

人工智能(AI)在教育领域的应用近来备受关注。目前,将人工智能工具整合到解剖学教学中吸引了全球的研究人员和学者。已经开发出了几款人工智能聊天机器人,其中最受欢迎的是ChatGPT(美国加利福尼亚州旧金山的OpenAI公司)。自2022年11月首次公开发布以来,已有多篇研究论文指出其在解剖学教育中的潜在作用。然而,在这一角色中它是否会被证明优于其他现有的人工智能工具尚不清楚。本文阐述了人工智能在解剖学教育中应用的研究现状,并比较了三款知名聊天机器人(ChatGPT、Gemini和Claude)回答解剖学问题的表现。每个聊天机器人都以总共23个问题作为提示。这些问题包括10个基于知识的问题、10个基于分析的美国医师执照考试第一步(USMLE Step 1)类型的问题以及三张X光片。ChatGPT是三者中最准确的,准确率达到100%。然而,在全面性方面,Claude表现最佳;它给出的解剖学回答条理非常清晰。Gemini的表现不如其他两者,准确率为60%,且科学解释较少。基于这些发现,本研究建议在解剖学教育中纳入Claude和ChatGPT,但不包括Gemini,至少就其当前状态而言。

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验