• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人工智能聊天机器人的认知领域评估:ChatGPT与Gemini对解剖学教育理解的比较研究

Cognitive Domain Assessment of Artificial Intelligence Chatbots: A Comparative Study Between ChatGPT and Gemini's Understanding of Anatomy Education.

作者信息

Ganapathy Arthi, Kaushal Parul

机构信息

Department of Anatomy, Teaching Block, All India Institute of Medical Sciences, New Delhi, 110029 India.

出版信息

Med Sci Educ. 2025 Feb 15;35(3):1295-1304. doi: 10.1007/s40670-025-02303-0. eCollection 2025 Jun.

DOI:10.1007/s40670-025-02303-0
PMID:40625950
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12228882/
Abstract

PURPOSE

The integration of AI chatbots into education has gained traction, particularly in medical fields such as anatomy. This study aims to evaluate and compare the responses of ChatGPT 4o mini and Gemini across different cognitive domains of anatomy education.

METHODS

A cross-sectional study was conducted to assess the responses of these two AI chatbots to a set of anatomy questions selected from the Manual on Competency-Based Undergraduate Curriculum. Questions were categorized into knowledge, comprehension and application levels of cognitive domain. Responses were scored against an answer key prepared by anatomy experts. Relevant comparative statistical analysis was performed.

RESULTS

The overall performance of ChatGPT 4o mini (76.15%) was significantly superior to Gemini (72.84%). In application-level questions, ChatGPT 4o mini outperformed Gemini. Conversely, Gemini scored higher in comprehension-level questions (76.88% vs. 73.66%). Both chatbots exhibited factual inaccuracies and limitations in contextually accurate responses, particularly in application-level questions.

CONCLUSION

Both ChatGPT 4o mini and Gemini demonstrate potential as educational tools in anatomy, with strengths and limitations varying by cognitive domain. While AI chatbots can supplement traditional learning methods, they require ongoing refinement and validation. To ensure the responsible integration of AI into medical education, close attention must be devoted to faculty and student training, setting up relevant IT environment and ethical issues. Future research should focus on expanding question pools, incorporating user feedback and comparing with traditional educational approaches to enhance their effectiveness.

摘要

目的

将人工智能聊天机器人整合到教育中已越来越受到关注,尤其是在解剖学等医学领域。本研究旨在评估和比较ChatGPT 4o mini和Gemini在解剖学教育不同认知领域的回答。

方法

进行了一项横断面研究,以评估这两个人工智能聊天机器人对从基于能力的本科课程手册中选出的一组解剖学问题的回答。问题被分为认知领域的知识、理解和应用水平。根据解剖学专家准备的答案对回答进行评分。进行了相关的比较统计分析。

结果

ChatGPT 4o mini的总体表现(76.15%)显著优于Gemini(72.84%)。在应用水平的问题上,ChatGPT 4o mini的表现优于Gemini。相反,Gemini在理解水平的问题上得分更高(76.88%对73.66%)。两个聊天机器人都存在事实不准确以及在上下文准确回答方面的局限性,尤其是在应用水平的问题上。

结论

ChatGPT 4o mini和Gemini在解剖学教育中都显示出作为教育工具的潜力,其优势和局限性因认知领域而异。虽然人工智能聊天机器人可以补充传统学习方法,但它们需要不断完善和验证。为确保人工智能在医学教育中的合理整合,必须密切关注教师和学生培训、建立相关的信息技术环境以及伦理问题。未来的研究应侧重于扩大问题库、纳入用户反馈并与传统教育方法进行比较,以提高其有效性。

相似文献

1
Cognitive Domain Assessment of Artificial Intelligence Chatbots: A Comparative Study Between ChatGPT and Gemini's Understanding of Anatomy Education.人工智能聊天机器人的认知领域评估:ChatGPT与Gemini对解剖学教育理解的比较研究
Med Sci Educ. 2025 Feb 15;35(3):1295-1304. doi: 10.1007/s40670-025-02303-0. eCollection 2025 Jun.
2
Artificial Intelligence in Peripheral Artery Disease Education: A Battle Between ChatGPT and Google Gemini.外周动脉疾病教育中的人工智能:ChatGPT与谷歌Gemini的较量
Cureus. 2025 Jun 1;17(6):e85174. doi: 10.7759/cureus.85174. eCollection 2025 Jun.
3
Psychological First Aid by AI: Proof-of-Concept and Comparative Performance of ChatGPT-4 and Gemini in Different Disaster Scenarios.人工智能提供的心理急救:ChatGPT-4和Gemini在不同灾难场景下的概念验证及性能比较
J Clin Psychol. 2025 Aug;81(8):726-738. doi: 10.1002/jclp.23808. Epub 2025 May 9.
4
Accuracy and Reliability of Artificial Intelligence Chatbots as Public Information Sources in Implant Dentistry.人工智能聊天机器人作为种植牙科公共信息来源的准确性和可靠性
Int J Oral Maxillofac Implants. 2025 Jun 25;0(0):1-23. doi: 10.11607/jomi.11280.
5
Performance of 3 Conversational Generative Artificial Intelligence Models for Computing Maximum Safe Doses of Local Anesthetics: Comparative Analysis.用于计算局部麻醉药最大安全剂量的3种对话式生成人工智能模型的性能:比较分析
JMIR AI. 2025 May 13;4:e66796. doi: 10.2196/66796.
6
Thyroid Eye Disease and Artificial Intelligence: A Comparative Study of ChatGPT-3.5, ChatGPT-4o, and Gemini in Patient Information Delivery.甲状腺眼病与人工智能:ChatGPT-3.5、ChatGPT-4o和Gemini在患者信息传递方面的比较研究
Ophthalmic Plast Reconstr Surg. 2024 Dec 24. doi: 10.1097/IOP.0000000000002882.
7
Comparative analysis of LLMs performance in medical embryology: A cross-platform study of ChatGPT, Claude, Gemini, and Copilot.大语言模型在医学胚胎学中的性能比较分析:ChatGPT、Claude、Gemini和Copilot的跨平台研究
Anat Sci Educ. 2025 May 11. doi: 10.1002/ase.70044.
8
A structured evaluation of LLM-generated step-by-step instructions in cadaveric brachial plexus dissection.对大语言模型生成的尸体臂丛神经解剖分步指导的结构化评估。
BMC Med Educ. 2025 Jul 1;25(1):903. doi: 10.1186/s12909-025-07493-0.
9
A Cross-Sectional Comparison of Patient Information Guides Generated by ChatGPT Versus Google Gemini for Alzheimer's Disease, Parkinsonism, and Migraine.ChatGPT与谷歌Gemini生成的针对阿尔茨海默病、帕金森症和偏头痛的患者信息指南的横断面比较
Cureus. 2025 May 20;17(5):e84507. doi: 10.7759/cureus.84507. eCollection 2025 May.
10
Evaluating the validity and consistency of artificial intelligence chatbots in responding to patients' frequently asked questions in prosthodontics.评估人工智能聊天机器人在回答患者口腔修复学常见问题时的有效性和一致性。
J Prosthet Dent. 2025 Apr 7. doi: 10.1016/j.prosdent.2025.03.009.