• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

探索人工智能聊天机器人在公共卫生教育中的潜在用途:可行性研究。

Exploring the Possible Use of AI Chatbots in Public Health Education: Feasibility Study.

作者信息

Baglivo Francesco, De Angelis Luigi, Casigliani Virginia, Arzilli Guglielmo, Privitera Gaetano Pierpaolo, Rizzo Caterina

机构信息

Department of Translational Research and New Technologies in Medicine and Surgery, University of Pisa, Pisa (PI), Italy.

Training Office, National Institute of Health, Rome, Italy.

出版信息

JMIR Med Educ. 2023 Nov 1;9:e51421. doi: 10.2196/51421.

DOI:10.2196/51421
PMID:37910155
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10652189/
Abstract

BACKGROUND

Artificial intelligence (AI) is a rapidly developing field with the potential to transform various aspects of health care and public health, including medical training. During the "Hygiene and Public Health" course for fifth-year medical students, a practical training session was conducted on vaccination using AI chatbots as an educational supportive tool. Before receiving specific training on vaccination, the students were given a web-based test extracted from the Italian National Medical Residency Test. After completing the test, a critical correction of each question was performed assisted by AI chatbots.

OBJECTIVE

The main aim of this study was to identify whether AI chatbots can be considered educational support tools for training in public health. The secondary objective was to assess the performance of different AI chatbots on complex multiple-choice medical questions in the Italian language.

METHODS

A test composed of 15 multiple-choice questions on vaccination was extracted from the Italian National Medical Residency Test using targeted keywords and administered to medical students via Google Forms and to different AI chatbot models (Bing Chat, ChatGPT, Chatsonic, Google Bard, and YouChat). The correction of the test was conducted in the classroom, focusing on the critical evaluation of the explanations provided by the chatbot. A Mann-Whitney U test was conducted to compare the performances of medical students and AI chatbots. Student feedback was collected anonymously at the end of the training experience.

RESULTS

In total, 36 medical students and 5 AI chatbot models completed the test. The students achieved an average score of 8.22 (SD 2.65) out of 15, while the AI chatbots scored an average of 12.22 (SD 2.77). The results indicated a statistically significant difference in performance between the 2 groups (U=49.5, P<.001), with a large effect size (r=0.69). When divided by question type (direct, scenario-based, and negative), significant differences were observed in direct (P<.001) and scenario-based (P<.001) questions, but not in negative questions (P=.48). The students reported a high level of satisfaction (7.9/10) with the educational experience, expressing a strong desire to repeat the experience (7.6/10).

CONCLUSIONS

This study demonstrated the efficacy of AI chatbots in answering complex medical questions related to vaccination and providing valuable educational support. Their performance significantly surpassed that of medical students in direct and scenario-based questions. The responsible and critical use of AI chatbots can enhance medical education, making it an essential aspect to integrate into the educational system.

摘要

背景

人工智能(AI)是一个快速发展的领域,有潜力改变医疗保健和公共卫生的各个方面,包括医学培训。在为五年级医学生开设的“卫生与公共卫生”课程中,使用人工智能聊天机器人作为教育辅助工具进行了一次关于疫苗接种的实践培训。在接受关于疫苗接种的具体培训之前,学生们进行了一次从意大利国家医学住院医师考试中提取的基于网络的测试。完成测试后,在人工智能聊天机器人的辅助下对每个问题进行了批判性纠正。

目的

本研究的主要目的是确定人工智能聊天机器人是否可被视为公共卫生培训的教育辅助工具。次要目的是评估不同人工智能聊天机器人在意大利语复杂多项选择题上的表现。

方法

使用目标关键词从意大利国家医学住院医师考试中提取了一份由15道关于疫苗接种的多项选择题组成的测试,并通过谷歌表单将其施测于医学生以及不同的人工智能聊天机器人模型(必应聊天、ChatGPT、Chatsonic、谷歌巴德和YouChat)。测试的批改在课堂上进行,重点是对聊天机器人提供的解释进行批判性评估。进行了曼-惠特尼U检验以比较医学生和人工智能聊天机器人的表现。在培训结束时匿名收集学生反馈。

结果

共有36名医学生和5个人工智能聊天机器人模型完成了测试。学生们在15道题中平均得分为8.22(标准差2.65),而人工智能聊天机器人的平均得分为12.22(标准差2.77)。结果表明两组在表现上存在统计学显著差异(U = 49.5,P <.001),效应量较大(r = 0.69)。按问题类型(直接、基于情景和否定)划分时,在直接问题(P <.001)和基于情景的问题(P <.001)中观察到显著差异,但在否定问题中未观察到显著差异(P =.48)。学生们对教育体验的满意度较高(7.9/10),表示非常希望再次体验(7.6/10)。

结论

本研究证明了人工智能聊天机器人在回答与疫苗接种相关的复杂医学问题以及提供有价值的教育支持方面的有效性。它们在直接和基于情景的问题上的表现明显超过医学生。负责任且批判性地使用人工智能聊天机器人可以加强医学教育,使其成为融入教育系统的一个重要方面。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d74f/10652189/d6660a573202/mededu_v9i1e51421_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d74f/10652189/aa1183aee140/mededu_v9i1e51421_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d74f/10652189/c58d5ae4d88a/mededu_v9i1e51421_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d74f/10652189/d6660a573202/mededu_v9i1e51421_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d74f/10652189/aa1183aee140/mededu_v9i1e51421_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d74f/10652189/c58d5ae4d88a/mededu_v9i1e51421_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d74f/10652189/d6660a573202/mededu_v9i1e51421_fig3.jpg

相似文献

1
Exploring the Possible Use of AI Chatbots in Public Health Education: Feasibility Study.探索人工智能聊天机器人在公共卫生教育中的潜在用途:可行性研究。
JMIR Med Educ. 2023 Nov 1;9:e51421. doi: 10.2196/51421.
2
Accuracy and Readability of Artificial Intelligence Chatbot Responses to Vasectomy-Related Questions: Public Beware.人工智能聊天机器人对输精管切除术相关问题回答的准确性和可读性:公众需谨慎。
Cureus. 2024 Aug 28;16(8):e67996. doi: 10.7759/cureus.67996. eCollection 2024 Aug.
3
Performance of Artificial Intelligence Chatbots on Glaucoma Questions Adapted From Patient Brochures.人工智能聊天机器人对改编自患者手册的青光眼问题的回答情况。
Cureus. 2024 Mar 23;16(3):e56766. doi: 10.7759/cureus.56766. eCollection 2024 Mar.
4
Efficacy of AI Chats to Determine an Emergency: A Comparison Between OpenAI's ChatGPT, Google Bard, and Microsoft Bing AI Chat.人工智能聊天工具在判定紧急情况方面的效能:OpenAI的ChatGPT、谷歌巴德和微软必应人工智能聊天工具的比较
Cureus. 2023 Sep 18;15(9):e45473. doi: 10.7759/cureus.45473. eCollection 2023 Sep.
5
Comparing the performance of artificial intelligence learning models to medical students in solving histology and embryology multiple choice questions.比较人工智能学习模型与医学生在解决组织学和胚胎学选择题方面的表现。
Ann Anat. 2024 Jun;254:152261. doi: 10.1016/j.aanat.2024.152261. Epub 2024 Mar 21.
6
Assessing the Accuracy of Information on Medication Abortion: A Comparative Analysis of ChatGPT and Google Bard AI.评估药物流产信息的准确性:ChatGPT与谷歌巴德人工智能的比较分析
Cureus. 2024 Jan 2;16(1):e51544. doi: 10.7759/cureus.51544. eCollection 2024 Jan.
7
Comparison of artificial intelligence large language model chatbots in answering frequently asked questions in anaesthesia.人工智能大语言模型聊天机器人在回答麻醉常见问题方面的比较。
BJA Open. 2024 May 8;10:100280. doi: 10.1016/j.bjao.2024.100280. eCollection 2024 Jun.
8
Comparative accuracy of ChatGPT-4, Microsoft Copilot and Google Gemini in the Italian entrance test for healthcare sciences degrees: a cross-sectional study.ChatGPT-4、微软 Copilot 和谷歌 Gemini 在意大利医疗科学学位入学考试中的比较准确性:一项横断面研究。
BMC Med Educ. 2024 Jun 26;24(1):694. doi: 10.1186/s12909-024-05630-9.
9
The performance of artificial intelligence chatbot large language models to address skeletal biology and bone health queries.人工智能聊天机器人大型语言模型在解决骨骼生物学和骨骼健康问题方面的表现。
J Bone Miner Res. 2024 Mar 22;39(2):106-115. doi: 10.1093/jbmr/zjad007.
10
Talking technology: exploring chatbots as a tool for cataract patient education.技术漫谈:探索聊天机器人作为白内障患者教育工具的作用
Clin Exp Optom. 2025 Jan;108(1):56-64. doi: 10.1080/08164622.2023.2298812. Epub 2024 Jan 9.

引用本文的文献

1
Evaluating the Accuracy, Completeness, and Readability of Chatbot Responses to Refractive Surgery-Related Patient Questions: A Comparative Analysis of ChatGPT and Google Gemini.评估聊天机器人对屈光手术相关患者问题回答的准确性、完整性和可读性:ChatGPT与谷歌Gemini的比较分析
Cureus. 2025 Jul 29;17(7):e88980. doi: 10.7759/cureus.88980. eCollection 2025 Jul.
2
Evaluating Accuracy and Readability of Responses to Midlife Health Questions: A Comparative Analysis of Six Large Language Model Chatbots.评估中年健康问题回答的准确性和可读性:六个大语言模型聊天机器人的比较分析
J Midlife Health. 2025 Jan-Mar;16(1):45-50. doi: 10.4103/jmh.jmh_182_24. Epub 2025 Apr 5.
3

本文引用的文献

1
AI and Medical Education - A 21st-Century Pandora's Box.人工智能与医学教育——一个21世纪的潘多拉魔盒。
N Engl J Med. 2023 Aug 3;389(5):385-387. doi: 10.1056/NEJMp2304993. Epub 2023 Jul 29.
2
Hallucinations in ChatGPT: A Cautionary Tale for Biomedical Researchers.ChatGPT中的幻觉:给生物医学研究人员的警示故事。
Am J Med. 2023 Nov;136(11):1059-1060. doi: 10.1016/j.amjmed.2023.06.012. Epub 2023 Jun 25.
3
Reflection on whether Chat GPT should be banned by academia from the perspective of education and teaching.从教育教学的角度反思学术界是否应该禁止Chat GPT。
Delving into the Practical Applications and Pitfalls of Large Language Models in Medical Education: Narrative Review.
深入探讨大语言模型在医学教育中的实际应用与陷阱:叙述性综述
Adv Med Educ Pract. 2025 Apr 18;16:625-636. doi: 10.2147/AMEP.S497020. eCollection 2025.
4
ChatGPT and Other Large Language Models in Medical Education - Scoping Literature Review.医学教育中的ChatGPT及其他大语言模型——文献综述
Med Sci Educ. 2024 Nov 13;35(1):555-567. doi: 10.1007/s40670-024-02206-6. eCollection 2025 Feb.
5
Which current chatbot is more competent in urological theoretical knowledge? A comparative analysis by the European board of urology in-service assessment.目前哪种聊天机器人在泌尿学理论知识方面更具能力?欧洲泌尿外科在职评估委员会的一项比较分析。
World J Urol. 2025 Feb 11;43(1):116. doi: 10.1007/s00345-025-05499-3.
6
Exploring medical students' intention to use of ChatGPT from a programming course: a grounded theory study in China.从一门编程课程探究医学生使用ChatGPT的意愿:一项在中国的扎根理论研究
BMC Med Educ. 2025 Feb 8;25(1):209. doi: 10.1186/s12909-025-06807-6.
7
Medical students and ChatGPT: analyzing attitudes, practices, and academic perceptions.医学生与ChatGPT:分析态度、实践及学术认知
BMC Med Educ. 2025 Feb 5;25(1):187. doi: 10.1186/s12909-025-06731-9.
8
Utilization of, Perceptions on, and Intention to Use AI Chatbots Among Medical Students in China: National Cross-Sectional Study.中国医学生对人工智能聊天机器人的使用、认知和使用意愿:全国横断面研究。
JMIR Med Educ. 2024 Oct 28;10:e57132. doi: 10.2196/57132.
9
Harnessing AI for public health: India's roadmap.利用人工智能促进公共卫生:印度的路线图。
Front Public Health. 2024 Sep 27;12:1417568. doi: 10.3389/fpubh.2024.1417568. eCollection 2024.
10
A framework for human evaluation of large language models in healthcare derived from literature review.一个源自文献综述的用于医疗保健领域大语言模型人工评估的框架。
NPJ Digit Med. 2024 Sep 28;7(1):258. doi: 10.1038/s41746-024-01258-7.
Front Psychol. 2023 Jun 1;14:1181712. doi: 10.3389/fpsyg.2023.1181712. eCollection 2023.
4
Large Language Models in Medical Education: Opportunities, Challenges, and Future Directions.医学教育中的大语言模型:机遇、挑战与未来方向。
JMIR Med Educ. 2023 Jun 1;9:e48291. doi: 10.2196/48291.
5
Effectiveness of chatbots on COVID vaccine confidence and acceptance in Thailand, Hong Kong, and Singapore.聊天机器人对泰国、香港和新加坡民众新冠疫苗信心及接受度的影响。
NPJ Digit Med. 2023 May 25;6(1):96. doi: 10.1038/s41746-023-00843-6.
6
ChatGPT and the rise of large language models: the new AI-driven infodemic threat in public health.ChatGPT 和大型语言模型的兴起:公共卫生领域新的 AI 驱动的信息疫情威胁。
Front Public Health. 2023 Apr 25;11:1166120. doi: 10.3389/fpubh.2023.1166120. eCollection 2023.
7
ChatGPT Utility in Healthcare Education, Research, and Practice: Systematic Review on the Promising Perspectives and Valid Concerns.ChatGPT在医学教育、研究与实践中的应用:对其前景与合理担忧的系统评价
Healthcare (Basel). 2023 Mar 19;11(6):887. doi: 10.3390/healthcare11060887.
8
ChatGPT - Reshaping medical education and clinical management.ChatGPT——重塑医学教育与临床管理。
Pak J Med Sci. 2023 Mar-Apr;39(2):605-607. doi: 10.12669/pjms.39.2.7653.
9
The Role of ChatGPT, Generative Language Models, and Artificial Intelligence in Medical Education: A Conversation With ChatGPT and a Call for Papers.ChatGPT、生成式语言模型和人工智能在医学教育中的作用:与ChatGPT的对话及论文征集
JMIR Med Educ. 2023 Mar 6;9:e46885. doi: 10.2196/46885.
10
Chatbots for future docs: exploring medical students' attitudes and knowledge towards artificial intelligence and medical chatbots.未来医生的聊天机器人:探索医学生对人工智能和医疗聊天机器人的态度和知识。
Med Educ Online. 2023 Dec;28(1):2182659. doi: 10.1080/10872981.2023.2182659.