人工智能与公共卫生：评估ChatGPT对疫苗接种谣言和误解的回应

Artificial Intelligence and Public Health: Evaluating ChatGPT Responses to Vaccination Myths and Misconceptions.

作者信息

Deiana Giovanna, Dettori Marco, Arghittu Antonella, Azara Antonio, Gabutti Giovanni, Castiglia Paolo

机构信息

Department of Biomedical Sciences, University of Sassari, 07100 Sassari, Italy.

Department of Medical, Surgical and Experimental Sciences, University Hospital of Sassari, 07100 Sassari, Italy.

出版信息

Vaccines (Basel). 2023 Jul 7;11(7):1217. doi: 10.3390/vaccines11071217.

DOI:10.3390/vaccines11071217

PMID:37515033

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10386180/

Abstract

Artificial intelligence (AI) tools, such as ChatGPT, are the subject of intense debate regarding their possible applications in contexts such as health care. This study evaluates the Correctness, Clarity, and Exhaustiveness of the answers provided by ChatGPT on the topic of vaccination. The World Health Organization's 11 "myths and misconceptions" about vaccinations were administered to both the free (GPT-3.5) and paid version (GPT-4.0) of ChatGPT. The AI tool's responses were evaluated qualitatively and quantitatively, in reference to those myth and misconceptions provided by WHO, independently by two expert Raters. The agreement between the Raters was significant for both versions ( of K < 0.05). Overall, ChatGPT responses were easy to understand and 85.4% accurate although one of the questions was misinterpreted. Qualitatively, the GPT-4.0 responses were superior to the GPT-3.5 responses in terms of Correctness, Clarity, and Exhaustiveness (Δ = 5.6%, 17.9%, 9.3%, respectively). The study shows that, if appropriately questioned, AI tools can represent a useful aid in the health care field. However, when consulted by non-expert users, without the support of expert medical advice, these tools are not free from the risk of eliciting misleading responses. Moreover, given the existing social divide in information access, the improved accuracy of answers from the paid version raises further ethical issues.

摘要

诸如ChatGPT之类的人工智能（AI）工具在医疗保健等领域的可能应用引发了激烈的争论。本研究评估了ChatGPT在疫苗接种主题上提供答案的正确性、清晰度和详尽性。世界卫生组织关于疫苗接种的11条“误解和错误观念”被用于询问ChatGPT的免费版本（GPT - 3.5）和付费版本（GPT - 4.0）。两位专家评分员独立参照世界卫生组织提供的那些误解和错误观念，对人工智能工具的回答进行了定性和定量评估。两个版本评分员之间的一致性都很显著（K < 0.05）。总体而言，ChatGPT的回答易于理解，尽管其中一个问题被误解了，但准确率仍达85.4%。定性地说，GPT - 4.0的回答在正确性、清晰度和详尽性方面优于GPT - 3.5的回答（分别相差5.6%、17.9%、9.3%）。该研究表明，如果提问恰当，人工智能工具在医疗保健领域可以是一种有用的辅助手段。然而，当非专业用户在没有专家医疗建议支持的情况下咨询这些工具时，它们存在引发误导性回答的风险。此外，鉴于现有的信息获取社会差距，付费版本回答准确性的提高引发了进一步的伦理问题。

相似文献

Artificial Intelligence and Public Health: Evaluating ChatGPT Responses to Vaccination Myths and Misconceptions.人工智能与公共卫生：评估ChatGPT对疫苗接种谣言和误解的回应

Vaccines (Basel). 2023 Jul 7;11(7):1217. doi: 10.3390/vaccines11071217.

Assessing the Accuracy of Generative Conversational Artificial Intelligence in Debunking Sleep Health Myths: Mixed Methods Comparative Study With Expert Analysis.评估生成式对话人工智能在破除睡眠健康误区方面的准确性：采用专家分析的混合方法比较研究

JMIR Form Res. 2024 Apr 16;8:e55762. doi: 10.2196/55762.

Using ChatGPT to evaluate cancer myths and misconceptions: artificial intelligence and cancer information.利用 ChatGPT 评估癌症谣言和误解：人工智能与癌症信息。

JNCI Cancer Spectr. 2023 Mar 1;7(2). doi: 10.1093/jncics/pkad015.

Accuracy of ChatGPT on Medical Questions in the National Medical Licensing Examination in Japan: Evaluation Study.ChatGPT在日本国家医师资格考试医学问题上的准确性：评估研究

JMIR Form Res. 2023 Oct 13;7:e48023. doi: 10.2196/48023.

Evaluating the effectiveness of artificial intelligence-based tools in detecting and understanding sleep health misinformation: Comparative analysis using Google Bard and OpenAI ChatGPT-4.评估基于人工智能的工具在检测和理解睡眠健康错误信息方面的有效性：使用 Google Bard 和 OpenAI ChatGPT-4 的比较分析。

J Sleep Res. 2024 Dec;33(6):e14210. doi: 10.1111/jsr.14210. Epub 2024 Apr 5.

Performance of ChatGPT on the Peruvian National Licensing Medical Examination: Cross-Sectional Study.ChatGPT在秘鲁国家医学执照考试中的表现：横断面研究

JMIR Med Educ. 2023 Sep 28;9:e48039. doi: 10.2196/48039.

Debunking Palliative Care Myths: Assessing the Performance of Artificial Intelligence Chatbots (ChatGPT vs. Google Gemini).揭穿姑息治疗的神话：评估人工智能聊天机器人的表现（ChatGPT与谷歌Gemini对比）

Indian J Palliat Care. 2024 Jul-Sep;30(3):284-287. doi: 10.25259/IJPC_44_2024. Epub 2024 Aug 9.

Decoding dietary myths: The role of ChatGPT in modern nutrition.解读饮食误区：ChatGPT 在现代营养学中的作用。

Clin Nutr ESPEN. 2024 Apr;60:285-288. doi: 10.1016/j.clnesp.2024.02.022. Epub 2024 Feb 23.

Evaluating the accuracy and reliability of AI chatbots in disseminating the content of current resuscitation guidelines: a comparative analysis between the ERC 2021 guidelines and both ChatGPTs 3.5 and 4.评估 AI 聊天机器人在传播最新复苏指南内容方面的准确性和可靠性：ERC 2021 指南与 ChatGPT 3.5 和 4 之间的比较分析

Scand J Trauma Resusc Emerg Med. 2024 Sep 26;32(1):95. doi: 10.1186/s13049-024-01266-2.

Vaccination hesitancy: agreement between WHO and ChatGPT-4.0 or Gemini Advanced.疫苗接种犹豫：世界卫生组织与ChatGPT-4.0或Gemini Advanced之间的一致性

Ann Ig. 2025 May-Jun;37(3):390-396. doi: 10.7416/ai.2024.2657. Epub 2024 Oct 7.

引用本文的文献

Assessing the accuracy, repeatability, and consistency of ChatGPT 4o in treatment planning for tooth-supported fixed prostheses: a comparative analysis of simple and complex clinical cases.评估ChatGPT 4o在牙支持固定修复体治疗计划中的准确性、可重复性和一致性：简单与复杂临床病例的对比分析

Clin Oral Investig. 2025 Sep 2;29(9):433. doi: 10.1007/s00784-025-06521-z.

ChatGPT and human dietitian responses to diet-related questions on an online Q&A platform: A comparative study.ChatGPT与人类营养师在在线问答平台上对饮食相关问题的回答：一项比较研究。

Digit Health. 2025 Aug 21;11:20552076251361381. doi: 10.1177/20552076251361381. eCollection 2025 Jan-Dec.

Evaluation of the accuracy of ChatGPT-4 and Gemini's responses to the World Dental Federation's frequently asked questions on oral health.评估ChatGPT-4和Gemini对世界牙科联盟关于口腔健康常见问题的回答的准确性。

BMC Oral Health. 2025 Aug 2;25(1):1293. doi: 10.1186/s12903-025-06624-9.

Assessing the accuracy and explainability of using ChatGPT to evaluate the quality of health news.评估使用ChatGPT评估健康新闻质量的准确性和可解释性。

BMC Public Health. 2025 Jun 2;25(1):2038. doi: 10.1186/s12889-025-23206-0.

Evaluating the influence of prompt formulation on the reliability and repeatability of ChatGPT in implant-supported prostheses.评估提示词制定对ChatGPT在种植体支持式修复体方面的可靠性和可重复性的影响。

PLoS One. 2025 May 30;20(5):e0323086. doi: 10.1371/journal.pone.0323086. eCollection 2025.

Online Health Information-Seeking in the Era of Large Language Models: Cross-Sectional Web-Based Survey Study.大语言模型时代的在线健康信息搜索：基于网络的横断面调查研究

J Med Internet Res. 2025 Mar 31;27:e68560. doi: 10.2196/68560.

Generative AI Decision-Making Attributes in Complex Health Services: A Rapid Review.复杂医疗服务中的生成式人工智能决策属性：快速综述

Cureus. 2025 Jan 30;17(1):e78257. doi: 10.7759/cureus.78257. eCollection 2025 Jan.

Is there any room for ChatGPT AI bot in speech-language pathology?在言语语言病理学领域，ChatGPT人工智能聊天机器人有立足之地吗？

Eur Arch Otorhinolaryngol. 2025 Jun;282(6):3267-3280. doi: 10.1007/s00405-025-09295-y. Epub 2025 Mar 1.

Large Language Models for Chatbot Health Advice Studies: A Systematic Review.用于聊天机器人健康建议研究的大语言模型：一项系统综述。

JAMA Netw Open. 2025 Feb 3;8(2):e2457879. doi: 10.1001/jamanetworkopen.2024.57879.

A comparison of the persuasiveness of human and ChatGPT generated pro-vaccine messages for HPV.人类与ChatGPT生成的HPV疫苗支持信息的说服力比较。

Front Public Health. 2025 Jan 16;12:1515871. doi: 10.3389/fpubh.2024.1515871. eCollection 2024.

本文引用的文献

Large language models encode clinical knowledge.大语言模型编码临床知识。

Nature. 2023 Aug;620(7972):172-180. doi: 10.1038/s41586-023-06291-2. Epub 2023 Jul 12.

Embracing Large Language Models for Medical Applications: Opportunities and Challenges.拥抱用于医学应用的大语言模型：机遇与挑战。

Cureus. 2023 May 21;15(5):e39305. doi: 10.7759/cureus.39305. eCollection 2023 May.

ChatGPT in medicine: an overview of its applications, advantages, limitations, future prospects, and ethical considerations.医学领域的ChatGPT：其应用、优势、局限性、未来前景及伦理考量概述

Front Artif Intell. 2023 May 4;6:1169595. doi: 10.3389/frai.2023.1169595. eCollection 2023.

Ethics of large language models in medicine and medical research.医学及医学研究中大型语言模型的伦理问题。

Lancet Digit Health. 2023 Jun;5(6):e333-e335. doi: 10.1016/S2589-7500(23)00083-3. Epub 2023 Apr 27.

First Year of Special Issue "New Insights in Vaccination and Public Health": Opinions and Considerations.《疫苗接种与公共卫生新见解》特刊创刊第一年：观点与思考

Vaccines (Basel). 2023 Mar 6;11(3):600. doi: 10.3390/vaccines11030600.

ChatGPT Utility in Healthcare Education, Research, and Practice: Systematic Review on the Promising Perspectives and Valid Concerns.ChatGPT在医学教育、研究与实践中的应用：对其前景与合理担忧的系统评价

Healthcare (Basel). 2023 Mar 19;11(6):887. doi: 10.3390/healthcare11060887.

Determinants of COVID-19 vaccine fatigue.新冠疫苗疲劳的决定因素。

Nat Med. 2023 May;29(5):1164-1171. doi: 10.1038/s41591-023-02282-y. Epub 2023 Mar 27.

Attention is not all you need: the complicated case of ethically using large language models in healthcare and medicine.注意力并非全部所需：在医疗保健和医学中使用大型语言模型所涉及的复杂伦理问题。

EBioMedicine. 2023 Apr;90:104512. doi: 10.1016/j.ebiom.2023.104512. Epub 2023 Mar 15.

On the cusp: Considering the impact of artificial intelligence language models in healthcare.处于临界点：思考人工智能语言模型在医疗保健领域的影响。

Med. 2023 Mar 10;4(3):139-140. doi: 10.1016/j.medj.2023.02.008.

Coronavirus Disease-2019 Vaccine Hesitancy.新型冠状病毒病 2019 疫苗犹豫

Pediatr Clin North Am. 2023 Apr;70(2):243-257. doi: 10.1016/j.pcl.2022.12.001. Epub 2022 Dec 8.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验