三款聊天机器人的听力学知识比较：ChatGPT、必应聊天和巴德

Comparison of the Audiological Knowledge of Three Chatbots: ChatGPT, Bing Chat, and Bard.

作者信息

Jedrzejczak W Wiktor, Kochanek Krzysztof

机构信息

Institute of Physiology and Pathology of Hearing, Warsaw, Poland.

World Hearing Center, Kajetany, Poland.

出版信息

Audiol Neurootol. 2024;29(6):457-463. doi: 10.1159/000538983. Epub 2024 May 6.

DOI:10.1159/000538983

PMID:38710158

Abstract

INTRODUCTION

The purpose of this study was to evaluate three chatbots - OpenAI ChatGPT, Microsoft Bing Chat (currently Copilot), and Google Bard (currently Gemini) - in terms of their responses to a defined set of audiological questions.

METHODS

Each chatbot was presented with the same 10 questions. The authors rated the responses on a Likert scale ranging from 1 to 5. Additional features, such as the number of inaccuracies or errors and the provision of references, were also examined.

RESULTS

Most responses given by all three chatbots were rated as satisfactory or better. However, all chatbots generated at least a few errors or inaccuracies. ChatGPT achieved the highest overall score, while Bard was the worst. Bard was also the only chatbot unable to provide a response to one of the questions. ChatGPT was the only chatbot that did not provide information about its sources.

CONCLUSIONS

Chatbots are an intriguing tool that can be used to access basic information in a specialized area like audiology. Nevertheless, one needs to be careful, as correct information is not infrequently mixed in with errors that are hard to pick up unless the user is well versed in the field.

摘要

引言

本研究的目的是评估三款聊天机器人——OpenAI ChatGPT、微软必应聊天（当前为Copilot）和谷歌巴德（当前为Gemini）——对一组特定听力学问题的回答情况。

方法

向每个聊天机器人提出相同的10个问题。作者根据从1到5的李克特量表对回答进行评分。还检查了其他特征，如不准确或错误的数量以及参考文献的提供情况。

结果

三款聊天机器人给出的大多数回答都被评为满意或更好。然而，所有聊天机器人都至少产生了一些错误或不准确之处。ChatGPT获得了最高的总分，而巴德最差。巴德也是唯一无法回答其中一个问题的聊天机器人。ChatGPT是唯一没有提供其信息来源的聊天机器人。

结论

聊天机器人是一种有趣的工具，可用于获取听力学等专业领域的基本信息。然而，人们需要谨慎，因为正确信息常常与错误信息混在一起，除非用户精通该领域，否则很难发现这些错误。

相似文献

Comparison of the Audiological Knowledge of Three Chatbots: ChatGPT, Bing Chat, and Bard.三款聊天机器人的听力学知识比较：ChatGPT、必应聊天和巴德

Audiol Neurootol. 2024;29(6):457-463. doi: 10.1159/000538983. Epub 2024 May 6.

Accuracy and Readability of Artificial Intelligence Chatbot Responses to Vasectomy-Related Questions: Public Beware.人工智能聊天机器人对输精管切除术相关问题回答的准确性和可读性：公众需谨慎。

Cureus. 2024 Aug 28;16(8):e67996. doi: 10.7759/cureus.67996. eCollection 2024 Aug.

Performance of Artificial Intelligence Chatbots on Glaucoma Questions Adapted From Patient Brochures.人工智能聊天机器人对改编自患者手册的青光眼问题的回答情况。

Cureus. 2024 Mar 23;16(3):e56766. doi: 10.7759/cureus.56766. eCollection 2024 Mar.

Accuracy of Prospective Assessments of 4 Large Language Model Chatbot Responses to Patient Questions About Emergency Care: Experimental Comparative Study.前瞻性评估 4 种大型语言模型聊天机器人对患者关于急救护理问题的回答的准确性：实验性对比研究。

J Med Internet Res. 2024 Nov 4;26:e60291. doi: 10.2196/60291.

The promising role of chatbots in keratorefractive surgery patient education.聊天机器人在角膜屈光手术患者教育中的潜在作用。

J Fr Ophtalmol. 2025 Feb;48(2):104381. doi: 10.1016/j.jfo.2024.104381. Epub 2024 Dec 13.

Comparison of artificial intelligence large language model chatbots in answering frequently asked questions in anaesthesia.人工智能大语言模型聊天机器人在回答麻醉常见问题方面的比较。

BJA Open. 2024 May 8;10:100280. doi: 10.1016/j.bjao.2024.100280. eCollection 2024 Jun.

Evaluation and Comparison of the Knowledge Levels of Current Artificial Intelligence Programs on Retinal/Vitreous Diseases and Treatment Methods.当前人工智能程序对视网膜/玻璃体疾病及治疗方法的知识水平评估与比较

J Curr Ophthalmol. 2024 Oct 16;36(1):78-81. doi: 10.4103/joco.joco_192_23. eCollection 2024 Jan-Mar.

Efficacy of AI Chats to Determine an Emergency: A Comparison Between OpenAI's ChatGPT, Google Bard, and Microsoft Bing AI Chat.人工智能聊天工具在判定紧急情况方面的效能：OpenAI的ChatGPT、谷歌巴德和微软必应人工智能聊天工具的比较

Cureus. 2023 Sep 18;15(9):e45473. doi: 10.7759/cureus.45473. eCollection 2023 Sep.

The Performance of Chatbots and the AAPOS Website as a Tool for Amblyopia Education.聊天机器人和 AAPOS 网站在弱视教育中的应用效果。

J Pediatr Ophthalmol Strabismus. 2024 Sep-Oct;61(5):325-331. doi: 10.3928/01913913-20240409-01. Epub 2024 May 30.

Assessment of readability, reliability, and quality of ChatGPT®, BARD®, Gemini®, Copilot®, Perplexity® responses on palliative care.评估 ChatGPT®、BARD®、 Gemini®、Copilot®、Perplexity® 在姑息治疗方面的可读性、可靠性和质量。

Medicine (Baltimore). 2024 Aug 16;103(33):e39305. doi: 10.1097/MD.0000000000039305.

引用本文的文献

Artificial intelligence in healthcare education: evaluating the accuracy of ChatGPT, Copilot, and Google Gemini in cardiovascular pharmacology.医疗保健教育中的人工智能：评估ChatGPT、Copilot和谷歌Gemini在心血管药理学方面的准确性。

Front Med (Lausanne). 2025 Feb 19;12:1495378. doi: 10.3389/fmed.2025.1495378. eCollection 2025.

Performance of ChatGPT in Pediatric Audiology as Rated by Students and Experts.学生和专家对ChatGPT在儿科听力学方面表现的评价

J Clin Med. 2025 Jan 28;14(3):875. doi: 10.3390/jcm14030875.

Quality of Chatbot Responses to the Most Popular Questions Regarding Erectile Dysfunction.聊天机器人对有关勃起功能障碍最常见问题的回答质量。

Urol Res Pract. 2025 Jan 3;50(4):253-260. doi: 10.5152/tud.2025.24098.

Comparative analysis of BERT-based and generative large language models for detecting suicidal ideation: a performance evaluation study.基于 BERT 的和生成式大型语言模型在自杀意念检测中的比较分析：一项性能评估研究。

Cad Saude Publica. 2024 Nov 25;40(10):e00028824. doi: 10.1590/0102-311XEN028824. eCollection 2024.

Artificial Intelligence in Audiology: A Scoping Review of Current Applications and Future Directions.人工智能在听力学中的应用：现状与未来方向的范围综述。

Sensors (Basel). 2024 Nov 6;24(22):7126. doi: 10.3390/s24227126.

PICOT questions and search strategies formulation: A novel approach using artificial intelligence automation.PICOT问题与检索策略制定：一种使用人工智能自动化的新方法。

J Nurs Scholarsh. 2025 Jan;57(1):5-16. doi: 10.1111/jnu.13036. Epub 2024 Nov 24.

Optimizing athletic performance through advanced nutrition strategies: can AI and digital platforms have a role in ultraendurance sports?通过先进的营养策略优化运动表现：人工智能和数字平台在超耐力运动中能发挥作用吗？

Biol Sport. 2024 Oct;41(4):305-313. doi: 10.5114/biolsport.2024.141063. Epub 2024 Jul 23.

Accuracy and Repeatability of ChatGPT Based on a Set of Multiple-Choice Questions on Objective Tests of Hearing.基于一组听力客观测试多项选择题的ChatGPT的准确性和可重复性。

Cureus. 2024 May 8;16(5):e59857. doi: 10.7759/cureus.59857. eCollection 2024 May.

ChatGPT for Tinnitus Information and Support: Response Accuracy and Retest after Three and Six Months.用于耳鸣信息与支持的ChatGPT：三个月和六个月后的回答准确性及重新测试

Brain Sci. 2024 May 7;14(5):465. doi: 10.3390/brainsci14050465.

Exploring the Performance of ChatGPT-4 in the Taiwan Audiologist Qualification Examination: Preliminary Observational Study Highlighting the Potential of AI Chatbots in Hearing Care.探索 ChatGPT-4 在台湾听力学家资格考试中的表现：初步观察性研究强调 AI 聊天机器人在听力保健中的潜力。

JMIR Med Educ. 2024 Apr 26;10:e55595. doi: 10.2196/55595.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

三款聊天机器人的听力学知识比较：ChatGPT、必应聊天和巴德

Comparison of the Audiological Knowledge of Three Chatbots: ChatGPT, Bing Chat, and Bard.

作者信息

机构信息

出版信息

INTRODUCTION

METHODS

RESULTS

CONCLUSIONS

引言

方法

结果

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献