Jedrzejczak W Wiktor, Kochanek Krzysztof
Institute of Physiology and Pathology of Hearing, Warsaw, Poland.
World Hearing Center, Kajetany, Poland.
Audiol Neurootol. 2024;29(6):457-463. doi: 10.1159/000538983. Epub 2024 May 6.
The purpose of this study was to evaluate three chatbots - OpenAI ChatGPT, Microsoft Bing Chat (currently Copilot), and Google Bard (currently Gemini) - in terms of their responses to a defined set of audiological questions.
Each chatbot was presented with the same 10 questions. The authors rated the responses on a Likert scale ranging from 1 to 5. Additional features, such as the number of inaccuracies or errors and the provision of references, were also examined.
Most responses given by all three chatbots were rated as satisfactory or better. However, all chatbots generated at least a few errors or inaccuracies. ChatGPT achieved the highest overall score, while Bard was the worst. Bard was also the only chatbot unable to provide a response to one of the questions. ChatGPT was the only chatbot that did not provide information about its sources.
Chatbots are an intriguing tool that can be used to access basic information in a specialized area like audiology. Nevertheless, one needs to be careful, as correct information is not infrequently mixed in with errors that are hard to pick up unless the user is well versed in the field.
本研究的目的是评估三款聊天机器人——OpenAI ChatGPT、微软必应聊天(当前为Copilot)和谷歌巴德(当前为Gemini)——对一组特定听力学问题的回答情况。
向每个聊天机器人提出相同的10个问题。作者根据从1到5的李克特量表对回答进行评分。还检查了其他特征,如不准确或错误的数量以及参考文献的提供情况。
三款聊天机器人给出的大多数回答都被评为满意或更好。然而,所有聊天机器人都至少产生了一些错误或不准确之处。ChatGPT获得了最高的总分,而巴德最差。巴德也是唯一无法回答其中一个问题的聊天机器人。ChatGPT是唯一没有提供其信息来源的聊天机器人。
聊天机器人是一种有趣的工具,可用于获取听力学等专业领域的基本信息。然而,人们需要谨慎,因为正确信息常常与错误信息混在一起,除非用户精通该领域,否则很难发现这些错误。