Sensoy Eyupcan, Citirik Mehmet
Department of Ophthalmology, Ankara Etlik City Hospital, Ankara, Turkey.
Oman J Ophthalmol. 2024 Oct 24;17(3):348-351. doi: 10.4103/ojo.ojo_19_24. eCollection 2024 Sep-Dec.
This study aims to evaluate the knowledge levels of chat generative pretrained transformer (ChatGPT), Bing, and Bard programs, which are three different artificial intelligence chatbots offered to the market free of charge by various manufacturers, regarding neuro-ophthalmological diseases, to examine their usability, and to investigate the existence of their superiority to each other.
Forty questions related to neuro-ophthalmological diseases were obtained from the study questions' section of the American Academy and Ophthalmology 2022-2023 Basic and Clinical Science Course Neuro-ophthalmology Book. The questions were posed to the ChatGPT, Bing, and Bard artificial intelligence chatbots. The answers were evaluated as correct or incorrect. The statistical relationship between the correct and incorrect answer rates offered by the artificial intelligence programs was tested.
The correct answer rates were given by the artificial intelligence programs to the questions asked: ChatGPT - 52.5%; Bing - 55%; and Bard - 65%. There was no statistically significant difference between the correct answer rates of the three artificial intelligence programs ( = 0.489, Pearson's Chi-square test).
Although information about neuro-ophthalmological diseases can be accessed quickly and accurately using up-to-date artificial intelligence programs, the answers given may not always be correct. Care should always be taken when evaluating the answers to the questions.
本研究旨在评估由不同制造商免费提供给市场的三种不同人工智能聊天机器人,即聊天生成预训练变换器(ChatGPT)、必应(Bing)和巴德(Bard)程序,关于神经眼科疾病的知识水平,检验它们的可用性,并调查它们之间是否存在优越性。
从美国眼科学会2022 - 2023年基础与临床科学课程神经眼科学书籍的研究问题部分获取了40个与神经眼科疾病相关的问题。将这些问题抛给ChatGPT、必应和巴德人工智能聊天机器人。答案被评估为正确或错误。测试了人工智能程序给出的正确和错误答案率之间的统计关系。
人工智能程序对所提问题给出的正确答案率如下:ChatGPT - 52.5%;必应 - 55%;巴德 - 65%。这三个人工智能程序的正确答案率之间没有统计学显著差异( = 0.489,Pearson卡方检验)。
尽管使用最新的人工智能程序可以快速准确地获取有关神经眼科疾病的信息,但给出的答案可能并不总是正确的。在评估问题答案时应始终谨慎。