检验人工智能程序在神经眼科疾病方面的能力并分析它们的相对优势。

Examining the competence of artificial intelligence programs in neuro-ophthalmological disorders and analyzing their comparative superiority.

作者信息

Sensoy Eyupcan, Citirik Mehmet

机构信息

Department of Ophthalmology, Ankara Etlik City Hospital, Ankara, Turkey.

出版信息

Oman J Ophthalmol. 2024 Oct 24;17(3):348-351. doi: 10.4103/ojo.ojo_19_24. eCollection 2024 Sep-Dec.

DOI:10.4103/ojo.ojo_19_24

PMID:39651501

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11620313/

Abstract

BACKGROUND

This study aims to evaluate the knowledge levels of chat generative pretrained transformer (ChatGPT), Bing, and Bard programs, which are three different artificial intelligence chatbots offered to the market free of charge by various manufacturers, regarding neuro-ophthalmological diseases, to examine their usability, and to investigate the existence of their superiority to each other.

MATERIALS AND METHODS

Forty questions related to neuro-ophthalmological diseases were obtained from the study questions' section of the American Academy and Ophthalmology 2022-2023 Basic and Clinical Science Course Neuro-ophthalmology Book. The questions were posed to the ChatGPT, Bing, and Bard artificial intelligence chatbots. The answers were evaluated as correct or incorrect. The statistical relationship between the correct and incorrect answer rates offered by the artificial intelligence programs was tested.

RESULTS

The correct answer rates were given by the artificial intelligence programs to the questions asked: ChatGPT - 52.5%; Bing - 55%; and Bard - 65%. There was no statistically significant difference between the correct answer rates of the three artificial intelligence programs ( = 0.489, Pearson's Chi-square test).

CONCLUSION

Although information about neuro-ophthalmological diseases can be accessed quickly and accurately using up-to-date artificial intelligence programs, the answers given may not always be correct. Care should always be taken when evaluating the answers to the questions.

摘要

背景

本研究旨在评估由不同制造商免费提供给市场的三种不同人工智能聊天机器人，即聊天生成预训练变换器（ChatGPT）、必应（Bing）和巴德（Bard）程序，关于神经眼科疾病的知识水平，检验它们的可用性，并调查它们之间是否存在优越性。

材料与方法

从美国眼科学会2022 - 2023年基础与临床科学课程神经眼科学书籍的研究问题部分获取了40个与神经眼科疾病相关的问题。将这些问题抛给ChatGPT、必应和巴德人工智能聊天机器人。答案被评估为正确或错误。测试了人工智能程序给出的正确和错误答案率之间的统计关系。

结果

人工智能程序对所提问题给出的正确答案率如下：ChatGPT - 52.5%；必应 - 55%；巴德 - 65%。这三个人工智能程序的正确答案率之间没有统计学显著差异（ = 0.489，Pearson卡方检验）。

结论

尽管使用最新的人工智能程序可以快速准确地获取有关神经眼科疾病的信息，但给出的答案可能并不总是正确的。在评估问题答案时应始终谨慎。

Suppr 超能文献

文献检索

文件翻译

深度研究

Suppr 超能文献

文献检索

文件翻译

深度研究

检验人工智能程序在神经眼科疾病方面的能力并分析它们的相对优势。

Examining the competence of artificial intelligence programs in neuro-ophthalmological disorders and analyzing their comparative superiority.

作者信息

机构信息

出版信息

BACKGROUND

MATERIALS AND METHODS

RESULTS

CONCLUSION

背景

材料与方法

结果

结论

相似文献

本文引用的文献

检验人工智能程序在神经眼科疾病方面的能力并分析它们的相对优势。

Examining the competence of artificial intelligence programs in neuro-ophthalmological disorders and analyzing their comparative superiority.

作者信息

机构信息

出版信息

BACKGROUND

MATERIALS AND METHODS

RESULTS

CONCLUSION

背景

材料与方法

结果

结论

相似文献

本文引用的文献