研究人工智能程序在评估眼部炎症、葡萄膜疾病及治疗方式相关知识水平方面的比较优势。

Investigating the comparative superiority of artificial intelligence programs in assessing knowledge levels regarding ocular inflammation, uvea diseases, and treatment modalities.

作者信息

Sensoy Eyupcan, Citirik Mehmet

机构信息

Department of Ophthalmology, Ankara Etlik City Hospital, Ankara, Turkey.

出版信息

Taiwan J Ophthalmol. 2024 Sep 13;14(3):409-413. doi: 10.4103/tjo.TJO-D-23-00166. eCollection 2024 Jul-Sep.

DOI:10.4103/tjo.TJO-D-23-00166

PMID:39430359

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11488809/

Abstract

PURPOSE

The purpose of the study was to evaluate the knowledge level of the Chat Generative Pretrained Transformer (ChatGPT), Bard, and Bing artificial intelligence (AI) chatbots regarding ocular inflammation, uveal diseases, and treatment modalities, and to investigate their relative performance compared to one another.

MATERIALS AND METHODS

Thirty-six questions related to ocular inflammation, uveal diseases, and treatment modalities were posed to the ChatGPT, Bard, and Bing AI chatbots, and both correct and incorrect responses were recorded. The accuracy rates were compared using the Chi-squared test.

RESULTS

The ChatGPT provided correct answers to 52.8% of the questions, while Bard answered 38.9% correctly, and Bing answered 44.4% correctly. All three AI programs provided identical responses to 20 (55.6%) of the questions, with 45% of these responses being correct and 55% incorrect. No significant difference was observed between the correct and incorrect responses from the three AI chatbots ( = 0.654).

CONCLUSION

AI chatbots should be developed to provide widespread access to accurate information about ocular inflammation, uveal diseases, and treatment modalities. Future research could explore ways to enhance the performance of these chatbots.

摘要

目的

本研究旨在评估聊天生成预训练变换器（ChatGPT）、巴德（Bard）和必应（Bing）人工智能（AI）聊天机器人关于眼部炎症、葡萄膜疾病及治疗方式的知识水平，并调查它们彼此之间的相对性能。

材料与方法

向ChatGPT、巴德和必应AI聊天机器人提出36个与眼部炎症、葡萄膜疾病及治疗方式相关的问题，记录正确和错误的回答。使用卡方检验比较准确率。

结果

ChatGPT对52.8%的问题给出了正确答案，而巴德的正确回答率为38.9%，必应的正确回答率为44.4%。三个AI程序对20个（55.6%）问题给出了相同回答，其中45%的回答正确，55%错误。三个AI聊天机器人的正确和错误回答之间未观察到显著差异（ = 0.654）。

结论

应开发AI聊天机器人，以便广泛获取有关眼部炎症、葡萄膜疾病及治疗方式的准确信息。未来的研究可以探索提高这些聊天机器人性能的方法。

相似文献

Investigating the comparative superiority of artificial intelligence programs in assessing knowledge levels regarding ocular inflammation, uvea diseases, and treatment modalities.研究人工智能程序在评估眼部炎症、葡萄膜疾病及治疗方式相关知识水平方面的比较优势。

Taiwan J Ophthalmol. 2024 Sep 13;14(3):409-413. doi: 10.4103/tjo.TJO-D-23-00166. eCollection 2024 Jul-Sep.

Evaluation of Current Artificial Intelligence Programs on the Knowledge of Glaucoma.当前人工智能程序对青光眼知识的评估

Klin Monbl Augenheilkd. 2024 Oct;241(10):1140-1144. doi: 10.1055/a-2327-8484. Epub 2024 Jul 24.

Performance of Artificial Intelligence Chatbots on Glaucoma Questions Adapted From Patient Brochures.人工智能聊天机器人对改编自患者手册的青光眼问题的回答情况。

Cureus. 2024 Mar 23;16(3):e56766. doi: 10.7759/cureus.56766. eCollection 2024 Mar.

A comparative study on the knowledge levels of artificial intelligence programs in diagnosing ophthalmic pathologies and intraocular tumors evaluated their superiority and potential utility.一项关于人工智能程序在诊断眼科疾病和眼内肿瘤方面知识水平的比较研究评估了它们的优越性和潜在效用。

Int Ophthalmol. 2023 Dec;43(12):4905-4909. doi: 10.1007/s10792-023-02893-x. Epub 2023 Oct 26.

Exploring Artificial Intelligence Programs' Understanding of Lens, Cataract, and Refractive Surgery Information.探索人工智能程序对晶状体、白内障和屈光手术信息的理解。

Middle East Afr J Ophthalmol. 2024 Sep 13;30(3):173-176. doi: 10.4103/meajo.meajo_199_23. eCollection 2023 Jul-Sep.

Accuracy and Readability of Artificial Intelligence Chatbot Responses to Vasectomy-Related Questions: Public Beware.人工智能聊天机器人对输精管切除术相关问题回答的准确性和可读性：公众需谨慎。

Cureus. 2024 Aug 28;16(8):e67996. doi: 10.7759/cureus.67996. eCollection 2024 Aug.

Assessing the proficiency of artificial intelligence programs in the diagnosis and treatment of cornea, conjunctiva, and eyelid diseases and exploring the advantages of each other benefits.评估人工智能程序在角膜、结膜和眼睑疾病的诊断和治疗中的熟练程度，并探索彼此的优势互补。

Cont Lens Anterior Eye. 2024 Apr;47(2):102125. doi: 10.1016/j.clae.2024.102125. Epub 2024 Mar 4.

Assessing the Accuracy of Information on Medication Abortion: A Comparative Analysis of ChatGPT and Google Bard AI.评估药物流产信息的准确性：ChatGPT与谷歌巴德人工智能的比较分析

Cureus. 2024 Jan 2;16(1):e51544. doi: 10.7759/cureus.51544. eCollection 2024 Jan.

Performance of ChatGPT-4 and Bard chatbots in responding to common patient questions on prostate cancer Lu-PSMA-617 therapy.ChatGPT-4和Bard聊天机器人在回答关于前列腺癌Lu-PSMA-617疗法常见患者问题方面的表现

Front Oncol. 2024 Jul 12;14:1386718. doi: 10.3389/fonc.2024.1386718. eCollection 2024.

Comparison of the Audiological Knowledge of Three Chatbots: ChatGPT, Bing Chat, and Bard.三款聊天机器人的听力学知识比较：ChatGPT、必应聊天和巴德

Audiol Neurootol. 2024;29(6):457-463. doi: 10.1159/000538983. Epub 2024 May 6.

引用本文的文献

Reply to Comment on "Evaluation and Comparison of the Knowledge Levels of Current Artificial Intelligence Programs on Retinal/Vitreous Diseases and Treatment Methods".对《当前人工智能程序在视网膜/玻璃体疾病及治疗方法方面的知识水平评估与比较》评论的回复

J Curr Ophthalmol. 2025 Jun 5;36(3):311. doi: 10.4103/joco.joco_266_24. eCollection 2024 Jul-Sep.

本文引用的文献

ChatGPT makes medicine easy to swallow: an exploratory case study on simplified radiology reports.ChatGPT 让医学文献通俗易懂：简化放射学报告的探索性案例研究。

Eur Radiol. 2024 May;34(5):2817-2825. doi: 10.1007/s00330-023-10213-1. Epub 2023 Oct 5.

The Potential Role of Large Language Models in Uveitis Care: Perspectives After ChatGPT and Bard Launch.大语言模型在葡萄膜炎护理中的潜在作用：ChatGPT和Bard发布后的观点

Ocul Immunol Inflamm. 2024 Sep;32(7):1435-1439. doi: 10.1080/09273948.2023.2242462. Epub 2023 Aug 10.

Performance of Generative Large Language Models on Ophthalmology Board-Style Questions.生成式大型语言模型在眼科 Board 式问题中的表现。

Am J Ophthalmol. 2023 Oct;254:141-149. doi: 10.1016/j.ajo.2023.05.024. Epub 2023 Jun 18.

ChatGPT - Reshaping medical education and clinical management.ChatGPT——重塑医学教育与临床管理。

Pak J Med Sci. 2023 Mar-Apr;39(2):605-607. doi: 10.12669/pjms.39.2.7653.

The future of ChatGPT in academic research and publishing: A commentary for clinical and translational medicine.ChatGPT在学术研究与出版领域的未来：一篇针对临床与转化医学的评论

Clin Transl Med. 2023 Mar;13(3):e1207. doi: 10.1002/ctm2.1207.

Performance of ChatGPT on USMLE: Potential for AI-assisted medical education using large language models.ChatGPT在美国医师执照考试中的表现：使用大语言模型进行人工智能辅助医学教育的潜力。

PLOS Digit Health. 2023 Feb 9;2(2):e0000198. doi: 10.1371/journal.pdig.0000198. eCollection 2023 Feb.

How Does ChatGPT Perform on the United States Medical Licensing Examination (USMLE)? The Implications of Large Language Models for Medical Education and Knowledge Assessment.ChatGPT在美国医师执照考试（USMLE）中的表现如何？大语言模型对医学教育和知识评估的影响。

JMIR Med Educ. 2023 Feb 8;9:e45312. doi: 10.2196/45312.

The current state of artificial intelligence in ophthalmology.人工智能在眼科学中的应用现状。

Surv Ophthalmol. 2019 Mar-Apr;64(2):233-240. doi: 10.1016/j.survophthal.2018.09.002. Epub 2018 Sep 22.

Deep learning applications in ophthalmology.深度学习在眼科中的应用。

Curr Opin Ophthalmol. 2018 May;29(3):254-260. doi: 10.1097/ICU.0000000000000470.

Electronic Health Records: Then, Now, and in the Future.电子健康记录：过去、现在与未来。

Yearb Med Inform. 2016 May 20;Suppl 1(Suppl 1):S48-61. doi: 10.15265/IYS-2016-s006.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验