基于ChatGPT-4的斜视和弱视常见问题回答的可接受性与可读性。

Acceptability and readability of ChatGPT-4 based responses for frequently asked questions about strabismus and amblyopia.

作者信息

Guven S, Ayyildiz B

机构信息

Kayseri City Hospital, Department of Ophthalmology, Kayseri, Turkey.

出版信息

J Fr Ophtalmol. 2025 Mar;48(3):104400. doi: 10.1016/j.jfo.2024.104400. Epub 2024 Dec 20.

DOI:10.1016/j.jfo.2024.104400

PMID:39708624

Abstract

PURPOSE

To evaluate the compatibility and readability of ChatGPT-4 in providing responses to common inquiries about strabismus and amblyopia.

MATERIALS AND METHODS

A series of commonly asked questions were compiled, covering topics such as the definition, prevalence, diagnostic approaches, surgical and non-surgical treatment alternatives, postoperative guidelines, surgery-related risks, and visual prognosis associated with strabismus and amblyopia. Each question was asked three times on the online ChatGPT-4 platform both in English and French, with data collection conducted on February 18, 2024. The responses generated by ChatGPT-4 were evaluated by two independent pediatric ophthalmologists, who classified them as "acceptable," "unacceptable," or "incomplete." Additionally, an online readability assessment tool called "readable" was utilized for readability analysis.

RESULTS

The majority of responses, totaling 97% of the questions regarding strabismus and amblyopia, consistently met the criteria for acceptability. Only 3% of responses were classified as incomplete, with no instances of unacceptable responses observed. The average Flesch-Kincaid Grade Level and Flesch Reading Ease Score were calculated as 14.53±1.8 and 23.63±8.2, respectively. Furthermore, the means for all readability indices, including the Coleman-Liau index, the Gunning Fog index, and the SMOG index, were found to be 15.75±1.4, 16.96±2.4, and 16.05±1.6, respectively.

CONCLUSIONS

ChatGPT-4 consistently produced acceptable responses to the majority of the questions asked (97%). Nevertheless, the readability of these responses proved challenging for the average layperson, requiring a college-level education for comprehension. Further improvements, particularly in terms of readability, are necessary to enhance the advisory capacity of this AI software in providing eye and health-related guidance for patients, physicians, and the general public.

摘要

目的

评估ChatGPT-4对有关斜视和弱视常见问题提供回答的兼容性和可读性。

材料与方法

汇编了一系列常见问题，涵盖斜视和弱视的定义、患病率、诊断方法、手术和非手术治疗选择、术后指南、手术相关风险以及视觉预后等主题。2024年2月18日，在在线ChatGPT-4平台上用英语和法语对每个问题各提问三次，并进行数据收集。由两名独立的儿科眼科医生对ChatGPT-4生成的回答进行评估，他们将回答分为“可接受”、“不可接受”或“不完整”。此外，还使用了一个名为“readable”的在线可读性评估工具进行可读性分析。

结果

关于斜视和弱视的问题，大多数回答（占97%）始终符合可接受标准。只有3%的回答被归类为不完整，未观察到不可接受的回答。平均弗莱施-金凯德年级水平和弗莱施阅读易度得分分别计算为14.53±1.8和23.63±8.2。此外，所有可读性指标的平均值，包括科尔曼-廖指数、冈宁雾度指数和烟雾指数，分别为15.75±1.4、16.96±2.4和16.05±1.6。

结论

ChatGPT-4对大多数提出的问题（97%）始终给出可接受的回答。然而，这些回答的可读性对普通外行人来说具有挑战性，需要大学水平的教育才能理解。有必要进一步改进，特别是在可读性方面，以提高该人工智能软件为患者、医生和公众提供眼睛和健康相关指导的咨询能力。

相似文献

Acceptability and readability of ChatGPT-4 based responses for frequently asked questions about strabismus and amblyopia.基于ChatGPT-4的斜视和弱视常见问题回答的可接受性与可读性。

J Fr Ophtalmol. 2025 Mar;48(3):104400. doi: 10.1016/j.jfo.2024.104400. Epub 2024 Dec 20.

American academy of Orthopedic Surgeons' OrthoInfo provides more readable information regarding meniscus injury than ChatGPT-4 while information accuracy is comparable.美国矫形外科医师学会的OrthoInfo在半月板损伤方面提供了比ChatGPT-4更具可读性的信息，而信息准确性相当。

J ISAKOS. 2025 Apr;11:100843. doi: 10.1016/j.jisako.2025.100843. Epub 2025 Feb 21.

Assessing the Responses of Large Language Models (ChatGPT-4, Claude 3, Gemini, and Microsoft Copilot) to Frequently Asked Questions in Retinopathy of Prematurity: A Study on Readability and Appropriateness.评估大型语言模型（ChatGPT-4、Claude 3、Gemini和Microsoft Copilot）对早产儿视网膜病变常见问题的回答：一项关于可读性和适宜性的研究

J Pediatr Ophthalmol Strabismus. 2025 Mar-Apr;62(2):84-95. doi: 10.3928/01913913-20240911-05. Epub 2024 Oct 28.

Generative artificial intelligence chatbots may provide appropriate informational responses to common vascular surgery questions by patients.生成式人工智能聊天机器人可能会为患者关于常见血管外科问题提供恰当的信息性回复。

Vascular. 2025 Feb;33(1):229-237. doi: 10.1177/17085381241240550. Epub 2024 Mar 18.

Assessing the quality and readability of patient education materials on chemotherapy cardiotoxicity from artificial intelligence chatbots: An observational cross-sectional study.评估人工智能聊天机器人提供的关于化疗心脏毒性的患者教育材料的质量和可读性：一项观察性横断面研究。

Medicine (Baltimore). 2025 Apr 11;104(15):e42135. doi: 10.1097/MD.0000000000042135.

The Performance of Chatbots and the AAPOS Website as a Tool for Amblyopia Education.聊天机器人和 AAPOS 网站在弱视教育中的应用效果。

J Pediatr Ophthalmol Strabismus. 2024 Sep-Oct;61(5):325-331. doi: 10.3928/01913913-20240409-01. Epub 2024 May 30.

Assessing the Readability of Patient Education Materials on Cardiac Catheterization From Artificial Intelligence Chatbots: An Observational Cross-Sectional Study.评估人工智能聊天机器人提供的心脏导管插入术患者教育材料的可读性：一项观察性横断面研究。

Cureus. 2024 Jul 4;16(7):e63865. doi: 10.7759/cureus.63865. eCollection 2024 Jul.

Readability analysis of ChatGPT's responses on lung cancer.肺癌相关问题的 ChatGPT 回复可读性分析。

Sci Rep. 2024 Jul 26;14(1):17234. doi: 10.1038/s41598-024-67293-2.

Appropriateness and Readability of ChatGPT-4-Generated Responses for Surgical Treatment of Retinal Diseases.ChatGPT-4 生成的回复在视网膜疾病手术治疗中的适宜性和可读性。

Ophthalmol Retina. 2023 Oct;7(10):862-868. doi: 10.1016/j.oret.2023.05.022. Epub 2023 Jun 3.

Artificial intelligence as a modality to enhance the readability of neurosurgical literature for patients.人工智能作为一种提高神经外科文献对患者可读性的方式。

J Neurosurg. 2024 Nov 8;142(4):1189-1195. doi: 10.3171/2024.6.JNS24617. Print 2025 Apr 1.

引用本文的文献

Leveraging ChatGPT to strengthen pediatric healthcare systems: a systematic review.利用ChatGPT加强儿科医疗系统：一项系统综述

Eur J Pediatr. 2025 Jul 12;184(8):478. doi: 10.1007/s00431-025-06320-4.

Enhancing Patient Comprehension of Glomerular Disease Treatments Using ChatGPT.使用ChatGPT提高患者对肾小球疾病治疗的理解

Healthcare (Basel). 2024 Dec 31;13(1):57. doi: 10.3390/healthcare13010057.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于ChatGPT-4的斜视和弱视常见问题回答的可接受性与可读性。

Acceptability and readability of ChatGPT-4 based responses for frequently asked questions about strabismus and amblyopia.

作者信息

机构信息

出版信息

PURPOSE

MATERIALS AND METHODS

RESULTS

CONCLUSIONS

目的

材料与方法

结果

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献