Duran Gökhan Serhat, Yurdakurban Ebru, Topsakal Kübra Gülnur
Department of Orthodontics, Gulhane Faculty of Dental Medicine, University of Health Sciences, Ankara, Türkiye.
Cleft Palate Craniofac J. 2025 Apr;62(4):588-595. doi: 10.1177/10556656231222387. Epub 2023 Dec 21.
ObjectiveTo assess the quality, reliability, readability, and similarity of the data that a recently created NLP-based artificial intelligence model ChatGPT 4 provides to users in Cleft Lip and Palate (CLP)-related information.DesignIn the evaluation of the responses provided by the OpenAI ChatGPT to the CLP-related 50 questions, several tools were utilized, including the Ensuring Quality Information for Patients (EQIP) tool, Reliability Scoring System (Adapted from DISCERN), Flesh Reading Ease Formula (FRES) and Flesch-Kinkaid Reading Grade Level (FKRGL) formulas, Global Quality Scale (GQS), and Similarity Index with plagiarism-detection tool. Jamovi (The Jamovi Project, 2022, version 2.3; Sydney, Australia) software was used for all statistical analyses.ResultsBased on the reliability and GQS values, ChatGPT demonstrated high reliability and good quality attributable to CLP. Furthermore, according to the FRES results, ChatGPT's readability is difficult, and the similarity index values of this software exhibit an acceptable level of similarity ratio. There is no significant difference in EQIP, Reliability Score System, FRES, FKGRL, GQS, and Similarity Index values among the two categories.ConclusionOpenAI ChatGPT provides a highly reliable, high-quality, but challenging to read, and acceptable similarity rate in providing information related to CLP. Ensuring that information obtained through these models is verified and assessed by a qualified medical expert is crucial.
目的
评估最近创建的基于自然语言处理的人工智能模型ChatGPT 4在唇腭裂(CLP)相关信息方面向用户提供的数据的质量、可靠性、可读性和相似度。
设计
在评估OpenAI ChatGPT对50个CLP相关问题的回答时,使用了多种工具,包括患者质量信息保障(EQIP)工具、可靠性评分系统(改编自DISCERN)、弗莱什易读性公式(FRES)和弗莱什-金凯德阅读年级水平(FKRGL)公式、全球质量量表(GQS)以及带有抄袭检测工具的相似度指数。所有统计分析均使用Jamovi软件(The Jamovi Project,2022,版本2.3;澳大利亚悉尼)。
结果
基于可靠性和GQS值,ChatGPT在CLP方面表现出高可靠性和良好质量。此外,根据FRES结果,ChatGPT的可读性较差,该软件的相似度指数值显示出可接受的相似度水平。两类之间在EQIP、可靠性评分系统、FRES、FKGRL、GQS和相似度指数值方面没有显著差异。
结论
OpenAI ChatGPT在提供与CLP相关的信息时,提供了高度可靠、高质量但可读性差且相似度可接受的信息。确保通过这些模型获得的信息由合格的医学专家进行验证和评估至关重要。