Gupta Anuj, Basha Adil, Sontam Tarun R, Hlavinka William J, Croen Brett J, Abdou Cherry, Abdullah Mohammed, Hamilton Rita
Texas A&M School of Medicine, Dallas, Texas, USA.
Department of Orthopedic Surgery, University of Pennsylvania Health System, Philadelphia, Pennsylvania, USA.
Proc (Bayl Univ Med Cent). 2025 Feb 28;38(3):221-226. doi: 10.1080/08998280.2025.2470033. eCollection 2025.
This study assessed the comprehensiveness and readability of medical information about complex regional pain syndrome provided by ChatGPT, an artificial intelligence (AI) chatbot, and Google using standardized scoring systems.
A Google search was conducted using the term "complex regional pain syndrome," and the first 10 frequently asked questions (FAQs) and answers generated were recorded. ChatGPT was presented these FAQs generated by Google, and its responses were evaluated alongside Google's answers using multiple metrics. ChatGPT was then asked to generate its own set of 10 FAQs and answers.
ChatGPT's answers were significantly longer than Google's in response to both independently generated questions (330.0 ± 51.3 words, < 0.0001) and Google-generated questions (289.7 ± 40.6 words, < 0.0001). ChatGPT's answers to Google-generated questions were more difficult to read based on the Flesch-Kincaid Reading Ease Score (13.6 ± 10.8, = 0.017).
Our findings suggest that ChatGPT is a promising tool for patient education regarding complex regional pain syndrome based on its ability to generate a variety of question topics with responses from credible sources. That said, challenges such as readability and ethical considerations must be addressed prior to its widespread use for health information.
本研究使用标准化评分系统评估了人工智能聊天机器人ChatGPT和谷歌提供的关于复杂性区域疼痛综合征的医学信息的全面性和可读性。
使用术语“复杂性区域疼痛综合征”进行谷歌搜索,并记录生成的前10个常见问题及答案。将谷歌生成的这些常见问题呈现给ChatGPT,并使用多种指标将其回答与谷歌的答案一起进行评估。然后要求ChatGPT生成自己的一组10个常见问题及答案。
ChatGPT对独立生成问题的回答(330.0±51.3个单词,<0.0001)和对谷歌生成问题的回答(289.7±40.6个单词,<0.0001)均明显长于谷歌的回答。根据弗莱施-金凯德易读性评分,ChatGPT对谷歌生成问题的回答更难读懂(13.6±10.8,=0.017)。
我们的研究结果表明,ChatGPT基于其能够生成各种问题主题并提供可靠来源的回答,是用于患者关于复杂性区域疼痛综合征教育的一个有前途的工具。也就是说,在其广泛用于健康信息之前,必须解决可读性和伦理考量等挑战。