Croen Brett J, Abdullah Mohammed S, Berns Ellis, Rapaport Sarah, Hahn Alexander K, Barrett Caitlin C, Sobel Andrew D
Department of Orthopaedic Surgery, Penn Medicine, Philadelphia, PA, USA.
Department of Orthopaedic Surgery, University of Connecticut, Farmington, USA.
Hand (N Y). 2024 Apr 25:15589447241247332. doi: 10.1177/15589447241247332.
ChatGPT, an artificial intelligence technology, has the potential to be a useful patient aid, though the accuracy and appropriateness of its responses and recommendations on common hand surgical pathologies and procedures must be understood. Comparing the sources referenced and characteristics of responses from ChatGPT and an established search engine (Google) on carpal tunnel surgery will allow for an understanding of the utility of ChatGPT for patient education.
A Google search of "carpal tunnel release surgery" was performed and "frequently asked questions (FAQs)" were recorded with their answer and source. ChatGPT was then asked to provide answers to the Google FAQs. The FAQs were compared, and answer content was compared using word count, readability analyses, and content source.
There was 40% concordance among questions asked by the programs. Google answered each question with one source per answer, whereas ChatGPT's answers were created from two sources per answer. ChatGPT's answers were significantly longer than Google's and multiple readability analysis algorithms found ChatGPT responses to be statistically significantly more difficult to read and at a higher grade level than Google's. ChatGPT always recommended "contacting your surgeon."
A comparison of ChatGPT's responses to Google's FAQ responses revealed that ChatGPT's answers were more in-depth, from multiple sources, and from a higher proportion of academic Web sites. However, ChatGPT answers were found to be more difficult to understand. Further study is needed to understand if the differences in the responses between programs correlate to a difference in patient comprehension.
人工智能技术ChatGPT有潜力成为有用的患者辅助工具,不过必须了解其对常见手部外科病理和手术的回答及建议的准确性和适当性。比较ChatGPT和成熟搜索引擎(谷歌)关于腕管手术的引用来源及回答特点,将有助于了解ChatGPT在患者教育方面的效用。
在谷歌上搜索“腕管松解手术”,记录“常见问题解答(FAQs)”及其答案和来源。然后让ChatGPT回答谷歌的常见问题。对这些常见问题进行比较,并使用字数统计、可读性分析和内容来源对答案内容进行比较。
两个程序提出的问题中有40%一致。谷歌每个问题的回答都只有一个来源,而ChatGPT的回答每个答案由两个来源生成。ChatGPT的答案明显比谷歌的长,多种可读性分析算法发现ChatGPT的回答在统计学上比谷歌的更难读懂,且阅读难度级别更高。ChatGPT总是建议“联系你的外科医生”。
将ChatGPT的回答与谷歌的常见问题解答回答进行比较发现,ChatGPT的答案更深入,来源多样,且来自学术网站的比例更高。然而,发现ChatGPT的答案更难理解。需要进一步研究以了解两个程序回答的差异是否与患者理解的差异相关。