基于大语言人工智能模型的腕管松解术患者教育材料评估

Evaluation of Patient Education Materials From Large-Language Artificial Intelligence Models on Carpal Tunnel Release.

作者信息

Croen Brett J, Abdullah Mohammed S, Berns Ellis, Rapaport Sarah, Hahn Alexander K, Barrett Caitlin C, Sobel Andrew D

机构信息

Department of Orthopaedic Surgery, Penn Medicine, Philadelphia, PA, USA.

Department of Orthopaedic Surgery, University of Connecticut, Farmington, USA.

出版信息

Hand (N Y). 2024 Apr 25:15589447241247332. doi: 10.1177/15589447241247332.

DOI:10.1177/15589447241247332

PMID:38660977

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11571324/

Abstract

BACKGROUND

ChatGPT, an artificial intelligence technology, has the potential to be a useful patient aid, though the accuracy and appropriateness of its responses and recommendations on common hand surgical pathologies and procedures must be understood. Comparing the sources referenced and characteristics of responses from ChatGPT and an established search engine (Google) on carpal tunnel surgery will allow for an understanding of the utility of ChatGPT for patient education.

METHODS

A Google search of "carpal tunnel release surgery" was performed and "frequently asked questions (FAQs)" were recorded with their answer and source. ChatGPT was then asked to provide answers to the Google FAQs. The FAQs were compared, and answer content was compared using word count, readability analyses, and content source.

RESULTS

There was 40% concordance among questions asked by the programs. Google answered each question with one source per answer, whereas ChatGPT's answers were created from two sources per answer. ChatGPT's answers were significantly longer than Google's and multiple readability analysis algorithms found ChatGPT responses to be statistically significantly more difficult to read and at a higher grade level than Google's. ChatGPT always recommended "contacting your surgeon."

CONCLUSION

A comparison of ChatGPT's responses to Google's FAQ responses revealed that ChatGPT's answers were more in-depth, from multiple sources, and from a higher proportion of academic Web sites. However, ChatGPT answers were found to be more difficult to understand. Further study is needed to understand if the differences in the responses between programs correlate to a difference in patient comprehension.

摘要

背景

人工智能技术ChatGPT有潜力成为有用的患者辅助工具，不过必须了解其对常见手部外科病理和手术的回答及建议的准确性和适当性。比较ChatGPT和成熟搜索引擎（谷歌）关于腕管手术的引用来源及回答特点，将有助于了解ChatGPT在患者教育方面的效用。

方法

在谷歌上搜索“腕管松解手术”，记录“常见问题解答（FAQs）”及其答案和来源。然后让ChatGPT回答谷歌的常见问题。对这些常见问题进行比较，并使用字数统计、可读性分析和内容来源对答案内容进行比较。

结果

两个程序提出的问题中有40%一致。谷歌每个问题的回答都只有一个来源，而ChatGPT的回答每个答案由两个来源生成。ChatGPT的答案明显比谷歌的长，多种可读性分析算法发现ChatGPT的回答在统计学上比谷歌的更难读懂，且阅读难度级别更高。ChatGPT总是建议“联系你的外科医生”。

结论

将ChatGPT的回答与谷歌的常见问题解答回答进行比较发现，ChatGPT的答案更深入，来源多样，且来自学术网站的比例更高。然而，发现ChatGPT的答案更难理解。需要进一步研究以了解两个程序回答的差异是否与患者理解的差异相关。

相似文献

Evaluation of Patient Education Materials From Large-Language Artificial Intelligence Models on Carpal Tunnel Release.基于大语言人工智能模型的腕管松解术患者教育材料评估

Hand (N Y). 2024 Apr 25:15589447241247332. doi: 10.1177/15589447241247332.

Evaluating ChatGPT's Utility in Biologic Therapy for Systemic Lupus Erythematosus: Comparative Study of ChatGPT and Google Web Search.评估ChatGPT在系统性红斑狼疮生物治疗中的效用：ChatGPT与谷歌网络搜索的比较研究

JMIR Form Res. 2025 Aug 28;9:e76458. doi: 10.2196/76458.

Is Information About Musculoskeletal Malignancies From Large Language Models or Web Resources at a Suitable Reading Level for Patients?来自大语言模型或网络资源的关于肌肉骨骼恶性肿瘤的信息对患者来说是否处于合适的阅读水平？

Clin Orthop Relat Res. 2025 Feb 1;483(2):306-315. doi: 10.1097/CORR.0000000000003263. Epub 2024 Sep 25.

Artificial Intelligence in Peripheral Artery Disease Education: A Battle Between ChatGPT and Google Gemini.外周动脉疾病教育中的人工智能：ChatGPT与谷歌Gemini的较量

Cureus. 2025 Jun 1;17(6):e85174. doi: 10.7759/cureus.85174. eCollection 2025 Jun.

Prescription of Controlled Substances: Benefits and Risks管制药品的处方：益处与风险

Comparison of Responses from ChatGPT-4, Google Gemini, and Google Search to Common Patient Questions About Ankle Sprains: A Readability Analysis.ChatGPT-4、谷歌Gemini和谷歌搜索对关于脚踝扭伤的常见患者问题的回答比较：可读性分析

J Am Acad Orthop Surg. 2025 Jul 3;33(16):924-930. doi: 10.5435/JAAOS-D-25-00260.

Assessing ChatGPT's Educational Potential in Lung Cancer Radiotherapy From Clinician and Patient Perspectives: Content Quality and Readability Analysis.从临床医生和患者角度评估ChatGPT在肺癌放疗中的教育潜力：内容质量与可读性分析

JMIR Cancer. 2025 Aug 13;11:e69783. doi: 10.2196/69783.

Is ChatGPT a more academic source than google searches for patient questions about hip arthroscopy? An analysis of the most frequently asked questions.对于患者关于髋关节镜检查的问题，ChatGPT 比谷歌搜索是更具学术性的信息来源吗？对最常见问题的分析。

J ISAKOS. 2025 Jun;12:100892. doi: 10.1016/j.jisako.2025.100892. Epub 2025 May 3.

Endoscopic release for carpal tunnel syndrome.腕管综合征的内镜下松解术

Cochrane Database Syst Rev. 2014 Jan 31;2014(1):CD008265. doi: 10.1002/14651858.CD008265.pub2.

Burn Patient Education in the Modern Age: A Comparative Analysis of ChatGPT and Google Performance Answering Common Questions on Burn Injury and Management.现代烧伤患者教育：ChatGPT与谷歌在回答烧伤损伤及处理常见问题方面的性能对比分析

J Burn Care Res. 2025 Jan 6. doi: 10.1093/jbcr/irae211.

引用本文的文献

The Emergence of Applied Artificial Intelligence in the Realm of Value Based Musculoskeletal Care.基于价值的肌肉骨骼护理领域中应用人工智能的出现。

Curr Rev Musculoskelet Med. 2025 Jun 14. doi: 10.1007/s12178-025-09982-7.

The Availability and Readability of Multilingual Online Patient Materials for Legg-Calve-Perthes Disease.关于Legg-Calve-Perthes病的多语言在线患者资料的可获取性与可读性

J Pediatr Soc North Am. 2025 Mar 6;11:100169. doi: 10.1016/j.jposna.2025.100169. eCollection 2025 May.

Readability, reliability and quality of responses generated by ChatGPT, gemini, and perplexity for the most frequently asked questions about pain.ChatGPT、Gemini和Perplexity针对最常见疼痛问题生成的回答的可读性、可靠性和质量。

Medicine (Baltimore). 2025 Mar 14;104(11):e41780. doi: 10.1097/MD.0000000000041780.

ChatGPT-3.5 and -4.0 Do Not Reliably Create Readable Patient Education Materials for Common Orthopaedic Upper- and Lower-Extremity Conditions.ChatGPT-3.5和-4.0不能可靠地为常见的骨科上肢和下肢疾病创建可读性强的患者教育材料。

Arthrosc Sports Med Rehabil. 2024 Oct 10;7(1):101027. doi: 10.1016/j.asmr.2024.101027. eCollection 2025 Feb.

Large language models in patient education: a scoping review of applications in medicine.用于患者教育的大语言模型：医学应用的范围综述

Front Med (Lausanne). 2024 Oct 29;11:1477898. doi: 10.3389/fmed.2024.1477898. eCollection 2024.

本文引用的文献

Head-to-Head Comparison of ChatGPT Versus Google Search for Medical Knowledge Acquisition.ChatGPT与谷歌搜索在医学知识获取方面的直接比较

Otolaryngol Head Neck Surg. 2024 Jun;170(6):1484-1491. doi: 10.1002/ohn.465. Epub 2023 Aug 2.

Assessing ChatGPT Responses to Common Patient Questions Regarding Total Hip Arthroplasty.评估 ChatGPT 对全髋关节置换术常见患者问题的回答。

J Bone Joint Surg Am. 2023 Oct 4;105(19):1519-1526. doi: 10.2106/JBJS.23.00209. Epub 2023 Jul 17.

High Rates of Fabricated and Inaccurate References in ChatGPT-Generated Medical Content.ChatGPT生成的医学内容中虚假和不准确参考文献的高比例。

Cureus. 2023 May 19;15(5):e39238. doi: 10.7759/cureus.39238. eCollection 2023 May.

The promise and peril of using a large language model to obtain clinical information: ChatGPT performs strongly as a fertility counseling tool with limitations.使用大型语言模型获取临床信息的前景与风险：ChatGPT 在作为生育咨询工具方面表现强劲，但存在一定局限性。

Fertil Steril. 2023 Sep;120(3 Pt 2):575-583. doi: 10.1016/j.fertnstert.2023.05.151. Epub 2023 May 20.

ChatGPT in medicine: an overview of its applications, advantages, limitations, future prospects, and ethical considerations.医学领域的ChatGPT：其应用、优势、局限性、未来前景及伦理考量概述

Front Artif Intell. 2023 May 4;6:1169595. doi: 10.3389/frai.2023.1169595. eCollection 2023.

Aesthetic Surgery Advice and Counseling from Artificial Intelligence: A Rhinoplasty Consultation with ChatGPT.人工智能提供的美容外科建议和咨询：ChatGPT 参与的隆鼻咨询。

Aesthetic Plast Surg. 2023 Oct;47(5):1985-1993. doi: 10.1007/s00266-023-03338-7. Epub 2023 Apr 24.

Using a Google Web Search Analysis to Assess the Utility of ChatGPT in Total Joint Arthroplasty.利用谷歌网页搜索分析评估 ChatGPT 在全关节置换中的效用。

J Arthroplasty. 2023 Jul;38(7):1195-1202. doi: 10.1016/j.arth.2023.04.007. Epub 2023 Apr 10.

ChatGPT - Reshaping medical education and clinical management.ChatGPT——重塑医学教育与临床管理。

Pak J Med Sci. 2023 Mar-Apr;39(2):605-607. doi: 10.12669/pjms.39.2.7653.

Examining the Public's Most Frequently Asked Questions About Carpal Tunnel Syndrome and Appraising Online Information About Treatment.检查公众对腕管综合征最常问的问题，并评估有关治疗的在线信息。

Hand (N Y). 2024 Jul;19(5):768-775. doi: 10.1177/15589447221142895. Epub 2022 Dec 23.

Quality and reliability evaluation of online videos on carpal tunnel syndrome: a YouTube video-based study.基于YouTube视频的腕管综合征在线视频质量与可靠性评估研究

BMJ Open. 2022 Apr 15;12(4):e059239. doi: 10.1136/bmjopen-2021-059239.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验