GPT-4作为颈椎前路椎间盘切除融合术患者信息来源：与谷歌网络搜索的比较分析

GPT-4 as a Source of Patient Information for Anterior Cervical Discectomy and Fusion: A Comparative Analysis Against Google Web Search.

作者信息

Mastrokostas Paul G, Mastrokostas Leonidas E, Emara Ahmed K, Wellington Ian J, Ginalis Elizabeth, Houten John K, Khalsa Amrit S, Saleh Ahmed, Razi Afshin E, Ng Mitchell K

机构信息

College of Medicine, State University of New York (SUNY) Downstate, Brooklyn, NY, USA.

Brooklyn College of the City University of New York, Brooklyn, NY, USA.

出版信息

Global Spine J. 2024 Nov;14(8):2389-2398. doi: 10.1177/21925682241241241. Epub 2024 Mar 21.

DOI:10.1177/21925682241241241

PMID:38513636

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11529100/

Abstract

STUDY DESIGN

Comparative study.

OBJECTIVES

This study aims to compare Google and GPT-4 in terms of (1) question types, (2) response readability, (3) source quality, and (4) numerical response accuracy for the top 10 most frequently asked questions (FAQs) about anterior cervical discectomy and fusion (ACDF).

METHODS

"Anterior cervical discectomy and fusion" was searched on Google and GPT-4 on December 18, 2023. Top 10 FAQs were classified according to the Rothwell system. Source quality was evaluated using benchmark criteria and readability was assessed using Flesch Reading Ease and Flesch-Kincaid grade level. Differences in scores, Flesch-Kincaid grade level, Flesch Reading Ease, and word count between platforms were analyzed using Student's t-tests. Statistical significance was set at the .05 level.

RESULTS

Frequently asked questions from Google were varied, while GPT-4 focused on technical details and indications/management. GPT-4 showed a higher Flesch-Kincaid grade level (12.96 vs 9.28, = .003), lower Flesch Reading Ease score (37.07 vs 54.85, = .005), and higher scores for source quality (3.333 vs 1.800, = .016). Numerically, 6 out of 10 responses varied between platforms, with GPT-4 providing broader recovery timelines for ACDF.

CONCLUSIONS

This study demonstrates GPT-4's ability to elevate patient education by providing high-quality, diverse information tailored to those with advanced literacy levels. As AI technology evolves, refining these tools for accuracy and user-friendliness remains crucial, catering to patients' varying literacy levels and information needs in spine surgery.

摘要

研究设计

比较研究。

目的

本研究旨在就（1）问题类型、（2）回答可读性、（3）来源质量以及（4）关于颈椎前路椎间盘切除融合术（ACDF）的前10个最常见问题（FAQs）的数值回答准确性，对谷歌和GPT-4进行比较。

方法

2023年12月18日在谷歌和GPT-4上搜索“颈椎前路椎间盘切除融合术”。根据罗斯韦尔系统对前10个常见问题进行分类。使用基准标准评估来源质量，并使用弗莱什易读性和弗莱什-金凯德年级水平评估可读性。使用学生t检验分析平台之间在得分、弗莱什-金凯德年级水平、弗莱什易读性和单词数方面的差异。统计学显著性设定为0.05水平。

结果

谷歌的常见问题多种多样，而GPT-4专注于技术细节和适应症/管理。GPT-4的弗莱什-金凯德年级水平更高（12.96对9.28，P = 0.003），弗莱什易读性得分更低（37.07对54.85，P = 0.005），来源质量得分更高（3.333对1.800，P = 0.016）。在数值方面，10个回答中有6个在平台之间存在差异，GPT-4为ACDF提供了更广泛的恢复时间线。

结论

本研究证明了GPT-4通过为具有较高识字水平的人提供高质量、多样化信息来提高患者教育水平的能力。随着人工智能技术的发展，提高这些工具的准确性和用户友好性仍然至关重要，以满足脊柱手术患者不同的识字水平和信息需求。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ea0a/11529100/492e687ee635/10.1177_21925682241241241-fig1.jpg

相似文献

GPT-4 as a Source of Patient Information for Anterior Cervical Discectomy and Fusion: A Comparative Analysis Against Google Web Search.GPT-4作为颈椎前路椎间盘切除融合术患者信息来源：与谷歌网络搜索的比较分析

Global Spine J. 2024 Nov;14(8):2389-2398. doi: 10.1177/21925682241241241. Epub 2024 Mar 21.

ChatGPT as a Source of Patient Information for Lumbar Spinal Fusion and Laminectomy: A Comparative Analysis Against Google Web Search.ChatGPT 作为腰椎融合和椎板切除术患者信息的来源：与谷歌网页搜索的对比分析。

Clin Spine Surg. 2024 Dec 1;37(10):E394-E403. doi: 10.1097/BSD.0000000000001582. Epub 2024 Feb 20.

Evaluation of Generative Language Models in Personalizing Medical Information: Instrument Validation Study.生成式语言模型在个性化医疗信息方面的评估：工具验证研究

JMIR AI. 2024 Aug 13;3:e54371. doi: 10.2196/54371.

Reliability and readability analysis of ChatGPT-4 and Google Bard as a patient information source for the most commonly applied radionuclide treatments in cancer patients.ChatGPT-4 和 Google Bard 作为癌症患者最常用放射性核素治疗的患者信息来源的可靠性和可读性分析。

Rev Esp Med Nucl Imagen Mol (Engl Ed). 2024 Jul-Aug;43(4):500021. doi: 10.1016/j.remnie.2024.500021. Epub 2024 May 29.

Accuracy and Readability of Artificial Intelligence Chatbot Responses to Vasectomy-Related Questions: Public Beware.人工智能聊天机器人对输精管切除术相关问题回答的准确性和可读性：公众需谨慎。

Cureus. 2024 Aug 28;16(8):e67996. doi: 10.7759/cureus.67996. eCollection 2024 Aug.

Ankle conFUSION: The quality and readability of information on the internet relating to ankle arthrodesis.踝关节融合术：互联网上与踝关节融合术相关的信息的质量和可读性。

Surgeon. 2021 Dec;19(6):e507-e511. doi: 10.1016/j.surge.2020.12.001. Epub 2021 Jan 13.

Dr. Google vs. Dr. ChatGPT: Exploring the Use of Artificial Intelligence in Ophthalmology by Comparing the Accuracy, Safety, and Readability of Responses to Frequently Asked Patient Questions Regarding Cataracts and Cataract Surgery.谷歌医生与ChatGPT医生：通过比较关于白内障及白内障手术的常见患者问题的回答的准确性、安全性和可读性，探索人工智能在眼科领域的应用。

Semin Ophthalmol. 2024 Aug;39(6):472-479. doi: 10.1080/08820538.2024.2326058. Epub 2024 Mar 22.

The Readability and Quality of Web-Based Patient Information on Nasopharyngeal Carcinoma: Quantitative Content Analysis.基于网络的鼻咽癌患者信息的可读性与质量：定量内容分析

JMIR Form Res. 2023 Nov 27;7:e47762. doi: 10.2196/47762.

Readability Levels of Dental Patient Education Brochures.牙科患者教育手册的可读性水平。

J Dent Hyg. 2016 Feb;90(1):28-34.

Can Artificial Intelligence Improve the Readability of Patient Education Materials?人工智能能否提高患者教育材料的可读性？

Clin Orthop Relat Res. 2023 Nov 1;481(11):2260-2267. doi: 10.1097/CORR.0000000000002668. Epub 2023 Apr 28.

引用本文的文献

Generative AI/LLMs for Plain Language Medical Information for Patients, Caregivers and General Public: Opportunities, Risks and Ethics.用于为患者、护理人员和普通公众提供通俗易懂的医学信息的生成式人工智能/大型语言模型：机遇、风险与伦理

Patient Prefer Adherence. 2025 Jul 31;19:2227-2249. doi: 10.2147/PPA.S527922. eCollection 2025.

Evaluating the Reliability and Quality of Sarcoidosis-Related Information Provided by AI Chatbots.评估人工智能聊天机器人提供的结节病相关信息的可靠性和质量。

Healthcare (Basel). 2025 Jun 5;13(11):1344. doi: 10.3390/healthcare13111344.

Evaluating the Efficacy of ChatGPT vs. Google Gemini in Generating Patient Education Materials for GLP-1 Receptor Agonists (Semaglutide, Liraglutide, Tirzepatide): A Cross-Sectional Study.评估ChatGPT与谷歌Gemini在生成GLP-1受体激动剂（司美格鲁肽、利拉鲁肽、替尔泊肽）患者教育材料方面的效果：一项横断面研究。

Cureus. 2025 Apr 10;17(4):e81993. doi: 10.7759/cureus.81993. eCollection 2025 Apr.

ChatGPT-4.0 vs. Google: Which Provides More Academic Answers to Patients' Questions on Arthroscopic Meniscus Repair?ChatGPT-4.0与谷歌：哪一个能为患者关于关节镜半月板修复的问题提供更多学术性答案？

Cureus. 2024 Dec 25;16(12):e76380. doi: 10.7759/cureus.76380. eCollection 2024 Dec.

Large language models in patient education: a scoping review of applications in medicine.用于患者教育的大语言模型：医学应用的范围综述

Front Med (Lausanne). 2024 Oct 29;11:1477898. doi: 10.3389/fmed.2024.1477898. eCollection 2024.

Is Information About Musculoskeletal Malignancies From Large Language Models or Web Resources at a Suitable Reading Level for Patients?来自大语言模型或网络资源的关于肌肉骨骼恶性肿瘤的信息对患者来说是否处于合适的阅读水平？

Clin Orthop Relat Res. 2025 Feb 1;483(2):306-315. doi: 10.1097/CORR.0000000000003263. Epub 2024 Sep 25.

本文引用的文献

Quality Assessment of Degenerative Cervical Myelopathy Information on the Internet.互联网上退行性颈椎脊髓病信息的质量评估

Int J Spine Surg. 2024 Mar 4;18(1):54-61. doi: 10.14444/8566.

ChatGPT and large language models in orthopedics: from education and surgery to research.骨科领域的ChatGPT和大语言模型：从教育、手术到研究

J Exp Orthop. 2023 Dec 1;10(1):128. doi: 10.1186/s40634-023-00700-1.

Evaluation of the performance of GPT-3.5 and GPT-4 on the Polish Medical Final Examination.评估 GPT-3.5 和 GPT-4 在波兰医学期末考试中的表现。

Sci Rep. 2023 Nov 22;13(1):20512. doi: 10.1038/s41598-023-46995-z.

Accuracy of ChatGPT in Common Gastrointestinal Diseases: Impact for Patients and Providers.ChatGPT 在常见胃肠道疾病中的准确性：对患者和提供者的影响。

Clin Gastroenterol Hepatol. 2024 Jun;22(6):1323-1325.e3. doi: 10.1016/j.cgh.2023.11.008. Epub 2023 Nov 19.

How Efficient Is ChatGPT in Accessing Accurate and Quality Health-Related Information?ChatGPT在获取准确且高质量的健康相关信息方面效率如何？

Cureus. 2023 Oct 7;15(10):e46662. doi: 10.7759/cureus.46662. eCollection 2023 Oct.

New Frontiers in Health Literacy: Using ChatGPT to Simplify Health Information for People in the Community.健康素养新前沿：利用 ChatGPT 简化社区人群的健康信息。

J Gen Intern Med. 2024 Mar;39(4):573-577. doi: 10.1007/s11606-023-08469-w. Epub 2023 Nov 8.

Answering head and neck cancer questions: An assessment of ChatGPT responses.回答头颈癌相关问题：对ChatGPT回答的评估。

Am J Otolaryngol. 2024 Jan-Feb;45(1):104085. doi: 10.1016/j.amjoto.2023.104085. Epub 2023 Oct 5.

Comparing ChatGPT and GPT-4 performance in USMLE soft skill assessments.比较 ChatGPT 和 GPT-4 在 USMLE 软技能评估中的表现。

Sci Rep. 2023 Oct 1;13(1):16492. doi: 10.1038/s41598-023-43436-9.

ChatGPT and Patient Information in Nuclear Medicine: GPT-3.5 Versus GPT-4.ChatGPT 和核医学中的患者信息：GPT-3.5 与 GPT-4

J Nucl Med Technol. 2023 Dec 5;51(4):307-313. doi: 10.2967/jnmt.123.266151.

Evaluation of Online Artificial Intelligence-Generated Information on Common Hand Procedures.常见手部手术的在线人工智能生成信息评估

J Hand Surg Am. 2023 Nov;48(11):1122-1127. doi: 10.1016/j.jhsa.2023.08.003. Epub 2023 Sep 9.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

GPT-4作为颈椎前路椎间盘切除融合术患者信息来源：与谷歌网络搜索的比较分析

GPT-4 as a Source of Patient Information for Anterior Cervical Discectomy and Fusion: A Comparative Analysis Against Google Web Search.

作者信息

机构信息

出版信息

STUDY DESIGN

OBJECTIVES

METHODS

RESULTS

CONCLUSIONS

研究设计

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献