ChatGPT对前交叉韧带手术常见问题回答的准确性评估

Accuracy assessment of ChatGPT responses to frequently asked questions regarding anterior cruciate ligament surgery.

作者信息

Villarreal-Espinosa Juan Bernardo, Berreta Rodrigo Saad, Allende Felicitas, Garcia José Rafael, Ayala Salvador, Familiari Filippo, Chahla Jorge

机构信息

Department of Orthopedics, Rush University Medical Center, Chicago, IL, USA.

Magna Graecia University of Catanzaro, Catanzaro, Italy.

出版信息

Knee. 2024 Dec;51:84-92. doi: 10.1016/j.knee.2024.08.014. Epub 2024 Sep 5.

DOI:10.1016/j.knee.2024.08.014

PMID:39241674

Abstract

BACKGROUND

The emergence of artificial intelligence (AI) has allowed users to have access to large sources of information in a chat-like manner. Thereby, we sought to evaluate ChatGPT-4 response's accuracy to the 10 patient most frequently asked questions (FAQs) regarding anterior cruciate ligament (ACL) surgery.

METHODS

A list of the top 10 FAQs pertaining to ACL surgery was created after conducting a search through all Sports Medicine Fellowship Institutions listed on the Arthroscopy Association of North America (AANA) and American Orthopaedic Society of Sports Medicine (AOSSM) websites. A Likert scale was used to grade response accuracy by two sports medicine fellowship-trained surgeons. Cohen's kappa was used to assess inter-rater agreement. Reproducibility of the responses over time was also assessed.

RESULTS

Five of the 10 responses received a 'completely accurate' grade by two-fellowship trained surgeons with three additional replies receiving a 'completely accurate' status by at least one. Moreover, inter-rater reliability accuracy assessment revealed a moderate agreement between fellowship-trained attending physicians (weighted kappa = 0.57, 95% confidence interval 0.15-0.99). Additionally, 80% of the responses were reproducible over time.

CONCLUSION

ChatGPT can be considered an accurate additional tool to answer general patient questions regarding ACL surgery. None the less, patient-surgeon interaction should not be deferred and must continue to be the driving force for information retrieval. Thus, the general recommendation is to address any questions in the presence of a qualified specialist.

摘要

背景

人工智能（AI）的出现使用户能够以类似聊天的方式获取大量信息。因此，我们试图评估ChatGPT-4对关于前交叉韧带（ACL）手术的10个患者最常问问题（FAQ）的回答准确性。

方法

在搜索北美关节镜协会（AANA）和美国运动医学骨科协会（AOSSM）网站上列出的所有运动医学进修机构后，创建了一份与ACL手术相关的前10个常见问题列表。由两名接受过运动医学进修培训的外科医生使用李克特量表对回答准确性进行评分。使用科恩kappa系数评估评分者间的一致性。还评估了回答随时间的可重复性。

结果

10个回答中有5个被两名接受过进修培训的外科医生评为“完全准确”，另外3个回答至少被一名医生评为“完全准确”。此外，评分者间可靠性准确性评估显示，接受过进修培训的主治医生之间存在中等程度的一致性（加权kappa系数=0.57，95%置信区间0.15-0.99）。此外，80%的回答随时间具有可重复性。

结论

ChatGPT可被视为回答患者关于ACL手术一般问题的准确辅助工具。尽管如此，患者与外科医生的互动不应被推迟，并且必须继续作为信息检索的驱动力。因此，一般建议是在有资质的专家在场的情况下解答任何问题。

相似文献

Accuracy assessment of ChatGPT responses to frequently asked questions regarding anterior cruciate ligament surgery.ChatGPT对前交叉韧带手术常见问题回答的准确性评估

Knee. 2024 Dec;51:84-92. doi: 10.1016/j.knee.2024.08.014. Epub 2024 Sep 5.

Understanding How ChatGPT May Become a Clinical Administrative Tool Through an Investigation on the Ability to Answer Common Patient Questions Concerning Ulnar Collateral Ligament Injuries.通过对ChatGPT回答有关尺侧副韧带损伤常见患者问题能力的调查，了解其如何成为临床管理工具。

Orthop J Sports Med. 2024 Jul 31;12(7):23259671241257516. doi: 10.1177/23259671241257516. eCollection 2024 Jul.

ChatGPT-4 Performs Clinical Information Retrieval Tasks Using Consistently More Trustworthy Resources Than Does Google Search for Queries Concerning the Latarjet Procedure.对于有关拉塔热手术的查询，ChatGPT-4在执行临床信息检索任务时，使用的资源始终比谷歌搜索更可靠。

Arthroscopy. 2025 Mar;41(3):588-597. doi: 10.1016/j.arthro.2024.05.025. Epub 2024 Jun 25.

ChatGPT Provides Unsatisfactory Responses to Frequently Asked Questions Regarding Anterior Cruciate Ligament Reconstruction.ChatGPT 对前交叉韧带重建相关常见问题的回答不尽如人意。

Arthroscopy. 2024 Jul;40(7):2067-2079.e1. doi: 10.1016/j.arthro.2024.01.017. Epub 2024 Feb 2.

ChatGPT Responses to Common Questions About Anterior Cruciate Ligament Reconstruction Are Frequently Satisfactory.ChatGPT 对前交叉韧带重建常见问题的回答通常令人满意。

Arthroscopy. 2024 Jul;40(7):2058-2066. doi: 10.1016/j.arthro.2023.12.009. Epub 2024 Jan 1.

Can ChatGPT 4.0 reliably answer patient frequently asked questions about boxer's fractures?ChatGPT 4.0能否可靠地回答患者关于拳击骨折的常见问题？

Hand Surg Rehabil. 2025 Apr;44(2):102082. doi: 10.1016/j.hansur.2025.102082. Epub 2025 Jan 9.

Early Operative Versus Delayed or Nonoperative Treatment of Anterior Cruciate Ligament Injuries in Pediatric Patients.小儿前交叉韧带损伤的早期手术治疗与延迟或非手术治疗对比

J Athl Train. 2016 May;51(5):425-7. doi: 10.4085/1062-6050.51.5.11. Epub 2016 May 31.

Can ChatGPT reliably answer the most common patient questions regarding total shoulder arthroplasty?ChatGPT能否可靠地回答患者关于全肩关节置换术最常见的问题？

J Shoulder Elbow Surg. 2025 May;34(5):e254-e264. doi: 10.1016/j.jse.2024.08.025. Epub 2024 Oct 16.

ChatGPT-3.5 and -4 provide mostly accurate information when answering patients' questions relating to femoroacetabular impingement syndrome and arthroscopic hip surgery.ChatGPT-3.5和ChatGPT-4在回答患者有关股骨髋臼撞击综合征和关节镜髋关节手术的问题时，提供的信息大多是准确的。

J ISAKOS. 2025 Feb;10:100376. doi: 10.1016/j.jisako.2024.100376. Epub 2024 Dec 12.

Is ChatGPT a more academic source than google searches for patient questions about hip arthroscopy? An analysis of the most frequently asked questions.对于患者关于髋关节镜检查的问题，ChatGPT 比谷歌搜索是更具学术性的信息来源吗？对最常见问题的分析。

J ISAKOS. 2025 Jun;12:100892. doi: 10.1016/j.jisako.2025.100892. Epub 2025 May 3.

引用本文的文献

ChatGPT-4 Responses on Ankle Cartilage Surgery Often Diverge from Expert Consensus: A Comparative Analysis.ChatGPT-4对踝关节软骨手术的回答往往与专家共识存在分歧：一项比较分析。

Foot Ankle Orthop. 2025 Aug 13;10(3):24730114251352494. doi: 10.1177/24730114251352494. eCollection 2025 Jul.

Exploring ChatGPT's Efficacy in Orthopaedic Arthroplasty Questions Compared to Adult Reconstruction Surgeons.与成人重建外科医生相比，探究ChatGPT在骨科关节置换问题方面的效能。

Arthroplast Today. 2025 Jul 14;34:101772. doi: 10.1016/j.artd.2025.101772. eCollection 2025 Aug.

Artificial Intelligence in Pediatric Orthopedics: A Comprehensive Review.小儿骨科中的人工智能：全面综述

Medicina (Kaunas). 2025 May 22;61(6):954. doi: 10.3390/medicina61060954.

A custom ChatGPT can accurately answer questions from an international expert osteotomy consensus statement.定制的ChatGPT可以准确回答来自国际专家截骨术共识声明的问题。

Eur J Orthop Surg Traumatol. 2025 Jun 16;35(1):247. doi: 10.1007/s00590-025-04373-7.

Artificial intelligence-generated responses to frequently asked questions on coccydynia: Evaluating the accuracy and consistency of GPT-4o's performance.人工智能对尾骨痛常见问题的回答：评估GPT-4o表现的准确性和一致性。

Arch Rheumatol. 2025 Mar 17;40(1):63-71. doi: 10.46497/ArchRheumatol.2025.10966. eCollection 2025 Mar.

Enhancing Patient Comprehension of Glomerular Disease Treatments Using ChatGPT.使用ChatGPT提高患者对肾小球疾病治疗的理解

Healthcare (Basel). 2024 Dec 31;13(1):57. doi: 10.3390/healthcare13010057.

High accuracy but limited readability of large language model-generated responses to frequently asked questions about Kienböck's disease.大语言模型生成的对月骨缺血性坏死常见问题解答的回复准确性高但可读性有限。

BMC Musculoskelet Disord. 2024 Nov 4;25(1):879. doi: 10.1186/s12891-024-07983-0.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

ChatGPT对前交叉韧带手术常见问题回答的准确性评估

Accuracy assessment of ChatGPT responses to frequently asked questions regarding anterior cruciate ligament surgery.

作者信息

机构信息

出版信息

BACKGROUND

METHODS

RESULTS

CONCLUSION

背景

方法

结果

结论

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献