Suppr超能文献

ChatGPT对前交叉韧带手术常见问题回答的准确性评估

Accuracy assessment of ChatGPT responses to frequently asked questions regarding anterior cruciate ligament surgery.

作者信息

Villarreal-Espinosa Juan Bernardo, Berreta Rodrigo Saad, Allende Felicitas, Garcia José Rafael, Ayala Salvador, Familiari Filippo, Chahla Jorge

机构信息

Department of Orthopedics, Rush University Medical Center, Chicago, IL, USA.

Magna Graecia University of Catanzaro, Catanzaro, Italy.

出版信息

Knee. 2024 Dec;51:84-92. doi: 10.1016/j.knee.2024.08.014. Epub 2024 Sep 5.

Abstract

BACKGROUND

The emergence of artificial intelligence (AI) has allowed users to have access to large sources of information in a chat-like manner. Thereby, we sought to evaluate ChatGPT-4 response's accuracy to the 10 patient most frequently asked questions (FAQs) regarding anterior cruciate ligament (ACL) surgery.

METHODS

A list of the top 10 FAQs pertaining to ACL surgery was created after conducting a search through all Sports Medicine Fellowship Institutions listed on the Arthroscopy Association of North America (AANA) and American Orthopaedic Society of Sports Medicine (AOSSM) websites. A Likert scale was used to grade response accuracy by two sports medicine fellowship-trained surgeons. Cohen's kappa was used to assess inter-rater agreement. Reproducibility of the responses over time was also assessed.

RESULTS

Five of the 10 responses received a 'completely accurate' grade by two-fellowship trained surgeons with three additional replies receiving a 'completely accurate' status by at least one. Moreover, inter-rater reliability accuracy assessment revealed a moderate agreement between fellowship-trained attending physicians (weighted kappa = 0.57, 95% confidence interval 0.15-0.99). Additionally, 80% of the responses were reproducible over time.

CONCLUSION

ChatGPT can be considered an accurate additional tool to answer general patient questions regarding ACL surgery. None the less, patient-surgeon interaction should not be deferred and must continue to be the driving force for information retrieval. Thus, the general recommendation is to address any questions in the presence of a qualified specialist.

摘要

背景

人工智能(AI)的出现使用户能够以类似聊天的方式获取大量信息。因此,我们试图评估ChatGPT-4对关于前交叉韧带(ACL)手术的10个患者最常问问题(FAQ)的回答准确性。

方法

在搜索北美关节镜协会(AANA)和美国运动医学骨科协会(AOSSM)网站上列出的所有运动医学进修机构后,创建了一份与ACL手术相关的前10个常见问题列表。由两名接受过运动医学进修培训的外科医生使用李克特量表对回答准确性进行评分。使用科恩kappa系数评估评分者间的一致性。还评估了回答随时间的可重复性。

结果

10个回答中有5个被两名接受过进修培训的外科医生评为“完全准确”,另外3个回答至少被一名医生评为“完全准确”。此外,评分者间可靠性准确性评估显示,接受过进修培训的主治医生之间存在中等程度的一致性(加权kappa系数=0.57,95%置信区间0.15-0.99)。此外,80%的回答随时间具有可重复性。

结论

ChatGPT可被视为回答患者关于ACL手术一般问题的准确辅助工具。尽管如此,患者与外科医生的互动不应被推迟,并且必须继续作为信息检索的驱动力。因此,一般建议是在有资质的专家在场的情况下解答任何问题。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验