• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

探索 ChatGPT 作为提供骨科信息的补充工具的潜力。

Exploring the potential of ChatGPT as a supplementary tool for providing orthopaedic information.

机构信息

Department of Orthopaedic Surgery, UPMC Freddie Fu Sports Medicine Center, University of Pittsburgh, Pittsburgh, USA.

Department of Orthopaedics, Institute of Clinical Sciences, Sahlgrenska Academy, University of Gothenburg, Göteborgsvägen 31, 431 80, Mölndal, Sweden.

出版信息

Knee Surg Sports Traumatol Arthrosc. 2023 Nov;31(11):5190-5198. doi: 10.1007/s00167-023-07529-2. Epub 2023 Aug 8.

DOI:10.1007/s00167-023-07529-2
PMID:37553552
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10598178/
Abstract

PURPOSE

To investigate the potential use of large language models (LLMs) in orthopaedics by presenting queries pertinent to anterior cruciate ligament (ACL) surgery to generative pre-trained transformer (ChatGPT, specifically using its GPT-4 model of March 14th 2023). Additionally, this study aimed to evaluate the depth of the LLM's knowledge and investigate its adaptability to different user groups. It was hypothesized that the ChatGPT would be able to adapt to different target groups due to its strong language understanding and processing capabilities.

METHODS

ChatGPT was presented with 20 questions and response was requested for two distinct target audiences: patients and non-orthopaedic medical doctors. Two board-certified orthopaedic sports medicine surgeons and two expert orthopaedic sports medicine surgeons independently evaluated the responses generated by ChatGPT. Mean correctness, completeness, and adaptability to the target audiences (patients and non-orthopaedic medical doctors) were determined. A three-point response scale facilitated nuanced assessment.

RESULTS

ChatGPT exhibited fair accuracy, with average correctness scores of 1.69 and 1.66 (on a scale from 0, incorrect, 1, partially correct, to 2, correct) for patients and medical doctors, respectively. Three of the 20 questions (15.0%) were deemed incorrect by any of the four orthopaedic sports medicine surgeon assessors. Moreover, overall completeness was calculated to be 1.51 and 1.64 for patients and medical doctors, respectively, while overall adaptiveness was determined to be 1.75 and 1.73 for patients and doctors, respectively.

CONCLUSION

Overall, ChatGPT was successful in generating correct responses in approximately 65% of the cases related to ACL surgery. The findings of this study imply that LLMs offer potential as a supplementary tool for acquiring orthopaedic knowledge. However, although ChatGPT can provide guidance and effectively adapt to diverse target audiences, it cannot supplant the expertise of orthopaedic sports medicine surgeons in diagnostic and treatment planning endeavours due to its limited understanding of orthopaedic domains and its potential for erroneous responses.

LEVEL OF EVIDENCE

V.

摘要

目的

通过向生成式预训练转换器(ChatGPT,具体使用其 2023 年 3 月 14 日的 GPT-4 模型)提出与前交叉韧带(ACL)手术相关的查询,来探讨大型语言模型(LLM)在骨科中的潜在应用。此外,本研究旨在评估 LLM 的知识深度,并研究其对不同用户群体的适应性。假设 ChatGPT 由于其强大的语言理解和处理能力,能够适应不同的目标群体。

方法

向 ChatGPT 提出了 20 个问题,并要求其针对两个不同的目标群体(患者和非骨科医生)做出回答。两位经过董事会认证的骨科运动医学外科医生和两位专家级骨科运动医学外科医生独立评估了 ChatGPT 生成的回复。确定了针对目标群体(患者和非骨科医生)的正确性、完整性和适应性的平均值。采用三分制响应量表进行细致的评估。

结果

ChatGPT 的准确性一般,对于患者和医生的平均正确率分别为 1.69 和 1.66(0 为错误,1 为部分正确,2 为正确)。有 3 个问题(15.0%)被任何 4 位骨科运动医学外科医生评估者判定为错误。此外,患者和医生的总体完整性分别计算为 1.51 和 1.64,而患者和医生的总体适应性分别为 1.75 和 1.73。

结论

总体而言,ChatGPT 在大约 65%的 ACL 手术相关问题上成功生成了正确的回复。本研究的结果表明,LLM 作为获取骨科知识的辅助工具具有潜力。然而,尽管 ChatGPT 可以提供指导并有效地适应不同的目标群体,但由于其对骨科领域的理解有限以及可能产生错误回复,它不能替代骨科运动医学外科医生在诊断和治疗计划方面的专业知识。

证据等级

V。

相似文献

1
Exploring the potential of ChatGPT as a supplementary tool for providing orthopaedic information.探索 ChatGPT 作为提供骨科信息的补充工具的潜力。
Knee Surg Sports Traumatol Arthrosc. 2023 Nov;31(11):5190-5198. doi: 10.1007/s00167-023-07529-2. Epub 2023 Aug 8.
2
ChatGPT can yield valuable responses in the context of orthopaedic trauma surgery.ChatGPT在骨科创伤手术领域能够给出有价值的回答。
J Exp Orthop. 2024 Jun 17;11(3):e12047. doi: 10.1002/jeo2.12047. eCollection 2024 Jul.
3
Can Artificial Intelligence Pass the American Board of Orthopaedic Surgery Examination? Orthopaedic Residents Versus ChatGPT.人工智能能通过美国骨科医师学会考试吗?骨科住院医师与ChatGPT的对比。
Clin Orthop Relat Res. 2023 Aug 1;481(8):1623-1630. doi: 10.1097/CORR.0000000000002704. Epub 2023 May 23.
4
Application of generative language models to orthopaedic practice.生成式语言模型在骨科实践中的应用。
BMJ Open. 2024 Mar 14;14(3):e076484. doi: 10.1136/bmjopen-2023-076484.
5
Comparison of ChatGPT-3.5, ChatGPT-4, and Orthopaedic Resident Performance on Orthopaedic Assessment Examinations.ChatGPT-3.5、ChatGPT-4 和骨科住院医师在骨科评估考试中的表现比较。
J Am Acad Orthop Surg. 2023 Dec 1;31(23):1173-1179. doi: 10.5435/JAAOS-D-23-00396. Epub 2023 Sep 4.
6
Artificial intelligence in orthopaedics: can Chat Generative Pre-trained Transformer (ChatGPT) pass Section 1 of the Fellowship of the Royal College of Surgeons (Trauma & Orthopaedics) examination?人工智能在骨科领域的应用:ChatGPT 能否通过皇家外科学院(创伤与骨科)研究员资格 Section 1 考试?
Postgrad Med J. 2023 Sep 21;99(1176):1110-1114. doi: 10.1093/postmj/qgad053.
7
Application of ChatGPT for Orthopedic Surgeries and Patient Care.ChatGPT 在骨科手术和患者护理中的应用。
Clin Orthop Surg. 2024 Jun;16(3):347-356. doi: 10.4055/cios23181. Epub 2024 May 13.
8
Artificial Intelligence Large Language Models Address Anterior Cruciate Ligament Reconstruction: Superior Clarity and Completeness by Gemini Compared With ChatGPT-4 in Response to American Academy of Orthopaedic Surgeons Clinical Practice Guidelines.人工智能大语言模型助力前交叉韧带重建:与ChatGPT-4相比,Gemini在回应美国矫形外科医师学会临床实践指南时具有更高的清晰度和完整性。
Arthroscopy. 2025 Jun;41(6):2002-2008. doi: 10.1016/j.arthro.2024.09.020. Epub 2024 Sep 21.
9
Examining the role of ChatGPT in the management of distal radius fractures: insights into its accuracy and consistency.探讨 ChatGPT 在桡骨远端骨折管理中的作用:探究其准确性和一致性。
ANZ J Surg. 2024 Jul-Aug;94(7-8):1391-1396. doi: 10.1111/ans.19143. Epub 2024 Jul 5.
10
Triage Performance Across Large Language Models, ChatGPT, and Untrained Doctors in Emergency Medicine: Comparative Study.分诊表现比较:大型语言模型、ChatGPT 和未经训练的急诊医生:一项对比研究。
J Med Internet Res. 2024 Jun 14;26:e53297. doi: 10.2196/53297.

引用本文的文献

1
Lost in Translation: Preoperative Orthopaedic Education Materials Significantly Exceed Recommended Reading Levels.翻译失误:术前骨科教育材料的阅读水平显著超过推荐标准。
JB JS Open Access. 2025 Aug 7;10(3). doi: 10.2106/JBJS.OA.25.00143. eCollection 2025 Jul-Sep.
2
The assessment of ChatGPT-4's performance compared to expert's consensus on chronic lateral ankle instability.与专家共识相比,ChatGPT-4在慢性外侧踝关节不稳方面的性能评估。
J Exp Orthop. 2025 Aug 5;12(3):e70393. doi: 10.1002/jeo2.70393. eCollection 2025 Jul.
3
Evaluating if ChatGPT Can Answer Common Patient Questions Compared to OrthoInfo Regarding Lateral Epicondylitis.评估与OrthoInfo相比,ChatGPT能否回答有关外侧上髁炎的常见患者问题。
Iowa Orthop J. 2025;45(1):19-32.
4
A custom ChatGPT can accurately answer questions from an international expert osteotomy consensus statement.定制的ChatGPT可以准确回答来自国际专家截骨术共识声明的问题。
Eur J Orthop Surg Traumatol. 2025 Jun 16;35(1):247. doi: 10.1007/s00590-025-04373-7.
5
Evaluating Large Language Models for Preoperative Patient Education in Superior Capsular Reconstruction: Comparative Study of Claude, GPT, and Gemini.评估大语言模型在肩胛下肌上囊重建术前患者教育中的应用:Claude、GPT和Gemini的比较研究
JMIR Perioper Med. 2025 Jun 12;8:e70047. doi: 10.2196/70047.
6
Evaluating DeepResearch and DeepThink in anterior cruciate ligament surgery patient education: ChatGPT-4o excels in comprehensiveness, DeepSeek R1 leads in clarity and readability of orthopaedic information.评估DeepResearch和DeepThink在前交叉韧带手术患者教育中的作用:ChatGPT-4o在全面性方面表现出色,DeepSeek R1在骨科信息的清晰度和可读性方面领先。
Knee Surg Sports Traumatol Arthrosc. 2025 Jun 1. doi: 10.1002/ksa.12711.
7
Editorial - Current capacities and future possibilities of large language models in orthopaedic surgery.社论——骨科手术中大型语言模型的当前能力与未来可能性
J Exp Orthop. 2025 May 26;12(2):e70273. doi: 10.1002/jeo2.70273. eCollection 2025 Apr.
8
Evaluation of ChatGPT Responses About Sexual Activity After Total Hip Arthroplasty: A Comparative Study with Observers of Different Experience Levels.评估ChatGPT对全髋关节置换术后性活动的回答:与不同经验水平观察者的对比研究。
J Clin Med. 2025 Apr 24;14(9):2942. doi: 10.3390/jcm14092942.
9
Exploring the role of artificial intelligence in Turkish orthopedic progression exams.探索人工智能在土耳其骨科进展考试中的作用。
Acta Orthop Traumatol Turc. 2025 Mar 17;59(1):18-26. doi: 10.5152/j.aott.2025.24090.
10
Are Large Language Model-Based Chatbots Effective in Providing Reliable Medical Advice for Achilles Tendinopathy? An International Multispecialist Evaluation.基于大语言模型的聊天机器人在为跟腱病提供可靠医学建议方面是否有效?一项国际多专家评估。
Orthop J Sports Med. 2025 Apr 30;13(4):23259671251332596. doi: 10.1177/23259671251332596. eCollection 2025 Apr.

本文引用的文献

1
Can Artificial Intelligence Pass the American Board of Orthopaedic Surgery Examination? Orthopaedic Residents Versus ChatGPT.人工智能能通过美国骨科医师学会考试吗?骨科住院医师与ChatGPT的对比。
Clin Orthop Relat Res. 2023 Aug 1;481(8):1623-1630. doi: 10.1097/CORR.0000000000002704. Epub 2023 May 23.
2
Comparing Physician and Artificial Intelligence Chatbot Responses to Patient Questions Posted to a Public Social Media Forum.比较医生和人工智能聊天机器人对发布在公共社交媒体论坛上的患者问题的回复。
JAMA Intern Med. 2023 Jun 1;183(6):589-596. doi: 10.1001/jamainternmed.2023.1838.
3
ChatGPT: Is this version good for healthcare and research?ChatGPT:这个版本对医疗保健和研究有帮助吗?
Diabetes Metab Syndr. 2023 Apr;17(4):102744. doi: 10.1016/j.dsx.2023.102744. Epub 2023 Mar 15.
4
Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma.评估 ChatGPT 在回答肝硬化和肝细胞癌相关问题方面的表现。
Clin Mol Hepatol. 2023 Jul;29(3):721-732. doi: 10.3350/cmh.2023.0089. Epub 2023 Mar 22.
5
Expanding Cosmetic Plastic Surgery Research With ChatGPT.利用 ChatGPT 拓展美容整形外科学研究。
Aesthet Surg J. 2023 Jul 15;43(8):930-937. doi: 10.1093/asj/sjad069.
6
The exciting potential for ChatGPT in obstetrics and gynecology.ChatGPT 在妇产科领域的令人兴奋的潜力。
Am J Obstet Gynecol. 2023 Jun;228(6):696-705. doi: 10.1016/j.ajog.2023.03.009. Epub 2023 Mar 15.
7
Attention is not all you need: the complicated case of ethically using large language models in healthcare and medicine.注意力并非全部所需:在医疗保健和医学中使用大型语言模型所涉及的复杂伦理问题。
EBioMedicine. 2023 Apr;90:104512. doi: 10.1016/j.ebiom.2023.104512. Epub 2023 Mar 15.
8
Consulting ChatGPT: Ethical dilemmas in language model artificial intelligence.咨询ChatGPT:语言模型人工智能中的伦理困境。
J Am Acad Dermatol. 2024 Apr;90(4):879-880. doi: 10.1016/j.jaad.2023.02.052. Epub 2023 Mar 11.
9
A deeper dive into ChatGPT: history, use and future perspectives for orthopaedic research.深入探究ChatGPT:骨科研究的历史、应用及未来展望
Knee Surg Sports Traumatol Arthrosc. 2023 Apr;31(4):1190-1192. doi: 10.1007/s00167-023-07372-5. Epub 2023 Mar 9.
10
What ChatGPT and generative AI mean for science.ChatGPT和生成式人工智能对科学意味着什么。
Nature. 2023 Feb;614(7947):214-216. doi: 10.1038/d41586-023-00340-6.