• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人工智能大语言模型在膝关节骨关节炎个性化康复计划中的作用:一项观察性研究。

The Role of Artificial Intelligence Large Language Models in Personalized Rehabilitation Programs for Knee Osteoarthritis: An Observational Study.

作者信息

Gürses Ömer Alperen, Özüdoğru Anıl, Tuncay Figen, Kararti Caner

机构信息

School of Physical Therapy and Rehabilitation, Department of Physiotherapy and Rehabilitation, Kırşehir Ahi Evran University, Merkez, Kırşehir, 40100, Türkiye.

Faculty of Medicine, Department of Physical Medicine and Rehabilitation, Kırşehir Ahi Evran University, Merkez, Kırşehir, 40100, Türkiye.

出版信息

J Med Syst. 2025 Jun 3;49(1):73. doi: 10.1007/s10916-025-02207-x.

DOI:10.1007/s10916-025-02207-x
PMID:40459660
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12134017/
Abstract

BACKGROUND

Large language models (LLMs) can contribute to treatment options and outcomes by assisting physiotherapists for conditions like osteoarthritis.

AIMS

The objective of this early-stage cross-sectional study is to assess the alignment of large language models with physiotherapists in designing physiotherapy and rehabilitation programs for knee osteoarthritis.

METHODS

Forty patients diagnosed with knee osteoarthritis were assessed using standardized clinical criteria. For each patient, individualized rehabilitation programs were created by three physiotherapists and by ChatGPT-4o and Gemini Advanced using structured prompts. The presence or absence of 50 clinically relevant rehabilitation parameters was recorded for each program. Chi-square tests were used to evaluate agreement rates between the LLMs and the physiotherapist-generated Consensus programs.

RESULTS

ChatGPT-4o achieved a 74% agreement rate with the physiotherapists' Consensus programs, while Gemini Advanced achieved 70%. Although both models showed high compatibility with general rehabilitation components, they demonstrated notable limitations in exercise specificity, including frequency, sets, and progression criteria. ChatGPT-4o performed as well as or better than Gemini in most phases, particularly in Phase 3, while Gemini showed lower consistency in balance and stabilization parameters.

CONCLUSIONS

ChatGPT-4o and Gemini Advanced demonstrate promising potential in generating personalized rehabilitation programs for knee osteoarthritis. While their outputs generally align with expert recommendations, notable gaps remain in clinical reasoning and the provision of detailed exercise parameters. These findings underscore the importance of ongoing model refinement and the necessity of expert supervision for safe and effective clinical integration.

摘要

背景

大语言模型(LLMs)可以通过协助物理治疗师治疗骨关节炎等病症,为治疗方案和结果做出贡献。

目的

这项早期横断面研究的目的是评估大语言模型在为膝关节骨关节炎设计物理治疗和康复计划方面与物理治疗师的契合度。

方法

使用标准化临床标准对40名被诊断为膝关节骨关节炎的患者进行评估。对于每位患者,由三名物理治疗师以及ChatGPT-4o和Gemini Advanced使用结构化提示创建个性化康复计划。记录每个计划中50个临床相关康复参数的有无。使用卡方检验评估大语言模型与物理治疗师生成的共识计划之间的一致率。

结果

ChatGPT-4o与物理治疗师的共识计划达成了74%的一致率,而Gemini Advanced达成了70%。尽管两个模型在一般康复组成部分方面都显示出高度兼容性,但它们在运动特异性方面表现出明显局限性,包括频率、组数和进展标准。ChatGPT-4o在大多数阶段的表现与Gemini相当或更好,特别是在第3阶段,而Gemini在平衡和稳定参数方面的一致性较低。

结论

ChatGPT-4o和Gemini Advanced在为膝关节骨关节炎生成个性化康复计划方面显示出有前景的潜力。虽然它们的输出总体上与专家建议一致,但在临床推理和提供详细运动参数方面仍存在明显差距。这些发现强调了持续改进模型的重要性以及专家监督对于安全有效的临床整合的必要性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afd1/12134017/b10fef16ae12/10916_2025_2207_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afd1/12134017/b10fef16ae12/10916_2025_2207_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/afd1/12134017/b10fef16ae12/10916_2025_2207_Fig1_HTML.jpg

相似文献

1
The Role of Artificial Intelligence Large Language Models in Personalized Rehabilitation Programs for Knee Osteoarthritis: An Observational Study.人工智能大语言模型在膝关节骨关节炎个性化康复计划中的作用:一项观察性研究。
J Med Syst. 2025 Jun 3;49(1):73. doi: 10.1007/s10916-025-02207-x.
2
Comparing Artificial Intelligence-Generated and Clinician-Created Personalized Self-Management Guidance for Patients With Knee Osteoarthritis: Blinded Observational Study.比较人工智能生成与临床医生创建的针对膝骨关节炎患者的个性化自我管理指导:盲法观察研究。
J Med Internet Res. 2025 May 7;27:e67830. doi: 10.2196/67830.
3
Enhancing responses from large language models with role-playing prompts: a comparative study on answering frequently asked questions about total knee arthroplasty.通过角色扮演提示增强大语言模型的回答:关于全膝关节置换术常见问题解答的比较研究
BMC Med Inform Decis Mak. 2025 May 23;25(1):196. doi: 10.1186/s12911-025-03024-5.
4
Comparative Analysis of ChatGPT-4o and Gemini Advanced Performance on Diagnostic Radiology In-Training Exams.ChatGPT-4o与Gemini在放射诊断学培训考试中的性能对比分析
Cureus. 2025 Mar 20;17(3):e80874. doi: 10.7759/cureus.80874. eCollection 2025 Mar.
5
Comparative analysis of ChatGPT-4o mini, ChatGPT-4o and Gemini Advanced in the treatment of postmenopausal osteoporosis.ChatGPT-4o mini、ChatGPT-4o与Gemini Advanced在绝经后骨质疏松症治疗中的对比分析。
BMC Musculoskelet Disord. 2025 Apr 16;26(1):369. doi: 10.1186/s12891-025-08601-3.
6
ChatGPT-4o outperforms gemini advanced in assisting multidisciplinary decision-making for advanced gastric cancer.ChatGPT-4o在协助晚期胃癌的多学科决策方面优于Gemini Advanced。
Eur J Surg Oncol. 2025 Apr 24;51(8):110096. doi: 10.1016/j.ejso.2025.110096.
7
Evaluation of a Novel e-Learning Program for Physiotherapists to Manage Knee Osteoarthritis via Telehealth: Qualitative Study Nested in the PEAK (Physiotherapy Exercise and Physical Activity for Knee Osteoarthritis) Randomized Controlled Trial.通过远程医疗管理膝骨关节炎的新型电子学习计划对物理治疗师的评估:嵌套在 PEAK(膝骨关节炎的物理治疗运动和体育活动)随机对照试验中的定性研究。
J Med Internet Res. 2021 Apr 30;23(4):e25872. doi: 10.2196/25872.
8
Evaluating text and visual diagnostic capabilities of large language models on questions related to the Breast Imaging Reporting and Data System Atlas 5 edition.评估大语言模型在与《乳腺影像报告和数据系统》第5版相关问题上的文本和视觉诊断能力。
Diagn Interv Radiol. 2025 Mar 3;31(2):111-129. doi: 10.4274/dir.2024.242876. Epub 2024 Sep 9.
9
Evaluating the Potential of Large Language Models for Vestibular Rehabilitation Education: A Comparison of ChatGPT, Google Gemini, and Clinicians.评估大语言模型用于前庭康复教育的潜力:ChatGPT、谷歌Gemini与临床医生的比较
Phys Ther. 2025 Apr 2;105(4). doi: 10.1093/ptj/pzaf010.
10
Performance of three artificial intelligence (AI)-based large language models in standardized testing; implications for AI-assisted dental education.三种基于人工智能(AI)的大语言模型在标准化测试中的表现;对人工智能辅助牙科教育的启示。
J Periodontal Res. 2025 Feb;60(2):121-133. doi: 10.1111/jre.13323. Epub 2024 Jul 18.

引用本文的文献

1
Artificial intelligence in personalized rehabilitation: current applications and a SWOT analysis.个性化康复中的人工智能:当前应用及SWOT分析
Front Digit Health. 2025 Jul 24;7:1606088. doi: 10.3389/fdgth.2025.1606088. eCollection 2025.

本文引用的文献

1
Evaluating large language model workflows in clinical decision support for triage and referral and diagnosis.评估用于分诊、转诊和诊断的临床决策支持中的大语言模型工作流程。
NPJ Digit Med. 2025 May 9;8(1):263. doi: 10.1038/s41746-025-01684-1.
2
Evaluating the Potential of Large Language Models for Vestibular Rehabilitation Education: A Comparison of ChatGPT, Google Gemini, and Clinicians.评估大语言模型用于前庭康复教育的潜力:ChatGPT、谷歌Gemini与临床医生的比较
Phys Ther. 2025 Apr 2;105(4). doi: 10.1093/ptj/pzaf010.
3
Assessing the performance of AI chatbots in answering patients' common questions about low back pain.
评估人工智能聊天机器人回答患者关于腰痛常见问题的表现。
Ann Rheum Dis. 2025 Jan;84(1):143-149. doi: 10.1136/ard-2024-226202. Epub 2025 Jan 2.
4
Large language models' performances regarding common patient questions about osteoarthritis: A comparative analysis of ChatGPT-3.5, ChatGPT-4.0, and Perplexity.大语言模型在关于骨关节炎的常见患者问题上的表现:ChatGPT-3.5、ChatGPT-4.0和Perplexity的比较分析
J Sport Health Sci. 2024 Nov 28;14:101016. doi: 10.1016/j.jshs.2024.101016.
5
Performance of ChatGPT-3.5 and ChatGPT-4o in the Japanese National Dental Examination.ChatGPT-3.5和ChatGPT-4o在日本国家牙科考试中的表现。
J Dent Educ. 2025 Apr;89(4):459-466. doi: 10.1002/jdd.13766. Epub 2024 Nov 13.
6
Is the information provided by large language models valid in educating patients about adolescent idiopathic scoliosis? An evaluation of content, clarity, and empathy : The perspective of the European Spine Study Group.大语言模型提供的信息在对患者进行青少年特发性脊柱侧凸教育方面是否有效?内容、清晰度和同理心的评估:欧洲脊柱研究小组的观点
Spine Deform. 2025 Mar;13(2):361-372. doi: 10.1007/s43390-024-00955-3. Epub 2024 Nov 4.
7
ChatGPT-4 and wearable device assisted Intelligent Exercise Therapy for co-existing Sarcopenia and Osteoarthritis (GAISO): a feasibility study and design for a randomized controlled PROBE non-inferiority trial.ChatGPT-4 和可穿戴设备辅助的肌少症和骨关节炎并存的智能运动治疗(GAISO):一项随机对照 PROBE 非劣效试验的可行性研究和设计。
J Orthop Surg Res. 2024 Oct 8;19(1):635. doi: 10.1186/s13018-024-05134-8.
8
Comparative Study to Evaluate the Accuracy of Differential Diagnosis Lists Generated by Gemini Advanced, Gemini, and Bard for a Case Report Series Analysis: Cross-Sectional Study.评估Gemini Advanced、Gemini和Bard生成的鉴别诊断列表准确性的比较研究:用于病例报告系列分析的横断面研究。
JMIR Med Inform. 2024 Oct 2;12:e63010. doi: 10.2196/63010.
9
Comparative performance of artificial intelligence models in rheumatology board-level questions: evaluating Google Gemini and ChatGPT-4o.人工智能模型在风湿病委员会级问题中的比较性能:评估 Google Gemini 和 ChatGPT-4o。
Clin Rheumatol. 2024 Nov;43(11):3507-3513. doi: 10.1007/s10067-024-07154-5. Epub 2024 Sep 28.
10
Currently Available Large Language Models Do Not Provide Musculoskeletal Treatment Recommendations That Are Concordant With Evidence-Based Clinical Practice Guidelines.目前可用的大语言模型并未提供与循证临床实践指南相一致的肌肉骨骼治疗建议。
Arthroscopy. 2025 Feb;41(2):263-275.e6. doi: 10.1016/j.arthro.2024.07.040. Epub 2024 Aug 22.