• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

人工智能在物理医学中管理肌肉骨骼疾病的影响:一项定性观察性研究,评估ChatGPT与Copilot在提供腰痛患者信息和临床建议方面的潜在用途。

Impact of artificial intelligence in managing musculoskeletal pathologies in physiatry: a qualitative observational study evaluating the potential use of ChatGPT versus Copilot for patient information and clinical advice on low back pain.

作者信息

Ah-Yan Christophe, Boissonnault Ève, Boudier-Revéret Mathieu, Mares Christopher

机构信息

Department of Physical Medicine and Rehabilitation, University of Montreal, Montreal, QC, Canada.

Department of Physical Medicine and Rehabilitation, Centre Hospitalier de l'Université de Montréal, Montreal, QC, Canada.

出版信息

J Yeungnam Med Sci. 2025;42:11. doi: 10.12701/jyms.2024.01151. Epub 2024 Nov 29.

DOI:10.12701/jyms.2024.01151
PMID:39610054
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11812099/
Abstract

BACKGROUND

The self-management of low back pain (LBP) through patient information interventions offers significant benefits in terms of cost, reduced work absenteeism, and overall healthcare utilization. Using a large language model (LLM), such as ChatGPT (OpenAI) or Copilot (Microsoft), could potentially enhance these outcomes further. Thus, it is important to evaluate the LLMs ChatGPT and Copilot in providing medical advice for LBP and assessing the impact of clinical context on the quality of responses.

METHODS

This was a qualitative comparative observational study. It was conducted within the Department of Physical Medicine and Rehabilitation, University of Montreal in Montreal, QC, Canada. ChatGPT and Copilot were used to answer 27 common questions related to LBP, with and without a specific clinical context. The responses were evaluated by physiatrists for validity, safety, and usefulness using a 4-point Likert scale (4, most favorable).

RESULTS

Both ChatGPT and Copilot demonstrated good performance across all measures. Validity scores were 3.33 for ChatGPT and 3.18 for Copilot, safety scores were 3.19 for ChatGPT and 3.13 for Copilot, and usefulness scores were 3.60 for ChatGPT and 3.57 for Copilot. The inclusion of clinical context did not significantly change the results.

CONCLUSION

LLMs, such as ChatGPT and Copilot, can provide reliable medical advice on LBP, irrespective of the detailed clinical context, supporting their potential to aid in patient self-management.

摘要

背景

通过患者信息干预进行腰痛(LBP)的自我管理在成本、减少旷工以及整体医疗保健利用方面具有显著益处。使用大型语言模型(LLM),如ChatGPT(OpenAI)或Copilot(微软),可能会进一步提升这些效果。因此,评估ChatGPT和Copilot等大型语言模型在提供腰痛医疗建议以及评估临床背景对回答质量的影响方面很重要。

方法

这是一项定性比较观察性研究。该研究在加拿大魁北克省蒙特利尔市蒙特利尔大学物理医学与康复系进行。使用ChatGPT和Copilot回答27个与腰痛相关的常见问题,有无特定临床背景。物理治疗师使用4点李克特量表(4表示最有利)对回答的有效性、安全性和实用性进行评估。

结果

ChatGPT和Copilot在所有指标上均表现良好。ChatGPT的有效性得分是3.33,Copilot的有效性得分是3.18;ChatGPT的安全性得分是3.19,Copilot的安全性得分是3.13;ChatGPT的实用性得分是3.60,Copilot的实用性得分是3.57。纳入临床背景并没有显著改变结果。

结论

ChatGPT和Copilot等大型语言模型可以提供关于腰痛的可靠医疗建议,无论详细的临床背景如何,这支持了它们在帮助患者自我管理方面的潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c44/11812099/03ce7a70ccc5/jyms-2024-01151f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c44/11812099/03ce7a70ccc5/jyms-2024-01151f1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8c44/11812099/03ce7a70ccc5/jyms-2024-01151f1.jpg

相似文献

1
Impact of artificial intelligence in managing musculoskeletal pathologies in physiatry: a qualitative observational study evaluating the potential use of ChatGPT versus Copilot for patient information and clinical advice on low back pain.人工智能在物理医学中管理肌肉骨骼疾病的影响:一项定性观察性研究,评估ChatGPT与Copilot在提供腰痛患者信息和临床建议方面的潜在用途。
J Yeungnam Med Sci. 2025;42:11. doi: 10.12701/jyms.2024.01151. Epub 2024 Nov 29.
2
Proficiency, Clarity, and Objectivity of Large Language Models Versus Specialists' Knowledge on COVID-19's Impacts in Pregnancy: Cross-Sectional Pilot Study.大型语言模型在新冠肺炎对妊娠影响方面的熟练度、清晰度和客观性与专家知识对比:横断面试点研究
JMIR Form Res. 2025 Feb 5;9:e56126. doi: 10.2196/56126.
3
Assessing the Responses of Large Language Models (ChatGPT-4, Gemini, and Microsoft Copilot) to Frequently Asked Questions in Breast Imaging: A Study on Readability and Accuracy.评估大语言模型(ChatGPT-4、Gemini和Microsoft Copilot)对乳腺成像常见问题的回答:可读性和准确性研究
Cureus. 2024 May 9;16(5):e59960. doi: 10.7759/cureus.59960. eCollection 2024 May.
4
Evaluating LLM-based generative AI tools in emergency triage: A comparative study of ChatGPT Plus, Copilot Pro, and triage nurses.评估基于大语言模型的生成式人工智能工具在急诊分诊中的应用:ChatGPT Plus、Copilot Pro与分诊护士的对比研究
Am J Emerg Med. 2025 Mar;89:174-181. doi: 10.1016/j.ajem.2024.12.024. Epub 2024 Dec 19.
5
Evaluating the reliability of the responses of large language models to keratoconus-related questions.评估大语言模型对圆锥角膜相关问题回答的可靠性。
Clin Exp Optom. 2024 Oct 24:1-8. doi: 10.1080/08164622.2024.2419524.
6
Using large language models (ChatGPT, Copilot, PaLM, Bard, and Gemini) in Gross Anatomy course: Comparative analysis.在大体解剖学课程中使用大语言模型(ChatGPT、Copilot、PaLM、Bard和Gemini):比较分析
Clin Anat. 2025 Mar;38(2):200-210. doi: 10.1002/ca.24244. Epub 2024 Nov 21.
7
Microsoft Copilot Provides More Accurate and Reliable Information About Anterior Cruciate Ligament Injury and Repair Than ChatGPT and Google Gemini; However, No Resource Was Overall the Best.与ChatGPT和谷歌Gemini相比,微软Copilot能提供关于前交叉韧带损伤与修复的更准确、更可靠的信息;然而,没有一种资源在各方面都是最佳的。
Arthrosc Sports Med Rehabil. 2024 Nov 19;7(2):101043. doi: 10.1016/j.asmr.2024.101043. eCollection 2025 Apr.
8
Can Artificial Intelligence Language Models Effectively Address Dental Trauma Questions?人工智能语言模型能否有效解决牙齿创伤问题?
Dent Traumatol. 2025 Apr 1. doi: 10.1111/edt.13063.
9
AI in Home Care-Evaluation of Large Language Models for Future Training of Informal Caregivers: Observational Comparative Case Study.家庭护理中的人工智能——对用于未来非正式护理人员培训的大语言模型的评估:观察性比较案例研究
J Med Internet Res. 2025 Apr 28;27:e70703. doi: 10.2196/70703.
10
Can AI Answer My Questions? Utilizing Artificial Intelligence in the Perioperative Assessment for Abdominoplasty Patients.人工智能能回答我的问题吗?腹部整形手术患者围手术期评估中人工智能的应用。
Aesthetic Plast Surg. 2024 Nov;48(22):4712-4724. doi: 10.1007/s00266-024-04157-0. Epub 2024 Jun 19.

引用本文的文献

1
To Self-Treat or Not to Self-Treat: Evaluating the Diagnostic, Advisory and Referral Effectiveness of ChatGPT Responses to the Most Common Musculoskeletal Disorders.自我治疗还是不自我治疗:评估ChatGPT对最常见肌肉骨骼疾病的诊断、咨询及转诊建议的有效性
Diagnostics (Basel). 2025 Jul 21;15(14):1834. doi: 10.3390/diagnostics15141834.

本文引用的文献

1
Performance of ChatGPT on NASS Clinical Guidelines for the Diagnosis and Treatment of Low Back Pain: A Comparison Study.ChatGPT 在 NASS 腰痛诊断和治疗临床指南中的表现:一项对比研究。
Spine (Phila Pa 1976). 2024 May 1;49(9):640-651. doi: 10.1097/BRS.0000000000004915. Epub 2024 Jan 12.
2
Perception of Chat Generative Pre-trained Transformer (Chat-GPT) AI tool amongst MSK clinicians.肌肉骨骼疾病(MSK)临床医生对聊天生成预训练变换器(Chat-GPT)人工智能工具的认知。
J Clin Orthop Trauma. 2023 Sep 23;44:102253. doi: 10.1016/j.jcot.2023.102253. eCollection 2023 Sep.
3
Diagnostic accuracy of a large language model in rheumatology: comparison of physician and ChatGPT-4.
大型语言模型在风湿病学中的诊断准确性:医生和 ChatGPT-4 的比较。
Rheumatol Int. 2024 Feb;44(2):303-306. doi: 10.1007/s00296-023-05464-6. Epub 2023 Sep 24.
4
Revolutionizing healthcare: the role of artificial intelligence in clinical practice.人工智能在临床实践中的应用:医疗保健的革命。
BMC Med Educ. 2023 Sep 22;23(1):689. doi: 10.1186/s12909-023-04698-z.
5
Global, regional, and national burden of low back pain, 1990-2020, its attributable risk factors, and projections to 2050: a systematic analysis of the Global Burden of Disease Study 2021.1990年至2020年全球、区域和国家腰痛负担及其可归因风险因素,以及到2050年的预测:全球疾病负担研究2021的系统分析
Lancet Rheumatol. 2023 May 22;5(6):e316-e329. doi: 10.1016/S2665-9913(23)00098-X. eCollection 2023 Jun.
6
"Dr ChatGPT": Is it a reliable and useful source for common rheumatic diseases?“ChatGPT 医生”:它是常见风湿病的可靠且有用的信息来源吗?
Int J Rheum Dis. 2023 Jul;26(7):1343-1349. doi: 10.1111/1756-185X.14749. Epub 2023 May 23.
7
ChatGPT in medicine: an overview of its applications, advantages, limitations, future prospects, and ethical considerations.医学领域的ChatGPT:其应用、优势、局限性、未来前景及伦理考量概述
Front Artif Intell. 2023 May 4;6:1169595. doi: 10.3389/frai.2023.1169595. eCollection 2023.
8
Benefits, Limits, and Risks of GPT-4 as an AI Chatbot for Medicine.GPT-4作为医学人工智能聊天机器人的益处、局限性和风险
N Engl J Med. 2023 Mar 30;388(13):1233-1239. doi: 10.1056/NEJMsr2214184.
9
High Agreement and High Prevalence: The Paradox of Cohen's Kappa.高一致性与高患病率:科恩kappa系数的悖论
Open Nurs J. 2017 Oct 31;11:211-218. doi: 10.2174/1874434601711010211. eCollection 2017.
10
Cost-effectiveness of providing patients with information on managing mild low-back symptoms in an occupational health setting.在职业健康环境中为患者提供轻度下背部症状管理信息的成本效益。
BMC Public Health. 2016 Apr 12;16:316. doi: 10.1186/s12889-016-2974-4.