评估ChatGPT对物理治疗临床问题的回答及参考文献的准确性。

Evaluation of the accuracy of ChatGPT's responses to and references for clinical questions in physical therapy.

作者信息

Sawamura Shogo, Bito Takanobu, Ando Takahiro, Masuda Kento, Kameyama Sakiko, Ishida Hiroyasu

机构信息

Department of Rehabilitation, Heisei College of Health Sciences: 180 Kurono, Gifu City, Gifu 501-1131, Japan.

Department of Rehabilitation, Gifu University Hospital, Japan.

出版信息

J Phys Ther Sci. 2024 May;36(5):234-239. doi: 10.1589/jpts.36.234. Epub 2024 May 1.

DOI:10.1589/jpts.36.234

PMID:38694019

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11060764/

Abstract

[Purpose] This study evaluated the accuracy of ChatGPT's responses to and references for five clinical questions in physical therapy based on the and assessed this language model's potential as a tool for supporting clinical decision-making in the rehabilitation field. [Participants and Methods] Five clinical questions from the "Stroke", "Musculoskeletal disorders", and "Internal disorders" sections of the , released by the Japanese Society of Physical Therapy, were presented to ChatGPT. ChatGPT was instructed to provide responses in Japanese accompanied by references such as PubMed IDs or digital object identifiers. The accuracy of the generated content and references was evaluated by two assessors with expertise in their respective sections by using a 4-point scale, and comments were provided for point deductions. The inter-rater agreement was evaluated using weighted kappa coefficients. [Results] ChatGPT demonstrated adequate accuracy in generating content for clinical questions in physical therapy. However, the accuracy of the references was poor, with a significant number of references being non-existent or misinterpreted. [Conclusion] ChatGPT has limitations in reference selection and reliability. While ChatGPT can offer accurate responses to clinical questions in physical therapy, it should be used with caution because it is not a completely reliable model.

摘要

[目的] 本研究基于《物理治疗临床实践指南》评估了ChatGPT对物理治疗中五个临床问题的回答及参考文献的准确性，并评估了该语言模型作为支持康复领域临床决策工具的潜力。[参与者与方法] 向ChatGPT提出了日本物理治疗学会发布的《物理治疗临床实践指南》中“中风”“肌肉骨骼疾病”和“内科疾病”部分的五个临床问题。要求ChatGPT用日语提供回答，并附上诸如PubMed ID或数字对象标识符等参考文献。由两名在各自领域具有专业知识的评估人员使用4分制对生成内容和参考文献的准确性进行评估，并对扣分情况给出评论。使用加权kappa系数评估评分者间的一致性。[结果] ChatGPT在生成物理治疗临床问题的内容方面表现出足够的准确性。然而，参考文献的准确性较差，大量参考文献不存在或被错误解读。[结论] ChatGPT在参考文献选择和可靠性方面存在局限性。虽然ChatGPT可以对物理治疗中的临床问题提供准确回答，但由于它不是一个完全可靠的模型，应谨慎使用。

相似文献

Evaluation of the accuracy of ChatGPT's responses to and references for clinical questions in physical therapy.

J Phys Ther Sci. 2024 May;36(5):234-239. doi: 10.1589/jpts.36.234. Epub 2024 May 1.

ChatGPT's performance in German OB/GYN exams - paving the way for AI-enhanced medical education and clinical practice.

Front Med (Lausanne). 2023 Dec 13;10:1296615. doi: 10.3389/fmed.2023.1296615. eCollection 2023.

Performance of ChatGPT on the Chinese Postgraduate Examination for Clinical Medicine: Survey Study.

JMIR Med Educ. 2024 Feb 9;10:e48514. doi: 10.2196/48514.

ChatGPT's adherence to otolaryngology clinical practice guidelines.

Eur Arch Otorhinolaryngol. 2024 Jul;281(7):3829-3834. doi: 10.1007/s00405-024-08634-9. Epub 2024 Apr 22.

Evaluating ChatGPT's effectiveness and tendencies in Japanese internal medicine.

J Eval Clin Pract. 2024 Sep;30(6):1017-1023. doi: 10.1111/jep.14011. Epub 2024 May 19.

Exploring the role of an artificial intelligence chatbot on appendicitis management: an experimental study on ChatGPT.

ANZ J Surg. 2024 Mar;94(3):342-352. doi: 10.1111/ans.18736. Epub 2023 Oct 19.

ChatGPT's Performance in Cardiac Arrest and Bradycardia Simulations Using the American Heart Association's Advanced Cardiovascular Life Support Guidelines: Exploratory Study.

J Med Internet Res. 2024 Apr 22;26:e55037. doi: 10.2196/55037.

ChatGPT versus NASS clinical guidelines for degenerative spondylolisthesis: a comparative analysis.

Eur Spine J. 2024 Nov;33(11):4182-4203. doi: 10.1007/s00586-024-08198-6. Epub 2024 Mar 15.

ChatGPT in radiology: A systematic review of performance, pitfalls, and future perspectives.

Diagn Interv Imaging. 2024 Jul-Aug;105(7-8):251-265. doi: 10.1016/j.diii.2024.04.003. Epub 2024 Apr 27.

Exploring the Boundaries of Reality: Investigating the Phenomenon of Artificial Intelligence Hallucination in Scientific Writing Through ChatGPT References.

Cureus. 2023 Apr 11;15(4):e37432. doi: 10.7759/cureus.37432. eCollection 2023 Apr.

引用本文的文献

Artificial intelligence in maxillofacial trauma: expert ally or unreliable assistant?

Med Oral Patol Oral Cir Bucal. 2025 Sep 1;30(5):e751-e757. doi: 10.4317/medoral.27229.

A cross-sectional study on ChatGPT's alignment with clinical practice guidelines in musculoskeletal rehabilitation.

BMC Musculoskelet Disord. 2025 Apr 24;26(1):411. doi: 10.1186/s12891-025-08650-8.

本文引用的文献

ChatGPT makes medicine easy to swallow: an exploratory case study on simplified radiology reports.

Eur Radiol. 2024 May;34(5):2817-2825. doi: 10.1007/s00330-023-10213-1. Epub 2023 Oct 5.

Assessing the Utility of ChatGPT Throughout the Entire Clinical Workflow: Development and Usability Study.

J Med Internet Res. 2023 Aug 22;25:e48659. doi: 10.2196/48659.

Chatbot vs Medical Student Performance on Free-Response Clinical Reasoning Examinations.

JAMA Intern Med. 2023 Sep 1;183(9):1028-1030. doi: 10.1001/jamainternmed.2023.2909.

Practical Applications of ChatGPT in Undergraduate Medical Education.

J Med Educ Curric Dev. 2023 May 24;10:23821205231178449. doi: 10.1177/23821205231178449. eCollection 2023 Jan-Dec.

Capacity of ChatGPT to Identify Guideline-Based Treatments for Advanced Solid Tumors.

Cureus. 2023 Apr 21;15(4):e37938. doi: 10.7759/cureus.37938. eCollection 2023 Apr.

ChatGPT in medicine: an overview of its applications, advantages, limitations, future prospects, and ethical considerations.

Front Artif Intell. 2023 May 4;6:1169595. doi: 10.3389/frai.2023.1169595. eCollection 2023.

Translating radiology reports into plain language using ChatGPT and GPT-4 with prompt learning: results, limitations, and potential.

Vis Comput Ind Biomed Art. 2023 May 18;6(1):9. doi: 10.1186/s42492-023-00136-5.

Evaluation of ChatGPT's Capabilities in Medical Report Generation.

Cureus. 2023 Apr 14;15(4):e37589. doi: 10.7759/cureus.37589. eCollection 2023 Apr.

User Intentions to Use ChatGPT for Self-Diagnosis and Health-Related Purposes: Cross-sectional Survey Study.

JMIR Hum Factors. 2023 May 17;10:e47564. doi: 10.2196/47564.

Exploring the Boundaries of Reality: Investigating the Phenomenon of Artificial Intelligence Hallucination in Scientific Writing Through ChatGPT References.

Cureus. 2023 Apr 11;15(4):e37432. doi: 10.7759/cureus.37432. eCollection 2023 Apr.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

评估ChatGPT对物理治疗临床问题的回答及参考文献的准确性。

Evaluation of the accuracy of ChatGPT's responses to and references for clinical questions in physical therapy.

作者信息

Sawamura Shogo, Bito Takanobu, Ando Takahiro, Masuda Kento, Kameyama Sakiko, Ishida Hiroyasu

机构信息

Department of Rehabilitation, Heisei College of Health Sciences: 180 Kurono, Gifu City, Gifu 501-1131, Japan.

Department of Rehabilitation, Gifu University Hospital, Japan.

出版信息

J Phys Ther Sci. 2024 May;36(5):234-239. doi: 10.1589/jpts.36.234. Epub 2024 May 1.

DOI:10.1589/jpts.36.234

PMID:38694019

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11060764/

Abstract

摘要

评估ChatGPT对物理治疗临床问题的回答及参考文献的准确性。

Evaluation of the accuracy of ChatGPT's responses to and references for clinical questions in physical therapy.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

评估ChatGPT对物理治疗临床问题的回答及参考文献的准确性。

Evaluation of the accuracy of ChatGPT's responses to and references for clinical questions in physical therapy.

作者信息

机构信息

出版信息