评估ChatGPT在回答脊髓损伤问题时的可靠性和可用性。

Assessment of the reliability and usability of ChatGPT in response to spinal cord injury questions.

作者信息

Özcan Fatma, Örücü Atar Merve, Köroğlu Özlem, Yılmaz Bilge

机构信息

Department of Physical Medicine and Rehabilitation, University of Health Sciences, Gaziler Physical Medicine and Rehabilitation Training and Research Hospital, Ankara, Turkey.

出版信息

J Spinal Cord Med. 2025 Sep;48(5):852-857. doi: 10.1080/10790268.2024.2361551. Epub 2024 Jun 11.

DOI:10.1080/10790268.2024.2361551

PMID:38860862

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12329821/

Abstract

OBJECTIVE

The use of artificial intelligence chatbots to obtain information about patients' diseases is increasing. This study aimed to determine the reliability and usability of ChatGPT for spinal cord injury-related questions.

METHODS

Three raters simultaneously evaluated a total of 47 questions on a 7-point Likert scale for reliability and usability, based on the three most frequently searched keywords in Google Trends ('general information', 'complications' and 'treatment').

RESULTS

Inter-rater Cronbach α scores indicated substantial agreement for both reliability and usability scores (α between 0.558 and 0.839, and α between 0.373 and 0.772, respectively). The highest mean reliability score was for 'complications' (mean 5.38). The lowest average was for the 'general information' section (mean 4.20). The 'treatment' had the highest mean scores for the usability (mean 5.87) and the lowest mean value was recorded in the 'general information' section (mean 4.80).

CONCLUSION

The answers given by ChatGPT to questions related to spinal cord injury were reliable and useful. Nevertheless, it should be kept in mind that ChatGPT may provide incorrect or incomplete information, especially in the 'general information' section, which may mislead patients and their relatives.

摘要

目的

使用人工智能聊天机器人获取患者疾病信息的情况日益增多。本研究旨在确定ChatGPT对于脊髓损伤相关问题的可靠性和可用性。

方法

基于谷歌趋势中搜索频率最高的三个关键词（“一般信息”、“并发症”和“治疗”），三位评估者同时以7分李克特量表对总共47个问题的可靠性和可用性进行评估。

结果

评估者间的Cronbach α分数表明，可靠性和可用性分数均具有高度一致性（α分别在0.558至0.839之间以及0.373至0.772之间）。“并发症”的平均可靠性分数最高（平均5.38）。“一般信息”部分的平均分最低（平均4.20）。“治疗”在可用性方面的平均分数最高（平均5.87），而“一般信息”部分的平均值最低（平均4.80）。

结论

ChatGPT对脊髓损伤相关问题给出的答案可靠且有用。然而，应牢记ChatGPT可能会提供不正确或不完整的信息，尤其是在“一般信息”部分，这可能会误导患者及其亲属。

相似文献

Assessment of the reliability and usability of ChatGPT in response to spinal cord injury questions.

J Spinal Cord Med. 2025 Sep;48(5):852-857. doi: 10.1080/10790268.2024.2361551. Epub 2024 Jun 11.

Evaluating the readability, quality, and reliability of responses generated by ChatGPT, Gemini, and Perplexity on the most commonly asked questions about Ankylosing spondylitis.

PLoS One. 2025 Jun 18;20(6):e0326351. doi: 10.1371/journal.pone.0326351. eCollection 2025.

Comparison of ChatGPT and Internet Research for Clinical Research and Decision-Making in Occupational Medicine: Randomized Controlled Trial.

JMIR Form Res. 2025 May 20;9:e63857. doi: 10.2196/63857.

Pharmacy meets AI: Effect of a drug information activity on student perceptions of generative artificial intelligence.

Curr Pharm Teach Learn. 2025 Jul 7;17(10):102439. doi: 10.1016/j.cptl.2025.102439.

Is Information About Musculoskeletal Malignancies From Large Language Models or Web Resources at a Suitable Reading Level for Patients?

Clin Orthop Relat Res. 2025 Feb 1;483(2):306-315. doi: 10.1097/CORR.0000000000003263. Epub 2024 Sep 25.

Thyroid Eye Disease and Artificial Intelligence: A Comparative Study of ChatGPT-3.5, ChatGPT-4o, and Gemini in Patient Information Delivery.

Ophthalmic Plast Reconstr Surg. 2024 Dec 24. doi: 10.1097/IOP.0000000000002882.

Evaluation of Information Provided by ChatGPT Versions on Traumatic Dental Injuries for Dental Students and Professionals.

Dent Traumatol. 2025 Aug;41(4):427-436. doi: 10.1111/edt.13042. Epub 2025 Jan 23.

Is ChatGPT a more academic source than google searches for patient questions about hip arthroscopy? An analysis of the most frequently asked questions.

J ISAKOS. 2025 Jun;12:100892. doi: 10.1016/j.jisako.2025.100892. Epub 2025 May 3.

Artificial Intelligence in Peripheral Artery Disease Education: A Battle Between ChatGPT and Google Gemini.

Cureus. 2025 Jun 1;17(6):e85174. doi: 10.7759/cureus.85174. eCollection 2025 Jun.

Evaluation of ChatGPT-4 as an Online Outpatient Assistant in Puerperal Mastitis Management: Content Analysis of an Observational Study.

JMIR Med Inform. 2025 Jul 24;13:e68980. doi: 10.2196/68980.

引用本文的文献

Evaluation of domain-specific vs general purpose GPT models for SCI-related gastrointestinal, cardiovascular and pulmonary complications.

J Spinal Cord Med. 2025 Jul;48(4):718-719. doi: 10.1080/10790268.2024.2420433. Epub 2024 Nov 20.

本文引用的文献

Information Quality and Readability: ChatGPT's Responses to the Most Common Questions About Spinal Cord Injury.

World Neurosurg. 2024 Jan;181:e1138-e1144. doi: 10.1016/j.wneu.2023.11.062. Epub 2023 Nov 22.

Patient perception of spinal cord injury through social media: An analysis of 703 Instagram and 117 Twitter posts.

J Craniovertebr Junction Spine. 2023 Jul-Sep;14(3):288-291. doi: 10.4103/jcvjs.jcvjs_87_23. Epub 2023 Sep 18.

Exploring the future of nursing: Insights from the ChatGPT model.

Belitung Nurs J. 2023 Feb 12;9(1):1-5. doi: 10.33546/bnj.2551. eCollection 2023.

Evaluating the role of social media in providing support for family caregivers of individuals with spinal cord injury.

Spinal Cord. 2023 Aug;61(8):460-465. doi: 10.1038/s41393-023-00914-1. Epub 2023 Jul 13.

"Dr ChatGPT": Is it a reliable and useful source for common rheumatic diseases?

Int J Rheum Dis. 2023 Jul;26(7):1343-1349. doi: 10.1111/1756-185X.14749. Epub 2023 May 23.

ChatGPT - Reshaping medical education and clinical management.

Pak J Med Sci. 2023 Mar-Apr;39(2):605-607. doi: 10.12669/pjms.39.2.7653.

Assessing the performance of ChatGPT in answering questions regarding cirrhosis and hepatocellular carcinoma.

Clin Mol Hepatol. 2023 Jul;29(3):721-732. doi: 10.3350/cmh.2023.0089. Epub 2023 Mar 22.

Interdisciplinary follow-up clinic for people with spinal cord injury: a retrospective study of a carousel model.

Spinal Cord Ser Cases. 2021 Sep 27;7(1):86. doi: 10.1038/s41394-021-00451-0.

Pressure ulcer/injury classification today: An international perspective.

J Tissue Viability. 2020 Aug;29(3):197-203. doi: 10.1016/j.jtv.2020.04.003. Epub 2020 May 1.

Introduction to artificial intelligence in medicine.

Minim Invasive Ther Allied Technol. 2019 Apr;28(2):73-81. doi: 10.1080/13645706.2019.1575882. Epub 2019 Feb 27.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

评估ChatGPT在回答脊髓损伤问题时的可靠性和可用性。

Assessment of the reliability and usability of ChatGPT in response to spinal cord injury questions.

作者信息

机构信息

出版信息

OBJECTIVE

METHODS

RESULTS

CONCLUSION

目的

方法

结果

结论

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献