• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估大语言模型在强直性脊柱炎/脊柱关节炎患者健康教育中的表现:一项在中国进行的横断面单盲研究。

Evaluating the performance of large language models in health education for patients with ankylosing spondylitis/spondyloarthritis: a cross-sectional, single-blind study in China.

作者信息

Ren Yong, Kang Yue-Ning, Cao Shuang-Yan, Meng Fanxuan, Zhang Jingyu, Liao Ruyi, Li Xiaomin, Chen Yuling, Wen Ya, Wu Jiayun, Xia Wenqi, Xu Liling, Wen Shenghui, Liu Huifen, Li Yuanqing, Gu Jieruo, Lv Qing

机构信息

Pazhou Lab, Guangzhou, Guangdong, China.

The Seventh Affiliated Hospital of Sun Yat-sen University, Shenzhen, Guangdong, China.

出版信息

BMJ Open. 2025 Mar 21;15(3):e097528. doi: 10.1136/bmjopen-2024-097528.

DOI:10.1136/bmjopen-2024-097528
PMID:40118477
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11931893/
Abstract

OBJECTIVES

To evaluate the potential of large language models (LLMs) in health education for patients with ankylosing spondylitis (AS)/spondyloarthritis (SpA), focusing on the accuracy of information transmission, patient acceptance and performance differences between different models.

DESIGN

Cross-sectional, single-blind study.

SETTING

Multiple centres in China.

PARTICIPANTS

182 volunteers, including 4 rheumatologists and 178 patients with AS/SpA.

PRIMARY AND SECONDARY OUTCOME MEASURES

Scientificity, precision and accessibility of the content of the answers provided by LLMs; patient acceptance of the answers.

RESULTS

LLMs performed well in terms of scientificity, precision and accessibility, with ChatGPT-4o and Kimi models outperforming traditional guidelines. Most patients with AS/SpA showed a higher level of understanding and acceptance of the responses from LLMs.

CONCLUSIONS

LLMs have significant potential in medical knowledge transmission and patient education, making them promising tools for future medical practice.

摘要

目的

评估大语言模型(LLMs)在强直性脊柱炎(AS)/脊柱关节炎(SpA)患者健康教育中的潜力,重点关注信息传递的准确性、患者接受度以及不同模型之间的性能差异。

设计

横断面单盲研究。

地点

中国多个中心。

参与者

182名志愿者,包括4名风湿病学家和178名AS/SpA患者。

主要和次要结局指标

大语言模型提供答案内容的科学性、精确性和可及性;患者对答案的接受度。

结果

大语言模型在科学性、精确性和可及性方面表现良好,ChatGPT-4o和Kimi模型优于传统指南。大多数AS/SpA患者对大语言模型的回答表现出更高的理解和接受程度。

结论

大语言模型在医学知识传播和患者教育方面具有巨大潜力,使其成为未来医学实践中有前景的工具。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/809f/11931893/9d4bcea80a18/bmjopen-15-3-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/809f/11931893/bf12ceba00df/bmjopen-15-3-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/809f/11931893/d00e16cf4b2b/bmjopen-15-3-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/809f/11931893/9d4bcea80a18/bmjopen-15-3-g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/809f/11931893/bf12ceba00df/bmjopen-15-3-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/809f/11931893/d00e16cf4b2b/bmjopen-15-3-g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/809f/11931893/9d4bcea80a18/bmjopen-15-3-g003.jpg

相似文献

1
Evaluating the performance of large language models in health education for patients with ankylosing spondylitis/spondyloarthritis: a cross-sectional, single-blind study in China.评估大语言模型在强直性脊柱炎/脊柱关节炎患者健康教育中的表现:一项在中国进行的横断面单盲研究。
BMJ Open. 2025 Mar 21;15(3):e097528. doi: 10.1136/bmjopen-2024-097528.
2
Evaluating the Effectiveness of Large Language Models in Providing Patient Education for Chinese Patients With Ocular Myasthenia Gravis: Mixed Methods Study.评估大语言模型为中国重症肌无力性眼病患者提供患者教育的有效性:混合方法研究
J Med Internet Res. 2025 Apr 10;27:e67883. doi: 10.2196/67883.
3
Patient- and clinician-based evaluation of large language models for patient education in prostate cancer radiotherapy.基于患者和临床医生的大语言模型在前列腺癌放疗患者教育中的评估
Strahlenther Onkol. 2025 Mar;201(3):333-342. doi: 10.1007/s00066-024-02342-3. Epub 2025 Jan 10.
4
Large Language Models as a Consulting Hotline for Patients With Breast Cancer and Specialists in China: Cross-Sectional Questionnaire Study.在中国,大型语言模型作为乳腺癌患者和专科医生的咨询热线:横断面问卷调查研究
JMIR Med Inform. 2025 May 27;13:e66429. doi: 10.2196/66429.
5
Evaluating text and visual diagnostic capabilities of large language models on questions related to the Breast Imaging Reporting and Data System Atlas 5 edition.评估大语言模型在与《乳腺影像报告和数据系统》第5版相关问题上的文本和视觉诊断能力。
Diagn Interv Radiol. 2025 Mar 3;31(2):111-129. doi: 10.4274/dir.2024.242876. Epub 2024 Sep 9.
6
Do large language model chatbots perform better than established patient information resources in answering patient questions? A comparative study on melanoma.在回答患者问题方面,大型语言模型聊天机器人的表现是否优于成熟的患者信息资源?一项关于黑色素瘤的比较研究。
Br J Dermatol. 2025 Jan 24;192(2):306-315. doi: 10.1093/bjd/ljae377.
7
Is the information provided by large language models valid in educating patients about adolescent idiopathic scoliosis? An evaluation of content, clarity, and empathy : The perspective of the European Spine Study Group.大语言模型提供的信息在对患者进行青少年特发性脊柱侧凸教育方面是否有效?内容、清晰度和同理心的评估:欧洲脊柱研究小组的观点
Spine Deform. 2025 Mar;13(2):361-372. doi: 10.1007/s43390-024-00955-3. Epub 2024 Nov 4.
8
Performance of ChatGPT on Nursing Licensure Examinations in the United States and China: Cross-Sectional Study.ChatGPT 在中美护理执照考试中的表现:横断面研究。
JMIR Med Educ. 2024 Oct 3;10:e52746. doi: 10.2196/52746.
9
[Medical care situation of patients with ankylosing spondylitis and psoriatic arthritis in Germany : Medical care situation of patients with spondyloarthritis (SpA): ankylosing spondylitis (AS) and psoriatic arthritis (PsA) from the perspective of rheumatologists in private practice and hospitals in Germany-Results of the research project "SpA Loop-Life of Outpatients"].德国强直性脊柱炎和银屑病关节炎患者的医疗状况:脊柱关节炎(SpA)患者的医疗状况:从德国私人诊所和医院的风湿病学家角度看强直性脊柱炎(AS)和银屑病关节炎(PsA)——“SpA Loop - 门诊患者生活”研究项目的结果
Z Rheumatol. 2019 May;78(4):372-381. doi: 10.1007/s00393-019-0619-6.
10
Evaluating large language models as patient education tools for inflammatory bowel disease: A comparative study.评估大型语言模型作为炎症性肠病患者教育工具的效果:一项比较研究。
World J Gastroenterol. 2025 Feb 14;31(6):102090. doi: 10.3748/wjg.v31.i6.102090.

引用本文的文献

1
Evaluation of the Performance of Large Language Models in the Management of Axial Spondyloarthropathy: Analysis of EULAR 2022 Recommendations.大型语言模型在轴性脊柱关节炎管理中的性能评估:对欧洲抗风湿病联盟2022年建议的分析
Diagnostics (Basel). 2025 Jun 7;15(12):1455. doi: 10.3390/diagnostics15121455.
2
Assessing the causal relationship between obesity and ankylosing spondylitis: A two-sample Mendelian randomization study.评估肥胖与强直性脊柱炎之间的因果关系:一项两样本孟德尔随机化研究。
Medicine (Baltimore). 2025 May 23;104(21):e42559. doi: 10.1097/MD.0000000000042559.

本文引用的文献

1
The role of deep learning in diagnostic imaging of spondyloarthropathies: a systematic review.深度学习在脊柱关节炎诊断成像中的作用:一项系统综述。
Eur Radiol. 2025 Jun;35(6):3661-3672. doi: 10.1007/s00330-024-11261-x. Epub 2024 Dec 10.
2
Large Language Models in Rheumatologic Diagnosis: A Multimodal Performance Analysis.大语言模型在风湿病诊断中的应用:多模态性能分析
J Rheumatol. 2025 Feb 1;52(2):187-188. doi: 10.3899/jrheum.2024-0975.
3
Advancing rheumatology with natural language processing: insights and prospects from a systematic review.
利用自然语言处理推动风湿病学发展:系统评价的见解与展望
Rheumatol Adv Pract. 2024 Sep 19;8(4):rkae120. doi: 10.1093/rap/rkae120. eCollection 2024.
4
Assessing Accuracy of ChatGPT on Addressing Helicobacter pylori Infection-Related Questions: A National Survey and Comparative Study.评估 ChatGPT 在解答与幽门螺杆菌感染相关问题方面的准确性:一项全国性调查和对比研究。
Helicobacter. 2024 Jul-Aug;29(4):e13116. doi: 10.1111/hel.13116.
5
Evaluation and mitigation of the limitations of large language models in clinical decision-making.评估和缓解大型语言模型在临床决策中的局限性。
Nat Med. 2024 Sep;30(9):2613-2622. doi: 10.1038/s41591-024-03097-1. Epub 2024 Jul 4.
6
A multimodal generative AI copilot for human pathology.用于人体病理学的多模态生成式人工智能副驾。
Nature. 2024 Oct;634(8033):466-473. doi: 10.1038/s41586-024-07618-3. Epub 2024 Jun 12.
7
Use of Artificial Intelligence Chatbots in Interpretation of Pathology Reports.人工智能聊天机器人在病理报告解读中的应用。
JAMA Netw Open. 2024 May 1;7(5):e2412767. doi: 10.1001/jamanetworkopen.2024.12767.
8
Evaluating large language models as agents in the clinic.评估大型语言模型作为临床中的智能体。
NPJ Digit Med. 2024 Apr 3;7(1):84. doi: 10.1038/s41746-024-01083-y.
9
Large language models: rheumatologists' newest colleagues?大型语言模型:风湿病学家的最新同事?
Nat Rev Rheumatol. 2024 Feb;20(2):75-76. doi: 10.1038/s41584-023-01070-9.
10
Accuracy of ChatGPT in Common Gastrointestinal Diseases: Impact for Patients and Providers.ChatGPT 在常见胃肠道疾病中的准确性:对患者和提供者的影响。
Clin Gastroenterol Hepatol. 2024 Jun;22(6):1323-1325.e3. doi: 10.1016/j.cgh.2023.11.008. Epub 2023 Nov 19.