• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于慢性阻塞性肺疾病患者教育和信息提供的人工智能聊天机器人评估。

Evaluation of AI chatbots for patient education and information on chronic obstructive pulmonary disease.

作者信息

Merç Pınar, Pirinççi Cansu Şahbaz, Cihan Emine

机构信息

Gulhane Faculty of Physiotherapy and Rehabilitation, University of Health Sciences, Ankara, Turkiye.

Department of Therapy and Rehabilitation, Vocational School of Health Sciences, Selcuk University, Konya, Turkiye.

出版信息

Heart Lung. 2025 Sep 10;75:21-25. doi: 10.1016/j.hrtlng.2025.09.002.

DOI:10.1016/j.hrtlng.2025.09.002
PMID:40939398
Abstract

BACKGROUND

Chronic obstructive pulmonary disease (COPD) is a chronic and progressive disease that affects patients' quality of life and functional capacity. With its widespread use and ease of access, AI chatbots stand out as an alternative source of patient-centered information and education.

OBJECTIVES

To evaluate the readability and accuracy of information provided by ChatGPT, Gemini, and DeepSeek in COPD.

METHODS

Ten most frequently asked questions and answers regarding COPD in English were provided using three AI chatbots (ChatGPT-4 Turbo, Gemini 2.0 Flash, DeepSeek R1). Readability was assessed using the Flesch-Kincaid Grade Level (FKGL), while information quality was analyzed by five physiotherapists based on the guidelines. Responses were graded using a 4-point system from "excellent response requiring no explanation" to "unsatisfactory requiring significant explanation." Statistical analyses were performed on SPSS.

RESULTS

Overall, all three AI chatbots responded to questions with similar quality, with Gemini 2.0 providing a statistically higher quality response to question 4 (p < 0.05). In terms of readability of the answers, DeepSeek was found to have better readability on Q5 (12.01), Q8 (9.24), Q9 (13.1) and Q10 (8.73) compared to ChatGPT (Q5:13.9, Q8:11.92, Q9:17.15, Q10:9.88) and Gemini (Q5:18.22, Q8:15.47, Q9:17.42, Q10:9.38). Gemini was observed to produce more complex and academic level answers on more questions (Q4, Q5, Q8).

CONCLUSIONS

ChatGPT, Gemini, and DeepSeek provided evidence-based answers to frequently asked patient questions about COPD. DeepSeek showed better readability performance for many questions. AI chatbots may serve as a valuable clinical tool for COPD patient education and disease management in the future.

摘要

背景

慢性阻塞性肺疾病(COPD)是一种慢性进行性疾病,会影响患者的生活质量和功能能力。由于其广泛使用且易于获取,人工智能聊天机器人成为以患者为中心的信息和教育的替代来源。

目的

评估ChatGPT、Gemini和DeepSeek提供的关于COPD信息的可读性和准确性。

方法

使用三个人工智能聊天机器人(ChatGPT-4 Turbo、Gemini 2.0 Flash、DeepSeek R1)提供十个关于COPD的最常见问答(英文)。使用弗莱什-金凯德年级水平(FKGL)评估可读性,同时由五名物理治疗师根据指南分析信息质量。回答使用从“无需解释的优秀回答”到“需要大量解释的不满意回答”的4分制进行评分。在SPSS上进行统计分析。

结果

总体而言,所有三个人工智能聊天机器人对问题的回答质量相似,Gemini 2.0对问题4的回答质量在统计学上更高(p < 0.05)。在答案的可读性方面,与ChatGPT(问题5:13.9,问题8:11.92,问题9:17.15,问题10:9.88)和Gemini(问题5:18.22,问题8:15.47,问题9:17.42,问题10:9.38)相比,DeepSeek在问题5(12.01)、问题8(9.24)、问题9(13.1)和问题10(8.73)上的可读性更好。观察到Gemini在更多问题(问题4、问题5、问题8)上给出更复杂和学术水平的答案。

结论

ChatGPT、Gemini和DeepSeek为患者关于COPD的常见问题提供了基于证据的答案。DeepSeek在许多问题上表现出更好的可读性。未来,人工智能聊天机器人可能成为COPD患者教育和疾病管理的有价值临床工具。

相似文献

1
Evaluation of AI chatbots for patient education and information on chronic obstructive pulmonary disease.用于慢性阻塞性肺疾病患者教育和信息提供的人工智能聊天机器人评估。
Heart Lung. 2025 Sep 10;75:21-25. doi: 10.1016/j.hrtlng.2025.09.002.
2
Artificial Intelligence in Peripheral Artery Disease Education: A Battle Between ChatGPT and Google Gemini.外周动脉疾病教育中的人工智能:ChatGPT与谷歌Gemini的较量
Cureus. 2025 Jun 1;17(6):e85174. doi: 10.7759/cureus.85174. eCollection 2025 Jun.
3
Performance of Advanced Artificial Intelligence Models in Pulp Therapy for Immature Permanent Teeth: A Comparison of ChatGPT-4 Omni, DeepSeek, and Gemini Advanced in Accuracy, Completeness, Response Time, and Readability.先进人工智能模型在年轻恒牙牙髓治疗中的表现:ChatGPT-4 Omni、DeepSeek和Gemini Advanced在准确性、完整性、响应时间和可读性方面的比较
J Endod. 2025 Aug 22. doi: 10.1016/j.joen.2025.08.011.
4
Evaluating the readability, quality, and reliability of responses generated by ChatGPT, Gemini, and Perplexity on the most commonly asked questions about Ankylosing spondylitis.评估ChatGPT、Gemini和Perplexity针对强直性脊柱炎最常见问题生成的回答的可读性、质量和可靠性。
PLoS One. 2025 Jun 18;20(6):e0326351. doi: 10.1371/journal.pone.0326351. eCollection 2025.
5
ChatGPT-4.0 or DeepSeek-V3? Comparative analysis of answers to the most frequently asked questions by total knee replacement candidate patients.ChatGPT-4.0还是DeepSeek-V3?全膝关节置换候选患者常见问题答案的比较分析。
Medicine (Baltimore). 2025 Aug 22;104(34):e43951. doi: 10.1097/MD.0000000000043951.
6
Readability, reliability and quality of responses generated by ChatGPT, gemini, and perplexity for the most frequently asked questions about pain.ChatGPT、Gemini和Perplexity针对最常见疼痛问题生成的回答的可读性、可靠性和质量。
Medicine (Baltimore). 2025 Mar 14;104(11):e41780. doi: 10.1097/MD.0000000000041780.
7
Evaluating artificial intelligence chatbots' responses to gynecomastia inquiries: Comparative study of information quality, readability, and guideline consistency.评估人工智能聊天机器人对男性乳房发育症咨询的回复:信息质量、可读性和指南一致性的比较研究
Digit Health. 2025 Aug 26;11:20552076251367645. doi: 10.1177/20552076251367645. eCollection 2025 Jan-Dec.
8
Evaluating DeepResearch and DeepThink in anterior cruciate ligament surgery patient education: ChatGPT-4o excels in comprehensiveness, DeepSeek R1 leads in clarity and readability of orthopaedic information.评估DeepResearch和DeepThink在前交叉韧带手术患者教育中的作用:ChatGPT-4o在全面性方面表现出色,DeepSeek R1在骨科信息的清晰度和可读性方面领先。
Knee Surg Sports Traumatol Arthrosc. 2025 Jun 1. doi: 10.1002/ksa.12711.
9
Evaluation of information provided by artificial intelligence chatbots on extraoral maxillofacial prostheses.人工智能聊天机器人提供的关于口腔外颌面修复体信息的评估
J Prosthet Dent. 2025 Sep 8. doi: 10.1016/j.prosdent.2025.08.028.
10
Multicriteria Assessment of Text Quality in Large Language Model-Generated Gynecomastia Materials: DeepSeek Versus OpenAI Versus Claude.大语言模型生成的男性乳腺增生症材料中文本质量的多标准评估:DeepSeek 与 OpenAI 与 Claude 的比较
J Craniofac Surg. 2025 Sep 10. doi: 10.1097/SCS.0000000000011930.