• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

评估DeepResearch和DeepThink在前交叉韧带手术患者教育中的作用:ChatGPT-4o在全面性方面表现出色,DeepSeek R1在骨科信息的清晰度和可读性方面领先。

Evaluating DeepResearch and DeepThink in anterior cruciate ligament surgery patient education: ChatGPT-4o excels in comprehensiveness, DeepSeek R1 leads in clarity and readability of orthopaedic information.

作者信息

Gültekin Onur, Inoue Jumpei, Yilmaz Baris, Cerci Mehmet Halis, Kilinc Bekir Eray, Yilmaz Hüsnü, Prill Robert, Kayaalp Mahmut Enes

机构信息

Department of Orthopaedics and Traumatology, Istanbul Fatih Sultan Mehmet Training and Research Hospital, University of Health Sciences, Istanbul, Turkey.

Department of Orthopaedic Surgery, Nagoya Tokushukai General Hospital, Kasugai, Aichi, Japan.

出版信息

Knee Surg Sports Traumatol Arthrosc. 2025 Jun 1. doi: 10.1002/ksa.12711.

DOI:10.1002/ksa.12711
PMID:40450565
Abstract

PURPOSE

This study compares ChatGPT-4o, equipped with its deep research feature, and DeepSeek R1, equipped with its deepthink feature-both enabling real-time online data access-in generating responses to frequently asked questions (FAQs) about anterior cruciate ligament (ACL) surgery. The aim is to evaluate and compare their performance in terms of accuracy, clarity, completeness, consistency and readibility for evidence-based patient education.

METHODS

A list of ten FAQs about ACL surgery was compiled after reviewing the Sports Medicine Fellowship Institution's webpages. These questions were posed to ChatGPT and DeepSeek in research-enabled modes. Orthopaedic sports surgeons evaluated the responses for accuracy, clarity, completeness, and consistency using a 4-point Likert scale. Inter-rater reliability of the evaluations was assessed using intraclass correlation coefficients (ICCs). In addition, a readability analysis was conducted using the Flesch-Kincaid Grade Level (FKGL) and Flesch Reading Ease Score (FRES) metrics via an established online calculator to objectively measure textual complexity. Paired t tests were used to compare the mean scores of the two models for each criterion, with significance set at p < 0.05.

RESULTS

Both models demonstrated high accuracy (mean scores of 3.9/4) and consistency (4/4). Significant differences were observed in clarity and completeness: ChatGPT provided more comprehensive responses (mean completeness 4.0 vs. 3.2, p < 0.001), while DeepSeek's answers were clearer and more accessible to laypersons (mean clarity 3.9 vs. 3.0, p < 0.001). DeepSeek had lower FKGL (8.9 vs. 14.2, p < 0.001) and higher FRES (61.3 vs. 32.7, p < 0.001), indicating greater ease of reading for a general audience. ICC analysis indicated substantial inter-rater agreement (composite ICC = 0.80).

CONCLUSION

ChatGPT-4o, leveraging its deep research feature, and DeepSeek R1, utilizing its deepthink feature, both deliver high-quality, accurate information for ACL surgery patient education. While ChatGPT excels in comprehensiveness, DeepSeek outperforms in clarity and readability, suggesting that integrating the strengths of both models could optimize patient education outcomes.

LEVEL OF EVIDENCE

Level V.

摘要

目的

本研究比较了具备深度研究功能的ChatGPT-4o和具备深度思考功能(均支持实时在线数据访问)的DeepSeek R1在生成关于前交叉韧带(ACL)手术常见问题(FAQ)的回答方面的表现。目的是评估和比较它们在基于证据的患者教育方面的准确性、清晰度、完整性、一致性和可读性。

方法

在查阅运动医学 fellowship 机构的网页后,编制了一份关于ACL手术的十个常见问题列表。这些问题以研究启用模式向ChatGPT和DeepSeek提出。骨科运动外科医生使用4点李克特量表评估回答的准确性、清晰度、完整性和一致性。使用组内相关系数(ICC)评估评估者间的可靠性。此外,通过一个既定的在线计算器,使用弗莱什-金凯德年级水平(FKGL)和弗莱什阅读简易度得分(FRES)指标进行可读性分析,以客观测量文本复杂性。使用配对t检验比较两个模型在每个标准上的平均得分,显著性设定为p < 0.05。

结果

两个模型都表现出高准确性(平均得分3.9/4)和一致性(4/4)。在清晰度和完整性方面观察到显著差异:ChatGPT提供了更全面的回答(平均完整性4.0对3.2,p < 0.001),而DeepSeek的回答对非专业人士来说更清晰、更容易理解(平均清晰度3.9对3.0,p < 0.001)。DeepSeek的FKGL较低(8.9对14.2,p < 0.001),FRES较高(61.3对32.7,p < 0.001),表明对普通读者来说阅读更容易。ICC分析表明评估者间有实质性的一致性(综合ICC = 0.80)。

结论

利用其深度研究功能的ChatGPT-4o和利用其深度思考功能的DeepSeek R1都为ACL手术患者教育提供了高质量、准确的信息。虽然ChatGPT在全面性方面表现出色,但DeepSeek在清晰度和可读性方面表现更优,这表明整合两个模型的优势可以优化患者教育效果。

证据水平

V级。

相似文献

1
Evaluating DeepResearch and DeepThink in anterior cruciate ligament surgery patient education: ChatGPT-4o excels in comprehensiveness, DeepSeek R1 leads in clarity and readability of orthopaedic information.评估DeepResearch和DeepThink在前交叉韧带手术患者教育中的作用:ChatGPT-4o在全面性方面表现出色,DeepSeek R1在骨科信息的清晰度和可读性方面领先。
Knee Surg Sports Traumatol Arthrosc. 2025 Jun 1. doi: 10.1002/ksa.12711.
2
Evaluating ChatGPT and DeepSeek in postdural puncture headache management: a comparative study with international consensus guidelines.评估ChatGPT和DeepSeek在硬膜穿刺后头痛管理中的应用:与国际共识指南的对比研究
BMC Neurol. 2025 Jul 1;25(1):264. doi: 10.1186/s12883-025-04280-8.
3
Evaluation of ChatGPT-4 as an Online Outpatient Assistant in Puerperal Mastitis Management: Content Analysis of an Observational Study.评估ChatGPT-4作为产褥期乳腺炎管理在线门诊助手的效果:一项观察性研究的内容分析
JMIR Med Inform. 2025 Jul 24;13:e68980. doi: 10.2196/68980.
4
Is Information About Musculoskeletal Malignancies From Large Language Models or Web Resources at a Suitable Reading Level for Patients?来自大语言模型或网络资源的关于肌肉骨骼恶性肿瘤的信息对患者来说是否处于合适的阅读水平?
Clin Orthop Relat Res. 2025 Feb 1;483(2):306-315. doi: 10.1097/CORR.0000000000003263. Epub 2024 Sep 25.
5
A Comparative Study of ChatGPT-4o and DeepSeek Responses to Mandibular Angle Osteotomy Questions.ChatGPT-4o与DeepSeek对下颌角截骨术问题回答的比较研究
J Craniofac Surg. 2025 Jul 31. doi: 10.1097/SCS.0000000000011698.
6
American Academy of Orthopaedic Surgeons OrthoInfo provides more readable information regarding rotator cuff injury than ChatGPT.美国矫形外科医师学会的OrthoInfo提供了比ChatGPT更具可读性的关于肩袖损伤的信息。
J ISAKOS. 2025 Feb 12;12:100841. doi: 10.1016/j.jisako.2025.100841.
7
Can Artificial Intelligence Improve the Readability of Patient Education Materials?人工智能能否提高患者教育材料的可读性?
Clin Orthop Relat Res. 2023 Nov 1;481(11):2260-2267. doi: 10.1097/CORR.0000000000002668. Epub 2023 Apr 28.
8
Artificial Intelligence in Peripheral Artery Disease Education: A Battle Between ChatGPT and Google Gemini.外周动脉疾病教育中的人工智能:ChatGPT与谷歌Gemini的较量
Cureus. 2025 Jun 1;17(6):e85174. doi: 10.7759/cureus.85174. eCollection 2025 Jun.
9
Bridging Health Literacy Gaps in Spine Care: Using ChatGPT-4o to Improve Patient-Education Materials.弥合脊柱护理中的健康素养差距:利用ChatGPT-4o改进患者教育材料。
J Bone Joint Surg Am. 2025 Jun 19. doi: 10.2106/JBJS.24.01484.
10
A structured evaluation of LLM-generated step-by-step instructions in cadaveric brachial plexus dissection.对大语言模型生成的尸体臂丛神经解剖分步指导的结构化评估。
BMC Med Educ. 2025 Jul 1;25(1):903. doi: 10.1186/s12909-025-07493-0.

本文引用的文献

1
Fake no more: The redemption of ChatGPT in literature reviews.不再虚假:ChatGPT在文献综述中的救赎
Account Res. 2025 Feb 16:1-3. doi: 10.1080/08989621.2025.2465619.
2
DeepSeek versus ChatGPT: Multimodal artificial intelligence revolutionizing scientific discovery. From language editing to autonomous content generation-Redefining innovation in research and practice.深度求索与ChatGPT:多模态人工智能正在革新科学发现。从语言编辑到自主内容生成——重新定义研究与实践中的创新。
Knee Surg Sports Traumatol Arthrosc. 2025 May;33(5):1553-1556. doi: 10.1002/ksa.12628. Epub 2025 Feb 12.
3
Enhancement of the Performance of Large Language Models in Diabetes Education through Retrieval-Augmented Generation: Comparative Study.
通过检索增强生成提高大语言模型在糖尿病教育中的性能:比较研究
J Med Internet Res. 2024 Nov 8;26:e58041. doi: 10.2196/58041.
4
Reviewing the Potential Role of Artificial Intelligence in Delivering Personalized and Interactive Pain Medicine Education for Chronic Pain Patients.审视人工智能在为慢性疼痛患者提供个性化和交互式疼痛医学教育方面的潜在作用。
J Pain Res. 2024 Mar 6;17:923-929. doi: 10.2147/JPR.S439452. eCollection 2024.
5
ChatGPT Provides Unsatisfactory Responses to Frequently Asked Questions Regarding Anterior Cruciate Ligament Reconstruction.ChatGPT 对前交叉韧带重建相关常见问题的回答不尽如人意。
Arthroscopy. 2024 Jul;40(7):2067-2079.e1. doi: 10.1016/j.arthro.2024.01.017. Epub 2024 Feb 2.
6
Assessment of Quality and Readability of Information Provided by ChatGPT in Relation to Anterior Cruciate Ligament Injury.ChatGPT提供的关于前交叉韧带损伤信息的质量和可读性评估
J Pers Med. 2024 Jan 18;14(1):104. doi: 10.3390/jpm14010104.
7
Embrace responsible ChatGPT usage to overcome language barriers in academic writing.以负责任的方式使用ChatGPT,以克服学术写作中的语言障碍。
Knee Surg Sports Traumatol Arthrosc. 2024 Jan;32(1):5-9. doi: 10.1002/ksa.12014. Epub 2023 Dec 31.
8
The ability of artificial intelligence tools to formulate orthopaedic clinical decisions in comparison to human clinicians: An analysis of ChatGPT 3.5, ChatGPT 4, and Bard.与人类临床医生相比,人工智能工具制定骨科临床决策的能力:对ChatGPT 3.5、ChatGPT 4和Bard的分析。
J Orthop. 2023 Dec 1;50:1-7. doi: 10.1016/j.jor.2023.11.063. eCollection 2024 Apr.
9
ChatGPT's potential to support home care for patients in the early period after orthopedic interventions and enhance public health.ChatGPT 在支持骨科干预后早期患者的家庭护理和增强公众健康方面的潜力。
Jt Dis Relat Surg. 2024 Jan 1;35(1):169-176. doi: 10.52312/jdrs.2023.1402. Epub 2023 Nov 30.
10
ChatGPT and large language models in orthopedics: from education and surgery to research.骨科领域的ChatGPT和大语言模型:从教育、手术到研究
J Exp Orthop. 2023 Dec 1;10(1):128. doi: 10.1186/s40634-023-00700-1.