• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

DeepSeek与泪道引流障碍:炒作还是它比ChatGPT表现更好?

DeepSeek and lacrimal drainage disorders: hype or is it performing better than ChatGPT?

作者信息

Ali Mohammad Javed

机构信息

Govindram Seksaria Institute of Dacryology, L.V. Prasad Eye Institute, Hyderabad, India.

出版信息

Orbit. 2025 May 8:1-7. doi: 10.1080/01676830.2025.2501656.

DOI:10.1080/01676830.2025.2501656
PMID:40336348
Abstract

PURPOSE

This study aimed to report the performance of the large language model DeepSeek (DeepSeek TM, Hangzhou, China) and perform a head-to-head comparison with ChatGPT (OpenAI, San Francisco, USA) in the context of lacrimal drainage disorders.

METHODS

Questions and statements were used to construct prompts to include common and uncommon aspects of lacrimal drainage disorders. Prompts avoided covering new knowledge beyond February 2024. Prompts were presented at least twice to the latest versions of DeepSeek and ChatGPT [Accessed February 15-18, 2025]. A set of assessed prompts for ChatGPT from 2023 (ChatGPT-2023) was utilized in this study. The responses of DeepSeek and ChatGPT were analyzed for evidence-based content, updated knowledge, specific responses, speed, and factual inaccuracies. The responses of the current ChatGPT were also compared with those of 2023 to assess the improvement of the artificial intelligence chatbot. Three lacrimal surgeons graded the responses into three categories: correct, partially correct, and factually incorrect. They also compared the overall quality of the response between DeepSeek and ChatGPT based on the overall content, organization, and clarity of the answers.

RESULTS

25 prompts were presented to the latest versions [February 2025] of DeepSeek and ChatGPT. There was no significant difference in the speed of response. The agreement among the three observers was high (96%) in grading the responses. In terms of the accuracy of the responses, both AI models were similar. DeepSeek's responses were graded as correct in 60% (15/25), partially correct in 36% (9/25), and factually incorrect in 4% (1/25). ChatGPT-2025 responses were graded as correct in 56% (14/25), partially correct in 40% (10/25), and factually incorrect in 4% (1/25). Compared to 2023, ChatGPT-2025 gave responses which were more specific, more accurate, less generic with lesser recycling of phrases. When confronted with inaccuracies, both admitted and corrected the mistakes in subsequent responses. Both the AI models demonstrated the capability of challenging incorrect prompts and premises.

CONCLUSION

DeepSeek was not superior but comparable to ChatGPT in the context of lacrimal drainage disorders. Each had unique advantages and could complement each other. They need to be specifically trained and re-trained for individual medical subspecialties.

摘要

目的

本研究旨在报告大语言模型DeepSeek(DeepSeek TM,中国杭州)的性能,并在泪道疾病背景下与ChatGPT(美国旧金山OpenAI)进行直接比较。

方法

使用问题和陈述来构建提示,涵盖泪道疾病的常见和罕见方面。提示避免涉及2024年2月以后的新知识。提示至少向DeepSeek和ChatGPT的最新版本展示两次[访问时间为2025年2月15日至18日]。本研究使用了一组2023年针对ChatGPT的评估提示(ChatGPT - 2023)。分析DeepSeek和ChatGPT的回答,评估其基于证据的内容、更新知识、具体回答、速度和事实错误。还将当前ChatGPT的回答与2023年的回答进行比较,以评估人工智能聊天机器人的改进情况。三位泪道外科医生将回答分为三类:正确、部分正确和事实错误。他们还根据回答的整体内容、组织和清晰度,比较了DeepSeek和ChatGPT回答的整体质量。

结果

向DeepSeek和ChatGPT的最新版本[2025年2月]展示了25个提示。回答速度没有显著差异。三位观察者在对回答进行评分时的一致性很高(96%)。在回答的准确性方面,两个人工智能模型相似。DeepSeek的回答被评为正确的占60%(15/25),部分正确的占36%(9/),事实错误的占4%(1/25)。ChatGPT - 2025的回答被评为正确的占56%(14/25),部分正确的占40%(10/25),事实错误的占4%(1/25)。与2023年相比,ChatGPT - 2025的回答更具体、更准确、更不笼统,短语重复使用较少。当面对不准确之处时,两者都承认并在后续回答中纠正了错误。两个人工智能模型都展示了挑战错误提示和前提的能力。

结论

在泪道疾病背景下,DeepSeek并不优于ChatGPT,但与之相当。两者各有独特优势,可以相互补充。它们需要针对各个医学亚专业进行专门训练和再训练。

相似文献

1
DeepSeek and lacrimal drainage disorders: hype or is it performing better than ChatGPT?DeepSeek与泪道引流障碍:炒作还是它比ChatGPT表现更好?
Orbit. 2025 May 8:1-7. doi: 10.1080/01676830.2025.2501656.
2
ChatGPT and Lacrimal Drainage Disorders: Performance and Scope of Improvement.ChatGPT 与泪液排出障碍:性能与改善范围。
Ophthalmic Plast Reconstr Surg. 2023;39(3):221-225. doi: 10.1097/IOP.0000000000002418. Epub 2023 May 10.
3
ChatGPT and Ophthalmology: Exploring Its Potential with Discharge Summaries and Operative Notes.ChatGPT 与眼科学:从出院小结和手术记录探索其潜力。
Semin Ophthalmol. 2023 Jul;38(5):503-507. doi: 10.1080/08820538.2023.2209166. Epub 2023 May 3.
4
Performance of DeepSeek, Qwen 2.5 MAX, and ChatGPT Assisting in Diagnosis of Corneal Eye Diseases, Glaucoma, and Neuro-Ophthalmology Diseases Based on Clinical Case Reports.基于临床病例报告,DeepSeek、通义千问2.5 MAX和ChatGPT在角膜眼病、青光眼和神经眼科疾病诊断中的性能表现。
medRxiv. 2025 Mar 17:2025.03.14.25323836. doi: 10.1101/2025.03.14.25323836.
5
Can deepseek and ChatGPT be used in the diagnosis of oral pathologies?DeepSeek和ChatGPT能用于口腔病理学诊断吗?
BMC Oral Health. 2025 Apr 25;25(1):638. doi: 10.1186/s12903-025-06034-x.
6
Accuracy and Readability of Artificial Intelligence Chatbot Responses to Vasectomy-Related Questions: Public Beware.人工智能聊天机器人对输精管切除术相关问题回答的准确性和可读性:公众需谨慎。
Cureus. 2024 Aug 28;16(8):e67996. doi: 10.7759/cureus.67996. eCollection 2024 Aug.
7
Artificial intelligence performance in answering multiple-choice oral pathology questions: a comparative analysis.人工智能在回答口腔病理学选择题方面的表现:一项对比分析。
BMC Oral Health. 2025 Apr 15;25(1):573. doi: 10.1186/s12903-025-05926-2.
8
DeepSeek in Healthcare: Revealing Opportunities and Steering Challenges of a New Open-Source Artificial Intelligence Frontier.医疗保健领域的DeepSeek:揭示新开源人工智能前沿的机遇与导向挑战
Cureus. 2025 Feb 18;17(2):e79221. doi: 10.7759/cureus.79221. eCollection 2025 Feb.
9
Unlocking Health Literacy: The Ultimate Guide to Hypertension Education From ChatGPT Versus Google Gemini.解锁健康素养:ChatGPT与谷歌Gemini高血压教育终极指南
Cureus. 2024 May 8;16(5):e59898. doi: 10.7759/cureus.59898. eCollection 2024 May.
10
Evaluating ChatGPT-3.5 and ChatGPT-4.0 Responses on Hyperlipidemia for Patient Education.评估ChatGPT-3.5和ChatGPT-4.0关于高脂血症的患者教育回复。
Cureus. 2024 May 25;16(5):e61067. doi: 10.7759/cureus.61067. eCollection 2024 May.