• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

眼科大语言模型研究的新兴趋势回顾与未来发展预测。

Review of emerging trends and projection of future developments in large language models research in ophthalmology.

机构信息

University of Cambridge, Cambridge, UK.

Yong Loo Lin School of Medicine, National University of Singapore, Singapore.

出版信息

Br J Ophthalmol. 2024 Sep 20;108(10):1362-1370. doi: 10.1136/bjo-2023-324734.

DOI:10.1136/bjo-2023-324734
PMID:38164563
Abstract

BACKGROUND

Large language models (LLMs) are fast emerging as potent tools in healthcare, including ophthalmology. This systematic review offers a twofold contribution: it summarises current trends in ophthalmology-related LLM research and projects future directions for this burgeoning field.

METHODS

We systematically searched across various databases (PubMed, Europe PMC, Scopus and Web of Science) for articles related to LLM use in ophthalmology, published between 1 January 2022 and 31 July 2023. Selected articles were summarised, and categorised by type (editorial, commentary, original research, etc) and their research focus (eg, evaluating ChatGPT's performance in ophthalmology examinations or clinical tasks).

FINDINGS

We identified 32 articles meeting our criteria, published between January and July 2023, with a peak in June (n=12). Most were original research evaluating LLMs' proficiency in clinically related tasks (n=9). Studies demonstrated that ChatGPT-4.0 outperformed its predecessor, ChatGPT-3.5, in ophthalmology exams. Furthermore, ChatGPT excelled in constructing discharge notes (n=2), evaluating diagnoses (n=2) and answering general medical queries (n=6). However, it struggled with generating scientific articles or abstracts (n=3) and answering specific subdomain questions, especially those regarding specific treatment options (n=2). ChatGPT's performance relative to other LLMs (Google's Bard, Microsoft's Bing) varied by study design. Ethical concerns such as data hallucination (n=27), authorship (n=5) and data privacy (n=2) were frequently cited.

INTERPRETATION

While LLMs hold transformative potential for healthcare and ophthalmology, concerns over accountability, accuracy and data security remain. Future research should focus on application programming interface integration, comparative assessments of popular LLMs, their ability to interpret image-based data and the establishment of standardised evaluation frameworks.

摘要

背景

大型语言模型(LLM)在医疗保健领域,包括眼科领域,迅速成为强大的工具。本系统综述有两个贡献:总结眼科相关 LLM 研究的当前趋势,并为这个新兴领域规划未来方向。

方法

我们系统地在多个数据库(PubMed、Europe PMC、Scopus 和 Web of Science)中搜索了 2022 年 1 月 1 日至 2023 年 7 月 31 日期间发表的与 LLM 在眼科中的使用相关的文章。选择的文章进行了总结,并按类型(社论、评论、原始研究等)和研究重点(例如,评估 ChatGPT 在眼科检查或临床任务中的表现)进行了分类。

结果

我们确定了符合标准的 32 篇文章,发表时间为 2023 年 1 月至 7 月,6 月达到高峰(n=12)。大多数是评估 LLM 在临床相关任务中的熟练程度的原始研究(n=9)。研究表明,ChatGPT-4.0 在眼科考试中优于其前身 ChatGPT-3.5。此外,ChatGPT 在构建出院记录(n=2)、评估诊断(n=2)和回答一般医学查询(n=6)方面表现出色。然而,它在生成科学文章或摘要(n=3)以及回答特定子领域问题方面存在困难,特别是涉及特定治疗方案的问题(n=2)。ChatGPT 的性能相对于其他 LLM(谷歌的 Bard、微软的 Bing)因研究设计而异。数据幻觉(n=27)、作者身份(n=5)和数据隐私(n=2)等伦理问题经常被提及。

解释

虽然 LLM 对医疗保健和眼科有变革性的潜力,但对问责制、准确性和数据安全的担忧仍然存在。未来的研究应侧重于应用程序编程接口集成、对流行 LLM 的比较评估、它们解释基于图像数据的能力以及建立标准化评估框架。

相似文献

1
Review of emerging trends and projection of future developments in large language models research in ophthalmology.眼科大语言模型研究的新兴趋势回顾与未来发展预测。
Br J Ophthalmol. 2024 Sep 20;108(10):1362-1370. doi: 10.1136/bjo-2023-324734.
2
Accuracy of large language models in answering ophthalmology board-style questions: A meta-analysis.大语言模型回答眼科考试式问题的准确性:一项荟萃分析。
Asia Pac J Ophthalmol (Phila). 2024 Sep-Oct;13(5):100106. doi: 10.1016/j.apjo.2024.100106. Epub 2024 Oct 5.
3
Evidence-based potential of generative artificial intelligence large language models in orthodontics: a comparative study of ChatGPT, Google Bard, and Microsoft Bing.生成式人工智能大语言模型在正畸学中的循证潜力:ChatGPT、谷歌巴德和微软必应的比较研究
Eur J Orthod. 2024 Apr 13. doi: 10.1093/ejo/cjae017.
4
Utility of artificial intelligence-based large language models in ophthalmic care.人工智能大型语言模型在眼科护理中的应用。
Ophthalmic Physiol Opt. 2024 May;44(3):641-671. doi: 10.1111/opo.13284. Epub 2024 Feb 25.
5
Evaluation of the Performance of Generative AI Large Language Models ChatGPT, Google Bard, and Microsoft Bing Chat in Supporting Evidence-Based Dentistry: Comparative Mixed Methods Study.评估生成式 AI 大语言模型 ChatGPT、Google Bard 和 Microsoft Bing Chat 在支持循证牙科方面的性能:比较混合方法研究。
J Med Internet Res. 2023 Dec 28;25:e51580. doi: 10.2196/51580.
6
A Systematic Review of ChatGPT and Other Conversational Large Language Models in Healthcare.ChatGPT及其他对话式大语言模型在医疗保健领域的系统评价
medRxiv. 2024 Apr 27:2024.04.26.24306390. doi: 10.1101/2024.04.26.24306390.
7
Evaluating the efficacy of leading large language models in the Japanese national dental hygienist examination: A comparative analysis of ChatGPT, Bard, and Bing Chat.评估领先的大语言模型在日本国家牙科保健员考试中的功效:ChatGPT、Bard和必应聊天的比较分析。
J Dent Sci. 2024 Oct;19(4):2262-2267. doi: 10.1016/j.jds.2024.02.019. Epub 2024 Feb 29.
8
Performance of artificial intelligence in bariatric surgery: comparative analysis of ChatGPT-4, Bing, and Bard in the American Society for Metabolic and Bariatric Surgery textbook of bariatric surgery questions.人工智能在减重手术中的表现:ChatGPT-4、Bing 和 Bard 在《美国代谢与减重外科学会减重手术教科书》减重手术问题中的比较分析。
Surg Obes Relat Dis. 2024 Jul;20(7):609-613. doi: 10.1016/j.soard.2024.04.014. Epub 2024 May 8.
9
Performance of Generative Large Language Models on Ophthalmology Board-Style Questions.生成式大型语言模型在眼科 Board 式问题中的表现。
Am J Ophthalmol. 2023 Oct;254:141-149. doi: 10.1016/j.ajo.2023.05.024. Epub 2023 Jun 18.
10
Eyes on AI: ChatGPT's Transformative Potential Impact on Ophthalmology.关注人工智能:ChatGPT对眼科的变革性潜在影响。
Cureus. 2023 Jun 21;15(6):e40765. doi: 10.7759/cureus.40765. eCollection 2023 Jun.

引用本文的文献

1
Impact of large language model (ChatGPT) in healthcare: an umbrella review and evidence synthesis.大语言模型(ChatGPT)在医疗保健领域的影响:一项综述与证据综合
J Biomed Sci. 2025 May 7;32(1):45. doi: 10.1186/s12929-025-01131-z.
2
Large Language Models in Ophthalmology: A Review of Publications from Top Ophthalmology Journals.眼科领域的大语言模型:顶级眼科期刊出版物综述
Ophthalmol Sci. 2024 Dec 17;5(3):100681. doi: 10.1016/j.xops.2024.100681. eCollection 2025 May-Jun.
3
User-Centred Design and Development of a Smartphone Application () for Digital Phenotyping in Ophthalmology.
以用户为中心的眼科数字表型智能手机应用程序()的设计与开发。
Healthcare (Basel). 2024 Dec 18;12(24):2550. doi: 10.3390/healthcare12242550.
4
Foundation models in ophthalmology: opportunities and challenges.眼科领域的基础模型:机遇与挑战。
Curr Opin Ophthalmol. 2025 Jan 1;36(1):90-98. doi: 10.1097/ICU.0000000000001091. Epub 2024 Nov 4.
5
Evaluating the performance of ChatGPT-3.5 and ChatGPT-4 on the Taiwan plastic surgery board examination.评估ChatGPT-3.5和ChatGPT-4在台湾整形外科医师资格考试中的表现。
Heliyon. 2024 Jul 18;10(14):e34851. doi: 10.1016/j.heliyon.2024.e34851. eCollection 2024 Jul 30.
6
A review of ophthalmology education in the era of generative artificial intelligence.眼科教育在生成式人工智能时代的回顾。
Asia Pac J Ophthalmol (Phila). 2024 Jul-Aug;13(4):100089. doi: 10.1016/j.apjo.2024.100089. Epub 2024 Aug 10.
7
Artificial Intelligence, Medical Knowledge, and Empowering Patients.人工智能、医学知识与赋能患者
Mayo Clin Proc Digit Health. 2024 Mar;2(1):160-162. doi: 10.1016/j.mcpdig.2024.01.008. Epub 2024 Feb 28.