• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

医疗保健中的大语言模型架构:研究视角的范围综述

Large Language Model Architectures in Health Care: Scoping Review of Research Perspectives.

作者信息

Leiser Florian, Guse Richard, Sunyaev Ali

机构信息

Research Group Critical Information Infrastructures, Institute of Applied Informatics and Formal Description Methods, Karlsruhe Institute of Technology, Karlsruhe, Germany.

Chair of Information Infrastructures, School of Computation, Information and Technology, Technical University of Munich, Campus Heilbronn, Heilbronn, Germany.

出版信息

J Med Internet Res. 2025 Jun 19;27:e70315. doi: 10.2196/70315.

DOI:10.2196/70315
PMID:40536801
Abstract

BACKGROUND

Large language models (LLMs) can support health care professionals in their daily work, for example, when writing and filing reports or communicating diagnoses. With the rise of LLMs, current research investigates how LLMs could be applied in medical practice and their benefits for physicians in clinical workflows. However, most studies neglect the importance of selecting suitable LLM architectures.

OBJECTIVE

In this literature review, we aim to provide insights on the different LLM model architecture families (ie, Bidirectional Encoder Representations from Transformers [BERT]-based or generative pretrained transformer [GPT]-based models) used in previous research. We report on the suitability and benefits of different LLM model architecture families for various research foci.

METHODS

To this end, we conduct a scoping review to identify which LLMs are used in health care. Our search included manuscripts from PubMed, arXiv, and medRxiv. We used open and selective coding to assess the 114 identified manuscripts regarding 11 dimensions related to usage and technical facets and the research focus of the manuscripts.

RESULTS

We identified 4 research foci that emerged previously in manuscripts, with LLM performance being the main focus. We found that GPT-based models are used for communicative purposes such as examination preparation or patient interaction. In contrast, BERT-based models are used for medical tasks such as knowledge discovery and model improvements.

CONCLUSIONS

Our study suggests that GPT-based models are better suited for communicative purposes such as report generation or patient interaction. BERT-based models seem to be better suited for innovative applications such as classification or knowledge discovery. This could be due to the architectural differences where GPT processes language unidirectionally and BERT bidirectionally, allowing more in-depth understanding of the text. In addition, BERT-based models seem to allow more straightforward extensions of their models for domain-specific tasks that generally lead to better results. In summary, health care professionals should consider the benefits and differences of the LLM architecture families when selecting a suitable model for their intended purpose.

摘要

背景

大语言模型(LLMs)可以在医疗保健专业人员的日常工作中提供支持,例如在撰写和归档报告或传达诊断结果时。随着大语言模型的兴起,当前的研究探讨了大语言模型如何应用于医疗实践以及它们在临床工作流程中对医生的益处。然而,大多数研究忽视了选择合适的大语言模型架构的重要性。

目的

在这篇文献综述中,我们旨在深入了解先前研究中使用的不同大语言模型架构家族(即基于变换器的双向编码器表征[BERT]或基于生成式预训练变换器[GPT]的模型)。我们报告不同大语言模型架构家族对于各种研究重点的适用性和益处。

方法

为此,我们进行了一项范围综述,以确定医疗保健中使用了哪些大语言模型。我们的搜索包括来自PubMed、arXiv和medRxiv的手稿。我们使用开放式和选择性编码,从与使用、技术方面以及手稿研究重点相关的11个维度评估114篇已识别的手稿。

结果

我们确定了先前在手稿中出现的4个研究重点,其中大语言模型性能是主要重点。我们发现基于GPT的模型用于诸如考试准备或患者互动等交流目的。相比之下,基于BERT的模型用于诸如知识发现和模型改进等医疗任务。

结论

我们的研究表明,基于GPT的模型更适合诸如报告生成或患者互动等交流目的。基于BERT的模型似乎更适合诸如分类或知识发现等创新应用。这可能是由于架构差异,其中GPT单向处理语言,而BERT双向处理语言,从而允许对文本有更深入的理解。此外,基于BERT的模型似乎允许更直接地将其模型扩展到特定领域任务,通常会带来更好的结果。总之,医疗保健专业人员在为其预期目的选择合适的模型时应考虑大语言模型架构家族的益处和差异。

相似文献

1
Large Language Model Architectures in Health Care: Scoping Review of Research Perspectives.医疗保健中的大语言模型架构:研究视角的范围综述
J Med Internet Res. 2025 Jun 19;27:e70315. doi: 10.2196/70315.
2
Use of Large Language Models to Classify Epidemiological Characteristics in Synthetic and Real-World Social Media Posts About Conjunctivitis Outbreaks: Infodemiology Study.利用大语言模型对合成及真实世界社交媒体上有关结膜炎爆发的帖子中的流行病学特征进行分类:信息流行病学研究
J Med Internet Res. 2025 Jul 2;27:e65226. doi: 10.2196/65226.
3
Survivor, family and professional experiences of psychosocial interventions for sexual abuse and violence: a qualitative evidence synthesis.性虐待和暴力的心理社会干预的幸存者、家庭和专业人员的经验:定性证据综合。
Cochrane Database Syst Rev. 2022 Oct 4;10(10):CD013648. doi: 10.1002/14651858.CD013648.pub2.
4
Applications of Large Language Models in the Field of Suicide Prevention: Scoping Review.大语言模型在自杀预防领域的应用:范围综述
J Med Internet Res. 2025 Jan 23;27:e63126. doi: 10.2196/63126.
5
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
6
The use of Open Dialogue in Trauma Informed Care services for mental health consumers and their family networks: A scoping review.创伤知情护理服务中使用开放对话模式为心理健康消费者及其家庭网络提供服务:范围综述。
J Psychiatr Ment Health Nurs. 2024 Aug;31(4):681-698. doi: 10.1111/jpm.13023. Epub 2024 Jan 17.
7
Comparison of ChatGPT and Internet Research for Clinical Research and Decision-Making in Occupational Medicine: Randomized Controlled Trial.ChatGPT与互联网搜索用于职业医学临床研究和决策的比较:随机对照试验
JMIR Form Res. 2025 May 20;9:e63857. doi: 10.2196/63857.
8
Large Language Models in Medical Diagnostics: Scoping Review With Bibliometric Analysis.医学诊断中的大语言模型:基于文献计量分析的综述
J Med Internet Res. 2025 Jun 9;27:e72062. doi: 10.2196/72062.
9
The potential of Generative Pre-trained Transformer 4 (GPT-4) to analyse medical notes in three different languages: a retrospective model-evaluation study.生成式预训练变换器4(GPT-4)分析三种不同语言医学笔记的潜力:一项回顾性模型评估研究。
Lancet Digit Health. 2025 Jan;7(1):e35-e43. doi: 10.1016/S2589-7500(24)00246-2.
10
Factors that impact on the use of mechanical ventilation weaning protocols in critically ill adults and children: a qualitative evidence-synthesis.影响重症成人和儿童机械通气撤机方案使用的因素:一项定性证据综合分析
Cochrane Database Syst Rev. 2016 Oct 4;10(10):CD011812. doi: 10.1002/14651858.CD011812.pub2.

本文引用的文献

1
Evaluation of large language models for discovery of gene set function.用于发现基因集功能的大语言模型评估
Nat Methods. 2025 Jan;22(1):82-91. doi: 10.1038/s41592-024-02525-x. Epub 2024 Nov 28.
2
OphGLM: An ophthalmology large language-and-vision assistant.OphGLM:一个眼科大语言与视觉助理。
Artif Intell Med. 2024 Nov;157:103001. doi: 10.1016/j.artmed.2024.103001. Epub 2024 Oct 22.
3
Large language models in health care: Development, applications, and challenges.医疗保健领域的大语言模型:发展、应用与挑战。
Health Care Sci. 2023 Jul 24;2(4):255-263. doi: 10.1002/hcs2.61. eCollection 2023 Aug.
4
Local Large Language Models for Complex Structured Tasks.用于复杂结构化任务的本地大语言模型。
AMIA Jt Summits Transl Sci Proc. 2024 May 31;2024:105-114. eCollection 2024.
5
Appropriateness of ChatGPT in Answering Heart Failure Related Questions.ChatGPT 在回答心力衰竭相关问题方面的适宜性。
Heart Lung Circ. 2024 Sep;33(9):1314-1318. doi: 10.1016/j.hlc.2024.03.005. Epub 2024 May 31.
6
Identifying and Extracting Rare Diseases and Their Phenotypes with Large Language Models.使用大语言模型识别和提取罕见疾病及其表型
J Healthc Inform Res. 2024 Jan 5;8(2):438-461. doi: 10.1007/s41666-023-00155-0. eCollection 2024 Jun.
7
neuroGPT-X: toward a clinic-ready large language model.神经 GPT-X:迈向临床就绪的大型语言模型。
J Neurosurg. 2023 Oct 6;140(4):1041-1053. doi: 10.3171/2023.7.JNS23573. Print 2024 Apr 1.
8
Leveraging large language models for generating responses to patient messages-a subjective analysis.利用大型语言模型生成对患者信息的回复——主观分析。
J Am Med Inform Assoc. 2024 May 20;31(6):1367-1379. doi: 10.1093/jamia/ocae052.
9
Performance of ChatGPT on Chinese national medical licensing examinations: a five-year examination evaluation study for physicians, pharmacists and nurses.ChatGPT 在国家医师、药师、护士等医学类考试中的表现:一项针对医、药、护人员的五年考试评估研究。
BMC Med Educ. 2024 Feb 14;24(1):143. doi: 10.1186/s12909-024-05125-7.
10
Almanac - Retrieval-Augmented Language Models for Clinical Medicine.用于临床医学的年鉴检索增强语言模型。
NEJM AI. 2024 Feb;1(2). doi: 10.1056/aioa2300068. Epub 2024 Jan 25.