• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

中风护理中生成式大语言模型的当前现状与未来方向:范围综述

Current Landscape and Future Directions Regarding Generative Large Language Models in Stroke Care: Scoping Review.

作者信息

Zhu XingCe, Dai Wei, Evans Richard, Geng Xueyu, Mu Aruhan, Liu Zhiyong

机构信息

School of Medicine and Health Management, Tongji Medical College, Huazhong University of Science and Technology, Wuhan, China.

Faculty of Computer Science, Dalhousie University, Halifax, NS, Canada.

出版信息

JMIR Med Inform. 2025 Aug 7;13:e76636. doi: 10.2196/76636.

DOI:10.2196/76636
PMID:40773746
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12371286/
Abstract

BACKGROUND

Stroke has a major impact on global health, causing long-term disability and straining health care resources. Generative large language models (gLLMs) have emerged as promising tools to help address these challenges, but their applications and reported performance in stroke care require comprehensive mapping and synthesis.

OBJECTIVE

The aim of this scoping review was to consolidate a fragmented evidence base and examine the current landscape, shortcomings, and future directions in the design, reporting, and evaluation of gLLM-based interventions in stroke care.

METHODS

In this scoping review, which adhered to the PRISMA-ScR (Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews) guidelines and the Population, Concept, and Context (PCC) framework, we searched 6 major scientific databases in December 2024 for gLLM-based interventions across the stroke care pathway, mapping their key characteristics and outcomes.

RESULTS

A total of 25 studies met the predefined eligibility criteria and were included for analysis. Retrospective designs predominated (n=16, 64%). Key applications of gLLMs included clinical decision-making support (n=10, 40%), administrative assistance (n=9, 36%), direct patient interaction (n=5, 20%), and automated literature review (n=1, 4%). Implementations mainly used generative pretrained transformer models accessed through task-prompted chat interfaces. In total, 5 key challenges were identified from the included studies during the implementation of gLLM-based interventions: ensuring factual alignment, maintaining system robustness, enhancing interpretability, optimizing efficiency, and facilitating clinical adoption.

CONCLUSIONS

The application of gLLMs in stroke care, while promising, remains relatively new, with most interventions reflecting early-stage or relatively simple implementations. Against this backdrop, critical gaps in research and clinical translation persist. To support the development of clinically impactful and trustworthy applications, we propose an actionable framework that prioritizes real-world evidence, mandates transparent technical reporting, broadens evaluation beyond output accuracy, strengthens validation of advanced task adaptation strategies, and investigates mechanisms for safe and effective human-gLLM interaction.

摘要

背景

中风对全球健康有重大影响,会导致长期残疾并使医疗保健资源紧张。生成式大语言模型(gLLMs)已成为帮助应对这些挑战的有前景的工具,但其在中风护理中的应用及报告的性能需要全面梳理和综合分析。

目的

本范围综述的目的是整合零散的证据基础,审视基于gLLM的中风护理干预措施在设计、报告和评估方面的现状、不足及未来方向。

方法

在本遵循PRISMA-ScR(系统评价和Meta分析扩展版的首选报告项目)指南及人群、概念和背景(PCC)框架的范围综述中,我们于2024年12月在6个主要科学数据库中搜索了中风护理路径中基于gLLM的干预措施,梳理其关键特征和结果。

结果

共有25项研究符合预先设定的纳入标准并被纳入分析。回顾性设计占主导(n = 16,64%)。gLLMs的关键应用包括临床决策支持(n = 10,40%)、行政协助(n = 9,36%)、直接患者互动(n = 5,20%)和自动文献综述(n = 1,4%)。实施主要使用通过任务提示聊天界面访问现成的生成式预训练变换器模型。在基于gLLM的干预措施实施过程中,从纳入研究中总共识别出5个关键挑战:确保事实一致性、维持系统稳健性、增强可解释性、优化效率以及促进临床应用。

结论

gLLMs在中风护理中的应用虽前景广阔,但仍相对较新,大多数干预措施反映的是早期或相对简单的实施情况。在此背景下,研究和临床转化方面仍存在重大差距。为支持开发具有临床影响力和可信度的应用,我们提出一个可操作的框架,该框架优先考虑真实世界证据,要求进行透明的技术报告,拓宽评估范围使其超越输出准确性,加强对高级任务适应策略的验证,并研究安全有效的人机gLLM交互机制。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05dc/12371286/584d002fdf9a/medinform_v13i1e76636_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05dc/12371286/e5d3b5f4e8a7/medinform_v13i1e76636_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05dc/12371286/584d002fdf9a/medinform_v13i1e76636_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05dc/12371286/e5d3b5f4e8a7/medinform_v13i1e76636_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/05dc/12371286/584d002fdf9a/medinform_v13i1e76636_fig2.jpg

相似文献

1
Current Landscape and Future Directions Regarding Generative Large Language Models in Stroke Care: Scoping Review.中风护理中生成式大语言模型的当前现状与未来方向:范围综述
JMIR Med Inform. 2025 Aug 7;13:e76636. doi: 10.2196/76636.
2
Applications of Large Language Models in the Field of Suicide Prevention: Scoping Review.大语言模型在自杀预防领域的应用:范围综述
J Med Internet Res. 2025 Jan 23;27:e63126. doi: 10.2196/63126.
3
AI in Medical Questionnaires: Innovations, Diagnosis, and Implications.医学问卷中的人工智能:创新、诊断及影响
J Med Internet Res. 2025 Jun 23;27:e72398. doi: 10.2196/72398.
4
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
5
Health Care Social Robots in the Age of Generative AI: Protocol for a Scoping Review.生成式人工智能时代的医疗保健社交机器人:一项范围综述的方案
JMIR Res Protoc. 2025 Apr 14;14:e63017. doi: 10.2196/63017.
6
Large Language Models in Medical Diagnostics: Scoping Review With Bibliometric Analysis.医学诊断中的大语言模型:基于文献计量分析的综述
J Med Internet Res. 2025 Jun 9;27:e72062. doi: 10.2196/72062.
7
Are Artificial Intelligence Models Reliable for Clinical Application in Pediatric Fracture Detection on Radiographs? A Systematic Review and Meta-analysis.人工智能模型在儿科骨折X线片检测中的临床应用是否可靠?一项系统评价和荟萃分析。
Clin Orthop Relat Res. 2025 Aug 20. doi: 10.1097/CORR.0000000000003660.
8
Use of Artificial Intelligence in Adolescents' Mental Health Care: Systematic Scoping Review of Current Applications and Future Directions.人工智能在青少年心理健康护理中的应用:当前应用及未来方向的系统综述
JMIR Ment Health. 2025 Jun 6;12:e70438. doi: 10.2196/70438.
9
Falls prevention interventions for community-dwelling older adults: systematic review and meta-analysis of benefits, harms, and patient values and preferences.社区居住的老年人跌倒预防干预措施:系统评价和荟萃分析的益处、危害以及患者的价值观和偏好。
Syst Rev. 2024 Nov 26;13(1):289. doi: 10.1186/s13643-024-02681-3.
10
Interventions to improve safe and effective medicines use by consumers: an overview of systematic reviews.改善消费者安全有效用药的干预措施:系统评价概述
Cochrane Database Syst Rev. 2014 Apr 29;2014(4):CD007768. doi: 10.1002/14651858.CD007768.pub3.

本文引用的文献

1
A vaccine chatbot intervention for parents to improve HPV vaccination uptake among middle school girls: a cluster randomized trial.一种用于帮助家长提高初中女生人乳头瘤病毒疫苗接种率的疫苗聊天机器人干预措施:一项整群随机试验。
Nat Med. 2025 Apr 7. doi: 10.1038/s41591-025-03618-6.
2
Large language model agents can use tools to perform clinical calculations.大型语言模型智能体可以使用工具来进行临床计算。
NPJ Digit Med. 2025 Mar 17;8(1):163. doi: 10.1038/s41746-025-01475-8.
3
Medical Misinformation in AI-Assisted Self-Diagnosis: Development of a Method (EvalPrompt) for Analyzing Large Language Models.
人工智能辅助自我诊断中的医学错误信息:一种用于分析大语言模型的方法(EvalPrompt)的开发
JMIR Form Res. 2025 Mar 10;9:e66207. doi: 10.2196/66207.
4
An evaluation framework for clinical use of large language models in patient interaction tasks.用于患者互动任务中大型语言模型临床应用的评估框架。
Nat Med. 2025 Jan;31(1):77-86. doi: 10.1038/s41591-024-03328-5. Epub 2025 Jan 2.
5
SurgeryLLM: a retrieval-augmented generation large language model framework for surgical decision support and workflow enhancement.外科手术语言模型:一种用于手术决策支持和工作流程优化的检索增强生成式大语言模型框架。
NPJ Digit Med. 2024 Dec 18;7(1):364. doi: 10.1038/s41746-024-01391-3.
6
Predicting hospitalization with LLMs from health insurance data.
Med Biol Eng Comput. 2025 Apr;63(4):1215-1226. doi: 10.1007/s11517-024-03251-4. Epub 2024 Dec 19.
7
Machine learning and deep learning algorithms in stroke medicine: a systematic review of hemorrhagic transformation prediction models.中风医学中的机器学习与深度学习算法:出血性转化预测模型的系统综述
J Neurol. 2024 Dec 12;272(1):37. doi: 10.1007/s00415-024-12810-6.
8
Toward Foundation Models in Radiology? Quantitative Assessment of GPT-4V's Multimodal and Multianatomic Region Capabilities.迈向放射学的基础模型?GPT-4V 的多模态和多原子区域能力的定量评估。
Radiology. 2024 Nov;313(2):e240955. doi: 10.1148/radiol.240955.
9
Precision Structuring of Free-Text Surgical Record for Enhanced Stroke Management: A Comparative Evaluation of Large Language Models.用于增强中风管理的自由文本手术记录的精准结构化:大语言模型的比较评估
J Multidiscip Healthc. 2024 Nov 14;17:5163-5175. doi: 10.2147/JMDH.S486449. eCollection 2024.
10
PhenoFlow: A Human-LLM Driven Visual Analytics System for Exploring Large and Complex Stroke Datasets.
IEEE Trans Vis Comput Graph. 2025 Jan;31(1):470-480. doi: 10.1109/TVCG.2024.3456215. Epub 2024 Nov 25.