• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于大语言模型的管道,用于从临床记录中的患者主诉和病史中提取信息以进行严重程度评估。

A large language model based pipeline for extracting information from patient complaint and anamnesis in clinical notes for severity assessment.

作者信息

Gao Hui, Wang Kaipeng, Yuan Yuan, Wang Yueguo, Liu Qingyuan, Wang Yulan, Sun Jian, Wang Wenwen, Wang Huanli, Zhou Shusheng, Jin Kui, Zhang Mengping, Lai Yinglei

机构信息

School of Mathematical Sciences, University of Science and Technology of China, Hefei, 230026, Anhui, China.

School of Mathematics and Statistics, Nanjing University of Science and Technology, Nanjing, 210094, Jiangsu, China.

出版信息

Sci Rep. 2025 Jul 14;15(1):25345. doi: 10.1038/s41598-025-07649-4.

DOI:10.1038/s41598-025-07649-4
PMID:40659653
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12260085/
Abstract

Identifying patients with critical illness in emergency departments (EDs) is an ongoing challenge, partly due to the limited information available at the time of admission. The clinical notes in patient records have already received attention for the value of improving prediction. Recent large language models (LLMs) have demonstrated their promising performance. However, the utilization of LLMs for analyzing clinical notes has not been extensively investigated. To improve the severity assessment of illness and the prediction of triage level, we developed a pipeline for utilizing LLMs (e.g. ChatGLM-2, GLM-4 and Alpaca-2) to extract information from patient complaint and anamnesis in clinical notes. In this pipeline, a LLM is supplied with the text input including complaint and anamnesis of a patient, where the input is further constructed by a prompt template, in-context learning (ICL), and retrieval-augmented generation (RAG). Then a severity score is extracted from the LLM, which is further integrated into a predictive model for improving its performance. We demonstrated the effectiveness of our pipeline based on the patient records derived from Chinese Emergency Triage, Assessment, and Treatment (CETAT) database. The extracted score were be incorporated into logistic regression as a predictor. At early stage, as vital signs were typically not yet measured, the predictive value of patient complaint and anamnesis was illustrated (evidenced by an improvement in AUC-ROC from 0.746 to 0.802). At later stage, vital signs became available, the enhancements in prediction attributable to the score were weaker, but still was observed with statistical significance in most cases. The recent LLMs are capable of extracting valuable information from clinical notes for identifying critical illness. The effectiveness has been illustrated in our study. It is still necessary to develop more efficient methods based on LLMs in order to achieve better performance.

摘要

在急诊科识别重症患者一直是一项挑战,部分原因是入院时可用信息有限。患者记录中的临床笔记因其在改善预测方面的价值而受到关注。最近的大语言模型(LLMs)已展现出其令人期待的性能。然而,利用大语言模型分析临床笔记的研究尚未广泛开展。为了改进疾病严重程度评估和分诊级别预测,我们开发了一个利用大语言模型(如ChatGLM - 2、GLM - 4和Alpaca - 2)从临床笔记中的患者主诉和病史中提取信息的流程。在这个流程中,大语言模型被提供包含患者主诉和病史的文本输入,其中输入通过提示模板、上下文学习(ICL)和检索增强生成(RAG)进一步构建。然后从大语言模型中提取严重程度评分,该评分进一步整合到预测模型中以提高其性能。我们基于源自中国急诊分诊、评估和治疗(CETAT)数据库的患者记录证明了我们流程的有效性。提取的评分被纳入逻辑回归作为预测因子。在早期,由于通常尚未测量生命体征,展示了患者主诉和病史的预测价值(AUC - ROC从0.746提高到0.802证明)。在后期,生命体征可用时,评分对预测的增强作用较弱,但在大多数情况下仍具有统计学意义。最近的大语言模型能够从临床笔记中提取有价值的信息以识别重症。我们的研究已证明了其有效性。为了实现更好的性能,仍有必要开发基于大语言模型的更高效方法。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/addc/12260085/0e8786846eba/41598_2025_7649_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/addc/12260085/747125fc69fc/41598_2025_7649_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/addc/12260085/ec13154742fe/41598_2025_7649_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/addc/12260085/70e1f5f4140d/41598_2025_7649_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/addc/12260085/c3b732103da4/41598_2025_7649_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/addc/12260085/80f276836402/41598_2025_7649_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/addc/12260085/0e8786846eba/41598_2025_7649_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/addc/12260085/747125fc69fc/41598_2025_7649_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/addc/12260085/ec13154742fe/41598_2025_7649_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/addc/12260085/70e1f5f4140d/41598_2025_7649_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/addc/12260085/c3b732103da4/41598_2025_7649_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/addc/12260085/80f276836402/41598_2025_7649_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/addc/12260085/0e8786846eba/41598_2025_7649_Fig6_HTML.jpg

相似文献

1
A large language model based pipeline for extracting information from patient complaint and anamnesis in clinical notes for severity assessment.一种基于大语言模型的管道,用于从临床记录中的患者主诉和病史中提取信息以进行严重程度评估。
Sci Rep. 2025 Jul 14;15(1):25345. doi: 10.1038/s41598-025-07649-4.
2
Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中,如果患者出现以下症状和体征,可判断其是否患有 COVID-19。
Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.
3
[Volume and health outcomes: evidence from systematic reviews and from evaluation of Italian hospital data].[容量与健康结果:来自系统评价和意大利医院数据评估的证据]
Epidemiol Prev. 2013 Mar-Jun;37(2-3 Suppl 2):1-100.
4
Comparison of Two Modern Survival Prediction Tools, SORG-MLA and METSSS, in Patients With Symptomatic Long-bone Metastases Who Underwent Local Treatment With Surgery Followed by Radiotherapy and With Radiotherapy Alone.两种现代生存预测工具 SORG-MLA 和 METSSS 在接受手术联合放疗和单纯放疗治疗有症状长骨转移患者中的比较。
Clin Orthop Relat Res. 2024 Dec 1;482(12):2193-2208. doi: 10.1097/CORR.0000000000003185. Epub 2024 Jul 23.
5
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.系统性药理学治疗慢性斑块状银屑病:网络荟萃分析。
Cochrane Database Syst Rev. 2021 Apr 19;4(4):CD011535. doi: 10.1002/14651858.CD011535.pub4.
6
Enhancing Pulmonary Disease Prediction Using Large Language Models With Feature Summarization and Hybrid Retrieval-Augmented Generation: Multicenter Methodological Study Based on Radiology Report.使用具有特征总结和混合检索增强生成功能的大语言模型增强肺部疾病预测:基于放射学报告的多中心方法学研究
J Med Internet Res. 2025 Jun 11;27:e72638. doi: 10.2196/72638.
7
A dataset and benchmark for hospital course summarization with adapted large language models.一个用于医院病程总结的数据集和基准测试,采用了适配的大语言模型。
J Am Med Inform Assoc. 2025 Mar 1;32(3):470-479. doi: 10.1093/jamia/ocae312.
8
Systemic pharmacological treatments for chronic plaque psoriasis: a network meta-analysis.慢性斑块状银屑病的全身药理学治疗:一项网状Meta分析。
Cochrane Database Syst Rev. 2020 Jan 9;1(1):CD011535. doi: 10.1002/14651858.CD011535.pub3.
9
Toward Cross-Hospital Deployment of Natural Language Processing Systems: Model Development and Validation of Fine-Tuned Large Language Models for Disease Name Recognition in Japanese.迈向自然语言处理系统的跨医院部署:用于日语疾病名称识别的微调大语言模型的模型开发与验证
JMIR Med Inform. 2025 Jul 8;13:e76773. doi: 10.2196/76773.
10
Utilizing large language models for detecting hospital-acquired conditions: an empirical study on pulmonary embolism.利用大语言模型检测医院获得性疾病:关于肺栓塞的实证研究
J Am Med Inform Assoc. 2025 May 1;32(5):876-884. doi: 10.1093/jamia/ocaf048.

本文引用的文献

1
Early identification of high-risk patients admitted to emergency departments using vital signs and machine learning.利用生命体征和机器学习对急诊科收治的高危患者进行早期识别。
World J Emerg Med. 2025;16(2):113-120. doi: 10.5847/wjem.j.1920-8642.2025.031.
2
The TRIPOD-LLM reporting guideline for studies using large language models.使用大语言模型的研究的TRIPOD-LLM报告指南。
Nat Med. 2025 Jan;31(1):60-69. doi: 10.1038/s41591-024-03425-5. Epub 2025 Jan 8.
3
Toward expert-level medical question answering with large language models.
迈向使用大语言模型实现专家级医学问答
Nat Med. 2025 Mar;31(3):943-950. doi: 10.1038/s41591-024-03423-7. Epub 2025 Jan 8.
4
Using Large Language Models to Extract Core Injury Information From Emergency Department Notes.使用大语言模型从急诊科病历中提取核心损伤信息。
J Korean Med Sci. 2024 Dec 2;39(46):e291. doi: 10.3346/jkms.2024.39.e291.
5
Matching patients to clinical trials with large language models.利用大型语言模型为患者匹配临床试验。
Nat Commun. 2024 Nov 18;15(1):9074. doi: 10.1038/s41467-024-53081-z.
6
Development of fully automated models for staging liver fibrosis using non-contrast MRI and artificial intelligence: a retrospective multicenter study.使用非增强磁共振成像和人工智能开发用于肝纤维化分期的全自动模型:一项回顾性多中心研究
EClinicalMedicine. 2024 Oct 17;77:102881. doi: 10.1016/j.eclinm.2024.102881. eCollection 2024 Nov.
7
Prevalence, predictors, and patterns of patient reported non-motor outcomes six months after stroke: a prospective cohort study.中风后六个月患者报告的非运动性结局的患病率、预测因素及模式:一项前瞻性队列研究。
Lancet Reg Health Eur. 2024 Oct 19;47:101080. doi: 10.1016/j.lanepe.2024.101080. eCollection 2024 Dec.
8
Glucagon-like peptide-1 receptor agonists before upper gastrointestinal endoscopy and risk of pulmonary aspiration or discontinuation of procedure: cohort study.上消化道内镜检查前应用胰高血糖素样肽-1 受体激动剂与发生吸入性肺炎或中止操作的风险:队列研究。
BMJ. 2024 Oct 22;387:e080340. doi: 10.1136/bmj-2024-080340.
9
Impact of CCR5Δ32 on the risk of infection, Staphylococcus aureus carriage, and plasma concentrations of chemokines in Danish blood donors.丹麦献血者中 CCR5Δ32 对感染风险、金黄色葡萄球菌携带率和趋化因子血浆浓度的影响。
EBioMedicine. 2024 Nov;109:105406. doi: 10.1016/j.ebiom.2024.105406. Epub 2024 Oct 21.
10
Climate emotions, thoughts, and plans among US adolescents and young adults: a cross-sectional descriptive survey and analysis by political party identification and self-reported exposure to severe weather events.美国青少年和年轻人的气候情绪、想法和计划:一项横断面描述性调查及按政党身份认同和自我报告的严重天气事件暴露情况进行的分析。
Lancet Planet Health. 2024 Nov;8(11):e879-e893. doi: 10.1016/S2542-5196(24)00229-8. Epub 2024 Oct 17.