• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

由大语言模型分析的吞咽困难患者的非结构化电子健康记录

Unstructured Electronic Health Records of Dysphagic Patients Analyzed by Large Language Models.

作者信息

Neubig Luisa, Larsen Deirdre, Kunduk Melda, Kist Andreas M

机构信息

Department of Artificial Intelligence in Biomedical EngineeringFriedrich-Alexander-Universität Erlangen-Nürnberg Erlangen 91054 Germany.

Department of Communication Sciences and DisordersEast Carolina University Greenville NC 27858 USA.

出版信息

IEEE J Transl Eng Health Med. 2025 May 19;13:237-245. doi: 10.1109/JTEHM.2025.3571255. eCollection 2025.

DOI:10.1109/JTEHM.2025.3571255
PMID:40740828
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12310174/
Abstract

OBJECTIVE

Dysphagia is a common and complex disorder that complicates both diagnoses and treatment. Consequently, the associated electronic health records (EHR) are often unstructured and complex, posing challenges for systematic data analysis.

METHODS AND PROCEDURES

In this study, we employ natural language processing (NLP) techniques and large language models (LLMs) to automatically analyze clinical narratives and extract diagnostic information from a diverse set of EHRs. Our dataset includes medical records from 486 patients, representing a group with diverse dysphagic conditions. We analyze diagnoses provided in unstructured free text that do not follow a standardized structure. We utilize clustering algorithms on the extracted diagnostic features to identify distinct groups of patients who share similar pathophysiological swallowing dysfunctions.

RESULTS

We found that basic NLP techniques often provide limited insights due to the high variability of the data. In contrast, LLMs help to bridge the gap in understanding the nuanced medical information about dysphagia and related conditions. Although applying these advanced LLM models is not straightforward, our results demonstrate that leveraging closed-source models can effectively cluster different categories of dysphagia.

CONCLUSION

Our study provides therefore evidence that LLMs are highly promising in future dysphagia research.

CLINICAL IMPACT

Dysphagia is a symptom associated with various diseases, though its underlying relationships remain unclear. This study demonstrates how analyzing large volumes of electronic health records can help clarify the causes of dysphagia and identify contributing factors. By applying natural language processing, we aim to enhance both understanding and treatment, supporting clinical staff in improving individualized care by identifying relevant patient cohorts. Clinical and Translational Impact Statement: This study uses LLMs to efficiently preprocess unstructured EHRs, improving dysphagia diagnosis and patient clustering. It aligns with Clinical Research, enhancing diagnostic speed and enabling personalized treatment.

摘要

目的

吞咽困难是一种常见且复杂的疾病,会使诊断和治疗都变得复杂。因此,相关的电子健康记录(EHR)往往是非结构化且复杂的,给系统的数据分析带来了挑战。

方法和步骤

在本研究中,我们采用自然语言处理(NLP)技术和大语言模型(LLM)来自动分析临床叙述,并从各种不同的电子健康记录中提取诊断信息。我们的数据集包括486名患者的病历,代表了一组具有不同吞咽困难情况的患者。我们分析非结构化自由文本中提供的诊断信息,这些文本没有遵循标准化结构。我们对提取的诊断特征使用聚类算法,以识别出具有相似病理生理吞咽功能障碍的不同患者群体。

结果

我们发现,由于数据的高度变异性,基本的自然语言处理技术往往只能提供有限的见解。相比之下,大语言模型有助于弥合在理解有关吞咽困难及相关病症的细微医学信息方面的差距。尽管应用这些先进的大语言模型并非易事,但我们的结果表明,利用闭源模型可以有效地对不同类型的吞咽困难进行聚类。

结论

因此,我们的研究证明大语言模型在未来吞咽困难研究中极具前景。

临床影响

吞咽困难是一种与多种疾病相关的症状,但其潜在关系仍不明确。本研究展示了分析大量电子健康记录如何有助于阐明吞咽困难的原因并识别相关因素。通过应用自然语言处理,我们旨在增进理解和治疗效果,通过识别相关患者群体来支持临床工作人员改善个性化护理。临床与转化影响声明:本研究使用大语言模型来高效预处理非结构化电子健康记录,改善吞咽困难诊断和患者聚类。它与临床研究相契合,提高诊断速度并实现个性化治疗。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/fb70e46ac2f6/neubi11-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/c7cdaae6f3b1/neubi1-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/18d7f3fdbd6c/neubi2-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/1375b63008ab/neubi3-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/c28f58436a4e/neubi4abc-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/819e2b8ccc4e/neubi5abcdefghi-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/58d33a01dd14/neubi6-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/6e7434a852dd/neubi7-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/a1f5dd5b484b/neubi8-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/a0fcb6858547/neubi9abcdef-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/6738b4f63dc7/neubi10-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/fb70e46ac2f6/neubi11-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/c7cdaae6f3b1/neubi1-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/18d7f3fdbd6c/neubi2-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/1375b63008ab/neubi3-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/c28f58436a4e/neubi4abc-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/819e2b8ccc4e/neubi5abcdefghi-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/58d33a01dd14/neubi6-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/6e7434a852dd/neubi7-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/a1f5dd5b484b/neubi8-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/a0fcb6858547/neubi9abcdef-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/6738b4f63dc7/neubi10-3571255.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad09/12310174/fb70e46ac2f6/neubi11-3571255.jpg

相似文献

1
Unstructured Electronic Health Records of Dysphagic Patients Analyzed by Large Language Models.由大语言模型分析的吞咽困难患者的非结构化电子健康记录
IEEE J Transl Eng Health Med. 2025 May 19;13:237-245. doi: 10.1109/JTEHM.2025.3571255. eCollection 2025.
2
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
3
Improving Large Language Models' Summarization Accuracy by Adding Highlights to Discharge Notes: Comparative Evaluation.通过在出院小结中添加重点内容提高大语言模型的总结准确性:比较评估
JMIR Med Inform. 2025 Jul 24;13:e66476. doi: 10.2196/66476.
4
Utilizing large language models for detecting hospital-acquired conditions: an empirical study on pulmonary embolism.利用大语言模型检测医院获得性疾病:关于肺栓塞的实证研究
J Am Med Inform Assoc. 2025 May 1;32(5):876-884. doi: 10.1093/jamia/ocaf048.
5
Multicriteria Optimization of Language Models for Heart Failure With Preserved Ejection Fraction Symptom Detection in Spanish Electronic Health Records: Comparative Modeling Study.西班牙电子健康记录中射血分数保留的心力衰竭症状检测语言模型的多标准优化:比较建模研究
J Med Internet Res. 2025 Jul 17;27:e76433. doi: 10.2196/76433.
6
Empowering large language models for automated clinical assessment with generation-augmented retrieval and hierarchical chain-of-thought.通过生成增强检索和分层思维链赋能大型语言模型进行自动化临床评估。
Artif Intell Med. 2025 Apr;162:103078. doi: 10.1016/j.artmed.2025.103078. Epub 2025 Feb 12.
7
Toward Cross-Hospital Deployment of Natural Language Processing Systems: Model Development and Validation of Fine-Tuned Large Language Models for Disease Name Recognition in Japanese.迈向自然语言处理系统的跨医院部署:用于日语疾病名称识别的微调大语言模型的模型开发与验证
JMIR Med Inform. 2025 Jul 8;13:e76773. doi: 10.2196/76773.
8
Early diagnosis of HIV cases by means of text mining and machine learning models on clinical notes.利用文本挖掘和机器学习模型对临床记录进行 HIV 病例的早期诊断。
Comput Biol Med. 2024 Sep;179:108830. doi: 10.1016/j.compbiomed.2024.108830. Epub 2024 Jul 10.
9
Enhancing Clinical Relevance of Pretrained Language Models Through Integration of External Knowledge: Case Study on Cardiovascular Diagnosis From Electronic Health Records.通过整合外部知识提高预训练语言模型的临床相关性:来自电子健康记录的心血管诊断案例研究
JMIR AI. 2024 Aug 6;3:e56932. doi: 10.2196/56932.
10
Implementing Large Language Models in Health Care: Clinician-Focused Review With Interactive Guideline.在医疗保健中应用大语言模型:以临床医生为重点的回顾与交互式指南
J Med Internet Res. 2025 Jul 11;27:e71916. doi: 10.2196/71916.

本文引用的文献

1
LLMs-based Few-Shot Disease Predictions using EHR: A Novel Approach Combining Predictive Agent Reasoning and Critical Agent Instruction.基于大语言模型的利用电子健康记录进行少样本疾病预测:一种结合预测性智能体推理与批判性智能体指令的新方法。
AMIA Annu Symp Proc. 2025 May 22;2024:319-328. eCollection 2024.
2
EHRAgent: Code Empowers Large Language Models for Few-shot Complex Tabular Reasoning on Electronic Health Records.EHRAgent:代码助力大语言模型在电子健康记录上进行少样本复杂表格推理。
Proc Conf Empir Methods Nat Lang Process. 2024 Nov;2024:22315-22339. doi: 10.18653/v1/2024.emnlp-main.1245.
3
Retrieving Evidence from EHRs with LLMs: Possibilities and Challenges.
利用大语言模型从电子健康记录中检索证据:可能性与挑战。
Proc Mach Learn Res. 2024 Jun;248:489-505.
4
PMC-LLaMA: toward building open-source language models for medicine.PMC-LLaMA:为医学构建开源语言模型的努力。
J Am Med Inform Assoc. 2024 Sep 1;31(9):1833-1843. doi: 10.1093/jamia/ocae045.
5
Large language models in medicine.医学中的大型语言模型。
Nat Med. 2023 Aug;29(8):1930-1940. doi: 10.1038/s41591-023-02448-8. Epub 2023 Jul 17.
6
From free-text electronic health records to structured cohorts: Onconum, an innovative methodology for real-world data mining in breast cancer.从自由文本电子健康记录到结构化队列:Onconum,一种用于乳腺癌真实世界数据挖掘的创新方法。
Comput Methods Programs Biomed. 2023 Oct;240:107693. doi: 10.1016/j.cmpb.2023.107693. Epub 2023 Jun 25.
7
Symptoms timeline and outcomes in amyotrophic lateral sclerosis using artificial intelligence.使用人工智能的肌萎缩侧索硬化症的症状时间轴和结果。
Sci Rep. 2023 Jan 13;13(1):702. doi: 10.1038/s41598-023-27863-2.
8
Evaluation of a Machine Learning-Based Dysphagia Prediction Tool in Clinical Routine: A Prospective Observational Cohort Study.基于机器学习的吞咽困难预测工具在临床常规中的评估:一项前瞻性观察队列研究。
Dysphagia. 2023 Aug;38(4):1238-1246. doi: 10.1007/s00455-022-10548-9. Epub 2023 Jan 10.
9
Natural language processing: state of the art, current trends and challenges.自然语言处理:技术现状、当前趋势与挑战。
Multimed Tools Appl. 2023;82(3):3713-3744. doi: 10.1007/s11042-022-13428-4. Epub 2022 Jul 14.
10
Med-BERT: pretrained contextualized embeddings on large-scale structured electronic health records for disease prediction.医学BERT:基于大规模结构化电子健康记录进行疾病预测的预训练上下文嵌入模型
NPJ Digit Med. 2021 May 20;4(1):86. doi: 10.1038/s41746-021-00455-y.