• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

电子健康记录(EHR)中患者数据的深度表征学习:一项系统综述。

Deep representation learning of patient data from Electronic Health Records (EHR): A systematic review.

作者信息

Si Yuqi, Du Jingcheng, Li Zhao, Jiang Xiaoqian, Miller Timothy, Wang Fei, Jim Zheng W, Roberts Kirk

机构信息

School of Biomedical Informatics, The University of Texas Health Science Center at Houston, TX, USA.

Computational Health Informatics Program (CHIP), Boston Children's Hospital and Harvard Medical School, MA, USA.

出版信息

J Biomed Inform. 2021 Mar;115:103671. doi: 10.1016/j.jbi.2020.103671. Epub 2020 Dec 31.

DOI:10.1016/j.jbi.2020.103671
PMID:33387683
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11290708/
Abstract

OBJECTIVES

Patient representation learning refers to learning a dense mathematical representation of a patient that encodes meaningful information from Electronic Health Records (EHRs). This is generally performed using advanced deep learning methods. This study presents a systematic review of this field and provides both qualitative and quantitative analyses from a methodological perspective.

METHODS

We identified studies developing patient representations from EHRs with deep learning methods from MEDLINE, EMBASE, Scopus, the Association for Computing Machinery (ACM) Digital Library, and the Institute of Electrical and Electronics Engineers (IEEE) Xplore Digital Library. After screening 363 articles, 49 papers were included for a comprehensive data collection.

RESULTS

Publications developing patient representations almost doubled each year from 2015 until 2019. We noticed a typical workflow starting with feeding raw data, applying deep learning models, and ending with clinical outcome predictions as evaluations of the learned representations. Specifically, learning representations from structured EHR data was dominant (37 out of 49 studies). Recurrent Neural Networks were widely applied as the deep learning architecture (Long short-term memory: 13 studies, Gated recurrent unit: 11 studies). Learning was mainly performed in a supervised manner (30 studies) optimized with cross-entropy loss. Disease prediction was the most common application and evaluation (31 studies). Benchmark datasets were mostly unavailable (28 studies) due to privacy concerns of EHR data, and code availability was assured in 20 studies.

DISCUSSION & CONCLUSION: The existing predictive models mainly focus on the prediction of single diseases, rather than considering the complex mechanisms of patients from a holistic review. We show the importance and feasibility of learning comprehensive representations of patient EHR data through a systematic review. Advances in patient representation learning techniques will be essential for powering patient-level EHR analyses. Future work will still be devoted to leveraging the richness and potential of available EHR data. Reproducibility and transparency of reported results will hopefully improve. Knowledge distillation and advanced learning techniques will be exploited to assist the capability of learning patient representation further.

摘要

目的

患者表征学习是指学习患者的密集数学表征,该表征对来自电子健康记录(EHR)的有意义信息进行编码。这通常使用先进的深度学习方法来执行。本研究对该领域进行了系统综述,并从方法论角度提供了定性和定量分析。

方法

我们从MEDLINE、EMBASE、Scopus、美国计算机协会(ACM)数字图书馆和电气与电子工程师协会(IEEE)Xplore数字图书馆中,识别出使用深度学习方法从电子健康记录中开发患者表征的研究。在筛选了363篇文章后,纳入了49篇论文进行全面的数据收集。

结果

从2015年到2019年,开发患者表征的出版物数量几乎每年都翻一番。我们注意到一个典型的工作流程,从输入原始数据开始,应用深度学习模型,最后以临床结果预测作为对所学表征的评估。具体而言,从结构化电子健康记录数据中学习表征占主导地位(49项研究中的37项)。循环神经网络被广泛用作深度学习架构(长短期记忆:13项研究,门控循环单元:11项研究)。学习主要以监督方式进行(30项研究),并通过交叉熵损失进行优化。疾病预测是最常见的应用和评估(31项研究)。由于电子健康记录数据的隐私问题,基准数据集大多不可用(28项研究),20项研究确保了代码可用性。

讨论与结论

现有的预测模型主要侧重于单一疾病的预测,而不是从整体角度考虑患者的复杂机制。我们通过系统综述展示了学习患者电子健康记录数据综合表征的重要性和可行性。患者表征学习技术的进步对于推动患者层面的电子健康记录分析至关重要。未来的工作仍将致力于利用现有电子健康记录数据的丰富性和潜力。有望提高报告结果的可重复性和透明度。知识蒸馏和先进的学习技术将被用于进一步辅助学习患者表征的能力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9048/11290708/3481f53ee08c/nihms-2008566-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9048/11290708/39c462a54f35/nihms-2008566-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9048/11290708/27cf8af15eff/nihms-2008566-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9048/11290708/3f9c85a27ffb/nihms-2008566-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9048/11290708/3481f53ee08c/nihms-2008566-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9048/11290708/39c462a54f35/nihms-2008566-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9048/11290708/27cf8af15eff/nihms-2008566-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9048/11290708/3f9c85a27ffb/nihms-2008566-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/9048/11290708/3481f53ee08c/nihms-2008566-f0004.jpg

相似文献

1
Deep representation learning of patient data from Electronic Health Records (EHR): A systematic review.电子健康记录(EHR)中患者数据的深度表征学习:一项系统综述。
J Biomed Inform. 2021 Mar;115:103671. doi: 10.1016/j.jbi.2020.103671. Epub 2020 Dec 31.
2
Deep learning prediction models based on EHR trajectories: A systematic review.基于电子健康记录轨迹的深度学习预测模型:系统评价。
J Biomed Inform. 2023 Aug;144:104430. doi: 10.1016/j.jbi.2023.104430. Epub 2023 Jun 26.
3
Deep learning for temporal data representation in electronic health records: A systematic review of challenges and methodologies.深度学习在电子健康记录中的时间数据表示:挑战和方法的系统评价。
J Biomed Inform. 2022 Feb;126:103980. doi: 10.1016/j.jbi.2021.103980. Epub 2021 Dec 30.
4
The Use of Deep Learning and Machine Learning on Longitudinal Electronic Health Records for the Early Detection and Prevention of Diseases: Scoping Review.深度学习和机器学习在纵向电子健康记录中用于疾病的早期检测和预防的应用:范围综述。
J Med Internet Res. 2024 Aug 20;26:e48320. doi: 10.2196/48320.
5
Opportunities and challenges in developing deep learning models using electronic health records data: a systematic review.利用电子健康记录数据开发深度学习模型的机遇与挑战:系统综述。
J Am Med Inform Assoc. 2018 Oct 1;25(10):1419-1428. doi: 10.1093/jamia/ocy068.
6
Enhancing Patient Outcome Prediction Through Deep Learning With Sequential Diagnosis Codes From Structured Electronic Health Record Data: Systematic Review.通过深度学习利用结构化电子健康记录数据中的顺序诊断代码增强患者预后预测:系统评价
J Med Internet Res. 2025 Mar 18;27:e57358. doi: 10.2196/57358.
7
Graph neural networks for clinical risk prediction based on electronic health records: A survey.基于电子健康记录的临床风险预测的图神经网络:调查。
J Biomed Inform. 2024 Mar;151:104616. doi: 10.1016/j.jbi.2024.104616. Epub 2024 Feb 27.
8
Question Answering for Electronic Health Records: Scoping Review of Datasets and Models.电子健康记录问答:数据集和模型的范围综述。
J Med Internet Res. 2024 Oct 30;26:e53636. doi: 10.2196/53636.
9
Fraud detection in healthcare claims using machine learning: A systematic review.使用机器学习进行医疗保健理赔欺诈检测:一项系统综述。
Artif Intell Med. 2025 Feb;160:103061. doi: 10.1016/j.artmed.2024.103061. Epub 2024 Dec 28.
10
Applying interpretable deep learning models to identify chronic cough patients using EHR data.应用可解释的深度学习模型,利用电子病历数据识别慢性咳嗽患者。
Comput Methods Programs Biomed. 2021 Oct;210:106395. doi: 10.1016/j.cmpb.2021.106395. Epub 2021 Sep 4.

引用本文的文献

1
Multi-Modal Fusion of Routine Care Electronic Health Records (EHR): A Scoping Review.常规护理电子健康记录(EHR)的多模态融合:一项范围综述
Information (Basel). 2025 Jan;16(1). doi: 10.3390/info16010054. Epub 2025 Jan 15.
2
Leveraging BERT for embedding ICD codes from large scale cardiovascular EMR data to understand patient diagnostic patterns.利用BERT从大规模心血管电子病历数据中嵌入ICD编码,以了解患者的诊断模式。
BMC Med Inform Decis Mak. 2025 Aug 11;25(1):300. doi: 10.1186/s12911-025-03145-x.
3
Using social media listening to identify the real-world challenges faced by dog owners globally when administering oral medications.

本文引用的文献

1
Distributed Weight Consolidation: A Brain Segmentation Case Study.分布式权重巩固:一个脑部分割案例研究。
Adv Neural Inf Process Syst. 2018 Dec;31:4093-4103.
2
Med-BERT: pretrained contextualized embeddings on large-scale structured electronic health records for disease prediction.医学BERT:基于大规模结构化电子健康记录进行疾病预测的预训练上下文嵌入模型
NPJ Digit Med. 2021 May 20;4(1):86. doi: 10.1038/s41746-021-00455-y.
3
Two-stage Federated Phenotyping and Patient Representation Learning.两阶段联合表型分析与患者表征学习
利用社交媒体监听来识别全球养狗人士在投喂口服药物时面临的现实挑战。
Front Vet Sci. 2025 Jun 30;12:1502236. doi: 10.3389/fvets.2025.1502236. eCollection 2025.
4
GenAI exceeds clinical experts in predicting acute kidney injury following paediatric cardiopulmonary bypass.在预测小儿体外循环术后急性肾损伤方面,生成式人工智能优于临床专家。
Sci Rep. 2025 Jul 1;15(1):20847. doi: 10.1038/s41598-025-04651-8.
5
A scoping review of self-supervised representation learning for clinical decision making using EHR categorical data.一项使用电子健康记录分类数据进行临床决策的自监督表征学习的范围综述。
NPJ Digit Med. 2025 Jun 14;8(1):362. doi: 10.1038/s41746-025-01692-1.
6
Trajectory-Ordered Objectives for Self-Supervised Representation Learning of Temporal Healthcare Data Using Transformers: Model Development and Evaluation Study.使用Transformer进行时间序列医疗数据自监督表示学习的轨迹有序目标:模型开发与评估研究
JMIR Med Inform. 2025 Jun 4;13:e68138. doi: 10.2196/68138.
7
BadCLM: Backdoor Attack in Clinical Language Models for Electronic Health Records.BadCLM:电子健康记录临床语言模型中的后门攻击
AMIA Annu Symp Proc. 2025 May 22;2024:768-777. eCollection 2024.
8
Enhancing Patient Outcome Prediction Through Deep Learning With Sequential Diagnosis Codes From Structured Electronic Health Record Data: Systematic Review.通过深度学习利用结构化电子健康记录数据中的顺序诊断代码增强患者预后预测:系统评价
J Med Internet Res. 2025 Mar 18;27:e57358. doi: 10.2196/57358.
9
A bibliometric analysis of the advance of artificial intelligence in medicine.医学领域人工智能进展的文献计量分析
Front Med (Lausanne). 2025 Feb 21;12:1504428. doi: 10.3389/fmed.2025.1504428. eCollection 2025.
10
Towards Trustworthy AI in Healthcare: Epistemic Uncertainty Estimation for Clinical Decision Support.迈向医疗保健领域值得信赖的人工智能:临床决策支持中的认知不确定性估计
J Pers Med. 2025 Jan 31;15(2):58. doi: 10.3390/jpm15020058.
Proc Conf Assoc Comput Linguist Meet. 2019 Aug;2019:283-291. doi: 10.18653/v1/W19-5030.
4
MetaPred: Meta-Learning for Clinical Risk Prediction with Limited Patient Electronic Health Records.MetaPred:利用有限患者电子健康记录进行临床风险预测的元学习
KDD. 2019 Aug;2019:2487-2495. doi: 10.1145/3292500.3330779.
5
GRAM: Graph-based Attention Model for Healthcare Representation Learning.GRAM:用于医疗保健表示学习的基于图的注意力模型。
KDD. 2017 Aug;2017:787-795. doi: 10.1145/3097983.3098126.
6
Generalized and transferable patient language representation for phenotyping with limited data.用于有限数据表型分析的通用且可转移的患者语言表示
J Biomed Inform. 2021 Apr;116:103726. doi: 10.1016/j.jbi.2021.103726. Epub 2021 Mar 9.
7
Language models are an effective representation learning technique for electronic health record data.语言模型是一种用于电子健康记录数据的有效表示学习技术。
J Biomed Inform. 2021 Jan;113:103637. doi: 10.1016/j.jbi.2020.103637. Epub 2020 Dec 5.
8
The future of digital health with federated learning.联合学习助力数字健康的未来。
NPJ Digit Med. 2020 Sep 14;3:119. doi: 10.1038/s41746-020-00323-1. eCollection 2020.
9
Rethinking domain adaptation for machine learning over clinical language.重新思考针对临床语言的机器学习领域适应问题。
JAMIA Open. 2020 Apr 13;3(2):146-150. doi: 10.1093/jamiaopen/ooaa010. eCollection 2020 Jul.
10
Patient Representation Transfer Learning from Clinical Notes based on Hierarchical Attention Network.基于分层注意力网络的临床笔记患者表示迁移学习
AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:597-606. eCollection 2020.