• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于电子病历中学习到的嵌入表示来提高急性心肌梗死住院患者结局预测的性能:开发和验证研究。

Improving the Performance of Outcome Prediction for Inpatients With Acute Myocardial Infarction Based on Embedding Representation Learned From Electronic Medical Records: Development and Validation Study.

机构信息

School of Biomedical Engineering, Capital Medical University, Beijing, China.

Beijing Key Laboratory of Fundamental Research on Biomechanics in Clinical Application, Capital Medical University, Beijing, China.

出版信息

J Med Internet Res. 2022 Aug 3;24(8):e37486. doi: 10.2196/37486.

DOI:10.2196/37486
PMID:35921141
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9386580/
Abstract

BACKGROUND

The widespread secondary use of electronic medical records (EMRs) promotes health care quality improvement. Representation learning that can automatically extract hidden information from EMR data has gained increasing attention.

OBJECTIVE

We aimed to propose a patient representation with more feature associations and task-specific feature importance to improve the outcome prediction performance for inpatients with acute myocardial infarction (AMI).

METHODS

Medical concepts, including patients' age, gender, disease diagnoses, laboratory tests, structured radiological features, procedures, and medications, were first embedded into real-value vectors using the improved skip-gram algorithm, where concepts in the context windows were selected by feature association strengths measured by association rule confidence. Then, each patient was represented as the sum of the feature embeddings weighted by the task-specific feature importance, which was applied to facilitate predictive model prediction from global and local perspectives. We finally applied the proposed patient representation into mortality risk prediction for 3010 and 1671 AMI inpatients from a public data set and a private data set, respectively, and compared it with several reference representation methods in terms of the area under the receiver operating characteristic curve (AUROC), area under the precision-recall curve (AUPRC), and F1-score.

RESULTS

Compared with the reference methods, the proposed embedding-based representation showed consistently superior predictive performance on the 2 data sets, achieving mean AUROCs of 0.878 and 0.973, AUPRCs of 0.220 and 0.505, and F1-scores of 0.376 and 0.674 for the public and private data sets, respectively, while the greatest AUROCs, AUPRCs, and F1-scores among the reference methods were 0.847 and 0.939, 0.196 and 0.283, and 0.344 and 0.361 for the public and private data sets, respectively. Feature importance integrated in patient representation reflected features that were also critical in prediction tasks and clinical practice.

CONCLUSIONS

The introduction of feature associations and feature importance facilitated an effective patient representation and contributed to prediction performance improvement and model interpretation.

摘要

背景

电子病历(EMR)的广泛二次使用促进了医疗质量的改进。能够自动从 EMR 数据中提取隐藏信息的表示学习受到了越来越多的关注。

目的

我们旨在提出一种具有更多特征关联和特定于任务的特征重要性的患者表示,以提高急性心肌梗死(AMI)住院患者的预后预测性能。

方法

首先,使用改进的 skip-gram 算法将医疗概念(包括患者的年龄、性别、疾病诊断、实验室检查、结构化放射学特征、程序和药物)嵌入到实值向量中,其中上下文窗口中的概念是通过关联规则置信度测量的特征关联强度选择的。然后,将每个患者表示为任务特定特征重要性加权的特征嵌入的总和,该总和用于从全局和局部角度促进预测模型的预测。最后,我们将提出的患者表示应用于来自公共数据集和私有数据集的 3010 名和 1671 名 AMI 住院患者的死亡率风险预测中,并在 AUC(ROC)、AUPRC(精度-召回曲线)和 F1 分数方面与几个参考表示方法进行比较。

结果

与参考方法相比,基于嵌入的表示在两个数据集上均表现出一致的卓越预测性能,分别实现了 AUC 的平均值为 0.878 和 0.973,AUPRC 的平均值为 0.220 和 0.505,以及 F1 分数的平均值为 0.376 和 0.674,而参考方法中的 AUC 最大值、AUPRC 最大值和 F1 分数分别为 0.847 和 0.939、0.196 和 0.283 以及 0.344 和 0.361。患者表示中集成的特征重要性反映了在预测任务和临床实践中也很关键的特征。

结论

特征关联和特征重要性的引入促进了有效的患者表示,并有助于提高预测性能和模型解释。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59b3/9386580/c1d3ec29305c/jmir_v24i8e37486_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59b3/9386580/ae21df813fe0/jmir_v24i8e37486_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59b3/9386580/b50212989ee2/jmir_v24i8e37486_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59b3/9386580/3b39f718b3a3/jmir_v24i8e37486_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59b3/9386580/7c3f5f97b5e2/jmir_v24i8e37486_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59b3/9386580/c1d3ec29305c/jmir_v24i8e37486_fig5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59b3/9386580/ae21df813fe0/jmir_v24i8e37486_fig1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59b3/9386580/b50212989ee2/jmir_v24i8e37486_fig2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59b3/9386580/3b39f718b3a3/jmir_v24i8e37486_fig3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59b3/9386580/7c3f5f97b5e2/jmir_v24i8e37486_fig4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/59b3/9386580/c1d3ec29305c/jmir_v24i8e37486_fig5.jpg

相似文献

1
Improving the Performance of Outcome Prediction for Inpatients With Acute Myocardial Infarction Based on Embedding Representation Learned From Electronic Medical Records: Development and Validation Study.基于电子病历中学习到的嵌入表示来提高急性心肌梗死住院患者结局预测的性能:开发和验证研究。
J Med Internet Res. 2022 Aug 3;24(8):e37486. doi: 10.2196/37486.
2
Patient Representation From Structured Electronic Medical Records Based on Embedding Technique: Development and Validation Study.基于嵌入技术的结构化电子病历患者表征:开发与验证研究
JMIR Med Inform. 2021 Jul 23;9(7):e19905. doi: 10.2196/19905.
3
Representation of time-varying and time-invariant EMR data and its application in modeling outcome prediction for heart failure patients.时变和时不变电子病历数据的表示及其在心力衰竭患者预后建模中的应用。
J Biomed Inform. 2023 Jul;143:104427. doi: 10.1016/j.jbi.2023.104427. Epub 2023 Jun 18.
4
Disease Concept-Embedding Based on the Self-Supervised Method for Medical Information Extraction from Electronic Health Records and Disease Retrieval: Algorithm Development and Validation Study.基于自监督方法的疾病概念嵌入在电子健康记录中的医学信息提取和疾病检索:算法开发和验证研究。
J Med Internet Res. 2021 Jan 27;23(1):e25113. doi: 10.2196/25113.
5
Electronic Medical Record-Based Machine Learning Approach to Predict the Risk of 30-Day Adverse Cardiac Events After Invasive Coronary Treatment: Machine Learning Model Development and Validation.基于电子病历的机器学习方法预测侵入性冠状动脉治疗后30天不良心脏事件风险:机器学习模型的开发与验证
JMIR Med Inform. 2022 May 11;10(5):e26801. doi: 10.2196/26801.
6
A Hybrid Decision Tree and Deep Learning Approach Combining Medical Imaging and Electronic Medical Records to Predict Intubation Among Hospitalized Patients With COVID-19: Algorithm Development and Validation.一种结合医学影像和电子病历的混合决策树与深度学习方法,用于预测COVID-19住院患者的插管情况:算法开发与验证
JMIR Form Res. 2023 Oct 26;7:e46905. doi: 10.2196/46905.
7
Implementation and evaluation of a multivariate abstraction-based, interval-based dynamic time-warping method as a similarity measure for longitudinal medical records.基于多元抽象和区间的动态时间规整方法的实现和评估,作为一种用于纵向医疗记录的相似性度量方法。
J Biomed Inform. 2021 Nov;123:103919. doi: 10.1016/j.jbi.2021.103919. Epub 2021 Oct 8.
8
Development of Electronic Health Record-Based Prediction Models for 30-Day Readmission Risk Among Patients Hospitalized for Acute Myocardial Infarction.基于电子健康记录的急性心肌梗死住院患者 30 天再入院风险预测模型的建立。
JAMA Netw Open. 2021 Jan 4;4(1):e2035782. doi: 10.1001/jamanetworkopen.2020.35782.
9
Adverse Drug Event Prediction Using Noisy Literature-Derived Knowledge Graphs: Algorithm Development and Validation.使用有噪声的文献衍生知识图谱进行药物不良事件预测:算法开发与验证
JMIR Med Inform. 2021 Oct 25;9(10):e32730. doi: 10.2196/32730.
10
Establishment of noninvasive diabetes risk prediction model based on tongue features and machine learning techniques.基于舌象特征和机器学习技术的无创糖尿病风险预测模型的建立。
Int J Med Inform. 2021 May;149:104429. doi: 10.1016/j.ijmedinf.2021.104429. Epub 2021 Feb 22.

引用本文的文献

1
Development and Validation of an Explainable Deep Learning Model to Predict In-Hospital Mortality for Patients With Acute Myocardial Infarction: Algorithm Development and Validation Study.开发和验证一种可解释的深度学习模型以预测急性心肌梗死患者的住院死亡率:算法开发和验证研究。
J Med Internet Res. 2024 May 10;26:e49848. doi: 10.2196/49848.

本文引用的文献

1
Sequential Data-Based Patient Similarity Framework for Patient Outcome Prediction: Algorithm Development.基于序列数据的患者相似性框架用于患者预后预测:算法开发。
J Med Internet Res. 2022 Jan 6;24(1):e30720. doi: 10.2196/30720.
2
Applying interpretable deep learning models to identify chronic cough patients using EHR data.应用可解释的深度学习模型,利用电子病历数据识别慢性咳嗽患者。
Comput Methods Programs Biomed. 2021 Oct;210:106395. doi: 10.1016/j.cmpb.2021.106395. Epub 2021 Sep 4.
3
Long-term prediction for temporal propagation of seasonal influenza using Transformer-based model.
基于 Transformer 的季节性流感时间传播的长期预测。
J Biomed Inform. 2021 Oct;122:103894. doi: 10.1016/j.jbi.2021.103894. Epub 2021 Aug 26.
4
A novel hierarchical machine learning model for hospital-acquired venous thromboembolism risk assessment among multiple-departments.一种用于多科室医院获得性静脉血栓栓塞风险评估的新型分层机器学习模型。
J Biomed Inform. 2021 Oct;122:103892. doi: 10.1016/j.jbi.2021.103892. Epub 2021 Aug 26.
5
Explainable Dynamic Multimodal Variational Autoencoder for the Prediction of Patients With Suspected Central Precocious Puberty.可解释动态多模态变分自动编码器在疑似中枢性性早熟患者预测中的应用。
IEEE J Biomed Health Inform. 2022 Mar;26(3):1362-1373. doi: 10.1109/JBHI.2021.3103271. Epub 2022 Mar 7.
6
Short- and long-term mortality prediction after an acute ST-elevation myocardial infarction (STEMI) in Asians: A machine learning approach.亚洲人急性ST段抬高型心肌梗死(STEMI)后的短期和长期死亡率预测:一种机器学习方法。
PLoS One. 2021 Aug 2;16(8):e0254894. doi: 10.1371/journal.pone.0254894. eCollection 2021.
7
Patient Representation From Structured Electronic Medical Records Based on Embedding Technique: Development and Validation Study.基于嵌入技术的结构化电子病历患者表征:开发与验证研究
JMIR Med Inform. 2021 Jul 23;9(7):e19905. doi: 10.2196/19905.
8
Treatment initiation prediction by EHR mapped PPD tensor based convolutional neural networks boosting algorithm.基于 EHR 映射 PPD 张量的卷积神经网络增强算法的治疗启动预测。
J Biomed Inform. 2021 Aug;120:103840. doi: 10.1016/j.jbi.2021.103840. Epub 2021 Jun 15.
9
Predicting morbidity by local similarities in multi-scale patient trajectories.基于多尺度患者轨迹的局部相似性预测发病率。
J Biomed Inform. 2021 Aug;120:103837. doi: 10.1016/j.jbi.2021.103837. Epub 2021 Jun 10.
10
Med-BERT: pretrained contextualized embeddings on large-scale structured electronic health records for disease prediction.医学BERT:基于大规模结构化电子健康记录进行疾病预测的预训练上下文嵌入模型
NPJ Digit Med. 2021 May 20;4(1):86. doi: 10.1038/s41746-021-00455-y.