• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于预测任务的电子健康记录中医疗编码的表示学习。

Prediction task guided representation learning of medical codes in EHR.

机构信息

Department of Industrial Engineering, Tsinghua University, Beijing, China.

Department of Industrial Engineering, Tsinghua University, Beijing, China.

出版信息

J Biomed Inform. 2018 Aug;84:1-10. doi: 10.1016/j.jbi.2018.06.013. Epub 2018 Jun 19.

DOI:10.1016/j.jbi.2018.06.013
PMID:29928997
Abstract

There have been rapidly growing applications using machine learning models for predictive analytics in Electronic Health Records (EHR) to improve the quality of hospital services and the efficiency of healthcare resource utilization. A fundamental and crucial step in developing such models is to convert medical codes in EHR to feature vectors. These medical codes are used to represent diagnoses or procedures. Their vector representations have a tremendous impact on the performance of machine learning models. Recently, some researchers have utilized representation learning methods from Natural Language Processing (NLP) to learn vector representations of medical codes. However, most previous approaches are unsupervised, i.e. the generation of medical code vectors is independent from prediction tasks. Thus, the obtained feature vectors may be inappropriate for a specific prediction task. Moreover, unsupervised methods often require a lot of samples to obtain reliable results, but most practical problems have very limited patient samples. In this paper, we develop a new method called Prediction Task Guided Health Record Aggregation (PTGHRA), which aggregates health records guided by prediction tasks, to construct training corpus for various representation learning models. Compared with unsupervised approaches, representation learning models integrated with PTGHRA yield a significant improvement in predictive capability of generated medical code vectors, especially for limited training samples.

摘要

机器学习模型在电子病历(EHR)中的预测分析中的应用迅速增长,以提高医院服务质量和医疗资源利用效率。开发此类模型的一个基本而关键的步骤是将 EHR 中的医疗代码转换为特征向量。这些医疗代码用于表示诊断或程序。它们的向量表示对机器学习模型的性能有很大的影响。最近,一些研究人员利用自然语言处理(NLP)中的表示学习方法来学习医疗代码的向量表示。然而,大多数以前的方法都是无监督的,即医疗代码向量的生成与预测任务无关。因此,获得的特征向量可能不适合特定的预测任务。此外,无监督方法通常需要大量样本才能获得可靠的结果,但大多数实际问题的患者样本非常有限。在本文中,我们开发了一种称为“预测任务引导健康记录聚合(PTGHRA)”的新方法,该方法根据预测任务聚合健康记录,为各种表示学习模型构建训练语料库。与无监督方法相比,与 PTGHRA 集成的表示学习模型在生成的医疗代码向量的预测能力方面有了显著提高,特别是对于有限的训练样本。

相似文献

1
Prediction task guided representation learning of medical codes in EHR.基于预测任务的电子健康记录中医疗编码的表示学习。
J Biomed Inform. 2018 Aug;84:1-10. doi: 10.1016/j.jbi.2018.06.013. Epub 2018 Jun 19.
2
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
3
Automated feature selection of predictors in electronic medical records data.电子病历数据中预测指标的自动特征选择
Biometrics. 2019 Mar;75(1):268-277. doi: 10.1111/biom.12987. Epub 2019 Apr 2.
4
Modeling Healthcare Quality via Compact Representations of Electronic Health Records.通过电子健康记录的紧凑表示来建模医疗质量。
IEEE/ACM Trans Comput Biol Bioinform. 2017 May-Jun;14(3):545-554. doi: 10.1109/TCBB.2016.2591523. Epub 2016 Jul 14.
5
Using clinical text to refine unspecific condition codes in Dutch general practitioner EHR data.利用临床文本完善荷兰全科医生电子健康记录数据中不明确的病症编码。
Int J Med Inform. 2024 Sep;189:105506. doi: 10.1016/j.ijmedinf.2024.105506. Epub 2024 May 29.
6
EHR phenotyping via jointly embedding medical concepts and words into a unified vector space.通过将医疗概念和词汇联合嵌入到统一的向量空间中进行 EHR 表型分析。
BMC Med Inform Decis Mak. 2018 Dec 12;18(Suppl 4):123. doi: 10.1186/s12911-018-0672-0.
7
Risk prediction using natural language processing of electronic mental health records in an inpatient forensic psychiatry setting.利用电子心理健康记录的自然语言处理进行住院法医精神病学环境中的风险预测。
J Biomed Inform. 2018 Oct;86:49-58. doi: 10.1016/j.jbi.2018.08.007. Epub 2018 Aug 14.
8
Comparing natural language processing representations of coded disease sequences for prediction in electronic health records.比较编码疾病序列的自然语言处理表示,以用于电子健康记录中的预测。
J Am Med Inform Assoc. 2024 Jun 20;31(7):1451-1462. doi: 10.1093/jamia/ocae091.
9
Patient representation learning and interpretable evaluation using clinical notes.利用临床记录进行患者表示学习和可解释评估。
J Biomed Inform. 2018 Aug;84:103-113. doi: 10.1016/j.jbi.2018.06.016. Epub 2018 Jul 3.
10
Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。
J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.

引用本文的文献

1
Improving the Performance of Outcome Prediction for Inpatients With Acute Myocardial Infarction Based on Embedding Representation Learned From Electronic Medical Records: Development and Validation Study.基于电子病历中学习到的嵌入表示来提高急性心肌梗死住院患者结局预测的性能:开发和验证研究。
J Med Internet Res. 2022 Aug 3;24(8):e37486. doi: 10.2196/37486.
2
A Machine-Learning Method of Predicting Vital Capacity Plateau Value for Ventilatory Pump Failure Based on Data Mining.一种基于数据挖掘的预测通气泵衰竭肺活量平台值的机器学习方法。
Healthcare (Basel). 2021 Sep 30;9(10):1306. doi: 10.3390/healthcare9101306.
3
Patient Representation From Structured Electronic Medical Records Based on Embedding Technique: Development and Validation Study.
基于嵌入技术的结构化电子病历患者表征:开发与验证研究
JMIR Med Inform. 2021 Jul 23;9(7):e19905. doi: 10.2196/19905.
4
Deep representation learning of patient data from Electronic Health Records (EHR): A systematic review.电子健康记录(EHR)中患者数据的深度表征学习:一项系统综述。
J Biomed Inform. 2021 Mar;115:103671. doi: 10.1016/j.jbi.2020.103671. Epub 2020 Dec 31.
5
Representation learning for clinical time series prediction tasks in electronic health records.电子健康记录中临床时间序列预测任务的表示学习。
BMC Med Inform Decis Mak. 2019 Dec 17;19(Suppl 8):259. doi: 10.1186/s12911-019-0985-7.
6
Findings from the 2019 International Medical Informatics Association Yearbook Section on Health Information Management.2019年国际医学信息学协会健康信息管理年鉴章节的研究结果。
Yearb Med Inform. 2019 Aug;28(1):65-68. doi: 10.1055/s-0039-1677941. Epub 2019 Aug 16.