• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

相似文献

1
Joint Learning of Representations of Medical Concepts and Words from EHR Data.基于电子健康记录数据的医学概念与词汇表示的联合学习
Proceedings (IEEE Int Conf Bioinformatics Biomed). 2017 Nov;2017:764-769. doi: 10.1109/BIBM.2017.8217752. Epub 2017 Dec 18.
2
EHR phenotyping via jointly embedding medical concepts and words into a unified vector space.通过将医疗概念和词汇联合嵌入到统一的向量空间中进行 EHR 表型分析。
BMC Med Inform Decis Mak. 2018 Dec 12;18(Suppl 4):123. doi: 10.1186/s12911-018-0672-0.
3
EHR2Vec: Representation Learning of Medical Concepts From Temporal Patterns of Clinical Notes Based on Self-Attention Mechanism.EHR2Vec:基于自注意力机制从临床记录的时间模式中学习医学概念表示
Front Genet. 2020 Jun 29;11:630. doi: 10.3389/fgene.2020.00630. eCollection 2020.
4
Extraction of Information Related to Drug Safety Surveillance From Electronic Health Record Notes: Joint Modeling of Entities and Relations Using Knowledge-Aware Neural Attentive Models.从电子健康记录笔记中提取与药物安全监测相关的信息:使用知识感知神经注意力模型对实体和关系进行联合建模
JMIR Med Inform. 2020 Jul 10;8(7):e18417. doi: 10.2196/18417.
5
Harmonized representation learning on dynamic EHR graphs.动态电子健康记录图上的协调表示学习。
J Biomed Inform. 2020 Jun;106:103426. doi: 10.1016/j.jbi.2020.103426. Epub 2020 Apr 25.
6
Medical Concept Representation Learning from Multi-source Data.基于多源数据的医学概念表示学习
IJCAI (U S). 2019 Jul;2019:4897-4903. doi: 10.24963/ijcai.2019/680.
7
Transitive Sequencing Medical Records for Mining Predictive and Interpretable Temporal Representations.用于挖掘预测性和可解释性时间表征的传递性序列医疗记录
Patterns (N Y). 2020 Jul 10;1(4):100051. doi: 10.1016/j.patter.2020.100051. Epub 2020 Jun 18.
8
A comparison of word embeddings for the biomedical natural language processing.生物医学自然语言处理中词嵌入的比较。
J Biomed Inform. 2018 Nov;87:12-20. doi: 10.1016/j.jbi.2018.09.008. Epub 2018 Sep 12.
9
Fusion of sequential visits and medical ontology for mortality prediction.基于序列就诊记录和医学本体的死亡率预测融合方法。
J Biomed Inform. 2022 Mar;127:104012. doi: 10.1016/j.jbi.2022.104012. Epub 2022 Feb 7.
10
Identifying adverse drug event information in clinical notes with distributional semantic representations of context.利用上下文的分布语义表示识别临床记录中的药物不良事件信息。
J Biomed Inform. 2015 Oct;57:333-49. doi: 10.1016/j.jbi.2015.08.013. Epub 2015 Aug 17.

引用本文的文献

1
Improving medical term embeddings using UMLS Metathesaurus.利用 UMLS 语义学术语表改进医学术语嵌入。
BMC Med Inform Decis Mak. 2022 Apr 29;22(1):114. doi: 10.1186/s12911-022-01850-5.
2
Generating contextual embeddings for emergency department chief complaints.为急诊科主要症状生成上下文嵌入。
JAMIA Open. 2020 Jul 15;3(2):160-166. doi: 10.1093/jamiaopen/ooaa022. eCollection 2020 Jul.
3
Medical Concept Representation Learning from Multi-source Data.基于多源数据的医学概念表示学习
IJCAI (U S). 2019 Jul;2019:4897-4903. doi: 10.24963/ijcai.2019/680.
4
Distributed learning from multiple EHR databases: Contextual embedding models for medical events.从多个电子健康记录数据库中进行分布式学习:用于医疗事件的上下文嵌入模型。
J Biomed Inform. 2019 Apr;92:103138. doi: 10.1016/j.jbi.2019.103138. Epub 2019 Feb 27.
5
EHR phenotyping via jointly embedding medical concepts and words into a unified vector space.通过将医疗概念和词汇联合嵌入到统一的向量空间中进行 EHR 表型分析。
BMC Med Inform Decis Mak. 2018 Dec 12;18(Suppl 4):123. doi: 10.1186/s12911-018-0672-0.

本文引用的文献

1
Doctor AI: Predicting Clinical Events via Recurrent Neural Networks.人工智能医生:通过循环神经网络预测临床事件
JMLR Workshop Conf Proc. 2016 Aug;56:301-318. Epub 2016 Dec 10.
2
A Visualization of Evolving Clinical Sentiment Using Vector Representations of Clinical Notes.使用临床记录的向量表示对不断演变的临床情感进行可视化
Comput Cardiol (2010). 2015 Sep;2015:629-632. doi: 10.1109/CIC.2015.7410989. Epub 2016 Feb 18.
3
Learning Low-Dimensional Representations of Medical Concepts.学习医学概念的低维表示。
AMIA Jt Summits Transl Sci Proc. 2016 Jul 20;2016:41-50. eCollection 2016.
4
Using recurrent neural network models for early detection of heart failure onset.使用循环神经网络模型进行心力衰竭发作的早期检测。
J Am Med Inform Assoc. 2017 Mar 1;24(2):361-370. doi: 10.1093/jamia/ocw112.
5
Modeling Healthcare Quality via Compact Representations of Electronic Health Records.通过电子健康记录的紧凑表示来建模医疗质量。
IEEE/ACM Trans Comput Biol Bioinform. 2017 May-Jun;14(3):545-554. doi: 10.1109/TCBB.2016.2591523. Epub 2016 Jul 14.
6
MIMIC-III, a freely accessible critical care database.MIMIC-III,一个免费获取的重症监护数据库。
Sci Data. 2016 May 24;3:160035. doi: 10.1038/sdata.2016.35.
7
A Study of Neural Word Embeddings for Named Entity Recognition in Clinical Text.用于临床文本中命名实体识别的神经词嵌入研究
AMIA Annu Symp Proc. 2015 Nov 5;2015:1326-33. eCollection 2015.
8
Learning probabilistic phenotypes from heterogeneous EHR data.从异构电子健康记录数据中学习概率性表型。
J Biomed Inform. 2015 Dec;58:156-165. doi: 10.1016/j.jbi.2015.10.001. Epub 2015 Oct 14.
9
Care episode retrieval: distributional semantic models for information retrieval in the clinical domain.护理事件检索:临床领域信息检索的分布式语义模型
BMC Med Inform Decis Mak. 2015;15 Suppl 2(Suppl 2):S2. doi: 10.1186/1472-6947-15-S2-S2. Epub 2015 Jun 15.
10
Unfolding Physiological State: Mortality Modelling in Intensive Care Units.展开生理状态:重症监护病房的死亡率建模
KDD. 2014 Aug 24;2014:75-84. doi: 10.1145/2623330.2623742.

基于电子健康记录数据的医学概念与词汇表示的联合学习

Joint Learning of Representations of Medical Concepts and Words from EHR Data.

作者信息

Bai Tian, Chanda Ashis Kumar, Egleston Brian L, Vucetic Slobodan

机构信息

Department of Computer and Information Sciences, Temple University, Philadelphia, PA 19122, USA.

Fox Chase Cancer Center, Philadelphia, PA 19111, USA.

出版信息

Proceedings (IEEE Int Conf Bioinformatics Biomed). 2017 Nov;2017:764-769. doi: 10.1109/BIBM.2017.8217752. Epub 2017 Dec 18.

DOI:10.1109/BIBM.2017.8217752
PMID:29375929
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5783648/
Abstract

There has been an increasing interest in learning low-dimensional vector representations of medical concepts from electronic health records (EHRs). While EHRs contain structured data such as diagnostic codes and laboratory tests, they also contain unstructured clinical notes, which provide more nuanced details on a patient's health status. In this work, we propose a method that jointly learns medical concept and word representations. In particular, we focus on capturing the relationship between medical codes and words by using a novel learning scheme for word2vec model. Our method exploits relationships between different parts of EHRs in the same visit and embeds both codes and words in the same continuous vector space. In the end, we are able to derive clusters which reflect distinct disease and treatment patterns. In our experiments, we qualitatively show how our methods of grouping words for given diagnostic codes compares with a topic modeling approach. We also test how well our representations can be used to predict disease patterns of the next visit. The results show that our approach outperforms several common methods.

摘要

从电子健康记录(EHR)中学习医学概念的低维向量表示越来越受到关注。虽然EHR包含诊断代码和实验室检查等结构化数据,但它们也包含非结构化的临床记录,这些记录提供了关于患者健康状况更细微的细节。在这项工作中,我们提出了一种联合学习医学概念和单词表示的方法。具体来说,我们通过对word2vec模型使用一种新颖的学习方案来专注于捕捉医学代码和单词之间的关系。我们的方法利用同一次就诊中EHR不同部分之间的关系,并将代码和单词嵌入到同一个连续向量空间中。最后,我们能够得出反映不同疾病和治疗模式的聚类。在我们的实验中,我们定性地展示了我们为给定诊断代码分组单词的方法与主题建模方法相比如何。我们还测试了我们的表示在预测下次就诊的疾病模式方面的效果如何。结果表明,我们的方法优于几种常用方法。