• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用临床记录进行患者表示学习和可解释评估。

Patient representation learning and interpretable evaluation using clinical notes.

机构信息

Antwerp University Hospital, ICT Department, Wilrijkstraat 10, Edegem 2650, Belgium; Computational Linguistics and Psycholinguistics (CLiPS) Research Center, University of Antwerp, Prinsstraat 13, Antwerp 2000, Belgium.

Computational Linguistics and Psycholinguistics (CLiPS) Research Center, University of Antwerp, Prinsstraat 13, Antwerp 2000, Belgium.

出版信息

J Biomed Inform. 2018 Aug;84:103-113. doi: 10.1016/j.jbi.2018.06.016. Epub 2018 Jul 3.

DOI:10.1016/j.jbi.2018.06.016
PMID:29966746
Abstract

We have three contributions in this work: 1. We explore the utility of a stacked denoising autoencoder and a paragraph vector model to learn task-independent dense patient representations directly from clinical notes. To analyze if these representations are transferable across tasks, we evaluate them in multiple supervised setups to predict patient mortality, primary diagnostic and procedural category, and gender. We compare their performance with sparse representations obtained from a bag-of-words model. We observe that the learned generalized representations significantly outperform the sparse representations when we have few positive instances to learn from, and there is an absence of strong lexical features. 2. We compare the model performance of the feature set constructed from a bag of words to that obtained from medical concepts. In the latter case, concepts represent problems, treatments, and tests. We find that concept identification does not improve the classification performance. 3. We propose novel techniques to facilitate model interpretability. To understand and interpret the representations, we explore the best encoded features within the patient representations obtained from the autoencoder model. Further, we calculate feature sensitivity across two networks to identify the most significant input features for different classification tasks when we use these pretrained representations as the supervised input. We successfully extract the most influential features for the pipeline using this technique.

摘要

我们在这项工作中有三个贡献

  1. 我们探索了堆叠去噪自动编码器和段落向量模型的实用性,以直接从临床笔记中学习与任务无关的密集患者表示。为了分析这些表示是否可以跨任务转移,我们在多个监督设置中评估它们,以预测患者死亡率、主要诊断和程序类别以及性别。我们将它们的性能与从词袋模型获得的稀疏表示进行比较。我们观察到,当我们从很少的正例学习并且没有强烈的词汇特征时,学习到的广义表示明显优于稀疏表示。2. 我们比较了从单词袋构造的特征集与从医学概念获得的模型性能。在后一种情况下,概念代表问题、治疗和测试。我们发现概念识别不能提高分类性能。3. 我们提出了新的技术来促进模型可解释性。为了理解和解释表示,我们探索了从自动编码器模型获得的患者表示中最佳编码的特征。此外,我们在两个网络中计算特征敏感性,以在使用这些预训练表示作为监督输入时,为不同的分类任务识别最相关的输入特征。我们成功地使用该技术提取了该流水线的最具影响力的特征。

相似文献

1
Patient representation learning and interpretable evaluation using clinical notes.利用临床记录进行患者表示学习和可解释评估。
J Biomed Inform. 2018 Aug;84:103-113. doi: 10.1016/j.jbi.2018.06.016. Epub 2018 Jul 3.
2
Prediction task guided representation learning of medical codes in EHR.基于预测任务的电子健康记录中医疗编码的表示学习。
J Biomed Inform. 2018 Aug;84:1-10. doi: 10.1016/j.jbi.2018.06.013. Epub 2018 Jun 19.
3
Medical subdomain classification of clinical notes using a machine learning-based natural language processing approach.基于机器学习的自然语言处理方法对临床笔记进行医学子域分类。
BMC Med Inform Decis Mak. 2017 Dec 1;17(1):155. doi: 10.1186/s12911-017-0556-8.
4
Artificial Intelligence Learning Semantics via External Resources for Classifying Diagnosis Codes in Discharge Notes.人工智能通过外部资源学习语义以对出院小结中的诊断代码进行分类。
J Med Internet Res. 2017 Nov 6;19(11):e380. doi: 10.2196/jmir.8344.
5
Recognizing clinical entities in hospital discharge summaries using Structural Support Vector Machines with word representation features.使用带有词表示特征的结构支持向量机识别医院出院小结中的临床实体。
BMC Med Inform Decis Mak. 2013;13 Suppl 1(Suppl 1):S1. doi: 10.1186/1472-6947-13-S1-S1. Epub 2013 Apr 5.
6
Semi-supervised distributed representations of documents for sentiment analysis.用于情感分析的文档的半监督分布式表示。
Neural Netw. 2019 Nov;119:139-150. doi: 10.1016/j.neunet.2019.08.001. Epub 2019 Aug 6.
7
Development and evaluation of RapTAT: a machine learning system for concept mapping of phrases from medical narratives.开发和评估 RapTAT:一种用于从医学叙述中映射短语概念的机器学习系统。
J Biomed Inform. 2014 Apr;48:54-65. doi: 10.1016/j.jbi.2013.11.008. Epub 2013 Dec 4.
8
EHR phenotyping via jointly embedding medical concepts and words into a unified vector space.通过将医疗概念和词汇联合嵌入到统一的向量空间中进行 EHR 表型分析。
BMC Med Inform Decis Mak. 2018 Dec 12;18(Suppl 4):123. doi: 10.1186/s12911-018-0672-0.
9
Dependency-based Siamese long short-term memory network for learning sentence representations.基于依赖的孪生长短时记忆网络用于学习句子表示。
PLoS One. 2018 Mar 7;13(3):e0193919. doi: 10.1371/journal.pone.0193919. eCollection 2018.
10
Mol2vec: Unsupervised Machine Learning Approach with Chemical Intuition.Mol2vec:具有化学直觉的无监督机器学习方法。
J Chem Inf Model. 2018 Jan 22;58(1):27-35. doi: 10.1021/acs.jcim.7b00616. Epub 2018 Jan 10.

引用本文的文献

1
From explainable to interpretable deep learning for natural language processing in healthcare: How far from reality?从可解释到可理解的深度学习在医疗保健自然语言处理中的应用:离现实还有多远?
Comput Struct Biotechnol J. 2024 May 9;24:362-373. doi: 10.1016/j.csbj.2024.05.004. eCollection 2024 Dec.
2
Fair patient model: Mitigating bias in the patient representation learned from the electronic health records.公平患者模型:减轻从电子健康记录中学习到的患者表示中的偏差。
J Biomed Inform. 2023 Dec;148:104544. doi: 10.1016/j.jbi.2023.104544. Epub 2023 Nov 22.
3
Real-world Patient Trajectory Prediction from Clinical Notes Using Artificial Neural Networks and UMLS-Based Extraction of Concepts.
使用人工神经网络和基于统一医学语言系统的概念提取从临床记录中进行真实世界患者轨迹预测
J Healthc Inform Res. 2021 Jun 5;5(4):474-496. doi: 10.1007/s41666-021-00100-z. eCollection 2021 Dec.
4
Patient Representation Learning From Heterogeneous Data Sources and Knowledge Graphs Using Deep Collective Matrix Factorization: Evaluation Study.使用深度集体矩阵分解从异构数据源和知识图谱中进行患者表示学习:评估研究
JMIR Med Inform. 2022 Jan 20;10(1):e28842. doi: 10.2196/28842.
5
Generalized and transferable patient language representation for phenotyping with limited data.用于有限数据表型分析的通用且可转移的患者语言表示
J Biomed Inform. 2021 Apr;116:103726. doi: 10.1016/j.jbi.2021.103726. Epub 2021 Mar 9.
6
Deep representation learning of patient data from Electronic Health Records (EHR): A systematic review.电子健康记录(EHR)中患者数据的深度表征学习:一项系统综述。
J Biomed Inform. 2021 Mar;115:103671. doi: 10.1016/j.jbi.2020.103671. Epub 2020 Dec 31.
7
Pre-training phenotyping classifiers.预训练表型分类器。
J Biomed Inform. 2021 Jan;113:103626. doi: 10.1016/j.jbi.2020.103626. Epub 2020 Nov 28.
8
Combining structured and unstructured data for predictive models: a deep learning approach.将结构化和非结构化数据结合用于预测模型:一种深度学习方法。
BMC Med Inform Decis Mak. 2020 Oct 29;20(1):280. doi: 10.1186/s12911-020-01297-6.
9
Patient Representation Transfer Learning from Clinical Notes based on Hierarchical Attention Network.基于分层注意力网络的临床笔记患者表示迁移学习
AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:597-606. eCollection 2020.
10
Representation learning for clinical time series prediction tasks in electronic health records.电子健康记录中临床时间序列预测任务的表示学习。
BMC Med Inform Decis Mak. 2019 Dec 17;19(Suppl 8):259. doi: 10.1186/s12911-019-0985-7.