• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于多模态潜在主题模型的异质临床笔记挖掘。

Mining heterogeneous clinical notes by multi-modal latent topic model.

机构信息

School of Computer Science and McGill Centre for Bioinformatics, McGill University, Montreal, Quebec, Canada.

Dana-Farber Cancer Institute, Boston, Massachusetts, United States of America.

出版信息

PLoS One. 2021 Apr 8;16(4):e0249622. doi: 10.1371/journal.pone.0249622. eCollection 2021.

DOI:10.1371/journal.pone.0249622
PMID:33831055
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8031429/
Abstract

Latent knowledge can be extracted from the electronic notes that are recorded during patient encounters with the health system. Using these clinical notes to decipher a patient's underlying comorbidites, symptom burdens, and treatment courses is an ongoing challenge. Latent topic model as an efficient Bayesian method can be used to model each patient's clinical notes as "documents" and the words in the notes as "tokens". However, standard latent topic models assume that all of the notes follow the same topic distribution, regardless of the type of note or the domain expertise of the author (such as doctors or nurses). We propose a novel application of latent topic modeling, using multi-note topic model (MNTM) to jointly infer distinct topic distributions of notes of different types. We applied our model to clinical notes from the MIMIC-III dataset to infer distinct topic distributions over the physician and nursing note types. Based on manual assessments made by clinicians, we observed a significant improvement in topic interpretability using MNTM modeling over the baseline single-note topic models that ignore the note types. Moreover, our MNTM model led to a significantly higher prediction accuracy for prolonged mechanical ventilation and mortality using only the first 48 hours of patient data. By correlating the patients' topic mixture with hospital mortality and prolonged mechanical ventilation, we identified several diagnostic topics that are associated with poor outcomes. Because of its elegant and intuitive formation, we envision a broad application of our approach in mining multi-modality text-based healthcare information that goes beyond clinical notes. Code available at https://github.com/li-lab-mcgill/heterogeneous_ehr.

摘要

潜在知识可以从与医疗系统交互时记录的电子病历中提取出来。使用这些临床记录来推断患者的潜在合并症、症状负担和治疗过程是一个持续的挑战。潜在主题模型作为一种有效的贝叶斯方法,可以用来将每个患者的临床记录建模为“文档”,记录中的单词建模为“标记”。然而,标准的潜在主题模型假设所有记录都遵循相同的主题分布,而不管记录的类型或作者的领域专业知识(如医生或护士)如何。我们提出了一种潜在主题建模的新应用,使用多记录主题模型(MNTM)联合推断不同类型记录的不同主题分布。我们将模型应用于 MIMIC-III 数据集的临床记录中,以推断医师和护理记录类型的不同主题分布。基于临床医生的手动评估,我们观察到,与忽略记录类型的基线单记录主题模型相比,使用 MNTM 建模可以显著提高主题的可解释性。此外,我们的 MNTM 模型仅使用患者数据的前 48 小时,就能显著提高机械通气时间延长和死亡率的预测准确性。通过将患者的主题混合与医院死亡率和机械通气时间延长相关联,我们确定了一些与不良预后相关的诊断主题。由于其优雅直观的形成方式,我们设想我们的方法可以广泛应用于挖掘基于多模态文本的医疗保健信息,而不仅仅是临床记录。代码可在 https://github.com/li-lab-mcgill/heterogeneous_ehr 上获得。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e61/8031429/c4c0dd17cef9/pone.0249622.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e61/8031429/bc73d172442a/pone.0249622.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e61/8031429/5a736a5ac22a/pone.0249622.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e61/8031429/f166980723aa/pone.0249622.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e61/8031429/f67ff3b7c038/pone.0249622.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e61/8031429/c4c0dd17cef9/pone.0249622.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e61/8031429/bc73d172442a/pone.0249622.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e61/8031429/5a736a5ac22a/pone.0249622.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e61/8031429/f166980723aa/pone.0249622.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e61/8031429/f67ff3b7c038/pone.0249622.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5e61/8031429/c4c0dd17cef9/pone.0249622.g005.jpg

相似文献

1
Mining heterogeneous clinical notes by multi-modal latent topic model.基于多模态潜在主题模型的异质临床笔记挖掘。
PLoS One. 2021 Apr 8;16(4):e0249622. doi: 10.1371/journal.pone.0249622. eCollection 2021.
2
MixEHR-SurG: A joint proportional hazard and guided topic model for inferring mortality-associated topics from electronic health records.MixEHR-SurG:一种联合比例风险和引导主题模型,用于从电子健康记录中推断与死亡率相关的主题。
J Biomed Inform. 2024 May;153:104638. doi: 10.1016/j.jbi.2024.104638. Epub 2024 Apr 15.
3
Redundancy in electronic health record corpora: analysis, impact on text mining performance and mitigation strategies.电子健康记录语料库中的冗余:分析、对文本挖掘性能的影响和缓解策略。
BMC Bioinformatics. 2013 Jan 16;14:10. doi: 10.1186/1471-2105-14-10.
4
Inferring multimodal latent topics from electronic health records.从电子健康记录中推断多模态潜在主题。
Nat Commun. 2020 May 21;11(1):2536. doi: 10.1038/s41467-020-16378-3.
5
Redundancy-aware topic modeling for patient record notes.用于病历记录的冗余感知主题建模
PLoS One. 2014 Feb 13;9(2):e87555. doi: 10.1371/journal.pone.0087555. eCollection 2014.
6
TGRA-P: Task-driven model predicts 90-day mortality from ICU clinical notes on mechanical ventilation.TGRA-P:任务驱动模型根据重症监护病房机械通气临床记录预测90天死亡率。
Comput Methods Programs Biomed. 2023 Dec;242:107783. doi: 10.1016/j.cmpb.2023.107783. Epub 2023 Sep 1.
7
Mining fall-related information in clinical notes: Comparison of rule-based and novel word embedding-based machine learning approaches.挖掘临床记录中与跌倒相关的信息:基于规则和基于新颖词嵌入的机器学习方法的比较。
J Biomed Inform. 2019 Feb;90:103103. doi: 10.1016/j.jbi.2019.103103. Epub 2019 Jan 9.
8
ComprehENotes, an Instrument to Assess Patient Reading Comprehension of Electronic Health Record Notes: Development and Validation.ComprehENotes,一种评估患者对电子健康记录笔记阅读理解能力的工具:开发与验证
J Med Internet Res. 2018 Apr 25;20(4):e139. doi: 10.2196/jmir.9380.
9
Prevalence and Sources of Duplicate Information in the Electronic Medical Record.电子病历中重复信息的流行率和来源。
JAMA Netw Open. 2022 Sep 1;5(9):e2233348. doi: 10.1001/jamanetworkopen.2022.33348.
10
MixEHR-Guided: A guided multi-modal topic modeling approach for large-scale automatic phenotyping using the electronic health record.混合 EHR 引导:一种使用电子健康记录进行大规模自动表型分析的引导式多模态主题建模方法。
J Biomed Inform. 2022 Oct;134:104190. doi: 10.1016/j.jbi.2022.104190. Epub 2022 Sep 1.

引用本文的文献

1
Latent disease similarities and therapeutic repurposing possibilities uncovered by multi-modal generative topic modeling of human diseases.通过人类疾病的多模态生成主题建模发现潜在疾病相似性和治疗方法重新利用的可能性。
Bioinform Adv. 2023 Apr 12;3(1):vbad047. doi: 10.1093/bioadv/vbad047. eCollection 2023.

本文引用的文献

1
Inferring multimodal latent topics from electronic health records.从电子健康记录中推断多模态潜在主题。
Nat Commun. 2020 May 21;11(1):2536. doi: 10.1038/s41467-020-16378-3.
2
Readmission prediction via deep contextual embedding of clinical concepts.基于临床概念的深度上下文嵌入的再入院预测。
PLoS One. 2018 Apr 9;13(4):e0195024. doi: 10.1371/journal.pone.0195024. eCollection 2018.
3
MIMIC-III, a freely accessible critical care database.MIMIC-III,一个免费获取的重症监护数据库。
Sci Data. 2016 May 24;3:160035. doi: 10.1038/sdata.2016.35.
4
Learning probabilistic phenotypes from heterogeneous EHR data.从异构电子健康记录数据中学习概率性表型。
J Biomed Inform. 2015 Dec;58:156-165. doi: 10.1016/j.jbi.2015.10.001. Epub 2015 Oct 14.
5
Building bridges across electronic health record systems through inferred phenotypic topics.通过推断的表型主题在电子健康记录系统之间搭建桥梁。
J Biomed Inform. 2015 Jun;55:82-93. doi: 10.1016/j.jbi.2015.03.011. Epub 2015 Apr 1.
6
Clinical review: scoring systems in the critically ill.临床综述:危重症患者的评分系统。
Crit Care. 2010;14(2):207. doi: 10.1186/cc8204. Epub 2010 Mar 26.
7
Weaning from mechanical ventilation.机械通气的撤机
Eur Respir J. 2007 May;29(5):1033-56. doi: 10.1183/09031936.00010206.
8
Systematic review and meta-analysis of studies of the timing of tracheostomy in adult patients undergoing artificial ventilation.对接受人工通气的成年患者气管切开术时机研究的系统评价和荟萃分析。
BMJ. 2005 May 28;330(7502):1243. doi: 10.1136/bmj.38467.485671.E0. Epub 2005 May 18.
9
Long-term outcome and quality of life of patients requiring prolonged mechanical ventilation after cardiac surgery.心脏手术后需要长期机械通气的患者的长期预后和生活质量
Eur J Cardiothorac Surg. 2004 Apr;25(4):548-52. doi: 10.1016/j.ejcts.2003.11.034.