• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用部分数据源进行临床结果预测的对比学习

Contrastive Learning for Clinical Outcome Prediction with Partial Data Sources.

作者信息

Xia Meng, Wilson Jonathan, Goldstein Benjamin, Henao Ricardo

机构信息

Department of Electrical and Computer Engineering, Duke University, Durham, US.

Department of Biostatistics and Bioinformatics, Duke University, Durham, US.

出版信息

Proc Mach Learn Res. 2024 Jul;235:54156-54177.

PMID:39148511
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11326519/
Abstract

The use of machine learning models to predict clinical outcomes from (longitudinal) electronic health record (EHR) data is becoming increasingly popular due to advances in deep architectures, representation learning, and the growing availability of large EHR datasets. Existing models generally assume access to the same data sources during both training and inference stages. However, this assumption is often challenged by the fact that real-world clinical datasets originate from various data sources (with distinct sets of covariates), which though can be available for training (in a research or retrospective setting), are more realistically only partially available (a subset of such sets) for inference when deployed. So motivated, we introduce Contrastive Learning for clinical Outcome Prediction with Partial data Sources (CLOPPS), that trains encoders to capture information across different data sources and then leverages them to build classifiers restricting access to a single data source. This approach can be used with existing cross-sectional or longitudinal outcome classification models. We present experiments on two real-world datasets demonstrating that CLOPPS consistently outperforms strong baselines in several practical scenarios.

摘要

由于深度架构、表示学习的进展以及大型电子健康记录(EHR)数据集的日益普及,使用机器学习模型从(纵向)EHR数据预测临床结果正变得越来越流行。现有模型通常假设在训练和推理阶段都能访问相同的数据源。然而,这一假设常常受到现实世界临床数据集源自各种数据源(具有不同的协变量集)这一事实的挑战,这些数据源虽然在训练时(在研究或回顾性设置中)可用,但在实际部署进行推理时,更现实的情况是只能部分可用(这些数据集的一个子集)。受此启发,我们引入了用于部分数据源临床结果预测的对比学习(CLOPPS),它训练编码器以跨不同数据源捕获信息,然后利用这些信息构建限制访问单个数据源的分类器。这种方法可以与现有的横断面或纵向结果分类模型一起使用。我们在两个真实世界数据集上进行了实验,结果表明CLOPPS在几种实际场景中始终优于强大的基线模型。

相似文献

1
Contrastive Learning for Clinical Outcome Prediction with Partial Data Sources.利用部分数据源进行临床结果预测的对比学习
Proc Mach Learn Res. 2024 Jul;235:54156-54177.
2
Multi-view representation learning for tabular data integration using inter-feature relationships.使用特征间关系进行表格数据集成的多视图表示学习。
J Biomed Inform. 2024 Mar;151:104602. doi: 10.1016/j.jbi.2024.104602. Epub 2024 Feb 10.
3
SCEHR: Supervised Contrastive Learning for Clinical Risk Prediction using Electronic Health Records.SCEHR:使用电子健康记录进行临床风险预测的监督对比学习
Proc IEEE Int Conf Data Min. 2021 Dec;2021:857-866. doi: 10.1109/icdm51629.2021.00097.
4
Local contrastive loss with pseudo-label based self-training for semi-supervised medical image segmentation.基于伪标签自训练的局部对比损失的半监督医学图像分割。
Med Image Anal. 2023 Jul;87:102792. doi: 10.1016/j.media.2023.102792. Epub 2023 Mar 11.
5
Weakly Semi-supervised phenotyping using Electronic Health records.基于电子健康记录的弱监督表型研究
J Biomed Inform. 2022 Oct;134:104175. doi: 10.1016/j.jbi.2022.104175. Epub 2022 Sep 5.
6
Deep learning for electronic health records: A comparative review of multiple deep neural architectures.深度学习在电子健康记录中的应用:多种深度神经网络架构的比较综述。
J Biomed Inform. 2020 Jan;101:103337. doi: 10.1016/j.jbi.2019.103337.
7
Systematic review of approaches to use of neighborhood-level risk factors with clinical data to predict clinical risk and recommend interventions.使用社区水平风险因素与临床数据相结合预测临床风险和推荐干预措施的方法的系统评价。
J Biomed Inform. 2021 Apr;116:103713. doi: 10.1016/j.jbi.2021.103713. Epub 2021 Feb 18.
8
Inferring gene regulatory networks from single-cell gene expression data via deep multi-view contrastive learning.通过深度多视图对比学习从单细胞基因表达数据推断基因调控网络。
Brief Bioinform. 2023 Jan 19;24(1). doi: 10.1093/bib/bbac586.
9
Meta-EHR: A meta-learning approach for electronic health records with a high imbalanced ratio and missing rate.元电子健康记录:一种针对高不平衡比和缺失率的电子健康记录的元学习方法。
Annu Int Conf IEEE Eng Med Biol Soc. 2023 Jul;2023:1-4. doi: 10.1109/EMBC40787.2023.10340634.
10
Multi-gate Mixture of Multi-view Graph Contrastive Learning on Electronic Health Record.电子健康记录上的多视图图对比学习的多门混合
IEEE J Biomed Health Inform. 2023 Oct 18;PP. doi: 10.1109/JBHI.2023.3325221.

本文引用的文献

1
Embracing Large Language Models for Medical Applications: Opportunities and Challenges.拥抱用于医学应用的大语言模型:机遇与挑战。
Cureus. 2023 May 21;15(5):e39305. doi: 10.7759/cureus.39305. eCollection 2023 May.
2
Predicting the Survival of Patients With Cancer From Their Initial Oncology Consultation Document Using Natural Language Processing.利用自然语言处理技术从癌症患者的初始肿瘤学咨询文档中预测其生存情况。
JAMA Netw Open. 2023 Feb 1;6(2):e230813. doi: 10.1001/jamanetworkopen.2023.0813.
3
Mortality prediction of patients in intensive care units using machine learning algorithms based on electronic health records.基于电子健康记录的机器学习算法对重症监护病房患者的死亡率预测。
Sci Rep. 2022 May 3;12(1):7180. doi: 10.1038/s41598-022-11226-4.
4
Deep-CSA: Deep Contrastive Learning for Dynamic Survival Analysis With Competing Risks.深度CSA:用于具有竞争风险的动态生存分析的深度对比学习
IEEE J Biomed Health Inform. 2022 Aug;26(8):4248-4257. doi: 10.1109/JBHI.2022.3161145. Epub 2022 Aug 11.
5
A Review of Challenges and Opportunities in Machine Learning for Health.机器学习在健康领域的挑战与机遇综述。
AMIA Jt Summits Transl Sci Proc. 2020 May 30;2020:191-200. eCollection 2020.
6
Machine learning-based mortality prediction of patients undergoing cardiac resynchronization therapy: the SEMMELWEIS-CRT score.基于机器学习的心脏再同步治疗患者死亡率预测:SEMMELWEIS-CRT 评分。
Eur Heart J. 2020 May 7;41(18):1747-1756. doi: 10.1093/eurheartj/ehz902.
7
Scalable and accurate deep learning with electronic health records.借助电子健康记录实现可扩展且准确的深度学习。
NPJ Digit Med. 2018 May 8;1:18. doi: 10.1038/s41746-018-0029-1. eCollection 2018.
8
Multitask learning and benchmarking with clinical time series data.多任务学习与临床时间序列数据的基准测试。
Sci Data. 2019 Jun 17;6(1):96. doi: 10.1038/s41597-019-0103-9.
9
Doctor AI: Predicting Clinical Events via Recurrent Neural Networks.人工智能医生:通过循环神经网络预测临床事件
JMLR Workshop Conf Proc. 2016 Aug;56:301-318. Epub 2016 Dec 10.
10
Using recurrent neural network models for early detection of heart failure onset.使用循环神经网络模型进行心力衰竭发作的早期检测。
J Am Med Inform Assoc. 2017 Mar 1;24(2):361-370. doi: 10.1093/jamia/ocw112.