• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

多任务学习在利用电子健康记录数据进行表型分析中的有效性。

The Effectiveness of Multitask Learning for Phenotyping with Electronic Health Records Data.

作者信息

Ding Daisy Yi, Simpson Chloé, Pfohl Stephen, Kale Dave C, Jung Kenneth, Shah Nigam H

机构信息

Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, CA, USA.

出版信息

Pac Symp Biocomput. 2019;24:18-29.

PMID:30864307
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6662921/
Abstract

Electronic phenotyping is the task of ascertaining whether an individual has a medical condition of interest by analyzing their medical record and is foundational in clinical informatics. Increasingly, electronic phenotyping is performed via supervised learning. We investigate the effectiveness of multitask learning for phenotyping using electronic health records (EHR) data. Multitask learning aims to improve model performance on a target task by jointly learning additional auxiliary tasks and has been used in disparate areas of machine learning. However, its utility when applied to EHR data has not been established, and prior work suggests that its benefits are inconsistent. We present experiments that elucidate when multitask learning with neural nets improves performance for phenotyping using EHR data relative to neural nets trained for a single phenotype and to well-tuned baselines. We find that multitask neural nets consistently outperform single-task neural nets for rare phenotypes but underperform for relatively more common phenotypes. The effect size increases as more auxiliary tasks are added. Moreover, multitask learning reduces the sensitivity of neural nets to hyperparameter settings for rare phenotypes. Last, we quantify phenotype complexity and find that neural nets trained with or without multitask learning do not improve on simple baselines unless the phenotypes are sufficiently complex.

摘要

电子表型分析是通过分析个人病历确定其是否患有感兴趣的疾病的任务,是临床信息学的基础。越来越多地,电子表型分析是通过监督学习来执行的。我们研究了使用电子健康记录(EHR)数据进行多任务学习以进行表型分析的有效性。多任务学习旨在通过联合学习额外的辅助任务来提高目标任务的模型性能,并且已在机器学习的不同领域中使用。然而,其应用于EHR数据时的效用尚未确定,并且先前的工作表明其益处并不一致。我们展示了一些实验,这些实验阐明了相对于为单一表型训练的神经网络和经过良好调优的基线,使用神经网络进行多任务学习在利用EHR数据进行表型分析时何时能提高性能。我们发现,对于罕见表型,多任务神经网络始终优于单任务神经网络,但对于相对更常见的表型则表现较差。随着添加更多辅助任务,效应大小会增加。此外,多任务学习降低了神经网络对罕见表型超参数设置的敏感性。最后,我们对表型复杂性进行了量化,发现无论是否使用多任务学习训练的神经网络,除非表型足够复杂,否则都不会比简单基线有改进。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd59/6662921/975133652a82/nihms-1040926-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd59/6662921/3a11e8956dbc/nihms-1040926-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd59/6662921/e8151baa368b/nihms-1040926-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd59/6662921/e3ea13c36868/nihms-1040926-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd59/6662921/cdadf9cb8662/nihms-1040926-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd59/6662921/b376170c680e/nihms-1040926-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd59/6662921/975133652a82/nihms-1040926-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd59/6662921/3a11e8956dbc/nihms-1040926-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd59/6662921/e8151baa368b/nihms-1040926-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd59/6662921/e3ea13c36868/nihms-1040926-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd59/6662921/cdadf9cb8662/nihms-1040926-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd59/6662921/b376170c680e/nihms-1040926-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/bd59/6662921/975133652a82/nihms-1040926-f0006.jpg

相似文献

1
The Effectiveness of Multitask Learning for Phenotyping with Electronic Health Records Data.多任务学习在利用电子健康记录数据进行表型分析中的有效性。
Pac Symp Biocomput. 2019;24:18-29.
2
Weakly Semi-supervised phenotyping using Electronic Health records.基于电子健康记录的弱监督表型研究
J Biomed Inform. 2022 Oct;134:104175. doi: 10.1016/j.jbi.2022.104175. Epub 2022 Sep 5.
3
Automated feature selection of predictors in electronic medical records data.电子病历数据中预测指标的自动特征选择
Biometrics. 2019 Mar;75(1):268-277. doi: 10.1111/biom.12987. Epub 2019 Apr 2.
4
Electronic health record phenotyping improves detection and screening of type 2 diabetes in the general United States population: A cross-sectional, unselected, retrospective study.电子健康记录表型分析改善了美国普通人群中2型糖尿病的检测和筛查:一项横断面、非选择性、回顾性研究。
J Biomed Inform. 2016 Apr;60:162-8. doi: 10.1016/j.jbi.2015.12.006. Epub 2015 Dec 17.
5
Relational machine learning for electronic health record-driven phenotyping.用于电子健康记录驱动的表型分析的关系机器学习。
J Biomed Inform. 2014 Dec;52:260-70. doi: 10.1016/j.jbi.2014.07.007. Epub 2014 Jul 15.
6
HPO2Vec+: Leveraging heterogeneous knowledge resources to enrich node embeddings for the Human Phenotype Ontology.HPO2Vec+:利用异构知识资源丰富人类表型本体的节点嵌入。
J Biomed Inform. 2019 Aug;96:103246. doi: 10.1016/j.jbi.2019.103246. Epub 2019 Jun 27.
7
Multitask learning and benchmarking with clinical time series data.多任务学习与临床时间序列数据的基准测试。
Sci Data. 2019 Jun 17;6(1):96. doi: 10.1038/s41597-019-0103-9.
8
Machine learning approaches for electronic health records phenotyping: a methodical review.基于机器学习的电子健康记录表型分析方法:系统评价
J Am Med Inform Assoc. 2023 Jan 18;30(2):367-381. doi: 10.1093/jamia/ocac216.
9
Natural Language Processing for EHR-Based Computational Phenotyping.基于电子健康记录的自然语言处理计算表型。
IEEE/ACM Trans Comput Biol Bioinform. 2019 Jan-Feb;16(1):139-153. doi: 10.1109/TCBB.2018.2849968. Epub 2018 Jun 25.
10
Detecting rare diseases in electronic health records using machine learning and knowledge engineering: Case study of acute hepatic porphyria.使用机器学习和知识工程在电子健康记录中检测罕见病:急性肝卟啉症案例研究。
PLoS One. 2020 Jul 2;15(7):e0235574. doi: 10.1371/journal.pone.0235574. eCollection 2020.

引用本文的文献

1
Dynamic predictions of postoperative complications from explainable, uncertainty-aware, and multi-task deep neural networks.基于可解释、不确定性感知和多任务深度神经网络的术后并发症动态预测。
Sci Rep. 2023 Jan 21;13(1):1224. doi: 10.1038/s41598-023-27418-5.
2
Generalized and transferable patient language representation for phenotyping with limited data.用于有限数据表型分析的通用且可转移的患者语言表示
J Biomed Inform. 2021 Apr;116:103726. doi: 10.1016/j.jbi.2021.103726. Epub 2021 Mar 9.
3
Deep representation learning of patient data from Electronic Health Records (EHR): A systematic review.

本文引用的文献

1
LogPar: Logistic PARAFAC2 Factorization for Temporal Binary Data with Missing Values.LogPar:用于处理带有缺失值的时态二元数据的逻辑PARAFAC2分解
KDD. 2020 Aug;2020:1625-1635. doi: 10.1145/3394486.3403213.
2
Electronic phenotyping with APHRODITE and the Observational Health Sciences and Informatics (OHDSI) data network.利用阿佛洛狄忒(APHRODITE)和观察性健康科学与信息学(OHDSI)数据网络进行电子表型分析。
AMIA Jt Summits Transl Sci Proc. 2017 Jul 26;2017:48-57. eCollection 2017.
3
Evaluating phecodes, clinical classification software, and ICD-9-CM codes for phenome-wide association studies in the electronic health record.
电子健康记录(EHR)中患者数据的深度表征学习:一项系统综述。
J Biomed Inform. 2021 Mar;115:103671. doi: 10.1016/j.jbi.2020.103671. Epub 2020 Dec 31.
4
High-throughput phenotyping with temporal sequences.高通量表型分析与时间序列。
J Am Med Inform Assoc. 2021 Mar 18;28(4):772-781. doi: 10.1093/jamia/ocaa288.
5
Generative transfer learning for measuring plausibility of EHR diagnosis records.基于生成式迁移学习的电子病历诊断记录可信度评估
J Am Med Inform Assoc. 2021 Mar 1;28(3):559-568. doi: 10.1093/jamia/ocaa215.
6
A clinically applicable approach to continuous prediction of future acute kidney injury.一种临床适用的急性肾损伤未来发生的连续预测方法。
Nature. 2019 Aug;572(7767):116-119. doi: 10.1038/s41586-019-1390-1. Epub 2019 Jul 31.
评估电子健康记录中全表型关联研究的疾病编码、临床分类软件和国际疾病分类第九版临床修订本编码。
PLoS One. 2017 Jul 7;12(7):e0175508. doi: 10.1371/journal.pone.0175508. eCollection 2017.
4
Population-Level Prediction of Type 2 Diabetes From Claims Data and Analysis of Risk Factors.从索赔数据预测 2 型糖尿病的人群水平及危险因素分析。
Big Data. 2015 Dec;3(4):277-87. doi: 10.1089/big.2015.0020.
5
A Multivariate Timeseries Modeling Approach to Severity of Illness Assessment and Forecasting in ICU with Sparse, Heterogeneous Clinical Data.一种利用稀疏、异构临床数据对重症监护病房(ICU)疾病严重程度进行评估和预测的多变量时间序列建模方法。
Proc AAAI Conf Artif Intell. 2015 Jan;2015:446-453.
6
Learning statistical models of phenotypes using noisy labeled training data.使用带有噪声标签的训练数据学习表型的统计模型。
J Am Med Inform Assoc. 2016 Nov;23(6):1166-1173. doi: 10.1093/jamia/ocw028. Epub 2016 May 12.
7
Electronic medical record phenotyping using the anchor and learn framework.使用锚定与学习框架进行电子病历表型分析。
J Am Med Inform Assoc. 2016 Jul;23(4):731-40. doi: 10.1093/jamia/ocw011. Epub 2016 Apr 23.
8
PheKB: a catalog and workflow for creating electronic phenotype algorithms for transportability.PheKB:一个用于创建可移植电子表型算法的目录和工作流程。
J Am Med Inform Assoc. 2016 Nov;23(6):1046-1052. doi: 10.1093/jamia/ocv202. Epub 2016 Mar 28.
9
Desiderata for computable representations of electronic health records-driven phenotype algorithms.电子健康记录驱动的表型算法可计算表示的要求。
J Am Med Inform Assoc. 2015 Nov;22(6):1220-30. doi: 10.1093/jamia/ocv112. Epub 2015 Sep 5.
10
Towards drug repositioning: a unified computational framework for integrating multiple aspects of drug similarity and disease similarity.迈向药物重新定位:一个整合药物相似性和疾病相似性多个方面的统一计算框架。
AMIA Annu Symp Proc. 2014 Nov 14;2014:1258-67. eCollection 2014.