Ding Daisy Yi, Simpson Chloé, Pfohl Stephen, Kale Dave C, Jung Kenneth, Shah Nigam H
Stanford Center for Biomedical Informatics Research, Stanford University, Stanford, CA, USA.
Pac Symp Biocomput. 2019;24:18-29.
Electronic phenotyping is the task of ascertaining whether an individual has a medical condition of interest by analyzing their medical record and is foundational in clinical informatics. Increasingly, electronic phenotyping is performed via supervised learning. We investigate the effectiveness of multitask learning for phenotyping using electronic health records (EHR) data. Multitask learning aims to improve model performance on a target task by jointly learning additional auxiliary tasks and has been used in disparate areas of machine learning. However, its utility when applied to EHR data has not been established, and prior work suggests that its benefits are inconsistent. We present experiments that elucidate when multitask learning with neural nets improves performance for phenotyping using EHR data relative to neural nets trained for a single phenotype and to well-tuned baselines. We find that multitask neural nets consistently outperform single-task neural nets for rare phenotypes but underperform for relatively more common phenotypes. The effect size increases as more auxiliary tasks are added. Moreover, multitask learning reduces the sensitivity of neural nets to hyperparameter settings for rare phenotypes. Last, we quantify phenotype complexity and find that neural nets trained with or without multitask learning do not improve on simple baselines unless the phenotypes are sufficiently complex.
电子表型分析是通过分析个人病历确定其是否患有感兴趣的疾病的任务,是临床信息学的基础。越来越多地,电子表型分析是通过监督学习来执行的。我们研究了使用电子健康记录(EHR)数据进行多任务学习以进行表型分析的有效性。多任务学习旨在通过联合学习额外的辅助任务来提高目标任务的模型性能,并且已在机器学习的不同领域中使用。然而,其应用于EHR数据时的效用尚未确定,并且先前的工作表明其益处并不一致。我们展示了一些实验,这些实验阐明了相对于为单一表型训练的神经网络和经过良好调优的基线,使用神经网络进行多任务学习在利用EHR数据进行表型分析时何时能提高性能。我们发现,对于罕见表型,多任务神经网络始终优于单任务神经网络,但对于相对更常见的表型则表现较差。随着添加更多辅助任务,效应大小会增加。此外,多任务学习降低了神经网络对罕见表型超参数设置的敏感性。最后,我们对表型复杂性进行了量化,发现无论是否使用多任务学习训练的神经网络,除非表型足够复杂,否则都不会比简单基线有改进。