Goncalves Andre, Ray Priyadip, Soper Braden, Widemann David, Nygård Mari, Nygård Jan F, Sales Ana Paula
Lawrence Livermore National Laboratory, Livermore, CA, USA.
Lawrence Livermore National Laboratory, Livermore, CA, USA.
J Biomed Inform. 2019;100S:100059. doi: 10.1016/j.yjbinx.2019.100059. Epub 2019 Oct 18.
Multitask learning (MTL) leverages commonalities across related tasks with the aim of improving individual task performance. A key modeling choice in designing MTL models is the structure of the tasks' relatedness, which may not be known. Here we propose a Bayesian multitask learning model that is able to infer the task relationship structure directly from the data. We present two variations of the model in terms of a priori information of task relatedness. First, a diffuse Wishart prior is placed on a task precision matrix so that all tasks are assumed to be equally related a priori. Second, a Bayesian graphical LASSO prior is used on the task precision matrix to impose sparsity in the task relatedness. Motivated by machine learning applications in the biomedical domain, we emphasize interpretability and uncertainty quantification in our models. To encourage model interpretability, linear mappings from the shared input spaces to task-dependent output spaces are used. To encourage uncertainty quantification, conjugate priors are used so that full posterior inference is possible. Using synthetic data, we show that our model is able to recover the underlying task relationships as well as features jointly relevant for all tasks. We demonstrate the utility of our model on three distinct biomedical applications: Alzheimer's disease progression, Parkinson's disease assessment, and cervical cancer screening compliance. We show that our model outperforms Single Task (STL) models in terms of predictive performance, and performs better than existing MTL methods for the majority of the scenarios.
多任务学习(MTL)利用相关任务之间的共性来提高单个任务的性能。设计MTL模型时的一个关键建模选择是任务相关性的结构,而这可能是未知的。在此,我们提出一种贝叶斯多任务学习模型,它能够直接从数据中推断任务关系结构。我们根据任务相关性的先验信息给出了该模型的两种变体。首先,在任务精度矩阵上放置一个扩散威沙特先验,使得所有任务在先验情况下被假定为同等相关。其次,在任务精度矩阵上使用贝叶斯图形LASSO先验,以在任务相关性中施加稀疏性。受生物医学领域机器学习应用的启发,我们在模型中强调可解释性和不确定性量化。为了促进模型的可解释性,使用了从共享输入空间到任务相关输出空间的线性映射。为了促进不确定性量化,使用共轭先验,以便进行完整的后验推断。通过合成数据,我们表明我们的模型能够恢复潜在的任务关系以及对所有任务共同相关的特征。我们在三个不同的生物医学应用中展示了我们模型的实用性:阿尔茨海默病进展、帕金森病评估和宫颈癌筛查依从性。我们表明,我们的模型在预测性能方面优于单任务(STL)模型,并且在大多数情况下比现有的MTL方法表现更好。