Choi Edward, Bahadori Mohammad Taha, Song Le, Stewart Walter F, Sun Jimeng
Georgia Institute of Technology, Atlanta, GA, USA.
Sutter Health, Walnut Creek, CA, USA.
KDD. 2017 Aug;2017:787-795. doi: 10.1145/3097983.3098126.
Deep learning methods exhibit promising performance for predictive modeling in healthcare, but two important challenges remain: Often in healthcare predictive modeling, the sample size is insufficient for deep learning methods to achieve satisfactory results. The representations learned by deep learning methods should align with medical knowledge. To address these challenges, we propose GRaph-based Attention Model (GRAM) that supplements electronic health records (EHR) with hierarchical information inherent to medical ontologies. Based on the data volume and the ontology structure, GRAM represents a medical concept as a combination of its ancestors in the ontology via an attention mechanism. We compared predictive performance ( accuracy, data needs, interpretability) of GRAM to various methods including the recurrent neural network (RNN) in two sequential diagnoses prediction tasks and one heart failure prediction task. Compared to the basic RNN, GRAM achieved 10% higher accuracy for predicting diseases rarely observed in the training data and 3% improved area under the ROC curve for predicting heart failure using an order of magnitude less training data. Additionally, unlike other methods, the medical concept representations learned by GRAM are well aligned with the medical ontology. Finally, GRAM exhibits intuitive attention behaviors by adaptively generalizing to higher level concepts when facing data insufficiency at the lower level concepts.
深度学习方法在医疗保健领域的预测建模中展现出了良好的性能,但仍存在两个重要挑战:在医疗保健预测建模中,样本量通常不足以让深度学习方法取得令人满意的结果。深度学习方法所学习到的表示应与医学知识保持一致。为应对这些挑战,我们提出了基于图的注意力模型(GRAM),该模型利用医学本体中固有的层次信息来补充电子健康记录(EHR)。基于数据量和本体结构,GRAM通过注意力机制将医学概念表示为其在本体中祖先的组合。我们在两项连续诊断预测任务和一项心力衰竭预测任务中,将GRAM的预测性能(准确性、数据需求、可解释性)与包括递归神经网络(RNN)在内的各种方法进行了比较。与基本的RNN相比,GRAM在预测训练数据中很少出现的疾病时,准确率提高了10%,并且在使用少一个数量级的训练数据预测心力衰竭时,ROC曲线下面积提高了3%。此外,与其他方法不同,GRAM所学习到的医学概念表示与医学本体高度一致。最后,当面对较低层次概念的数据不足时,GRAM通过自适应地归纳到更高层次概念,展现出直观的注意力行为。