Xiao Cao, Hoang Trong Nghia, Hong Shenda, Ma Tengfei, Sun Jimeng
Analytics Center of Excellence, IQVIA, Cambridge, MA, 02139.
MIT-IBM Watson AI Lab, Cambridge, MA, 02142.
IEEE Trans Knowl Data Eng. 2022 Feb;34(2):531-543. doi: 10.1109/tkde.2020.2989405. Epub 2020 Apr 22.
There is a growing interest in applying deep learning (DL) to healthcare, driven by the availability of data with multiple feature channels in environments (e.g., intensive care units). However, in many other practical situations, we can only access data with much fewer feature channels in a environments (e.g., at home), which often results in predictive models with poor performance. How can we boost the performance of models learned from such environment by leveraging knowledge extracted from existing models trained using in a related environment? To address this question, we develop a knowledge infusion framework named CHEER that can succinctly summarize such into transferable representations, which can be incorporated into the to improve its performance. The infused model is analyzed theoretically and evaluated empirically on several datasets. Our empirical results showed that CHEER outperformed baselines by 5.60% to 46.80% in terms of the macro-F1 score on multiple physiological datasets.
受环境(如重症监护病房)中具有多特征通道的数据可用性的推动,将深度学习(DL)应用于医疗保健的兴趣日益浓厚。然而,在许多其他实际情况下,我们只能在环境(如在家中)中访问特征通道少得多的数据,这通常会导致预测模型性能不佳。我们如何通过利用从在相关环境中使用的数据训练的现有模型中提取的知识来提高从此类环境中学习的模型的性能?为了解决这个问题,我们开发了一个名为CHEER的知识注入框架,该框架可以简洁地将此类知识总结为可转移的表示形式,这些表示形式可以合并到模型中以提高其性能。对注入知识后的模型进行了理论分析,并在几个数据集上进行了实证评估。我们的实证结果表明,在多个生理数据集上,CHEER在宏F1分数方面比基线高出5.60%至46.80%。