School of Computer Science, Zhongshan Institute, University of Electronic Science and Technology of China, Zhongshan 528402, China; School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China; Institute for Infocomm Research, Agency for Science, Technology and Research, Singapore.
School of Computer Science, Zhongshan Institute, University of Electronic Science and Technology of China, Zhongshan 528402, China; School of Computer Science and Engineering, University of Electronic Science and Technology of China, Chengdu 611731, China.
Neural Netw. 2019 Nov;119:151-161. doi: 10.1016/j.neunet.2019.08.004. Epub 2019 Aug 17.
Transfer learning has achieved a lot of success in deep neural networks to reuse useful knowledge from source domains. However, most of the existing transfer learning strategies on neural networks are for classification tasks or based on simple training strategies, which have limited use in multi-source knowledge regression due to the ineffectiveness of learning common latent features and source information loss in regression. In this paper, we propose transferable Recurrent Neural Network (RNN) units on the Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU) to adapt source knowledge in multi-source regression scenarios. Specifically, two knowledge adaptation methods are proposed, the first one utilizes similarity weights as the transfer coefficients of each source, and the other defines a transfer-gate to control the flow of source knowledge. By using the proposed methods, useful source knowledge embedded in both internal state and output is adapted. Extensive experiments on both synthetic data and human motion prediction tasks on the Human 3.6M dataset demonstrate the superiority of our transfer RNN units compared with conventional models.
迁移学习在深度神经网络中取得了很大的成功,可以重用来自源域的有用知识。然而,大多数现有的神经网络迁移学习策略都是针对分类任务或基于简单的训练策略,由于在回归中学习公共潜在特征和源信息丢失的效果不佳,因此在多源知识回归中的应用有限。在本文中,我们提出了可迁移的递归神经网络 (RNN) 单元,用于适应多源回归场景中的源知识。具体来说,提出了两种知识适应方法,第一种方法利用相似性权重作为每个源的转移系数,另一种方法定义了转移门来控制源知识的流动。通过使用所提出的方法,可以适应嵌入在内部状态和输出中的有用源知识。在合成数据和 Human 3.6M 数据集上的人体运动预测任务上的广泛实验表明,与传统模型相比,我们的迁移 RNN 单元具有优越性。