Knowledge Technology Institute, Department of Informatics, University of Hamburg, Germany.
Cognitive Neurorobotics Research Unit, Okinawa Institute of Science and Technology (OIST), Japan.
Neural Netw. 2017 Dec;96:137-149. doi: 10.1016/j.neunet.2017.09.001. Epub 2017 Sep 20.
Lifelong learning is fundamental in autonomous robotics for the acquisition and fine-tuning of knowledge through experience. However, conventional deep neural models for action recognition from videos do not account for lifelong learning but rather learn a batch of training data with a predefined number of action classes and samples. Thus, there is the need to develop learning systems with the ability to incrementally process available perceptual cues and to adapt their responses over time. We propose a self-organizing neural architecture for incrementally learning to classify human actions from video sequences. The architecture comprises growing self-organizing networks equipped with recurrent neurons for processing time-varying patterns. We use a set of hierarchically arranged recurrent networks for the unsupervised learning of action representations with increasingly large spatiotemporal receptive fields. Lifelong learning is achieved in terms of prediction-driven neural dynamics in which the growth and the adaptation of the recurrent networks are driven by their capability to reconstruct temporally ordered input sequences. Experimental results on a classification task using two action benchmark datasets show that our model is competitive with state-of-the-art methods for batch learning also when a significant number of sample labels are missing or corrupted during training sessions. Additional experiments show the ability of our model to adapt to non-stationary input avoiding catastrophic interference.
终身学习对于自主机器人来说是至关重要的,它可以通过经验来获取和微调知识。然而,传统的基于视频的动作识别深度神经网络模型并没有考虑终身学习,而是使用预定数量的动作类和样本学习一批训练数据。因此,需要开发具有能够增量处理可用感知线索并随时间调整响应能力的学习系统。我们提出了一种自组织神经网络架构,用于从视频序列中增量学习分类人类动作。该架构包括配备用于处理时变模式的递归神经元的生长自组织网络。我们使用一组层次排列的递归网络来进行无监督学习,以获得具有越来越大的时空感受野的动作表示。终身学习是通过预测驱动的神经动力学来实现的,其中递归网络的生长和适应是由它们对时间顺序输入序列进行重构的能力驱动的。在使用两个动作基准数据集的分类任务上的实验结果表明,我们的模型在批量学习方面与最先进的方法具有竞争力,即使在训练过程中丢失或损坏了大量样本标签。额外的实验表明,我们的模型能够适应非平稳输入,避免灾难性干扰。