Nie Xiangli, Deng Zhiguang, He Mingdong, Fan Mingyu, Tang Zheng
IEEE Trans Neural Netw Learn Syst. 2024 Dec;35(12):17790-17804. doi: 10.1109/TNNLS.2023.3308900. Epub 2024 Dec 2.
In real-world applications, robotic systems collect vast amounts of new data from ever-changing environments over time. They need to continually interact and learn new knowledge from the external world to adapt to the environment. Particularly, lifelong object recognition in an online and interactive manner is a crucial and fundamental capability for robotic systems. To meet this realistic demand, in this article, we propose an online active continual learning (OACL) framework for robotic lifelong object recognition, in the scenario of both classes and domains changing with dynamic environments. First, to reduce the labeling cost as much as possible while maximizing the performance, a new online active learning (OAL) strategy is designed by taking both the uncertainty and diversity of samples into account to protect the information volume and distribution of data. In addition, to prevent catastrophic forgetting and reduce memory costs, a novel online continual learning (OCL) algorithm is proposed based on the deep feature semantic augmentation and a new loss-based deep model and replay buffer update, which can mitigate the class imbalance between the old and new classes and alleviate confusion between two similar classes. Moreover, the mistake bound of the proposed method is analyzed in theory. OACL allows robots to select the most representative new samples to query labels and continually learn new objects and new variants of previously learned objects from a nonindependent and identically distributed (i.i.d.) data stream without catastrophic forgetting. Extensive experiments conducted on real lifelong robotic vision datasets demonstrate that our algorithm, even trained with fewer labeled samples and replay exemplars, can achieve state-of-the-art performance on OCL tasks.
在实际应用中,机器人系统会随着时间的推移从不断变化的环境中收集大量新数据。它们需要持续与外部世界进行交互并学习新知识,以适应环境。特别是,以在线和交互式方式进行终身目标识别是机器人系统一项至关重要且基础的能力。为满足这一现实需求,在本文中,我们针对机器人终身目标识别提出了一种在线主动持续学习(OACL)框架,适用于类别和领域随动态环境变化的场景。首先,为在尽可能降低标注成本的同时最大化性能,通过综合考虑样本的不确定性和多样性来设计一种新的在线主动学习(OAL)策略,以保护数据的信息量和分布。此外,为防止灾难性遗忘并降低内存成本,基于深度特征语义增强以及一种基于新损失的深度模型和重放缓冲区更新,提出了一种新颖的在线持续学习(OCL)算法,该算法可以减轻新旧类别之间的类别不平衡,并缓解两个相似类别之间的混淆。此外,从理论上分析了所提方法的错误界。OACL允许机器人选择最具代表性的新样本以查询标签,并从非独立同分布(i.i.d.)数据流中持续学习新对象以及先前学习对象的新变体,而不会出现灾难性遗忘。在真实的终身机器人视觉数据集上进行的大量实验表明,我们的算法即使在使用较少标注样本和重放示例进行训练的情况下,也能在OCL任务上实现领先的性能。