Keller Ingo, Lohan Katrin S
Department of Mathematical and Computer Science, Heriot-Watt University, Edinburgh, United Kingdom.
EMS Institute for Development of Mechatronic Systems, NTB University of Applied Sciences in Technology, Buchs, Switzerland.
Front Robot AI. 2020 Jan 21;6:154. doi: 10.3389/frobt.2019.00154. eCollection 2019.
Most collaborative tasks require interaction with everyday objects (e.g., utensils while cooking). Thus, robots must perceive everyday objects in an effective and efficient way. This highlights the necessity of understanding environmental factors and their impact on visual perception, such as illumination changes throughout the day on robotic systems in the real world. In object recognition, two of these factors are changes due to illumination of the scene and differences in the sensors capturing it. In this paper, we will present data augmentations for object recognition that enhance a deep learning architecture. We will show how simple linear and non-linear illumination models and feature concatenation can be used to improve deep learning-based approaches. The aim of this work is to allow for more realistic Human-Robot Interaction scenarios with a small amount of training data in combination with incremental interactive object learning. This will benefit the interaction with the robot to maximize object learning for long-term and location-independent learning in unshaped environments. With our model-based analysis, we showed that changes in illumination affect recognition approaches that use Deep Convolutional Neural Network to encode features for object recognition. Using data augmentation, we were able to show that such a system can be modified toward a more robust recognition without retraining the network. Additionally, we have shown that using simple brightness change models can help to improve the recognition across all training set sizes.
大多数协作任务都需要与日常物品进行交互(例如,烹饪时使用器具)。因此,机器人必须以有效且高效的方式感知日常物品。这凸显了理解环境因素及其对视觉感知的影响的必要性,比如现实世界中机器人系统一整天内的光照变化。在物体识别中,其中两个因素是场景光照引起的变化以及捕捉场景的传感器之间的差异。在本文中,我们将展示用于物体识别的数据增强方法,这些方法能增强深度学习架构。我们将展示简单的线性和非线性光照模型以及特征拼接如何用于改进基于深度学习的方法。这项工作的目的是在结合增量交互式物体学习的少量训练数据的情况下,实现更逼真的人机交互场景。这将有利于与机器人的交互,以在无特定形状的环境中实现长期且与位置无关的学习,从而最大限度地进行物体学习。通过我们基于模型的分析,我们表明光照变化会影响使用深度卷积神经网络对物体识别特征进行编码的识别方法。通过数据增强,我们能够证明这样的系统无需重新训练网络就能朝着更稳健的识别方向进行修改。此外,我们还表明使用简单的亮度变化模型有助于在所有训练集规模下提高识别效果。