CNR-Italian National Research Council, Institute of Cognitive Sciences and Technologies, Via Gaifami 18, 95126 Catania, Italy.
CNR-Italian National Research Council, Institute of Cognitive Sciences and Technologies, Via S. Martino della Battaglia 44, 00185 Rome, Italy.
Sensors (Basel). 2022 Apr 28;22(9):3366. doi: 10.3390/s22093366.
The development of a Social Intelligence System based on artificial intelligence is one of the cutting edge technologies in Assistive Robotics. Such systems need to create an empathic interaction with the users; therefore, it os required to include an Emotion Recognition (ER) framework which has to run, in near real-time, together with several other intelligent services. Most of the low-cost commercial robots, however, although more accessible by users and healthcare facilities, have to balance costs and effectiveness, resulting in under-performing hardware in terms of memory and processing unit. This aspect makes the design of the systems challenging, requiring a trade-off between the accuracy and the complexity of the adopted models. This paper proposes a compact and robust service for Assistive Robotics, called (LEMON), which uses image processing, Computer Vision and Deep Learning (DL) algorithms to recognize facial expressions. Specifically, the proposed DL model is based on with the combination of and . The first remarkable result is the few numbers (i.e., 1.6 Million) of parameters characterizing our model. In addition, expand receptive fields exponentially with preserving resolution, less computation and memory cost to recognize the distinction among facial expressions by capturing the displacement of the pixels. Finally, to reduce the dying ReLU problem and improve the stability of the model, we apply an (ELU) activation function in the initial layers of the model. We have performed training and evaluation (via one- and five-fold cross validation) of the model with five datasets available in the community and one mixed dataset created by taking samples from all of them. With respect to the other approaches, our model achieves comparable results with a significant reduction in terms of the number of parameters.
基于人工智能的社交智能系统的开发是辅助机器人领域的前沿技术之一。此类系统需要与用户建立共情交互,因此需要包含情感识别 (ER) 框架,该框架必须与其他几个智能服务一起近乎实时运行。然而,大多数低成本商业机器人虽然更容易被用户和医疗机构使用,但必须在成本和有效性之间取得平衡,这导致硬件在内存和处理单元方面表现不佳。这一方面使得系统的设计具有挑战性,需要在采用的模型的准确性和复杂性之间进行权衡。本文提出了一种名为 (LEMON) 的紧凑且强大的辅助机器人服务,它使用图像处理、计算机视觉和深度学习 (DL) 算法来识别面部表情。具体来说,所提出的 DL 模型基于 ,结合 和 。第一个显著的结果是我们的模型所具有的特征数量很少(即 160 万)。此外, 指数级扩展感受野,同时保持分辨率,计算和存储成本更低,通过捕捉像素的位移来识别面部表情的区别。最后,为了减少 ReLU 死亡问题并提高模型的稳定性,我们在模型的初始层应用了 (ELU)激活函数。我们使用社区中可用的五个数据集和一个从所有数据集获取样本创建的混合数据集对模型进行了训练和评估(通过一次和五折交叉验证)。与其他方法相比,我们的模型在参数数量显著减少的情况下取得了可比的结果。