Feng Liqi, Zhao Yaqin, Sun Yichao, Zhao Wenxuan, Tang Jiaxi
College of Mechanical and Electronic Engineering, Nanjing Forestry University, Nanjing 210037, China.
Kidswant Children Products Co., Ltd., Nanjing 211135, China.
Animals (Basel). 2021 Feb 12;11(2):485. doi: 10.3390/ani11020485.
Behavior analysis of wild felines has significance for the protection of a grassland ecological environment. Compared with human action recognition, fewer researchers have focused on feline behavior analysis. This paper proposes a novel two-stream architecture that incorporates spatial and temporal networks for wild feline action recognition. The spatial portion outlines the object region extracted by Mask region-based convolutional neural network (R-CNN) and builds a Tiny Visual Geometry Group (VGG) network for static action recognition. Compared with VGG16, the Tiny VGG network can reduce the number of network parameters and avoid overfitting. The temporal part presents a novel skeleton-based action recognition model based on the bending angle fluctuation amplitude of the knee joints in a video clip. Due to its temporal features, the model can effectively distinguish between different upright actions, such as standing, ambling, and galloping, particularly when the felines are occluded by objects such as plants, fallen trees, and so on. The experimental results showed that the proposed two-stream network model can effectively outline the wild feline targets in captured images and can significantly improve the performance of wild feline action recognition due to its spatial and temporal features.
野生猫科动物的行为分析对于保护草原生态环境具有重要意义。与人类行为识别相比,关注猫科动物行为分析的研究人员较少。本文提出了一种新颖的双流架构,该架构结合了空间和时间网络用于野生猫科动物行为识别。空间部分勾勒出基于掩码区域的卷积神经网络(R-CNN)提取的目标区域,并构建一个微小视觉几何组(VGG)网络用于静态行为识别。与VGG16相比,微小VGG网络可以减少网络参数数量并避免过拟合。时间部分提出了一种基于视频片段中膝关节弯曲角度波动幅度的新颖的基于骨架的行为识别模型。由于其时间特征,该模型可以有效地区分不同的直立行为,例如站立、慢步和奔跑,特别是当猫科动物被植物、倒下的树木等物体遮挡时。实验结果表明,所提出的双流网络模型可以有效地勾勒出捕获图像中的野生猫科动物目标,并且由于其空间和时间特征,可以显著提高野生猫科动物行为识别的性能。