Suppr超能文献

使用时空网络对野生猫科动物进行动作识别

Action Recognition Using a Spatial-Temporal Network for Wild Felines.

作者信息

Feng Liqi, Zhao Yaqin, Sun Yichao, Zhao Wenxuan, Tang Jiaxi

机构信息

College of Mechanical and Electronic Engineering, Nanjing Forestry University, Nanjing 210037, China.

Kidswant Children Products Co., Ltd., Nanjing 211135, China.

出版信息

Animals (Basel). 2021 Feb 12;11(2):485. doi: 10.3390/ani11020485.

Abstract

Behavior analysis of wild felines has significance for the protection of a grassland ecological environment. Compared with human action recognition, fewer researchers have focused on feline behavior analysis. This paper proposes a novel two-stream architecture that incorporates spatial and temporal networks for wild feline action recognition. The spatial portion outlines the object region extracted by Mask region-based convolutional neural network (R-CNN) and builds a Tiny Visual Geometry Group (VGG) network for static action recognition. Compared with VGG16, the Tiny VGG network can reduce the number of network parameters and avoid overfitting. The temporal part presents a novel skeleton-based action recognition model based on the bending angle fluctuation amplitude of the knee joints in a video clip. Due to its temporal features, the model can effectively distinguish between different upright actions, such as standing, ambling, and galloping, particularly when the felines are occluded by objects such as plants, fallen trees, and so on. The experimental results showed that the proposed two-stream network model can effectively outline the wild feline targets in captured images and can significantly improve the performance of wild feline action recognition due to its spatial and temporal features.

摘要

野生猫科动物的行为分析对于保护草原生态环境具有重要意义。与人类行为识别相比,关注猫科动物行为分析的研究人员较少。本文提出了一种新颖的双流架构,该架构结合了空间和时间网络用于野生猫科动物行为识别。空间部分勾勒出基于掩码区域的卷积神经网络(R-CNN)提取的目标区域,并构建一个微小视觉几何组(VGG)网络用于静态行为识别。与VGG16相比,微小VGG网络可以减少网络参数数量并避免过拟合。时间部分提出了一种基于视频片段中膝关节弯曲角度波动幅度的新颖的基于骨架的行为识别模型。由于其时间特征,该模型可以有效地区分不同的直立行为,例如站立、慢步和奔跑,特别是当猫科动物被植物、倒下的树木等物体遮挡时。实验结果表明,所提出的双流网络模型可以有效地勾勒出捕获图像中的野生猫科动物目标,并且由于其空间和时间特征,可以显著提高野生猫科动物行为识别的性能。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8851/7917733/67310d7611a4/animals-11-00485-g001.jpg

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验