Fujian Normal University, Fuzhou 350108, Fujian, China.
Comput Intell Neurosci. 2022 May 29;2022:3832118. doi: 10.1155/2022/3832118. eCollection 2022.
Long jump is a test item of national student physical health monitoring, which can reflect the quality of students' lower limb strength. Long jump is a highly technical activity, which includes four basic movements: running aid, jumping, vacating, and landing. Many students have problems with the technical aspects, resulting in test scores that do not objectively reflect the true physical condition of the students, which affects the accuracy of the test results. From the perspective of rapid diagnostic feedback of students' long jump movements, we design and develop a long jump movement recognition method based on deep convolutional neural network. In this paper, we firstly summarize the traditional visual action recognition algorithm, then apply 3D convolution to extract the spatiotemporal features of long jump action from three directions of the video block, and fuse the spatiotemporal features of the three directions in different ways to achieve feature complementation; finally, using the multimodality of long jump action data, we use 3D convolutional neural network to train the RGB images and then train the depth. This joint training method can accelerate the convergence speed and improve the accuracy of the network on both depth and edge images. The experiments compared the recognition effects of the tandem fusion of features, the maximum fusion, and the multiplicative fusion in the scoring layer, and the highest accuracy of 82.3% was achieved by the tandem fusion of features with the fusion of three modalities.
跳远是国家学生体质健康监测的测试项目之一,它可以反映学生下肢力量的质量。跳远是一项高度技术性的活动,包括四个基本动作:助跑、起跳、腾空和落地。许多学生在技术方面存在问题,导致测试成绩不能客观反映学生的真实身体状况,从而影响测试结果的准确性。从学生跳远动作快速诊断反馈的角度出发,我们设计并开发了一种基于深度卷积神经网络的跳远动作识别方法。本文首先总结了传统的视觉动作识别算法,然后应用 3D 卷积从视频块的三个方向提取跳远动作的时空特征,并以不同的方式融合三个方向的时空特征,实现特征互补;最后,利用跳远动作数据的多模态性,使用 3D 卷积神经网络对 RGB 图像进行训练,然后对深度图像进行训练。这种联合训练方法可以加快网络在深度和边缘图像上的收敛速度,提高网络的准确性。实验比较了特征串联融合、最大融合和评分层乘积融合的识别效果,特征串联融合三种模态的融合效果最高,准确率达到 82.3%。