Liu Shaoqing, Li Cun
Langfang Health Vocational College, Langfang, Hebei, China.
PeerJ Comput Sci. 2023 Jan 20;9:e1206. doi: 10.7717/peerj-cs.1206. eCollection 2023.
In the era of health big data, with the continuous development of information technology, students' physical health management also relies more on various information technologies. Blockchain, as an emerging technology in recent years, has the characteristics of high efficiency and intelligence. College physical education is an important part of college students' health big data. Unlike cultural classes, physical education with its rich movements and activities, leaves teachers no time to monitor students' real classroom performance. Therefore, we propose a human pose estimation method based on cross-attention-based Transformer multi-scale representation learning to monitor students' class concentration. Firstly, the feature maps with different resolution are obtained by deep convolutional network and these feature maps are transformed into multi-scale visual markers. Secondly, we propose a cross-attention module with the multi-scales. The module reduces the redundancy of key point markers and the number of cross fusion operations through multiple interactions between feature markers with different resolutions and the strategy of moving key points for key point markers. Finally, the cross-attention fusion module extracts feature information of different scales from feature tags to form key tags. We can confirm the performance of the cross-attention module and the fusion module by the experimental results conducting on MSCOCO datasets, which can effectively promote the Transformer encoder to learn the association relationship between key points. Compared with the completive TokenPose method, our method can reduce the computational cost by 11.8% without reducing the performance.
在健康大数据时代,随着信息技术的不断发展,学生的身体健康管理也越来越依赖于各种信息技术。区块链作为近年来新兴的技术,具有高效性和智能性的特点。高校体育是大学生健康大数据的重要组成部分。与文化课程不同,体育课程有着丰富的动作和活动,使得教师无暇监控学生在课堂上的真实表现。因此,我们提出一种基于基于交叉注意力的Transformer多尺度表示学习的人体姿态估计方法,以监控学生的课堂专注度。首先,通过深度卷积网络获得具有不同分辨率的特征图,并将这些特征图转换为多尺度视觉标记。其次,我们提出一种多尺度交叉注意力模块。该模块通过不同分辨率的特征标记之间的多次交互以及关键点标记的关键点移动策略,减少了关键点标记的冗余和交叉融合操作的数量。最后,交叉注意力融合模块从特征标签中提取不同尺度的特征信息以形成关键标签。通过在MSCOCO数据集上进行的实验结果,我们可以确认交叉注意力模块和融合模块的性能,这可以有效地促进Transformer编码器学习关键点之间的关联关系。与竞争方法TokenPose相比,我们的方法在不降低性能的情况下可以将计算成本降低11.8%。