Wang Guanlin
Faculty of Education, University of Macau, Macau, Macau SAR, China.
Front Neurorobot. 2024 Aug 14;18:1439188. doi: 10.3389/fnbot.2024.1439188. eCollection 2024.
In swimming, the posture and technique of athletes are crucial for improving performance. However, traditional swimming coaches often struggle to capture and analyze athletes' movements in real-time, which limits the effectiveness of coaching. Therefore, this paper proposes RL-CWtrans Net: a robot vision-driven multimodal swimming training system that provides precise and real-time guidance and feedback to swimmers. The system utilizes the Swin-Transformer as a computer vision model to effectively extract the motion and posture features of swimmers. Additionally, with the help of the CLIP model, the system can understand natural language instructions and descriptions related to swimming. By integrating visual and textual features, the system achieves a more comprehensive and accurate information representation. Finally, by employing reinforcement learning to train an intelligent agent, the system can provide personalized guidance and feedback based on multimodal inputs. Experimental results demonstrate significant advancements in accuracy and practicality for this multimodal robot swimming coaching system. The system is capable of capturing real-time movements and providing immediate feedback, thereby enhancing the effectiveness of swimming instruction. This technology holds promise.
在游泳运动中,运动员的姿势和技术对于提高成绩至关重要。然而,传统的游泳教练常常难以实时捕捉和分析运动员的动作,这限制了训练效果。因此,本文提出了RL-CWtrans Net:一种由机器人视觉驱动的多模态游泳训练系统,该系统能为游泳者提供精确的实时指导和反馈。该系统利用Swin-Transformer作为计算机视觉模型,有效提取游泳者的动作和姿势特征。此外,借助CLIP模型,该系统能够理解与游泳相关的自然语言指令和描述。通过整合视觉和文本特征,系统实现了更全面、准确的信息表示。最后,通过应用强化学习来训练智能体,该系统能够基于多模态输入提供个性化的指导和反馈。实验结果表明,这种多模态机器人游泳训练系统在准确性和实用性方面取得了显著进展。该系统能够捕捉实时动作并提供即时反馈,从而提高游泳教学的效果。这项技术前景广阔。