RL-CWtrans网络：基于机器人视觉驱动的多模态游泳训练指导

RL-CWtrans Net: multimodal swimming coaching driven via robot vision.

作者信息

Wang Guanlin

机构信息

Faculty of Education, University of Macau, Macau, Macau SAR, China.

出版信息

Front Neurorobot. 2024 Aug 14;18:1439188. doi: 10.3389/fnbot.2024.1439188. eCollection 2024.

DOI:10.3389/fnbot.2024.1439188

PMID:39205877

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11349712/

Abstract

In swimming, the posture and technique of athletes are crucial for improving performance. However, traditional swimming coaches often struggle to capture and analyze athletes' movements in real-time, which limits the effectiveness of coaching. Therefore, this paper proposes RL-CWtrans Net: a robot vision-driven multimodal swimming training system that provides precise and real-time guidance and feedback to swimmers. The system utilizes the Swin-Transformer as a computer vision model to effectively extract the motion and posture features of swimmers. Additionally, with the help of the CLIP model, the system can understand natural language instructions and descriptions related to swimming. By integrating visual and textual features, the system achieves a more comprehensive and accurate information representation. Finally, by employing reinforcement learning to train an intelligent agent, the system can provide personalized guidance and feedback based on multimodal inputs. Experimental results demonstrate significant advancements in accuracy and practicality for this multimodal robot swimming coaching system. The system is capable of capturing real-time movements and providing immediate feedback, thereby enhancing the effectiveness of swimming instruction. This technology holds promise.

摘要

在游泳运动中，运动员的姿势和技术对于提高成绩至关重要。然而，传统的游泳教练常常难以实时捕捉和分析运动员的动作，这限制了训练效果。因此，本文提出了RL-CWtrans Net：一种由机器人视觉驱动的多模态游泳训练系统，该系统能为游泳者提供精确的实时指导和反馈。该系统利用Swin-Transformer作为计算机视觉模型，有效提取游泳者的动作和姿势特征。此外，借助CLIP模型，该系统能够理解与游泳相关的自然语言指令和描述。通过整合视觉和文本特征，系统实现了更全面、准确的信息表示。最后，通过应用强化学习来训练智能体，该系统能够基于多模态输入提供个性化的指导和反馈。实验结果表明，这种多模态机器人游泳训练系统在准确性和实用性方面取得了显著进展。该系统能够捕捉实时动作并提供即时反馈，从而提高游泳教学的效果。这项技术前景广阔。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/2968/11349712/5ac0d8bbfdad/fnbot-18-1439188-g0001.jpg

相似文献

RL-CWtrans Net: multimodal swimming coaching driven via robot vision.

Front Neurorobot. 2024 Aug 14;18:1439188. doi: 10.3389/fnbot.2024.1439188. eCollection 2024.

Swimtrans Net: a multimodal robotic system for swimming action recognition driven via Swin-Transformer.

Front Neurorobot. 2024 Sep 24;18:1452019. doi: 10.3389/fnbot.2024.1452019. eCollection 2024.

Sports competition tactical analysis model of cross-modal transfer learning intelligent robot based on Swin Transformer and CLIP.

Front Neurorobot. 2023 Oct 30;17:1275645. doi: 10.3389/fnbot.2023.1275645. eCollection 2023.

Construction of Swimmer's Underwater Posture Training Model Based on Multimodal Neural Network Model.

Comput Intell Neurosci. 2022 Apr 11;2022:1134558. doi: 10.1155/2022/1134558. eCollection 2022.

Framework for Intelligent Swimming Analytics with Wearable Sensors for Stroke Classification.

Sensors (Basel). 2021 Jul 30;21(15):5162. doi: 10.3390/s21155162.

Swin-GA-RF: genetic algorithm-based Swin Transformer and random forest for enhancing cervical cancer classification.

Front Oncol. 2024 Jul 19;14:1392301. doi: 10.3389/fonc.2024.1392301. eCollection 2024.

An adaptive reinforcement learning-based multimodal data fusion framework for human-robot confrontation gaming.

Neural Netw. 2023 Jul;164:489-496. doi: 10.1016/j.neunet.2023.04.043. Epub 2023 May 6.

Swin-Net: A Swin-Transformer-Based Network Combing with Multi-Scale Features for Segmentation of Breast Tumor Ultrasound Images.

Diagnostics (Basel). 2024 Jan 26;14(3):269. doi: 10.3390/diagnostics14030269.

SwinCross: Cross-modal Swin transformer for head-and-neck tumor segmentation in PET/CT images.

Med Phys. 2024 Mar;51(3):2096-2107. doi: 10.1002/mp.16703. Epub 2023 Sep 30.

Multi-dimensional fusion: transformer and GANs-based multimodal audiovisual perception robot for musical performance art.

Front Neurorobot. 2023 Sep 29;17:1281944. doi: 10.3389/fnbot.2023.1281944. eCollection 2023.

引用本文的文献

Interdisciplinary approaches to image processing for medical robotics.

Front Med (Lausanne). 2025 Jun 2;12:1564678. doi: 10.3389/fmed.2025.1564678. eCollection 2025.

Cross-attention swin-transformer for detailed segmentation of ancient architectural color patterns.

Front Neurorobot. 2024 Dec 13;18:1513488. doi: 10.3389/fnbot.2024.1513488. eCollection 2024.

本文引用的文献

Learning-based personalisation of robot behaviour for robot-assisted therapy.

Front Robot AI. 2024 Apr 8;11:1352152. doi: 10.3389/frobt.2024.1352152. eCollection 2024.

Cross-modal self-attention mechanism for controlling robot volleyball motion.

Front Neurorobot. 2023 Nov 10;17:1288463. doi: 10.3389/fnbot.2023.1288463. eCollection 2023.

Editorial: Recent advances in image fusion and quality improvement for cyber-physical systems.

Front Neurorobot. 2023 May 4;17:1201266. doi: 10.3389/fnbot.2023.1201266. eCollection 2023.

AquaClimber: a limbed swimming and climbing robot based on reduced order models.

Bioinspir Biomim. 2022 Nov 16;18(1). doi: 10.1088/1748-3190/aca05c.

SmartSwim, a Novel IMU-Based Coaching Assistance.

Sensors (Basel). 2022 Apr 27;22(9):3356. doi: 10.3390/s22093356.

Stability and manoeuvrability in animal movement: lessons from biology, modelling and robotics.

Proc Biol Sci. 2022 Jan 26;289(1967):20212492. doi: 10.1098/rspb.2021.2492. Epub 2022 Jan 19.

Design of a Robotic Coach for Motor, Social and Cognitive Skills Training Toward Applications With ASD Children.

IEEE Trans Neural Syst Rehabil Eng. 2021;29:1223-1232. doi: 10.1109/TNSRE.2021.3091320. Epub 2021 Jul 1.

Intelligent Trainer for Dyna-Style Model-Based Deep Reinforcement Learning.

IEEE Trans Neural Netw Learn Syst. 2021 Jun;32(6):2758-2771. doi: 10.1109/TNNLS.2020.3008249. Epub 2021 Jun 2.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

RL-CWtrans网络：基于机器人视觉驱动的多模态游泳训练指导

RL-CWtrans Net: multimodal swimming coaching driven via robot vision.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献