• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于双向长短期记忆融合注意力机制的排球训练视频分类描述

Volleyball training video classification description using the BiLSTM fusion attention mechanism.

作者信息

Ruiye Zhao

机构信息

School of Civil Aviation and Transportation, Zhejiang Yuying College of Vocational Technology, Hangzhou, 310018, China.

出版信息

Heliyon. 2024 Jul 16;10(15):e34735. doi: 10.1016/j.heliyon.2024.e34735. eCollection 2024 Aug 15.

DOI:10.1016/j.heliyon.2024.e34735
PMID:39144994
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11320147/
Abstract

This study aims to explore methods for classifying and describing volleyball training videos using deep learning techniques. By developing an innovative model that integrates Bi-directional Long Short-Term Memory (BiLSTM) and attention mechanisms, referred to BiLSTM-Multimodal Attention Fusion Temporal Classification (BiLSTM-MAFTC), the study enhances the accuracy and efficiency of volleyball video content analysis. Initially, the model encodes features from various modalities into feature vectors, capturing different types of information such as positional and modal data. The BiLSTM network is then used to model multi-modal temporal information, while spatial and channel attention mechanisms are incorporated to form a dual-attention module. This module establishes correlations between different modality features, extracting valuable information from each modality and uncovering complementary information across modalities. Extensive experiments validate the method's effectiveness and state-of-the-art performance. Compared to conventional recurrent neural network algorithms, the model achieves recognition accuracies exceeding 95 % under Top-1 and Top-5 metrics for action recognition, with a recognition speed of 0.04 s per video. The study demonstrates that the model can effectively process and analyze multimodal temporal information, including athlete movements, positional relationships on the court, and ball trajectories. Consequently, precise classification and description of volleyball training videos are achieved. This advancement significantly enhances the efficiency of coaches and athletes in volleyball training and provides valuable insights for broader sports video analysis research.

摘要

本研究旨在探索使用深度学习技术对排球训练视频进行分类和描述的方法。通过开发一种集成双向长短期记忆(BiLSTM)和注意力机制的创新模型,即BiLSTM-多模态注意力融合时间分类(BiLSTM-MAFTC),该研究提高了排球视频内容分析的准确性和效率。最初,该模型将来自各种模态的特征编码为特征向量,捕获诸如位置和模态数据等不同类型的信息。然后使用BiLSTM网络对多模态时间信息进行建模,同时结合空间和通道注意力机制形成双注意力模块。该模块在不同模态特征之间建立关联,从每个模态中提取有价值的信息,并揭示跨模态的互补信息。大量实验验证了该方法的有效性和领先性能。与传统递归神经网络算法相比,该模型在动作识别的Top-1和Top-5指标下实现了超过95%的识别准确率,每个视频的识别速度为0.04秒。该研究表明,该模型可以有效地处理和分析多模态时间信息,包括运动员动作、场上位置关系和球的轨迹。因此,实现了对排球训练视频的精确分类和描述。这一进展显著提高了排球训练中教练和运动员的效率,并为更广泛的体育视频分析研究提供了有价值的见解。

相似文献

1
Volleyball training video classification description using the BiLSTM fusion attention mechanism.基于双向长短期记忆融合注意力机制的排球训练视频分类描述
Heliyon. 2024 Jul 16;10(15):e34735. doi: 10.1016/j.heliyon.2024.e34735. eCollection 2024 Aug 15.
2
Research on cross-modal emotion recognition based on multi-layer semantic fusion.基于多层语义融合的跨模态情感识别研究
Math Biosci Eng. 2024 Jan 17;21(2):2488-2514. doi: 10.3934/mbe.2024110.
3
Spatial-frequency-temporal convolutional recurrent network for olfactory-enhanced EEG emotion recognition.基于空间频率-时间卷积循环网络的嗅觉增强脑电情感识别
J Neurosci Methods. 2022 Jul 1;376:109624. doi: 10.1016/j.jneumeth.2022.109624. Epub 2022 May 16.
4
Analysis of Volleyball Video Intelligent Description Technology Based on Computer Memory Network and Attention Mechanism.基于计算机存储网络和注意力机制的排球视频智能描述技术分析。
Comput Intell Neurosci. 2021 Dec 28;2021:7976888. doi: 10.1155/2021/7976888. eCollection 2021.
5
Research on Volleyball Video Intelligent Description Technology Combining the Long-Term and Short-Term Memory Network and Attention Mechanism.结合长短时记忆网络和注意力机制的排球视频智能描述技术研究。
Comput Intell Neurosci. 2021 Oct 14;2021:7088837. doi: 10.1155/2021/7088837. eCollection 2021.
6
Cross-modal self-attention mechanism for controlling robot volleyball motion.用于控制机器人排球运动的跨模态自注意力机制。
Front Neurorobot. 2023 Nov 10;17:1288463. doi: 10.3389/fnbot.2023.1288463. eCollection 2023.
7
The intelligent football players' motion recognition system based on convolutional neural network and big data.基于卷积神经网络和大数据的智能足球运动员动作识别系统
Heliyon. 2023 Nov 14;9(11):e22316. doi: 10.1016/j.heliyon.2023.e22316. eCollection 2023 Nov.
8
Volleyball Movement Standardization Recognition Model Based on Convolutional Neural Network.基于卷积神经网络的排球动作标准化识别模型。
Comput Intell Neurosci. 2023 Jan 25;2023:6116144. doi: 10.1155/2023/6116144. eCollection 2023.
9
Multimodal Gait Abnormality Recognition Using a Convolutional Neural Network-Bidirectional Long Short-Term Memory (CNN-BiLSTM) Network Based on Multi-Sensor Data Fusion.基于多传感器数据融合的卷积神经网络-双向长短时记忆网络(CNN-BiLSTM)的多模态步态异常识别
Sensors (Basel). 2023 Nov 10;23(22):9101. doi: 10.3390/s23229101.
10
CMAF-Net: a cross-modal attention fusion-based deep neural network for incomplete multi-modal brain tumor segmentation.CMAF-Net:一种基于跨模态注意力融合的深度神经网络,用于不完全多模态脑肿瘤分割。
Quant Imaging Med Surg. 2024 Jul 1;14(7):4579-4604. doi: 10.21037/qims-24-9. Epub 2024 Jun 27.

引用本文的文献

1
Verification and application of deep learning models in daily sports activities of teenagers.深度学习模型在青少年日常体育活动中的验证与应用
PLoS One. 2025 Jun 4;20(6):e0322166. doi: 10.1371/journal.pone.0322166. eCollection 2025.

本文引用的文献

1
Individual Locating of Soccer Players from a Single Moving View.基于单一移动视角的足球运动员个体定位
Sensors (Basel). 2023 Sep 16;23(18):7938. doi: 10.3390/s23187938.
2
Skilled Performers Show Right Parietal Lateralization during Anticipation of Volleyball Attacks.熟练表演者在预判排球进攻时表现出右侧顶叶偏侧化。
Brain Sci. 2023 Aug 15;13(8):1204. doi: 10.3390/brainsci13081204.
3
Towards the development of a quality youth sport experience measure: Understanding participant and stakeholder perspectives.迈向优质青少年体育体验衡量标准的发展:理解参与者和利益相关者的视角。
PLoS One. 2023 Jul 7;18(7):e0287387. doi: 10.1371/journal.pone.0287387. eCollection 2023.
4
Mechanisms of human dynamic object recognition revealed by sequential deep neural networks.基于序贯深度神经网络揭示的人类动态物体识别机制
PLoS Comput Biol. 2023 Jun 9;19(6):e1011169. doi: 10.1371/journal.pcbi.1011169. eCollection 2023 Jun.
5
Intelligent auxiliary system for music performance under edge computing and long short-term recurrent neural networks.基于边缘计算和长短时记忆递归神经网络的音乐演奏智能辅助系统。
PLoS One. 2023 May 8;18(5):e0285496. doi: 10.1371/journal.pone.0285496. eCollection 2023.
6
Study on Volleyball-Movement Pose Recognition Based on Joint Point Sequence.基于关节点序列的排球运动姿势识别研究。
Comput Intell Neurosci. 2023 Feb 17;2023:2198495. doi: 10.1155/2023/2198495. eCollection 2023.
7
RNN and BiLSTM Fusion for Accurate Automatic Epileptic Seizure Diagnosis Using EEG Signals.基于脑电图信号的RNN与双向长短期记忆网络融合用于准确自动癫痫发作诊断
Life (Basel). 2022 Nov 22;12(12):1946. doi: 10.3390/life12121946.
8
AAP-MIT: Attentive Atrous Pyramid Network and Memory Incorporated Transformer for Multisentence Video Description.AAP-MIT:用于多句子视频描述的注意多孔金字塔网络和记忆整合转换器。
IEEE Trans Image Process. 2022;31:5559-5569. doi: 10.1109/TIP.2022.3195643. Epub 2022 Aug 26.
9
EEG-Based Emotion Recognition Using Spatial-Temporal Graph Convolutional LSTM With Attention Mechanism.基于 EEG 的情绪识别使用时空图卷积 LSTM 与注意力机制。
IEEE J Biomed Health Inform. 2022 Nov;26(11):5406-5417. doi: 10.1109/JBHI.2022.3198688. Epub 2022 Nov 10.
10
A novel hybrid learning paradigm with feature extraction for carbon price prediction based on Bi-directional long short-term memory network optimized by an improved sparrow search algorithm.基于双向长短期记忆网络优化的改进麻雀搜索算法的碳价预测新型混合学习范式及特征提取
Environ Sci Pollut Res Int. 2022 Sep;29(43):65585-65598. doi: 10.1007/s11356-022-20450-4. Epub 2022 Apr 30.