• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于体育分析中运动员群体活动识别的Transformer中的分层查询设计与分布式注意力

Hierarchical query design and distributed attention in transformer for player group activity recognition in sports analysis.

作者信息

Chen Xiao, Liu Zhi

机构信息

College of Physical Education and Recreation, Guangdong Ocean University, Zhanjiang, 524000, China.

Sports Department, University of Electronic Science and Technology of China, Chengdu, 611731, Sichuan, China.

出版信息

Sci Rep. 2025 Aug 27;15(1):31571. doi: 10.1038/s41598-025-16752-5.

DOI:10.1038/s41598-025-16752-5
PMID:40866462
Abstract

Group activity recognition in sports analysis is a critical challenge in computer vision, requiring robust modeling of complex player interactions and dynamic scenarios. Existing approaches predominantly rely on region-based features and two-stage pipelines involving individual localization and activity classification. These methods are inherently limited by their dependency on accurate bounding box detection and often struggle with feature entanglement, occlusions, and the integration of broader contextual information. To address these gaps, this study introduces the hierarchical query design and distributed attention framework within a transformer architecture, tailored specifically for player group activity recognition in sports. The proposed model, named the hierarchical attention query transformer (HAQT), leverages a novel dual-pathway architecture to decouple individual and group activity recognition. By employing hierarchical query design, the framework ensures efficient disentanglement of individual and group-level features. In contrast, a distributed attention mechanism facilitates refined communication within and across player groups. Additionally, the deformable transformer backbone dynamically aggregates multi-scale spatiotemporal features, enhancing the model's robustness to occlusions, variable player formations, and motion dynamics. The proposed set prediction paradigm eliminates reliance on bounding box accuracy, enabling precise player localization and activity classification. Comprehensive experiments on Volleyball and Basketball-51 datasets validate the effectiveness of the HAQT. On the Volleyball dataset, HAQT achieves a state-of-the-art mean Average Precision (mAP) of 92.8% for group activity recognition, significantly surpassing existing models. On the Basketball-51 dataset, it achieves an impressive accuracy of 92.76%, demonstrating its superior ability to model complex spatiotemporal dependencies.

摘要

体育分析中的群体活动识别是计算机视觉中的一项关键挑战,需要对复杂的运动员互动和动态场景进行稳健建模。现有方法主要依赖基于区域的特征和涉及个体定位与活动分类的两阶段流程。这些方法本质上受限于对精确边界框检测的依赖,并且常常在特征纠缠、遮挡以及更广泛上下文信息的整合方面存在困难。为了解决这些差距,本研究在Transformer架构中引入了分层查询设计和分布式注意力框架,专门针对体育中的运动员群体活动识别。所提出的模型名为分层注意力查询Transformer(HAQT),利用一种新颖的双路径架构来解耦个体和群体活动识别。通过采用分层查询设计,该框架确保了个体和群体级特征的有效解缠。相比之下,分布式注意力机制促进了运动员群体内部和之间的精细通信。此外,可变形Transformer主干动态聚合多尺度时空特征,增强了模型对遮挡、可变运动员阵型和运动动态的鲁棒性。所提出的集合预测范式消除了对边界框准确性的依赖,实现了精确的运动员定位和活动分类。在排球和篮球 - 51数据集上的综合实验验证了HAQT的有效性。在排球数据集上,HAQT在群体活动识别方面实现了92.8%的当前最优平均精度(mAP),显著超过现有模型。在篮球 - 51数据集上,它达到了令人印象深刻的92.76%的准确率,展示了其对复杂时空依赖进行建模的卓越能力。

相似文献

1
Hierarchical query design and distributed attention in transformer for player group activity recognition in sports analysis.用于体育分析中运动员群体活动识别的Transformer中的分层查询设计与分布式注意力
Sci Rep. 2025 Aug 27;15(1):31571. doi: 10.1038/s41598-025-16752-5.
2
Integrated neural network framework for multi-object detection and recognition using UAV imagery.用于使用无人机图像进行多目标检测与识别的集成神经网络框架。
Front Neurorobot. 2025 Jul 30;19:1643011. doi: 10.3389/fnbot.2025.1643011. eCollection 2025.
3
Dual-stream interactive mechanism with multi-modal hierarchical aggregation transformer for gait recognition.基于多模态分层聚合变换器的双流交互机制用于步态识别。
Sci Rep. 2025 Jul 18;15(1):26079. doi: 10.1038/s41598-025-10930-1.
4
Video swin-CLSTM transformer: Enhancing human action recognition with optical flow and long-term dependencies.视频双流卷积长短期记忆变压器:利用光流和长期依赖性增强人体动作识别
PLoS One. 2025 Jul 7;20(7):e0327717. doi: 10.1371/journal.pone.0327717. eCollection 2025.
5
Multi-level channel-spatial attention and light-weight scale-fusion network (MCSLF-Net): multi-level channel-spatial attention and light-weight scale-fusion transformer for 3D brain tumor segmentation.多级通道空间注意力与轻量级尺度融合网络(MCSLF-Net):用于3D脑肿瘤分割的多级通道空间注意力与轻量级尺度融合变换器
Quant Imaging Med Surg. 2025 Jul 1;15(7):6301-6325. doi: 10.21037/qims-2025-354. Epub 2025 Jun 30.
6
iACP-DPNet: a dual-pooling causal dilated convolutional network for interpretable anticancer peptide identification.iACP-DPNet:一种用于可解释抗癌肽识别的双池因果扩张卷积网络。
Funct Integr Genomics. 2025 Jul 4;25(1):147. doi: 10.1007/s10142-025-01641-x.
7
A dual-branch deep learning model based on fNIRS for assessing 3D visual fatigue.一种基于功能近红外光谱技术的双分支深度学习模型,用于评估三维视觉疲劳。
Front Neurosci. 2025 Jun 5;19:1589152. doi: 10.3389/fnins.2025.1589152. eCollection 2025.
8
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
9
ConvTransNet-S: A CNN-Transformer Hybrid Disease Recognition Model for Complex Field Environments.ConvTransNet-S:一种用于复杂现场环境的卷积神经网络与Transformer混合疾病识别模型。
Plants (Basel). 2025 Jul 22;14(15):2252. doi: 10.3390/plants14152252.
10
WSDC-ViT: a novel transformer network for pneumonia image classification based on windows scalable attention and dynamic rectified linear unit convolutional modules.WSDC-ViT:一种基于窗口可扩展注意力和动态整流线性单元卷积模块的新型肺炎图像分类变压器网络。
Sci Rep. 2025 Jul 30;15(1):27868. doi: 10.1038/s41598-025-12117-0.

本文引用的文献

1
ACA-Net: adaptive context-aware network for basketball action recognition.ACA-Net:用于篮球动作识别的自适应上下文感知网络
Front Neurorobot. 2024 Sep 25;18:1471327. doi: 10.3389/fnbot.2024.1471327. eCollection 2024.
2
Spatiotemporal Co-Attention Recurrent Neural Networks for Human-Skeleton Motion Prediction.用于人体骨骼运动预测的时空协同注意力循环神经网络
IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):3300-3315. doi: 10.1109/TPAMI.2021.3050918. Epub 2022 May 5.
3
HiGCIN: Hierarchical Graph-Based Cross Inference Network for Group Activity Recognition.
HiGCIN:基于分层图的群组活动识别交叉推断网络。
IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):6955-6968. doi: 10.1109/TPAMI.2020.3034233. Epub 2023 May 5.
4
Symbiotic Attention for Egocentric Action Recognition With Object-Centric Alignment.共生注意力的自我中心动作识别与目标中心对齐。
IEEE Trans Pattern Anal Mach Intell. 2023 Jun;45(6):6605-6617. doi: 10.1109/TPAMI.2020.3015894. Epub 2023 May 5.
5
EleAtt-RNN: Adding Attentiveness to Neurons in Recurrent Neural Networks.EleAtt-RNN:在循环神经网络中为神经元增添注意力机制
IEEE Trans Image Process. 2019 Sep 2. doi: 10.1109/TIP.2019.2937724.
6
Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.更快的 R-CNN:基于区域建议网络的实时目标检测。
IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149. doi: 10.1109/TPAMI.2016.2577031. Epub 2016 Jun 6.