• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

通过分层姿态引导的多阶段对比回归进行动作质量评估

Action Quality Assessment via Hierarchical Pose-Guided Multi-Stage Contrastive Regression.

作者信息

Qi Mengshi, Ye Hao, Peng Jiaxuan, Ma Huadong

出版信息

IEEE Trans Image Process. 2025;34:6461-6474. doi: 10.1109/TIP.2025.3613952.

DOI:10.1109/TIP.2025.3613952
PMID:41032565
Abstract

Action Quality Assessment (AQA), which aims at the automatic and fair evaluation of athletic performance, has gained increasing attention in recent years. However, athletes are often in rapid movement and the corresponding visual appearance variances are subtle, making it challenging to capture fine-grained pose differences and leading to poor estimation performance. Furthermore, most common AQA tasks, such as diving in sports, are usually divided into multiple sub-actions, each of which contains different durations. However, existing methods focus on segmenting the video into fixed frames, which disrupts the temporal continuity of sub-actions resulting in unavoidable prediction errors. To address these challenges, we propose a novel action quality assessment method through hierarchically pose-guided multi-stage contrastive regression. Firstly, we introduce a multi-scale dynamic visual-skeleton encoder to capture fine-grained spatio-temporal visual and skeletal features. Compared to mask or auxiliary visual features, skeletal features provide a more accurate representation during athletic movements. Then, a procedure segmentation network is introduced to separate different sub-actions and obtain segmented features. Afterwards, the segmented visual and skeletal features are both fed into a multi-modal fusion module as physics structural priors, to guide the model in learning refined activity similarities and variances. Finally, a multi-stage contrastive learning regression approach is employed to learn discriminative representations and output prediction results. In addition, we introduce a newly-annotated FineDiving-Pose Dataset to improve the current low-quality human pose labels. In experiments, the results on FineDiving and MTL-AQA datasets demonstrate the effectiveness and superiority of our proposed approach. Our source code and dataset are available at https://github.com/Lumos0507/HP-MCoRe.

摘要

动作质量评估(AQA)旨在对运动表现进行自动且公平的评估,近年来受到了越来越多的关注。然而,运动员通常处于快速运动中,相应的视觉外观差异很细微,这使得捕捉细粒度的姿势差异具有挑战性,并导致估计性能不佳。此外,大多数常见的AQA任务,如体育项目中的跳水,通常分为多个子动作,每个子动作包含不同的持续时间。然而,现有方法专注于将视频分割成固定的帧,这破坏了子动作的时间连续性,导致不可避免的预测误差。为了应对这些挑战,我们提出了一种通过分层姿势引导的多阶段对比回归的新型动作质量评估方法。首先,我们引入了一个多尺度动态视觉骨架编码器来捕捉细粒度的时空视觉和骨骼特征。与掩码或辅助视觉特征相比,骨骼特征在运动过程中提供了更准确的表示。然后,引入一个过程分割网络来分离不同的子动作并获得分割后的特征。之后,分割后的视觉和骨骼特征都作为物理结构先验输入到一个多模态融合模块中,以指导模型学习精细的活动相似性和差异。最后,采用多阶段对比学习回归方法来学习判别性表示并输出预测结果。此外,我们引入了一个新标注的FineDiving-Pose数据集来改善当前低质量的人体姿势标签。在实验中,在FineDiving和MTL-AQA数据集上的结果证明了我们提出的方法的有效性和优越性。我们的源代码和数据集可在https://github.com/Lumos0507/HP-MCoRe获取。

相似文献

1
Action Quality Assessment via Hierarchical Pose-Guided Multi-Stage Contrastive Regression.通过分层姿态引导的多阶段对比回归进行动作质量评估
IEEE Trans Image Process. 2025;34:6461-6474. doi: 10.1109/TIP.2025.3613952.
2
Mid Forehead Brow Lift额中眉提升术
3
Shoulder Arthrogram肩关节造影
4
Prescription of Controlled Substances: Benefits and Risks管制药品的处方:益处与风险
5
Vesicoureteral Reflux膀胱输尿管反流
6
FLAG3D++: A Benchmark for 3D Fitness Activity Comprehension With Language Instruction.
IEEE Trans Pattern Anal Mach Intell. 2025 Nov;47(11):9731-9748. doi: 10.1109/TPAMI.2025.3590012.
7
MSCT-UNET: multi-scale contrastive transformer within U-shaped network for medical image segmentation.MSCT-UNET:U 形网络中的多尺度对比变换用于医学图像分割。
Phys Med Biol. 2023 Dec 28;69(1). doi: 10.1088/1361-6560/ad135d.
8
Short-Term Memory Impairment短期记忆障碍
9
Human-Centric Fine-Grained Action Quality Assessment.
IEEE Trans Pattern Anal Mach Intell. 2025 Aug;47(8):6242-6255. doi: 10.1109/TPAMI.2025.3556935.
10
Retinal vessel segmentation driven by structure prior tokens.由结构先验标记驱动的视网膜血管分割
Med Phys. 2025 Aug;52(8):e18018. doi: 10.1002/mp.18018.