• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于运动预测的时空图中学习约束动态相关性

Learning Constrained Dynamic Correlations in Spatiotemporal Graphs for Motion Prediction.

作者信息

Fu Jiajun, Yang Fuxing, Dang Yonghao, Liu Xiaoli, Yin Jianqin

出版信息

IEEE Trans Neural Netw Learn Syst. 2024 Oct;35(10):14273-14287. doi: 10.1109/TNNLS.2023.3277476. Epub 2024 Oct 7.

DOI:10.1109/TNNLS.2023.3277476
PMID:37256808
Abstract

Human motion prediction is challenging due to the complex spatiotemporal feature modeling. Among all methods, graph convolution networks (GCNs) are extensively utilized because of their superiority in explicit connection modeling. Within a GCN, the graph correlation adjacency matrix drives feature aggregation, and thus, is the key to extracting predictive motion features. State-of-the-art methods decompose the spatiotemporal correlation into spatial correlations for each frame and temporal correlations for each joint. Directly parameterizing these correlations introduces redundant parameters to represent common relations shared by all frames and all joints. Besides, the spatiotemporal graph adjacency matrix is the same for different motion samples, and thus, cannot reflect samplewise correspondence variances. To overcome these two bottlenecks, we propose dynamic spatiotemporal decompose GC (DSTD-GC), which only takes 28.6% parameters of the state-of-the-art GC. The key of DSTD-GC is constrained dynamic correlation modeling, which explicitly parameterizes the common static constraints as a spatial/temporal vanilla adjacency matrix shared by all frames/joints and dynamically extracts correspondence variances for each frame/joint with an adjustment modeling function. For each sample, the common constrained adjacency matrices are fixed to represent generic motion patterns, while the extracted variances complete the matrices with specific pattern adjustments. Meanwhile, we mathematically reformulate GCs on spatiotemporal graphs into a unified form and find that DSTD-GC relaxes certain constraints of other GC, which contributes to a better representation capability. Moreover, by combining DSTD-GC with prior knowledge like body connection and temporal context, we propose a powerful spatiotemporal GCN called DSTD-GCN. On the Human3.6M, Carnegie Mellon University (CMU) Mocap, and 3D Poses in the Wild (3DPW) datasets, DSTD-GCN outperforms state-of-the-art methods by 3.9%-8.7% in prediction accuracy with 55.0%-96.9% fewer parameters. Codes are available at https://github.com/Jaakk0F/DSTD-GCN.

摘要

由于复杂的时空特征建模,人体运动预测具有挑战性。在所有方法中,图卷积网络(GCN)因其在显式连接建模方面的优势而被广泛使用。在GCN中,图相关邻接矩阵驱动特征聚合,因此是提取预测运动特征的关键。现有方法将时空相关性分解为每个帧的空间相关性和每个关节的时间相关性。直接对这些相关性进行参数化会引入冗余参数来表示所有帧和所有关节共享的共同关系。此外,时空图邻接矩阵对于不同的运动样本是相同的,因此不能反映样本间的对应差异。为了克服这两个瓶颈,我们提出了动态时空分解GC(DSTD-GC),它只采用了现有技术GC的28.6%的参数。DSTD-GC的关键是约束动态相关性建模,它将常见的静态约束明确参数化为所有帧/关节共享的空间/时间普通邻接矩阵,并通过调整建模函数为每个帧/关节动态提取对应差异。对于每个样本,常见的约束邻接矩阵被固定以表示通用运动模式,而提取的差异则通过特定模式调整来完善矩阵。同时,我们在数学上把时空图上的GCN重新表述为统一形式,并发现DSTD-GC放宽了其他GC的某些约束,这有助于提高表示能力。此外,通过将DSTD-GC与身体连接和时间上下文等先验知识相结合,我们提出了一种强大的时空GCN,称为DSTD-GCN。在Human3.6M、卡内基梅隆大学(CMU)动作捕捉和野外3D姿态(3DPW)数据集上,DSTD-GCN在预测准确率上比现有技术方法高出3.9%-8.7%,参数减少了55.0%-96.9%。代码可在https://github.com/Jaakk0F/DSTD-GCN获取。

相似文献

1
Learning Constrained Dynamic Correlations in Spatiotemporal Graphs for Motion Prediction.用于运动预测的时空图中学习约束动态相关性
IEEE Trans Neural Netw Learn Syst. 2024 Oct;35(10):14273-14287. doi: 10.1109/TNNLS.2023.3277476. Epub 2024 Oct 7.
2
Dynamic Dense Graph Convolutional Network for Skeleton-Based Human Motion Prediction.基于骨架的人体运动预测的动态密集图卷积网络。
IEEE Trans Image Process. 2024;33:1-15. doi: 10.1109/TIP.2023.3334954. Epub 2023 Dec 6.
3
Symbiotic Graph Neural Networks for 3D Skeleton-Based Human Action Recognition and Motion Prediction.基于共生图神经网络的 3D 骨骼人类动作识别与运动预测。
IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):3316-3333. doi: 10.1109/TPAMI.2021.3053765. Epub 2022 May 5.
4
An initial prediction and fine-tuning model based on improving GCN for 3D human motion prediction.一种基于改进图卷积网络(GCN)的用于3D人体运动预测的初始预测和微调模型。
Front Comput Neurosci. 2023 Apr 5;17:1145209. doi: 10.3389/fncom.2023.1145209. eCollection 2023.
5
Multiscale Spatio-Temporal Graph Neural Networks for 3D Skeleton-Based Motion Prediction.基于多尺度时空图神经网络的 3D 骨骼运动预测
IEEE Trans Image Process. 2021;30:7760-7775. doi: 10.1109/TIP.2021.3108708. Epub 2021 Sep 14.
6
Granger-Causality-Based Multi-Frequency Band EEG Graph Feature Extraction and Fusion for Emotion Recognition.基于格兰杰因果关系的多频段脑电图图形特征提取与融合用于情感识别
Brain Sci. 2022 Dec 1;12(12):1649. doi: 10.3390/brainsci12121649.
7
AMHGCN: Adaptive multi-level hypergraph convolution network for human motion prediction.AMHGCN:用于人体运动预测的自适应多层次超图卷积网络。
Neural Netw. 2024 Apr;172:106153. doi: 10.1016/j.neunet.2024.106153. Epub 2024 Jan 29.
8
Learning Dynamical Human-Joint Affinity for 3D Pose Estimation in Videos.学习用于视频中3D姿态估计的动态人体关节亲和性
IEEE Trans Image Process. 2021;30:7914-7925. doi: 10.1109/TIP.2021.3109517. Epub 2021 Sep 21.
9
Graph Diffusion Convolutional Network for Skeleton Based Semantic Recognition of Two-Person Actions.基于骨架的两人动作语义识别的图扩散卷积网络。
IEEE Trans Pattern Anal Mach Intell. 2023 Jul;45(7):8477-8493. doi: 10.1109/TPAMI.2023.3238411. Epub 2023 Jun 5.
10
STTG-net: a Spatio-temporal network for human motion prediction based on transformer and graph convolution network.STTG网络:一种基于Transformer和图卷积网络的人体运动预测时空网络。
Vis Comput Ind Biomed Art. 2022 Jul 29;5(1):19. doi: 10.1186/s42492-022-00112-5.

引用本文的文献

1
Learning behavior aware features across spaces for improved 3D human motion prediction.跨空间学习行为感知特征以改进3D人体运动预测。
Sci Rep. 2025 Aug 4;15(1):28355. doi: 10.1038/s41598-025-11073-z.
2
GT-SRR: A Structured Method for Social Relation Recognition with GGNN-Based Transformer.GT-SRR:一种基于图神经网络(GGNN)的变换器的社会关系识别结构化方法。
Sensors (Basel). 2025 May 9;25(10):2992. doi: 10.3390/s25102992.
3
Parallel multi-stage rectification networks for 3D skeleton-based motion prediction.用于基于3D骨架的运动预测的并行多阶段整流网络。
Sci Rep. 2024 Oct 30;14(1):26058. doi: 10.1038/s41598-024-75782-7.