• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

STTG网络:一种基于Transformer和图卷积网络的人体运动预测时空网络。

STTG-net: a Spatio-temporal network for human motion prediction based on transformer and graph convolution network.

作者信息

Chen Lujing, Liu Rui, Yang Xin, Zhou Dongsheng, Zhang Qiang, Wei Xiaopeng

机构信息

National and Local Joint Engineering Laboratory of Computer Aided Design, School of Software Engineering, Dalian University, Dalian, 116622, China.

School of Computer Science and Technology, Dalian University of Technology, Dalian, 116024, China.

出版信息

Vis Comput Ind Biomed Art. 2022 Jul 29;5(1):19. doi: 10.1186/s42492-022-00112-5.

DOI:10.1186/s42492-022-00112-5
PMID:35904666
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9338210/
Abstract

In recent years, human motion prediction has become an active research topic in computer vision. However, owing to the complexity and stochastic nature of human motion, it remains a challenging problem. In previous works, human motion prediction has always been treated as a typical inter-sequence problem, and most works have aimed to capture the temporal dependence between successive frames. However, although these approaches focused on the effects of the temporal dimension, they rarely considered the correlation between different joints in space. Thus, the spatio-temporal coupling of human joints is considered, to propose a novel spatio-temporal network based on a transformer and a gragh convolutional network (GCN) (STTG-Net). The temporal transformer is used to capture the global temporal dependencies, and the spatial GCN module is used to establish local spatial correlations between the joints for each frame. To overcome the problems of error accumulation and discontinuity in the motion prediction, a revision method based on fusion strategy is also proposed, in which the current prediction frame is fused with the previous frame. The experimental results show that the proposed prediction method has less prediction error and the prediction motion is smoother than previous prediction methods. The effectiveness of the proposed method is also demonstrated comparing it with the state-of-the-art method on the Human3.6 M dataset.

摘要

近年来,人体运动预测已成为计算机视觉领域一个活跃的研究课题。然而,由于人体运动的复杂性和随机性,它仍然是一个具有挑战性的问题。在以往的工作中,人体运动预测一直被视为一个典型的序列间问题,大多数工作旨在捕捉连续帧之间的时间依赖性。然而,尽管这些方法关注时间维度的影响,但它们很少考虑不同关节在空间上的相关性。因此,考虑人体关节的时空耦合,提出了一种基于Transformer和图卷积网络(GCN)的新型时空网络(STTG-Net)。时间Transformer用于捕捉全局时间依赖性,空间GCN模块用于为每一帧建立关节之间的局部空间相关性。为了克服运动预测中的误差累积和不连续性问题,还提出了一种基于融合策略的修正方法,即将当前预测帧与前一帧进行融合。实验结果表明,所提出的预测方法具有较小的预测误差,且预测运动比以往的预测方法更平滑。通过在Human3.6 M数据集上与现有最先进方法进行比较,也证明了所提方法的有效性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f13/9338210/57d6568640ce/42492_2022_112_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f13/9338210/05fd501d07b8/42492_2022_112_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f13/9338210/ef5143d220fd/42492_2022_112_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f13/9338210/2b8a6ebca876/42492_2022_112_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f13/9338210/71ffbdc3e531/42492_2022_112_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f13/9338210/57d6568640ce/42492_2022_112_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f13/9338210/05fd501d07b8/42492_2022_112_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f13/9338210/ef5143d220fd/42492_2022_112_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f13/9338210/2b8a6ebca876/42492_2022_112_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f13/9338210/71ffbdc3e531/42492_2022_112_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7f13/9338210/57d6568640ce/42492_2022_112_Fig5_HTML.jpg

相似文献

1
STTG-net: a Spatio-temporal network for human motion prediction based on transformer and graph convolution network.STTG网络:一种基于Transformer和图卷积网络的人体运动预测时空网络。
Vis Comput Ind Biomed Art. 2022 Jul 29;5(1):19. doi: 10.1186/s42492-022-00112-5.
2
Automated freezing of gait assessment with marker-based motion capture and multi-stage spatial-temporal graph convolutional neural networks.基于标记的运动捕捉和多阶段时空图卷积神经网络的自动化冻结步态评估。
J Neuroeng Rehabil. 2022 May 21;19(1):48. doi: 10.1186/s12984-022-01025-3.
3
SGGformer: Shifted Graph Convolutional Graph-Transformer for Traffic Prediction.SGGformer:用于交通预测的移位图卷积图变换器
Sensors (Basel). 2022 Nov 21;22(22):9024. doi: 10.3390/s22229024.
4
A novel hybrid framework based on temporal convolution network and transformer for network traffic prediction.基于时间卷积网络和转换器的新型混合框架用于网络流量预测。
PLoS One. 2023 Sep 8;18(9):e0288935. doi: 10.1371/journal.pone.0288935. eCollection 2023.
5
An initial prediction and fine-tuning model based on improving GCN for 3D human motion prediction.一种基于改进图卷积网络(GCN)的用于3D人体运动预测的初始预测和微调模型。
Front Comput Neurosci. 2023 Apr 5;17:1145209. doi: 10.3389/fncom.2023.1145209. eCollection 2023.
6
Spatial linear transformer and temporal convolution network for traffic flow prediction.用于交通流预测的空间线性变压器和时间卷积网络
Sci Rep. 2024 Feb 19;14(1):4040. doi: 10.1038/s41598-024-54114-9.
7
MSST-RT: Multi-Stream Spatial-Temporal Relative Transformer for Skeleton-Based Action Recognition.基于骨架的动作识别的多流时空相对Transformer(MSST-RT):Multi-Stream Spatial-Temporal Relative Transformer for Skeleton-Based Action Recognition。
Sensors (Basel). 2021 Aug 7;21(16):5339. doi: 10.3390/s21165339.
8
Learning Dynamical Human-Joint Affinity for 3D Pose Estimation in Videos.学习用于视频中3D姿态估计的动态人体关节亲和性
IEEE Trans Image Process. 2021;30:7914-7925. doi: 10.1109/TIP.2021.3109517. Epub 2021 Sep 21.
9
Learning Constrained Dynamic Correlations in Spatiotemporal Graphs for Motion Prediction.用于运动预测的时空图中学习约束动态相关性
IEEE Trans Neural Netw Learn Syst. 2024 Oct;35(10):14273-14287. doi: 10.1109/TNNLS.2023.3277476. Epub 2024 Oct 7.
10
Exploiting dynamic spatio-temporal graph convolutional neural networks for citywide traffic flows prediction.利用动态时空图卷积神经网络进行全市交通流预测。
Neural Netw. 2022 Jan;145:233-247. doi: 10.1016/j.neunet.2021.10.021. Epub 2021 Oct 28.

引用本文的文献

1
Achieving view-distance and -angle invariance in motion prediction using a simple network.使用简单网络在运动预测中实现视距和视角不变性。
Vis Comput Ind Biomed Art. 2024 Oct 28;7(1):26. doi: 10.1186/s42492-024-00176-5.

本文引用的文献

1
Generative model-enhanced human motion prediction.生成模型增强的人体运动预测
Appl AI Lett. 2022 Apr;3(2):e63. doi: 10.1002/ail2.63. Epub 2022 Mar 23.
2
Skeleton-Based Action Recognition Using Spatio-Temporal LSTM Network with Trust Gates.基于时空长短期记忆网络及信任门控的骨骼动作识别
IEEE Trans Pattern Anal Mach Intell. 2018 Dec;40(12):3007-3021. doi: 10.1109/TPAMI.2017.2771306. Epub 2017 Nov 9.
3
Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments.Human3.6M:自然环境中 3D 人体感应的大规模数据集和预测方法。
IEEE Trans Pattern Anal Mach Intell. 2014 Jul;36(7):1325-39. doi: 10.1109/TPAMI.2013.248.
4
Gaussian process dynamical models for human motion.用于人体运动的高斯过程动态模型。
IEEE Trans Pattern Anal Mach Intell. 2008 Feb;30(2):283-98. doi: 10.1109/TPAMI.2007.1167.