• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种基于改进图卷积网络(GCN)的用于3D人体运动预测的初始预测和微调模型。

An initial prediction and fine-tuning model based on improving GCN for 3D human motion prediction.

作者信息

He Zhiquan, Zhang Lujun, Wang Hengyou

机构信息

Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen, China.

Guangdong Multimedia Information Service Engineering Technology Research Center, Shenzhen University, Shenzhen, China.

出版信息

Front Comput Neurosci. 2023 Apr 5;17:1145209. doi: 10.3389/fncom.2023.1145209. eCollection 2023.

DOI:10.3389/fncom.2023.1145209
PMID:37089134
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC10116871/
Abstract

Human motion prediction is one of the fundamental studies of computer vision. Much work based on deep learning has shown impressive performance for it in recent years. However, long-term prediction and human skeletal deformation are still challenging tasks for human motion prediction. For accurate prediction, this paper proposes a GCN-based two-stage prediction method. We train a prediction model in the first stage. Using multiple cascaded spatial attention graph convolution layers (SAGCL) to extract features, the prediction model generates an initial motion sequence of future actions based on the observed pose. Since the initial pose generated in the first stage often deviates from natural human body motion, such as a motion sequence in which the length of a bone is changed. So the task of the second stage is to fine-tune the predicted pose and make it closer to natural motion. We present a fine-tuning model including multiple cascaded causally temporal-graph convolution layers (CT-GCL). We apply the spatial coordinate error of joints and bone length error as loss functions to train the fine-tuning model. We validate our model on Human3.6m and CMU-MoCap datasets. Extensive experiments show that the two-stage prediction method outperforms state-of-the-art methods. The limitations of proposed methods are discussed as well, hoping to make a breakthrough in future exploration.

摘要

人体运动预测是计算机视觉的基础研究之一。近年来,许多基于深度学习的工作在这方面展现出了令人印象深刻的性能。然而,长期预测和人体骨骼变形对于人体运动预测而言仍然是具有挑战性的任务。为了实现准确预测,本文提出了一种基于图卷积网络(GCN)的两阶段预测方法。我们在第一阶段训练一个预测模型。该预测模型使用多个级联的空间注意力图卷积层(SAGCL)来提取特征,并基于观察到的姿势生成未来动作的初始运动序列。由于在第一阶段生成的初始姿势往往偏离自然人体运动,例如出现骨骼长度发生变化的运动序列。所以第二阶段的任务是对预测姿势进行微调,使其更接近自然运动。我们提出了一个包含多个级联因果时间图卷积层(CT - GCL)的微调模型。我们将关节的空间坐标误差和骨骼长度误差作为损失函数来训练微调模型。我们在Human3.6m和CMU - MoCap数据集上对我们的模型进行了验证。大量实验表明,这种两阶段预测方法优于现有方法。同时也讨论了所提方法的局限性,希望在未来的探索中取得突破。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b00/10116871/216d9446a676/fncom-17-1145209-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b00/10116871/4c3cd94cdb02/fncom-17-1145209-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b00/10116871/26ba3e098299/fncom-17-1145209-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b00/10116871/a1a6794edda0/fncom-17-1145209-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b00/10116871/216d9446a676/fncom-17-1145209-g0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b00/10116871/4c3cd94cdb02/fncom-17-1145209-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b00/10116871/26ba3e098299/fncom-17-1145209-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b00/10116871/a1a6794edda0/fncom-17-1145209-g0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1b00/10116871/216d9446a676/fncom-17-1145209-g0004.jpg

相似文献

1
An initial prediction and fine-tuning model based on improving GCN for 3D human motion prediction.一种基于改进图卷积网络(GCN)的用于3D人体运动预测的初始预测和微调模型。
Front Comput Neurosci. 2023 Apr 5;17:1145209. doi: 10.3389/fncom.2023.1145209. eCollection 2023.
2
Dynamic Dense Graph Convolutional Network for Skeleton-Based Human Motion Prediction.基于骨架的人体运动预测的动态密集图卷积网络。
IEEE Trans Image Process. 2024;33:1-15. doi: 10.1109/TIP.2023.3334954. Epub 2023 Dec 6.
3
Symbiotic Graph Neural Networks for 3D Skeleton-Based Human Action Recognition and Motion Prediction.基于共生图神经网络的 3D 骨骼人类动作识别与运动预测。
IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):3316-3333. doi: 10.1109/TPAMI.2021.3053765. Epub 2022 May 5.
4
Learning Constrained Dynamic Correlations in Spatiotemporal Graphs for Motion Prediction.用于运动预测的时空图中学习约束动态相关性
IEEE Trans Neural Netw Learn Syst. 2024 Oct;35(10):14273-14287. doi: 10.1109/TNNLS.2023.3277476. Epub 2024 Oct 7.
5
Corrigendum: An initial prediction and fine-tuning model based on improving GCN for 3D human motion prediction.勘误:基于改进的图卷积网络(GCN)的3D人体运动预测初始预测和微调模型
Front Comput Neurosci. 2023 Jun 13;17:1232765. doi: 10.3389/fncom.2023.1232765. eCollection 2023.
6
STTG-net: a Spatio-temporal network for human motion prediction based on transformer and graph convolution network.STTG网络:一种基于Transformer和图卷积网络的人体运动预测时空网络。
Vis Comput Ind Biomed Art. 2022 Jul 29;5(1):19. doi: 10.1186/s42492-022-00112-5.
7
Learning Dynamical Human-Joint Affinity for 3D Pose Estimation in Videos.学习用于视频中3D姿态估计的动态人体关节亲和性
IEEE Trans Image Process. 2021;30:7914-7925. doi: 10.1109/TIP.2021.3109517. Epub 2021 Sep 21.
8
Automated freezing of gait assessment with marker-based motion capture and multi-stage spatial-temporal graph convolutional neural networks.基于标记的运动捕捉和多阶段时空图卷积神经网络的自动化冻结步态评估。
J Neuroeng Rehabil. 2022 May 21;19(1):48. doi: 10.1186/s12984-022-01025-3.
9
Multiscale Spatio-Temporal Graph Neural Networks for 3D Skeleton-Based Motion Prediction.基于多尺度时空图神经网络的 3D 骨骼运动预测
IEEE Trans Image Process. 2021;30:7760-7775. doi: 10.1109/TIP.2021.3108708. Epub 2021 Sep 14.
10
Human Motion Prediction via Dual-Attention and Multi-Granularity Temporal Convolutional Networks.基于双注意力和多粒度时间卷积网络的人体运动预测。
Sensors (Basel). 2023 Jun 16;23(12):5653. doi: 10.3390/s23125653.

引用本文的文献

1
Improving the accuracy of prediction models for small datasets of Cytochrome P450 inhibition with deep learning.利用深度学习提高细胞色素P450抑制作用小数据集预测模型的准确性。
J Cheminform. 2025 Apr 30;17(1):66. doi: 10.1186/s13321-025-01015-2.

本文引用的文献

1
Identification of the ubiquitin-proteasome pathway domain by hyperparameter optimization based on a 2D convolutional neural network.基于二维卷积神经网络的超参数优化识别泛素-蛋白酶体途径结构域
Front Genet. 2022 Jul 22;13:851688. doi: 10.3389/fgene.2022.851688. eCollection 2022.
2
Prediction of disease-associated nsSNPs by integrating multi-scale ResNet models with deep feature fusion.通过整合多尺度 ResNet 模型与深度特征融合来预测疾病相关的 nsSNPs。
Brief Bioinform. 2022 Jan 17;23(1). doi: 10.1093/bib/bbab530.
3
MutTMPredictor: Robust and accurate cascade XGBoost classifier for prediction of mutations in transmembrane proteins.
MutTMPredictor:用于预测跨膜蛋白突变的强大且准确的级联XGBoost分类器。
Comput Struct Biotechnol J. 2021 Nov 19;19:6400-6416. doi: 10.1016/j.csbj.2021.11.024. eCollection 2021.
4
DeepCPPred: A Deep Learning Framework for the Discrimination of Cell-Penetrating Peptides and Their Uptake Efficiencies.DeepCPPred:一种用于区分细胞穿透肽及其摄取效率的深度学习框架。
IEEE/ACM Trans Comput Biol Bioinform. 2022 Sep-Oct;19(5):2749-2759. doi: 10.1109/TCBB.2021.3102133. Epub 2022 Oct 10.
5
Online Adaptive Prediction of Human Motion Intention Based on sEMG.基于表面肌电信号的人体运动意图在线自适应预测。
Sensors (Basel). 2021 Apr 20;21(8):2882. doi: 10.3390/s21082882.
6
Spatiotemporal Co-Attention Recurrent Neural Networks for Human-Skeleton Motion Prediction.用于人体骨骼运动预测的时空协同注意力循环神经网络
IEEE Trans Pattern Anal Mach Intell. 2022 Jun;44(6):3300-3315. doi: 10.1109/TPAMI.2021.3050918. Epub 2022 May 5.
7
TargetMM: Accurate Missense Mutation Prediction by Utilizing Local and Global Sequence Information with Classifier Ensemble.利用分类器集成利用局部和全局序列信息进行准确的错义突变预测。
Comb Chem High Throughput Screen. 2022;25(1):38-52. doi: 10.2174/1386207323666201204140438.
8
A Pairwise Attentive Adversarial Spatiotemporal Network for Cross-Domain Few-Shot Action Recognition-R2.用于跨域少样本动作识别的成对注意力对抗时空网络 - R2
IEEE Trans Image Process. 2021;30:767-782. doi: 10.1109/TIP.2020.3038372. Epub 2020 Dec 4.
9
Scene Recognition with Prototype-agnostic Scene Layout.
IEEE Trans Image Process. 2020 Apr 14. doi: 10.1109/TIP.2020.2986599.
10
Human3.6M: Large Scale Datasets and Predictive Methods for 3D Human Sensing in Natural Environments.Human3.6M:自然环境中 3D 人体感应的大规模数据集和预测方法。
IEEE Trans Pattern Anal Mach Intell. 2014 Jul;36(7):1325-39. doi: 10.1109/TPAMI.2013.248.