基于视频记录步态的两步深度学习足跟关键点识别

Two-step deep-learning identification of heel keypoints from video-recorded gait.

作者信息

Halvorsen Kjartan, Peng Wei, Olsson Fredrik, Åberg Anna Cristina

机构信息

School of Health and Welfare, Dalarna University, Falun, Sweden.

Department of Public Health and Caring Sciences, Uppsala University, Uppsala, Sweden.

出版信息

Med Biol Eng Comput. 2025 Jan;63(1):229-237. doi: 10.1007/s11517-024-03189-7. Epub 2024 Sep 18.

DOI:10.1007/s11517-024-03189-7

PMID:39292381

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11695559/

Abstract

Accurate and fast extraction of step parameters from video recordings of gait allows for richer information to be obtained from clinical tests such as Timed Up and Go. Current deep-learning methods are promising, but lack in accuracy for many clinical use cases. Extracting step parameters will often depend on extracted landmarks (keypoints) on the feet. We hypothesize that such keypoints can be determined with an accuracy relevant for clinical practice from video recordings by combining an existing general-purpose pose estimation method (OpenPose) with custom convolutional neural networks (convnets) specifically trained to identify keypoints on the heel. The combined method finds keypoints on the posterior and lateral aspects of the heel of the foot in side-view and frontal-view images from which step length and step width can be determined for calibrated cameras. Six different candidate convnets were evaluated, combining three different standard architectures as networks for feature extraction (backbone), and with two different networks for predicting keypoints on the heel (head networks). Using transfer learning, the backbone networks were pre-trained on the ImageNet dataset, and the combined networks (backbone + head) were fine-tuned on data from 184 trials of older, unimpaired adults. The data was recorded at three different locations and consisted of 193 k side-view images and 110 k frontal-view images. We evaluated the six different models using the absolute distance on the floor between predicted keypoints and manually labelled keypoints. For the best-performing convnet, the median error was 0.55 cm and the 75% quartile was below 1.26 cm using data from the side-view camera. The predictions are overall accurate, but show some outliers. The results indicate potential for future clinical use by automating a key step in marker-less gait parameter extraction.

摘要

从步态视频记录中准确快速地提取步幅参数，能够从诸如定时起立行走测试等临床测试中获取更丰富的信息。当前的深度学习方法很有前景，但在许多临床应用案例中准确性不足。提取步幅参数通常依赖于足部提取的地标（关键点）。我们假设，通过将现有的通用姿态估计方法（OpenPose）与专门训练用于识别足跟关键点的定制卷积神经网络（卷积网络）相结合，可以从视频记录中以与临床实践相关的精度确定此类关键点。该组合方法可在侧视图和正视图图像中找到足跟后部和外侧的关键点，据此可以为校准相机确定步长和步宽。对六种不同的候选卷积网络进行了评估，将三种不同的标准架构作为特征提取网络（主干网络），并使用两种不同的网络来预测足跟上的关键点（头部网络）。利用迁移学习，主干网络在ImageNet数据集上进行预训练，组合网络（主干 + 头部）在184例老年健康成年人的试验数据上进行微调。数据在三个不同地点记录，包括19.3万张侧视图图像和11万张正视图图像。我们使用预测关键点与手动标记关键点在地面上的绝对距离来评估这六种不同的模型。对于性能最佳的卷积网络，使用侧视图相机的数据时，中位数误差为0.55厘米，75%四分位数低于1.26厘米。预测总体准确，但存在一些异常值。结果表明，通过自动化无标记步态参数提取中的关键步骤，未来具有临床应用潜力。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8a7c/11695559/71efcec7623e/11517_2024_3189_Fig1_HTML.jpg

相似文献

Two-step deep-learning identification of heel keypoints from video-recorded gait.基于视频记录步态的两步深度学习足跟关键点识别

Med Biol Eng Comput. 2025 Jan;63(1):229-237. doi: 10.1007/s11517-024-03189-7. Epub 2024 Sep 18.

Extraction of gait parameters from marker-free video recordings of Timed Up-and-Go tests: Validity, inter- and intra-rater reliability.从计时起立行走测试的无标记视频记录中提取步态参数：效度、评分者间和评分者内信度。

Gait Posture. 2021 Oct;90:489-495. doi: 10.1016/j.gaitpost.2021.08.004. Epub 2021 Aug 14.

Assessment of a novel deep learning-based marker-less motion capture system for gait study.新型基于深度学习的无标记运动捕捉系统在步态研究中的评估。

Gait Posture. 2022 May;94:138-143. doi: 10.1016/j.gaitpost.2022.03.008. Epub 2022 Mar 15.

Video-Based Pose Estimation for Gait Analysis in Stroke Survivors during Clinical Assessments: A Proof-of-Concept Study.临床评估中用于中风幸存者步态分析的基于视频的姿势估计：一项概念验证研究。

Digit Biomark. 2022 Jan 13;6(1):9-18. doi: 10.1159/000520732. eCollection 2022.

Brain tumor segmentation and detection in MRI using convolutional neural networks and VGG16.使用卷积神经网络和VGG16在磁共振成像（MRI）中进行脑肿瘤分割与检测

Cancer Biomark. 2025 Mar;42(3):18758592241311184. doi: 10.1177/18758592241311184. Epub 2025 Apr 4.

Single-Camera-Based Method for Step Length Symmetry Measurement in Unconstrained Elderly Home Monitoring.基于单摄像头的无约束老年家庭监测中步长对称性测量方法

IEEE Trans Biomed Eng. 2017 Nov;64(11):2618-2627. doi: 10.1109/TBME.2017.2653246.

Two-Stream Modality-Based Deep Learning Approach for Enhanced Two-Person Human Interaction Recognition in Videos.基于双流模态的深度学习方法增强视频中的双人互动识别。

Sensors (Basel). 2024 Nov 3;24(21):7077. doi: 10.3390/s24217077.

A novel dataset and deep learning-based approach for marker-less motion capture during gait.一种用于步态无标记运动捕捉的新型数据集和深度学习方法。

Gait Posture. 2021 May;86:70-76. doi: 10.1016/j.gaitpost.2021.03.003. Epub 2021 Mar 6.

Vision-Based Gait Events Detection Using Deep Convolutional Neural Networks.基于视觉的步态事件检测的深度学习卷积神经网络方法。

Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:1936-1941. doi: 10.1109/EMBC46164.2021.9630431.

Comparing the accuracy of open-source pose estimation methods for measuring gait kinematics.比较用于测量步态运动学的开源姿态估计方法的准确性。

Gait Posture. 2022 Sep;97:188-195. doi: 10.1016/j.gaitpost.2022.08.008. Epub 2022 Aug 18.

本文引用的文献

Comparing the accuracy of open-source pose estimation methods for measuring gait kinematics.比较用于测量步态运动学的开源姿态估计方法的准确性。

Gait Posture. 2022 Sep;97:188-195. doi: 10.1016/j.gaitpost.2022.08.008. Epub 2022 Aug 18.

Corrigendum to "Extraction of gait parameters from marker-free video recordings of timed up-and-go tests: Validity, inter- and intra-rater reliability" [Gait Posture 90 (2021) 489-495].《从计时起立行走测试的无标记视频记录中提取步态参数：效度、评分者间和评分者内信度》的勘误 [《步态与姿势》90 (2021) 489 - 495]

Gait Posture. 2022 May;94:195-197. doi: 10.1016/j.gaitpost.2022.03.015. Epub 2022 Mar 28.

LHPE-nets: A lightweight 2D and 3D human pose estimation model with well-structural deep networks and multi-view pose sample simplification method.LHPE-nets：一种具有良好结构深度网络和多视图姿态样本简化方法的轻量级 2D 和 3D 人体姿态估计模型。

PLoS One. 2022 Feb 23;17(2):e0264302. doi: 10.1371/journal.pone.0264302. eCollection 2022.

Diagnostic value of a vision-based intelligent gait analyzer in screening for gait abnormalities.基于视觉的智能步态分析器在步态异常筛查中的诊断价值。

Gait Posture. 2022 Jan;91:205-211. doi: 10.1016/j.gaitpost.2021.10.028. Epub 2021 Oct 26.

Gait Posture. 2021 Oct;90:489-495. doi: 10.1016/j.gaitpost.2021.08.004. Epub 2021 Aug 14.

Can Markerless Pose Estimation Algorithms Estimate 3D Mass Centre Positions and Velocities during Linear Sprinting Activities?无标记姿势估计算法能否在直线冲刺活动中估计 3D 质心位置和速度？

Sensors (Basel). 2021 Apr 20;21(8):2889. doi: 10.3390/s21082889.

Two-dimensional video-based analysis of human gait using pose estimation.基于二维视频的人体步态姿势估计分析。

PLoS Comput Biol. 2021 Apr 23;17(4):e1008935. doi: 10.1371/journal.pcbi.1008935. eCollection 2021 Apr.

Evaluation of 3D Markerless Motion Capture Accuracy Using OpenPose With Multiple Video Cameras.使用OpenPose和多台摄像机评估无标记三维运动捕捉精度

Front Sports Act Living. 2020 May 27;2:50. doi: 10.3389/fspor.2020.00050. eCollection 2020.

Dual-Task Tests Predict Conversion to Dementia-A Prospective Memory-Clinic-Based Cohort Study.双重任务测试预测向痴呆的转化-基于前瞻性记忆诊所的队列研究。

Int J Environ Res Public Health. 2020 Nov 3;17(21):8129. doi: 10.3390/ijerph17218129.

Deep neural networks enable quantitative movement analysis using single-camera videos.深度神经网络可以使用单目视频进行定量运动分析。

Nat Commun. 2020 Aug 13;11(1):4054. doi: 10.1038/s41467-020-17807-z.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

基于视频记录步态的两步深度学习足跟关键点识别

Two-step deep-learning identification of heel keypoints from video-recorded gait.

作者信息

机构信息

出版信息

相似文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

本文引用的文献