IEEE Trans Neural Syst Rehabil Eng. 2024;32:1715-1724. doi: 10.1109/TNSRE.2024.3391908. Epub 2024 May 1.
Evaluation of human gait through smartphone-based pose estimation algorithms provides an attractive alternative to costly lab-bound instrumented assessment and offers a paradigm shift with real time gait capture for clinical assessment. Systems based on smart phones, such as OpenPose and BlazePose have demonstrated potential for virtual motion assessment but still lack the accuracy and repeatability standards required for clinical viability. Seq2seq architecture offers an alternative solution to conventional deep learning techniques for predicting joint kinematics during gait. This study introduces a novel enhancement to the low-powered BlazePose algorithm by incorporating a Seq2seq autoencoder deep learning model. To ensure data accuracy and reliability, synchronized motion capture involving an RGB camera and ten Vicon cameras were employed across three distinct self-selected walking speeds. This investigation presents a groundbreaking avenue for remote gait assessment, harnessing the potential of Seq2seq architectures inspired by natural language processing (NLP) to enhance pose estimation accuracy. When comparing BlazePose alone to the combination of BlazePose and 1D convolution Long Short-term Memory Network (1D-LSTM), Gated Recurrent Unit (GRU) and Long Short-Term Memory (LSTM), the average mean absolute errors decreased from 13.4° to 5.3° for fast gait, from 16.3° to 7.5° for normal gait, and from 15.5° to 7.5° for slow gait at the left ankle joint angle respectively. The strategic utilization of synchronized data and rigorous testing methodologies further bolsters the robustness and credibility of these findings.
通过基于智能手机的姿势估计算法评估人类步态,为昂贵的实验室仪器评估提供了一种有吸引力的替代方案,并为临床评估提供了实时步态捕捉的范式转变。基于智能手机的系统,如 OpenPose 和 BlazePose,已经证明了虚拟运动评估的潜力,但仍然缺乏临床可行性所需的准确性和可重复性标准。序列到序列(Seq2seq)架构为预测步态过程中的关节运动学提供了一种替代传统深度学习技术的解决方案。本研究通过将 Seq2seq 自动编码器深度学习模型纳入低功耗的 BlazePose 算法,对其进行了改进。为了确保数据的准确性和可靠性,使用同步运动捕捉技术,涉及一个 RGB 相机和十个 Vicon 相机,在三个不同的自主选择的步行速度下进行。这项研究为远程步态评估开辟了一条新途径,利用受自然语言处理(NLP)启发的 Seq2seq 架构的潜力,提高姿势估计的准确性。将单独的 BlazePose 与 BlazePose 和 1D 卷积长短期记忆网络(1D-LSTM)、门控循环单元(GRU)和长短期记忆(LSTM)的组合进行比较时,左踝关节角度的快速步态的平均绝对误差从 13.4°降至 5.3°,正常步态从 16.3°降至 7.5°,缓慢步态从 15.5°降至 7.5°。同步数据的策略性利用和严格的测试方法学进一步增强了这些发现的稳健性和可信度。