IEEE Trans Neural Syst Rehabil Eng. 2024;32:431-441. doi: 10.1109/TNSRE.2024.3352416. Epub 2024 Jan 24.
This paper leverages the OpenSim physics-based simulation environment for the forward dynamic simulation of an osseointegrated transfemoral amputee musculoskeletal model, wearing a generic prosthesis. A deep reinforcement learning architecture, which combines the proximal policy optimization algorithm with imitation learning, is designed to enable the model to walk by using three different observation states. The first is a complete state that includes the agent's kinematics, ground reaction forces, and muscle data; the second is a reduced state that only includes the kinematics and ground reaction forces; the third is an augmented state that combines the kinematics and ground reaction forces with a prediction of the muscle data generated by a fully-connected feed-forward neural network. The empirical results demonstrate that the model trained with the augmented observation state can achieve walking patterns with rewards and gait symmetry ratings comparable to those of the model trained with the complete observation state, while there are no symmetric walking patterns when using the reduced observation state. This paper shows the importance of including muscle data in a deep reinforcement learning architecture for the forward dynamic simulation of musculoskeletal models of transfemoral amputees.
本文利用 OpenSim 基于物理的仿真环境,对穿着通用假肢的一体化股骨截肢者肌肉骨骼模型进行正向动力学仿真。设计了一种深度强化学习架构,该架构将近端策略优化算法与模仿学习相结合,使模型能够通过三种不同的观察状态进行行走。第一个是完整的状态,包括代理的运动学、地面反作用力和肌肉数据;第二个是简化的状态,仅包括运动学和地面反作用力;第三个是增强的状态,将运动学和地面反作用力与通过全连接前馈神经网络生成的肌肉数据的预测相结合。实验结果表明,使用增强观察状态训练的模型可以实现具有奖励和步态对称评分的行走模式,与使用完整观察状态训练的模型相当,而使用简化观察状态则没有对称的行走模式。本文表明,在股骨截肢者肌肉骨骼模型的正向动力学仿真中,将肌肉数据纳入深度强化学习架构的重要性。