Thomas Philip, Branicky Michael, van den Bogert Antonie, Jagodnik Kathleen
Department of Electrical Engineering and Computer Science, Case Western Reserve University.
Proc Innov Appl Artif Intell Conf. 2009;2009:165-172.
Clinical tests have shown that the dynamics of a human arm, controlled using Functional Electrical Stimulation (FES), can vary significantly between and during trials. In this paper, we study the application of the actor-critic architecture, with neural networks for the both the actor and the critic, as a controller that can adapt to these changing dynamics of a human arm. Development and tests were done in simulation using a planar arm model and Hill-based muscle dynamics. We begin by training it using a Proportional Derivative (PD) controller as a supervisor. We then make clinically relevant changes to the dynamics of the arm and test the actor-critic's ability to adapt without supervision in a reasonable number of episodes. Finally, we devise methods for achieving both rapid learning and long-term stability.
临床试验表明,使用功能性电刺激(FES)控制的人体手臂动力学在试验之间和试验过程中可能会有显著差异。在本文中,我们研究了使用神经网络分别作为行为体和评论家的行为-评论家架构的应用,将其作为一种能够适应人体手臂这些变化动力学的控制器。使用平面手臂模型和基于希尔的肌肉动力学在模拟中进行了开发和测试。我们首先使用比例微分(PD)控制器作为监督器对其进行训练。然后,我们对手臂动力学进行与临床相关的改变,并测试行为-评论家在合理数量的情节中无监督适应的能力。最后,我们设计了实现快速学习和长期稳定性的方法。