作为一种感觉运动控制理论的自适应动态规划

Adaptive dynamic programming as a theory of sensorimotor control.

作者信息

Jiang Yu, Jiang Zhong-Ping

机构信息

Control and Networks Laboratory, Department of Electrical and Computer Engineering, Polytechnic School of Engineering, New York University, 5 Metrotech Center, Brooklyn, NY , 11201, USA.

出版信息

Biol Cybern. 2014 Aug;108(4):459-73. doi: 10.1007/s00422-014-0613-7. Epub 2014 Jun 25.

DOI:10.1007/s00422-014-0613-7

PMID:24962078

Abstract

Many characteristics of sensorimotor control can be explained by models based on optimization and optimal control theories. However, most of the previous models assume that the central nervous system has access to the precise knowledge of the sensorimotor system and its interacting environment. This viewpoint is difficult to be justified theoretically and has not been convincingly validated by experiments. To address this problem, this paper presents a new computational mechanism for sensorimotor control from a perspective of adaptive dynamic programming (ADP), which shares some features of reinforcement learning. The ADP-based model for sensorimotor control suggests that a command signal for the human movement is derived directly from the real-time sensory data, without the need to identify the system dynamics. An iterative learning scheme based on the proposed ADP theory is developed, along with rigorous convergence analysis. Interestingly, the computational model as advocated here is able to reproduce the motor learning behavior observed in experiments where a divergent force field or velocity-dependent force field was present. In addition, this modeling strategy provides a clear way to perform stability analysis of the overall system. Hence, we conjecture that human sensorimotor systems use an ADP-type mechanism to control movements and to achieve successful adaptation to uncertainties present in the environment.

摘要

许多感觉运动控制的特征可以通过基于优化和最优控制理论的模型来解释。然而，大多数先前的模型都假定中枢神经系统能够获取感觉运动系统及其相互作用环境的精确知识。这种观点在理论上难以得到证实，也尚未被实验令人信服地验证。为了解决这个问题，本文从自适应动态规划（ADP）的角度提出了一种新的感觉运动控制计算机制，它具有强化学习的一些特征。基于ADP的感觉运动控制模型表明，人类运动的指令信号直接从实时感官数据中得出，无需识别系统动力学。基于所提出的ADP理论开发了一种迭代学习方案，并进行了严格的收敛性分析。有趣的是，这里所倡导的计算模型能够重现存在发散力场或速度依赖力场的实验中观察到的运动学习行为。此外，这种建模策略为进行整个系统的稳定性分析提供了一种清晰的方法。因此，我们推测人类感觉运动系统使用ADP类型的机制来控制运动，并成功适应环境中存在的不确定性。

相似文献

Adaptive dynamic programming as a theory of sensorimotor control.

Biol Cybern. 2014 Aug;108(4):459-73. doi: 10.1007/s00422-014-0613-7. Epub 2014 Jun 25.

An overview of adaptive model theory: solving the problems of redundancy, resources, and nonlinear interactions in human movement control.

J Neural Eng. 2005 Sep;2(3):S279-312. doi: 10.1088/1741-2560/2/3/S10. Epub 2005 Aug 31.

Optimal coordination and control of posture and movements.

J Physiol Paris. 2009 Sep-Dec;103(3-5):159-77. doi: 10.1016/j.jphysparis.2009.08.013. Epub 2009 Aug 9.

Computational mechanisms of sensorimotor control.

Neuron. 2011 Nov 3;72(3):425-42. doi: 10.1016/j.neuron.2011.10.006.

Generalization in adaptation to stable and unstable dynamics.

PLoS One. 2012;7(10):e45075. doi: 10.1371/journal.pone.0045075. Epub 2012 Oct 8.

Simulating closed- and open-loop voluntary movement: a nonlinear control-systems approach.

IEEE Trans Biomed Eng. 2002 Nov;49(11):1242-52. doi: 10.1109/TBME.2002.804601.

Model-Free Robust Optimal Feedback Mechanisms of Biological Motor Control.

Neural Comput. 2020 Mar;32(3):562-595. doi: 10.1162/neco_a_01260. Epub 2020 Jan 17.

Temporal specificity of the initial adaptive response in motor adaptation.

PLoS Comput Biol. 2017 Jul 10;13(7):e1005438. doi: 10.1371/journal.pcbi.1005438. eCollection 2017 Jul.

Dynamics systems vs. optimal control--a unifying view.

Prog Brain Res. 2007;165:425-45. doi: 10.1016/S0079-6123(06)65027-9.

Control of constraint forces and trajectories in a rich sensory and actuation environment.

Math Biosci. 2010 Dec;228(2):171-84. doi: 10.1016/j.mbs.2010.10.001. Epub 2010 Oct 12.

引用本文的文献

Computations underlying sensorimotor learning.

Curr Opin Neurobiol. 2016 Apr;37:7-11. doi: 10.1016/j.conb.2015.12.003. Epub 2015 Dec 23.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

作为一种感觉运动控制理论的自适应动态规划

Adaptive dynamic programming as a theory of sensorimotor control.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献