Institute of Neuroscience, UCLouvain, Louvain-la-Neuve, Belgium.
Institute of Information and Communication Technologies, Electronics and Applied Mathematics, UCLouvain, Louvain-la-Neuve, Belgium.
PLoS Comput Biol. 2023 Sep 27;19(9):e1011493. doi: 10.1371/journal.pcbi.1011493. eCollection 2023 Sep.
Humans consider the parameters linked to movement goal during reaching to adjust their control strategy online. Indeed, rapid changes in target structure or disturbances interfering with their initial plan elicit rapid changes in behavior. Here, we hypothesize that these changes could result from the continuous use of a decision variable combining motor and cognitive components. We combine an optimal feedback controller with a real-time evaluation of the expected cost-to-go, which considers target- and movement-related costs, in a common theoretical framework. This model reproduces human behaviors in presence of changes in the target structure occurring during movement and of online decisions to flexibly change target following external perturbations. It also predicts that the time taken to decide to select a novel goal after a perturbation depends on the amplitude of the disturbance and on the rewards of the different options, which is a direct result of the continuous monitoring of the cost-to-go. We show that this result was present in our previously collected dataset. Together our developments point towards a continuous evaluation of the cost-to-go during reaching to update control online and make efficient decisions about movement goal.
人类在伸手够物的过程中会考虑与运动目标相关的参数,以在线调整他们的控制策略。事实上,目标结构的快速变化或干扰其初始计划的干扰会导致行为的快速变化。在这里,我们假设这些变化可能是由于持续使用结合运动和认知成分的决策变量导致的。我们在一个通用的理论框架中将最优反馈控制器与对预期成本的实时评估相结合,该评估考虑了目标和运动相关的成本。该模型再现了在运动过程中目标结构发生变化以及在外部干扰下灵活改变目标的在线决策情况下人类的行为。它还预测,在受到干扰后决定选择新目标所需的时间取决于干扰的幅度和不同选项的奖励,这是对成本的连续监测的直接结果。我们表明,这一结果在我们之前收集的数据集中存在。总之,我们的研究结果表明,在伸手够物的过程中,成本的连续评估可以在线更新控制,并对运动目标做出高效的决策。