Mosberger A C, Sibener L J, Chen T X, Rodrigues Hfm, Hormigo R, Ingram J N, Athalye V R, Tabachnik T, Wolpert D M, Murray J M, Costa R M
bioRxiv. 2023 May 8:2023.05.08.539291. doi: 10.1101/2023.05.08.539291.
The brain can learn to generate actions, such as reaching to a target, using different movement strategies. Understanding how different variables bias which strategies are learned to produce such a reach is important for our understanding of the neural bases of movement. Here we introduce a novel spatial forelimb target task in which perched head-fixed mice learn to reach to a circular target area from a set start position using a joystick. These reaches can be achieved by learning to move into a specific direction or to a specific endpoint location. We find that mice gradually learn to successfully reach the covert target. With time, they refine their initially exploratory complex joystick trajectories into controlled targeted reaches. The execution of these controlled reaches depends on the sensorimotor cortex. Using a probe test with shifting start positions, we show that individual mice learned to use strategies biased to either direction or endpoint-based movements. The degree of endpoint learning bias was correlated with the spatial directional variability with which the workspace was explored early in training. Furthermore, we demonstrate that reinforcement learning model agents exhibit a similar correlation between directional variability during training and learned strategy. These results provide evidence that individual exploratory behavior during training biases the control strategies that mice use to perform forelimb covert target reaches.
大脑能够学会运用不同的运动策略来产生动作,比如伸手去够一个目标。理解不同变量如何影响所学习的用于产生此类伸手动作的策略,对于我们理解运动的神经基础至关重要。在此,我们引入了一种新颖的空间前肢目标任务,在该任务中,处于栖息状态且头部固定的小鼠学会使用操纵杆从设定的起始位置伸手去够一个圆形目标区域。这些伸手动作可以通过学习朝着特定方向移动或到达特定的终点位置来实现。我们发现小鼠逐渐学会成功够到隐蔽目标。随着时间推移,它们将最初探索性的复杂操纵杆轨迹细化为可控的目标伸手动作。这些可控伸手动作的执行依赖于感觉运动皮层。通过使用具有不同起始位置的探测测试,我们表明个体小鼠学会了使用偏向于基于方向或基于终点的运动策略。终点学习偏向的程度与训练早期探索工作空间时的空间方向变异性相关。此外,我们证明强化学习模型主体在训练期间的方向变异性与所学习的策略之间也表现出类似的相关性。这些结果提供了证据,表明训练期间的个体探索行为会影响小鼠用于执行前肢隐蔽目标伸手动作的控制策略。