Stoloff Rebecca H, Taylor Jordan A, Xu Jing, Ridderikhoff Arne, Ivry Richard B
UCSF Joint Graduate Group in Bioengineering, University of California Berkeley Berkeley, CA, USA.
Front Neurosci. 2011 Mar 23;5:41. doi: 10.3389/fnins.2011.00041. eCollection 2011.
Choosing which hand to use for an action is one of the most frequent decisions people make in everyday behavior. We developed a simple reaching task in which we vary the lateral position of a target and the participant is free to reach to it with either the right or left hand. While people exhibit a strong preference to use the hand ipsilateral to the target, there is a region of uncertainty within which hand choice varies across trials. We manipulated the reinforcement rates for the two hands, either by increasing the likelihood that a reach with the non-dominant hand would successfully intersect the target or decreasing the likelihood that a reach with the dominant hand would be successful. While participants had minimal awareness of these manipulations, we observed an increase in the use of the non-dominant hand for targets presented in the region of uncertainty. We modeled the shift in hand use using a Q-learning model of reinforcement learning. The results provided a good fit of the data and indicate that the effects of increasing and decreasing the rate of positive reinforcement are additive. These experiments emphasize the role of decision processes for effector selection, and may point to a novel approach for physical rehabilitation based on intrinsic reinforcement.
选择用哪只手执行动作是人们在日常行为中最常做出的决定之一。我们设计了一个简单的伸手任务,在这个任务中,我们改变目标的横向位置,参与者可以自由地用右手或左手去够目标。虽然人们表现出强烈的偏好,倾向于使用与目标同侧的手,但存在一个不确定区域,在这个区域内,每次试验的手的选择都会有所不同。我们通过增加非优势手够到目标的成功率或降低优势手够到目标的成功率来操纵两只手的强化率。虽然参与者对这些操纵几乎没有意识,但我们观察到,在不确定区域呈现目标时,非优势手的使用增加了。我们使用强化学习的Q学习模型对手的使用变化进行了建模。结果很好地拟合了数据,表明增加和减少正强化率的效果是相加的。这些实验强调了决策过程在效应器选择中的作用,并可能指出一种基于内在强化的物理康复新方法。