Kimura Kenta, Kanayama Noriaki, Toyama Asako, Katahira Kentaro
Human Informatics and Interaction Research Institute, National Institute of Advanced Industrial Science and Technology (AIST), Tsukuba, Japan.
Japan Society for the Promotion of Science, Tokyo, Japan.
Front Neurosci. 2022 Jun 2;16:889440. doi: 10.3389/fnins.2022.889440. eCollection 2022.
This study aimed to investigate whether instrumental reward learning is affected by the cardiac cycle. To this end, we examined the effects of the cardiac cycle (systole or diastole) on the computational processes underlying the participants' choices in the instrumental learning task. In the instrumental learning task, participants were required to select one of two discriminative stimuli (neutral visual stimuli) and immediately receive reward/punishment feedback depending on the probability assigned to the chosen stimuli. To manipulate the cardiac cycle, the presentation of discriminative stimuli was timed to coincide with either cardiac systole or diastole. We fitted the participants' choices in the task with reinforcement learning (RL) models and estimated parameters involving instrumental learning (i.e., learning rate and inverse temperature) separately in the systole and diastole trials. Model-based analysis revealed that the learning rate for positive prediction errors was higher than that for negative prediction errors in the systole trials; however, learning rates did not differ between positive and negative prediction errors in the diastole trials. These results demonstrate that the natural fluctuation of cardiac afferent signals can affect asymmetric value updating in instrumental reward learning.
本研究旨在调查工具性奖励学习是否受心动周期的影响。为此,我们考察了心动周期(收缩期或舒张期)对参与者在工具性学习任务中做出选择的计算过程的影响。在工具性学习任务中,参与者需要从两个辨别性刺激(中性视觉刺激)中选择一个,并根据分配给所选刺激的概率立即获得奖励/惩罚反馈。为了操纵心动周期,辨别性刺激的呈现时间与心脏收缩期或舒张期同步。我们用强化学习(RL)模型拟合参与者在任务中的选择,并分别在收缩期和舒张期试验中估计涉及工具性学习的参数(即学习率和逆温度)。基于模型的分析表明,在收缩期试验中,正向预测误差的学习率高于负向预测误差的学习率;然而,在舒张期试验中,正向和负向预测误差的学习率没有差异。这些结果表明,心脏传入信号的自然波动会影响工具性奖励学习中的不对称价值更新。