Inokawa Hitoshi, Matsumoto Naoyuki, Kimura Minoru, Yamada Hiroshi
Department of Physiology, Kyoto Prefectural University of Medicine, Kyoto 602-8566, Japan; Department of Physiology and System Bioscience, Kyoto Prefectural University of Medicine, Kyoto 602-8566, Japan.
Department of Physiology, Kyoto Prefectural University of Medicine, Kyoto 602-8566, Japan; Division of Food & Health Environmental Sciences, Faculty of Environmental & Symbiotic Sciences, Prefectural University of Kumamoto, Kumamoto 862-8502, Japan.
Neuroscience. 2020 Oct 15;446:271-284. doi: 10.1016/j.neuroscience.2020.08.007. Epub 2020 Aug 13.
An animal's choice behavior is shaped by the outcome feedback from selected actions in a trial-and-error approach. Tonically active neurons (TANs), presumed cholinergic interneurons in the striatum, are thought to be involved in the learning and performance of reward-directed behaviors, but it remains unclear how TANs are involved in shaping reward-directed choice behaviors based on the outcome feedback. To this end, we recorded activity of TANs from the dorsal striatum of two macaque monkeys (Macaca fuscata; 1 male, 1 female) while they performed a multi-step choice task to obtain multiple rewards. In this task, the monkeys first searched for a rewarding target from among three alternatives in a trial-and-error manner and then earned additional rewards by repeatedly choosing the rewarded target. We found that a considerable proportion of TANs selectively responded to either the reward or the no-reward outcome feedback during the trial-and-error search, but these feedback responses were not observed during repeat trials. Moreover, the feedback responses of TANs were similarly observed in any search trials, without distinctions regarding the predicted probability of rewards and the location of chosen targets. Unambiguously, TANs detected reward and no-reward feedback specifically when the monkeys performed trial-and-error searches, in which the monkeys were learning the value of the targets and adjusting their subsequent choice behavior based on the reward and no-reward feedback. These results suggest that striatal cholinergic interneurons signal outcome feedback specifically during search behavior, in circumstances where the choice outcomes cannot be predicted with certainty by the animals.
动物的选择行为是通过试错法中所选行动的结果反馈来塑造的。紧张性活动神经元(TANs),被认为是纹状体中的胆碱能中间神经元,被认为参与奖励导向行为的学习和表现,但目前尚不清楚TANs如何基于结果反馈参与塑造奖励导向的选择行为。为此,我们记录了两只猕猴(食蟹猴;1只雄性,1只雌性)背侧纹状体中TANs的活动,同时它们执行一个多步选择任务以获得多个奖励。在这个任务中,猴子首先以试错的方式从三个选项中寻找一个有奖励的目标,然后通过反复选择有奖励的目标来获得额外的奖励。我们发现,在试错搜索过程中,相当一部分TANs对奖励或无奖励的结果反馈有选择性地做出反应,但在重复试验中未观察到这些反馈反应。此外,在任何搜索试验中都类似地观察到了TANs的反馈反应,而与奖励的预测概率和所选目标的位置无关。明确地说,当猴子进行试错搜索时,TANs特别检测到奖励和无奖励反馈,在这种情况下,猴子正在学习目标的价值,并根据奖励和无奖励反馈调整它们随后的选择行为。这些结果表明,纹状体胆碱能中间神经元在搜索行为期间,即在动物无法确定预测选择结果的情况下,特异性地发出结果反馈信号。