Centre for Studies in Behavioural Neurobiology, Concordia University, Montreal, Québec, Canada.
PLoS One. 2020 Jun 5;15(6):e0226722. doi: 10.1371/journal.pone.0226722. eCollection 2020.
The neurobiological study of reward was launched by the discovery of intracranial self-stimulation (ICSS). Subsequent investigation of this phenomenon provided the initial link between reward-seeking behavior and dopaminergic neurotransmission. We re-evaluated this relationship by psychophysical, pharmacological, optogenetic, and computational means. In rats working for direct, optical activation of midbrain dopamine neurons, we varied the strength and opportunity cost of the stimulation and measured time allocation, the proportion of trial time devoted to reward pursuit. We found that the dependence of time allocation on the strength and cost of stimulation was similar formally to that observed when electrical stimulation of the medial forebrain bundle served as the reward. When the stimulation is strong and cheap, the rats devote almost all their time to reward pursuit; time allocation falls off as stimulation strength is decreased and/or its opportunity cost is increased. A 3D plot of time allocation versus stimulation strength and cost produces a surface resembling the corner of a plateau (the "reward mountain"). We show that dopamine-transporter blockade shifts the mountain along both the strength and cost axes in rats working for optical activation of midbrain dopamine neurons. In contrast, the same drug shifted the mountain uniquely along the opportunity-cost axis when rats worked for electrical MFB stimulation in a prior study. Dopamine neurons are an obligatory stage in the dominant model of ICSS, which positions them at a key nexus in the final common path for reward seeking. This model fails to provide a cogent account for the differential effect of dopamine transporter blockade on the reward mountain. Instead, we propose that midbrain dopamine neurons and neurons with non-dopaminergic, MFB axons constitute parallel limbs of brain-reward circuitry that ultimately converge on the final-common path for the evaluation and pursuit of rewards.
奖励的神经生物学研究是由颅内自我刺激(ICSS)的发现而启动的。对这一现象的后续研究为寻求奖励的行为和多巴胺能神经传递之间提供了最初的联系。我们通过心理物理学、药理学、光遗传学和计算方法重新评估了这种关系。在为中脑多巴胺神经元的直接光学激活而工作的大鼠中,我们改变了刺激的强度和机会成本,并测量了时间分配,即用于奖励追求的试验时间的比例。我们发现,时间分配对刺激强度和成本的依赖性在形式上与电刺激内侧前脑束作为奖励时观察到的相似。当刺激强烈且廉价时,大鼠几乎将所有时间都用于奖励追求;随着刺激强度的降低和/或其机会成本的增加,时间分配会减少。时间分配与刺激强度和成本的 3D 图产生类似于高原一角(“奖励山”)的表面。我们表明,在为中脑多巴胺神经元的光学激活而工作的大鼠中,多巴胺转运体阻滞剂将山沿着强度和成本轴移动。相比之下,当大鼠在之前的研究中通过电 MFB 刺激工作时,相同的药物仅沿着机会成本轴移动山。多巴胺神经元是 ICSS 主导模型中的一个必需阶段,将它们置于寻求奖励的最终共同途径的关键枢纽。该模型未能为多巴胺转运体阻滞剂对奖励山的差异影响提供有力解释。相反,我们提出中脑多巴胺神经元和具有非多巴胺能、MFB 轴突的神经元构成了大脑奖励回路的平行分支,最终汇聚到奖励评估和追求的最终共同途径上。