Nakanishi S, Hikida T, Yawata S
Department of Systems Biology, Osaka Bioscience Institute, 6-2-4 Furuedai, Suita, Osaka 565-0874, Japan.
Medical Innovation Center, Kyoto University Graduate School of Medicine, 53, Shogoin Kawahara-chou, Sakyo-ku, Kyoto 606-8507, Japan.
Neuroscience. 2014 Dec 12;282:49-59. doi: 10.1016/j.neuroscience.2014.04.026. Epub 2014 Apr 24.
The nucleus accumbens (NAc) plays a pivotal role in reward and aversive learning and learning flexibility. Outputs of the NAc are transmitted through two parallel routes termed the direct and indirect pathways and controlled by the dopamine (DA) neurotransmitter. To explore how reward-based and avoidance learning is controlled in the NAc of the mouse, we developed the reversible neurotransmission-blocking (RNB) technique, in which transmission of each pathway could be selectively and reversibly blocked by the pathway-specific expression of transmission-blocking tetanus toxin and the asymmetric RNB technique, in which one side of the NAc was blocked by the RNB technique and the other intact side was pharmacologically manipulated by a transmitter agonist or antagonist. Our studies demonstrated that the activation of D1 receptors in the direct pathway and the inactivation of D2 receptors in the indirect pathway are key determinants that distinctly control reward-based and avoidance learning, respectively. The D2 receptor inactivation is also critical for flexibility of reward learning. Furthermore, reward and aversive learning is regulated by a set of common downstream receptors and signaling cascades, all of which are involved in the induction of long-term potentiation at cortico-accumbens synapses of the two pathways. In this article, we review our studies that specify the regulatory mechanisms of each pathway in learning behavior and propose a mechanistic model to explain how dynamic DA modulation promotes selection of actions that achieve reward-seeking outcomes and avoid aversive ones. The biological significance of the network organization consisting of two parallel transmission pathways is also discussed from the point of effective and prompt selection of neural outcomes in the neural network.
伏隔核(NAc)在奖赏、厌恶学习及学习灵活性中起关键作用。NAc的输出通过被称为直接通路和间接通路的两条平行途径进行传递,并受多巴胺(DA)神经递质的控制。为探究小鼠NAc中基于奖赏和回避的学习是如何被控制的,我们开发了可逆性神经传递阻断(RNB)技术,其中每条通路的传递可通过传递阻断破伤风毒素的通路特异性表达被选择性且可逆地阻断;还开发了不对称RNB技术,其中NAc的一侧通过RNB技术被阻断,而另一侧完整的部分则通过递质激动剂或拮抗剂进行药理学操作。我们的研究表明,直接通路中D1受体的激活和间接通路中D2受体的失活分别是明显控制基于奖赏和回避学习的关键决定因素。D2受体失活对于奖赏学习的灵活性也至关重要。此外,奖赏和厌恶学习受一组共同的下游受体和信号级联调节,所有这些都参与了两条通路的皮质-伏隔核突触处的长时程增强的诱导。在本文中,我们回顾了我们的研究,这些研究明确了每条通路在学习行为中的调节机制,并提出了一个机制模型来解释动态DA调节如何促进对实现奖赏寻求结果和避免厌恶结果的行为的选择。还从神经网络中有效且迅速地选择神经结果的角度讨论了由两条平行传递通路组成的网络组织的生物学意义。