Department of Systems Biology, Osaka Bioscience Institute, Suita, Osaka 565-0874, Japan.
Proc Natl Acad Sci U S A. 2012 Jul 31;109(31):12764-9. doi: 10.1073/pnas.1210797109. Epub 2012 Jul 16.
In the basal ganglia, inputs from the nucleus accumbens (NAc) are transmitted through both direct and indirect pathways and control reward-based learning. In the NAc, dopamine (DA) serves as a key neurotransmitter, modulating these two parallel pathways. This study explored how reward learning and its flexibility are controlled in a pathway-specific and DA receptor-dependent manner. We used two techniques (i) reversible neurotransmission blocking (RNB), in which transmission of the direct (D-RNB) or the indirect pathway (I-RNB) in the NAc on both sides of the hemispheres was selectively blocked by transmission-blocking tetanus toxin; and (ii) asymmetric RNB, in which transmission of the direct (D-aRNB) or the indirect pathway (I-aRNB) was unilaterally blocked by RNB techniques and the intact side of the NAc was infused with DA agonists or antagonists. Reward-based learning was assessed by measuring goal-directed learning ability based on visual cue tasks (VCTs) or response-direction tasks (RDTs). Learning flexibility was then tested by switching from a previously learned VCT to a new VCT or RDT. D-RNB mice and D1 receptor antagonist-treated D-aRNB mice showed severe impairments in learning acquisition but normal flexibility to switch from a previously learned strategy. In contrast, I-RNB mice and D2 receptor agonist-treated I-aRNB mice showed normal learning acquisition but severe impairments not only in the flexibility to the learning switch but also in the subsequent acquisition of learning a new strategy. D1 and D2 receptors thus play distinct but cooperative roles in reward learning and its flexibility in a pathway-specific manner.
在基底神经节中,来自伏隔核(NAc)的输入通过直接和间接途径传递,控制基于奖励的学习。在 NAc 中,多巴胺(DA)作为关键神经递质,调节这两条平行途径。本研究探讨了奖励学习及其灵活性如何以特定途径和 DA 受体依赖的方式进行控制。我们使用了两种技术:(i)可还原神经传递阻断(RNB),其中通过传递阻断破伤风毒素选择性地阻断 NAc 两侧的直接(D-RNB)或间接途径(I-RNB)的传递;和(ii)不对称 RNB,其中通过 RNB 技术单侧阻断直接(D-aRNB)或间接途径(I-aRNB)的传递,并用 DA 激动剂或拮抗剂输注 NAc 的完整侧。基于视觉线索任务(VCTs)或反应方向任务(RDTs),通过测量目标导向学习能力来评估奖励学习。然后通过从先前学习的 VCT 切换到新的 VCT 或 RDT 来测试学习灵活性。D-RNB 小鼠和 D1 受体拮抗剂处理的 D-aRNB 小鼠在学习获得方面表现出严重的损伤,但在从先前学习的策略切换时具有正常的灵活性。相比之下,I-RNB 小鼠和 D2 受体激动剂处理的 I-aRNB 小鼠在学习获得方面表现正常,但不仅在学习策略切换的灵活性方面,而且在随后学习新策略的获得方面都表现出严重的损伤。因此,D1 和 D2 受体以特定途径的方式在奖励学习及其灵活性中发挥着不同但协作的作用。