Behavioral and Systems Neuroscience Program and the Department of Behavioral Neuroscience, Oregon Health and Science University, Portland, Oregon 97239.
Behavioral and Systems Neuroscience Program and the Department of Behavioral Neuroscience, Oregon Health and Science University, Portland, Oregon 97239
J Neurosci. 2020 Jun 24;40(26):5063-5077. doi: 10.1523/JNEUROSCI.0310-20.2020. Epub 2020 May 14.
Actions executed toward obtaining a reward are frequently associated with the probability of harm occurring during action execution. Learning this probability allows for appropriate computation of future harm to guide action selection. Impaired learning of this probability may be critical for the pathogenesis of anxiety or reckless and impulsive behavior. Here we designed a task for punishment probability learning during reward-guided actions to begin to understand the neuronal basis of this form of learning, and the biological or environmental variables that influence action selection after learning. Male and female Long-Evans rats were trained in a seek-take behavioral paradigm where the seek action was associated with varying probability of punishment. The take action remained safe and was followed by reward delivery. Learning was evident as subjects selectively adapted seek action behavior as a function of punishment probability. Recording of neural activity in the mPFC during learning revealed changes in phasic mPFC neuronal activity during risky-seek actions but not during the safe take actions or reward delivery, revealing that this region is involved in learning of probabilistic punishment. After learning, the variables that influenced behavior included reinforcer and punisher value, pretreatment with the anxiolytic diazepam, and biological sex. In particular, females were more sensitive to probabilistic punishment than males. These data demonstrate that flexible encoding of risky actions by mPFC is involved in probabilistic punishment learning and provide a novel behavioral approach for studying the pathogenesis of anxiety and impulsivity with inclusion of sex as a biological variable. Actions we choose to execute toward obtaining a reward are often associated with the probability of harm occurring. Impaired learning of this probability may be critical for the pathogenesis of anxiety or reckless behavior and impulsivity. We developed a behavioral model to assess this mode of learning. This procedure allowed us to determine biological and environmental factors that influence the resistance of reward seeking to probabilistic punishment and to identify the mPFC as a region that flexibly adapts its response to risky actions as contingencies are learned.
我们执行的获取奖励的行为通常与行为执行过程中发生伤害的概率有关。学习这种概率可以帮助我们合理地计算未来的伤害,从而指导行为选择。这种概率学习的损伤可能对焦虑症或鲁莽、冲动行为的发病机制至关重要。在这里,我们设计了一个在奖励引导的行为中进行惩罚概率学习的任务,以开始理解这种学习形式的神经基础,以及学习后影响行为选择的生物或环境变量。雄性和雌性长耳大仓鼠在寻求-采取行为范式中接受训练,其中寻求行为与惩罚概率的变化相关联。采取行动是安全的,之后会有奖励。随着受试者根据惩罚概率选择性地调整寻求行为,学习是明显的。在学习过程中,mPFC 中的神经活动记录显示,在冒险寻求行为中,相期 mPFC 神经元活动发生变化,但在安全的采取行为或奖励传递中没有变化,这表明该区域参与了概率性惩罚的学习。学习后,影响行为的变量包括强化剂和惩罚剂的价值、预先给予抗焦虑药地西泮,以及生物性别。特别是,女性对概率性惩罚比男性更敏感。这些数据表明,mPFC 对冒险行为的灵活编码参与了概率性惩罚学习,并提供了一种新的行为方法,用于研究焦虑和冲动的发病机制,包括将性别作为生物学变量。我们选择执行以获取奖励的行为通常与发生伤害的概率有关。这种概率学习的损伤可能对焦虑症或鲁莽行为和冲动的发病机制至关重要。我们开发了一种行为模型来评估这种学习模式。该程序使我们能够确定影响奖励寻求对概率性惩罚的抵抗力的生物和环境因素,并确定 mPFC 作为一个区域,根据条件的学习,灵活地调整其对冒险行为的反应。