Department of Pharmacology, Vanderbilt University, Nashville TN 37232.
Department of Pharmacology, Vanderbilt University, Nashville TN 37232
eNeuro. 2023 Sep 27;10(9). doi: 10.1523/ENEURO.0063-23.2023. Print 2023 Sep.
Behavioral strategies are often classified based on whether reinforcer value controls reinforcement. Value-sensitive behaviors, in which animals update their actions when reinforcer value is changed, are classified as goal-directed; conversely, value-insensitive actions, where behavior remains consistent when the reinforcer is removed or devalued, are considered habitual. Basic reinforcement schedules can help to bias behavior toward either process: random ratio (RR) schedules are thought to promote the formation of goal-directed behaviors while random intervals (RIs) promote habitual control. However, how the schedule-specific features of these tasks interact with other factors that influence learning to control behavior has not been well characterized. Using male and female mice, we asked how distinct food restriction levels, a strategy often used to increase task engagement, interact with RR and RI schedules to control performance during task acquisition and devaluation procedures. We determined that food restriction level has a stronger effect on the behavior of mice following RR schedules compared with RI schedules, and that it promotes a decrease in response rate during devaluation procedures that is best explained by the effects of extinction rather than devaluation. Surprisingly, food restriction accelerated the decrease in response rates observed following devaluation across sequential extinction sessions, but not within a single session. Our results support the idea that the relationships between schedules and behavioral control strategies are not clear-cut and suggest that an animal's engagement in a task must be accounted for, together with the structure of reinforcement schedules, to appropriately interpret the cognitive underpinnings of behavior.
行为策略通常根据强化值是否控制强化来分类。当强化值发生变化时,动物会更新其行为的价值敏感行为被归类为目标导向;相反,当强化物被移除或贬值时,行为保持一致的价值不敏感行为被认为是习惯的。基本强化程序可以帮助行为偏向于这两个过程:随机比率(RR)程序被认为可以促进目标导向行为的形成,而随机间隔(RI)程序则促进习惯控制。然而,这些任务的特定于计划的特征如何与影响控制行为的其他因素相互作用尚未得到很好的描述。使用雄性和雌性小鼠,我们询问了不同的食物限制水平(一种常用于增加任务参与度的策略)如何与 RR 和 RI 程序相互作用,以控制任务获得和贬值程序期间的表现。我们确定,与 RI 程序相比,食物限制水平对 RR 程序后小鼠的行为有更强的影响,并且它促进了在贬值过程中反应率的降低,这最好通过灭绝而不是贬值的影响来解释。令人惊讶的是,食物限制加速了在连续灭绝期间观察到的反应率下降,而不是在单个会话中。我们的结果支持这样一种观点,即计划和行为控制策略之间的关系不是一成不变的,并表明动物对任务的参与度必须与强化计划的结构一起考虑,以正确解释行为的认知基础。