Tosun Tuğçe, Gür Ezgi, Balcı Fuat
Department of Psychology, Koç University, Rumelifeneri Yolu, Sarıyer, Istanbul 34450, Turkey.
Department of Psychology, Koç University, Rumelifeneri Yolu, Sarıyer, Istanbul 34450, Turkey
Proc Natl Acad Sci U S A. 2016 Jan 19;113(3):787-92. doi: 10.1073/pnas.1518316113. Epub 2016 Jan 5.
Animals can shape their timed behaviors based on experienced probabilistic relations in a nearly optimal fashion. On the other hand, it is not clear if they adopt these timed decisions by making computations based on previously learnt task parameters (time intervals, locations, and probabilities) or if they gradually develop their decisions based on trial and error. To address this question, we tested mice in the timed-switching task, which required them to anticipate when (after a short or long delay) and at which of the two delay locations a reward would be presented. The probability of short trials differed between test groups in two experiments. Critically, we first trained mice on relevant task parameters by signaling the active trial with a discriminative stimulus and delivered the corresponding reward after the associated delay without any response requirement (without inducing switching behavior). During the test phase, both options were presented simultaneously to characterize the emergence and temporal characteristics of the switching behavior. Mice exhibited timed-switching behavior starting from the first few test trials, and their performance remained stable throughout testing in the majority of the conditions. Furthermore, as the probability of the short trial increased, mice waited longer before switching from the short to long location (experiment 1). These behavioral adjustments were in directions predicted by reward maximization. These results suggest that rather than gradually adjusting their time-dependent choice behavior, mice abruptly adopted temporal decision strategies by directly integrating their previous knowledge of task parameters into their timed behavior, supporting the model-based representational account of temporal risk assessment.
动物能够以近乎最优的方式,根据经验性概率关系来塑造其定时行为。另一方面,目前尚不清楚它们是通过基于先前学习的任务参数(时间间隔、位置和概率)进行计算来做出这些定时决策,还是基于试错逐渐形成决策。为了解决这个问题,我们在定时切换任务中对小鼠进行了测试,该任务要求它们预测奖励将在何时(短延迟或长延迟之后)以及在两个延迟位置中的哪一个出现。在两个实验中,测试组之间短试验的概率有所不同。关键的是,我们首先通过用辨别性刺激标记活跃试验,让小鼠对相关任务参数进行训练,并在相关延迟后给予相应奖励,且没有任何反应要求(不诱导切换行为)。在测试阶段,同时呈现两种选择,以表征切换行为的出现和时间特征。小鼠从最初的几次测试试验开始就表现出定时切换行为,并且在大多数情况下,它们在整个测试过程中的表现保持稳定。此外,随着短试验概率的增加,小鼠在从短位置切换到长位置之前等待的时间更长(实验1)。这些行为调整符合奖励最大化预测的方向。这些结果表明,小鼠并非逐渐调整其与时间相关的选择行为,而是通过将先前对任务参数的了解直接整合到其定时行为中,突然采用了时间决策策略,这支持了基于模型的时间风险评估表征解释。