了解奖励身份和时间。

Learning about reward identities and time.

作者信息

Delamater Andrew R, Siegel Daniel B, Tu Norman C

机构信息

Brooklyn College of the City University of New York, USA; Graduate Center of the City University of New York, USA.

Graduate Center of the City University of New York, USA.

出版信息

Behav Processes. 2023 Apr;207:104859. doi: 10.1016/j.beproc.2023.104859. Epub 2023 Mar 22.

DOI:10.1016/j.beproc.2023.104859

PMID:36963726

Abstract

We discuss three empirical findings that we think any theory attempting to integrate interval timing with associative learning concepts will need to address. These empirical phenomena all come from studies that combine peak timing procedures with reinforcer devaluation or conditional discrimination tasks commonly employed, respectively, in interval timing or associative learning research traditions. The three phenomena we discuss include: (1) the observation that disruptions in reward identity encoding have little to no impact on the encoding of reward time in the peak procedure (Delamateret al., 2018), (2) the findings that organisms tend to average their time estimates when presented with a stimulus compound consisting of separately learned stimuli indicating short or long reward times but that such temporal averaging, itself, is sensitive to post-conditioning selective reward devaluation, and (3) that rats can learn a temporal patterning task in which two stimuli presented independently indicate one time to reward availability while their compound indicates another. We review our prior results and present new findings illustrating these three phenomena and we discuss the special challenges they pose for cascade theories of timing, for multiple-oscillator models, and for any approach that attempts to integrate interval timing and associative models. We close by illustrating some ways in which multi-layer connectionist network models might begin to address some of our key findings. We believe this will require an approach that includes separate mechanisms that code for reward identity and time, but that does so in a way that permits for integration between the two systems.

摘要

我们讨论了三项实证研究结果，我们认为任何试图将间隔计时与联想学习概念相结合的理论都需要应对这些结果。这些实证现象均来自于将峰值计时程序分别与间隔计时或联想学习研究传统中常用的强化物贬值或条件辨别任务相结合的研究。我们讨论的三个现象包括：（1）观察到奖励身份编码的破坏对峰值程序中奖励时间的编码几乎没有影响（Delamater等人，2018）；（2）研究结果表明，当生物体面对由分别学习的指示短奖励时间或长奖励时间的刺激组成的刺激复合物时，它们倾向于平均其时间估计，但这种时间平均本身对条件后选择性奖励贬值敏感；（3）大鼠可以学习一种时间模式任务，其中独立呈现的两个刺激指示奖励可获得的一个时间，而它们的复合物指示另一个时间。我们回顾了我们之前的结果，并展示了说明这三个现象的新发现，我们讨论了它们对计时的级联理论、多振荡器模型以及任何试图整合间隔计时和联想模型的方法所带来的特殊挑战。我们通过说明多层连接主义网络模型可能开始解决我们一些关键发现的一些方式来结束本文。我们认为这将需要一种方法，该方法包括用于奖励身份和时间编码的单独机制，但要以允许两个系统之间进行整合的方式来实现。