Suppr超能文献

封闭训练与交错训练对相对价值学习的影响。

Effects of blocked versus interleaved training on relative value learning.

作者信息

Hayes William M, Wedell Douglas H

机构信息

Department of Psychology, University of South Carolina, 1512 Pendleton St, Columbia, SC, 29208, USA.

出版信息

Psychon Bull Rev. 2023 Oct;30(5):1895-1907. doi: 10.3758/s13423-023-02290-6. Epub 2023 Apr 18.

Abstract

In reinforcement learning tasks, people learn the values of options relative to other options in the local context. Prior research suggests that relative value learning is enhanced when choice contexts are temporally clustered in a blocked sequence compared to a randomly interleaved sequence. The present study was aimed at further investigating the effects of blocked versus interleaved training using a choice task that distinguishes among different contextual encoding models. Our results showed that the presentation format in which contexts are experienced can lead to qualitatively distinct forms of relative value learning. This conclusion was supported by a combination of model-free and model-based analyses. In the blocked condition, choice behavior was most consistent with a reference point model in which outcomes are encoded relative to a dynamic estimate of the contextual average reward. In contrast, the interleaved condition was best described by a range-frequency encoding model. We propose that blocked training makes it easier to track contextual outcome statistics, such as the average reward, which may then be used to relativize the values of experienced outcomes. When contexts are interleaved, range-frequency encoding may serve as a more efficient means of storing option values in memory for later retrieval.

摘要

在强化学习任务中,人们在局部情境中学习选项相对于其他选项的价值。先前的研究表明,与随机交错序列相比,当选择情境按时间顺序聚类成一个分块序列时,相对价值学习会得到增强。本研究旨在使用一种能区分不同情境编码模型的选择任务,进一步探究分块训练与交错训练的效果。我们的结果表明,体验情境的呈现格式会导致相对价值学习出现质的不同形式。这一结论得到了无模型分析和基于模型分析的共同支持。在分块条件下,选择行为最符合一个参考点模型,在该模型中,结果是相对于情境平均奖励的动态估计进行编码的。相比之下,交错条件最好用范围频率编码模型来描述。我们提出,分块训练使跟踪情境结果统计信息(如平均奖励)变得更容易,然后这些信息可用于将所体验结果的价值相对化。当情境交错时,范围频率编码可能是一种在记忆中存储选项价值以便日后检索的更有效方式。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验