Suppr超能文献

强化学习中有多少是工作记忆,而不是强化学习?一项行为、计算和神经遗传学分析。

How much of reinforcement learning is working memory, not reinforcement learning? A behavioral, computational, and neurogenetic analysis.

机构信息

Department of Cognitive, Linguistic and Psychological Sciences, Brown Institute for Brain Science, Brown University, Providence, RI, USA.

出版信息

Eur J Neurosci. 2012 Apr;35(7):1024-35. doi: 10.1111/j.1460-9568.2011.07980.x.

Abstract

Instrumental learning involves corticostriatal circuitry and the dopaminergic system. This system is typically modeled in the reinforcement learning (RL) framework by incrementally accumulating reward values of states and actions. However, human learning also implicates prefrontal cortical mechanisms involved in higher level cognitive functions. The interaction of these systems remains poorly understood, and models of human behavior often ignore working memory (WM) and therefore incorrectly assign behavioral variance to the RL system. Here we designed a task that highlights the profound entanglement of these two processes, even in simple learning problems. By systematically varying the size of the learning problem and delay between stimulus repetitions, we separately extracted WM-specific effects of load and delay on learning. We propose a new computational model that accounts for the dynamic integration of RL and WM processes observed in subjects' behavior. Incorporating capacity-limited WM into the model allowed us to capture behavioral variance that could not be captured in a pure RL framework even if we (implausibly) allowed separate RL systems for each set size. The WM component also allowed for a more reasonable estimation of a single RL process. Finally, we report effects of two genetic polymorphisms having relative specificity for prefrontal and basal ganglia functions. Whereas the COMT gene coding for catechol-O-methyl transferase selectively influenced model estimates of WM capacity, the GPR6 gene coding for G-protein-coupled receptor 6 influenced the RL learning rate. Thus, this study allowed us to specify distinct influences of the high-level and low-level cognitive functions on instrumental learning, beyond the possibilities offered by simple RL models.

摘要

工具性学习涉及皮质纹状体回路和多巴胺能系统。该系统通常在强化学习 (RL) 框架中通过逐步积累状态和动作的奖励值来建模。然而,人类学习也涉及到参与更高层次认知功能的前额叶皮质机制。这些系统的相互作用仍然知之甚少,人类行为模型通常忽略工作记忆 (WM),因此错误地将行为方差分配给 RL 系统。在这里,我们设计了一项任务,突出了这两个过程的深刻纠缠,即使在简单的学习问题中也是如此。通过系统地改变学习问题的大小和刺激重复之间的延迟,我们分别提取了 WM 特定的负载和延迟对学习的影响。我们提出了一个新的计算模型,该模型解释了观察到的受试者行为中 RL 和 WM 过程的动态整合。将容量有限的 WM 纳入模型中,使我们能够捕捉到即使在纯 RL 框架中,如果我们(不合理地)允许为每个集合大小设置单独的 RL 系统,也无法捕捉到的行为方差。WM 组件还允许对单个 RL 过程进行更合理的估计。最后,我们报告了两种遗传多态性的影响,这些多态性对前额叶和基底神经节功能具有相对特异性。编码儿茶酚-O-甲基转移酶的 COMT 基因选择性地影响 WM 容量的模型估计,而编码 G 蛋白偶联受体 6 的 GPR6 基因影响 RL 学习率。因此,这项研究使我们能够指定高水平和低水平认知功能对工具性学习的独特影响,而不仅仅是简单 RL 模型提供的可能性。

相似文献

4
Distentangling the systems contributing to changes in learning during adolescence.厘清导致青少年学习变化的系统。
Dev Cogn Neurosci. 2020 Feb;41:100732. doi: 10.1016/j.dcn.2019.100732. Epub 2019 Nov 14.
5
Relevance of working memory for reinforcement learning in older adults varies with timescale of learning.工作记忆与老年人强化学习的相关性随学习时间尺度而异。
Neuropsychol Dev Cogn B Aging Neuropsychol Cogn. 2020 Sep;27(5):654-676. doi: 10.1080/13825585.2019.1664389. Epub 2019 Sep 22.
6
Working Memory Load Strengthens Reward Prediction Errors.工作记忆负荷增强奖励预测误差。
J Neurosci. 2017 Apr 19;37(16):4332-4342. doi: 10.1523/JNEUROSCI.2700-16.2017. Epub 2017 Mar 20.
10
Multiple memory systems as substrates for multiple decision systems.多种记忆系统作为多种决策系统的基础。
Neurobiol Learn Mem. 2015 Jan;117:4-13. doi: 10.1016/j.nlm.2014.04.014. Epub 2014 May 15.

引用本文的文献

3
Social inequity disrupts reward-based learning.社会不平等会扰乱基于奖励的学习。
Commun Psychol. 2025 Aug 16;3(1):125. doi: 10.1038/s44271-025-00300-y.

本文引用的文献

1
The number and quality of representations in working memory.工作记忆中的表象数量和质量。
Psychol Sci. 2011 Nov;22(11):1434-41. doi: 10.1177/0956797611417006. Epub 2011 Oct 10.
2
A neural signature of hierarchical reinforcement learning.分层强化学习的神经特征。
Neuron. 2011 Jul 28;71(2):370-9. doi: 10.1016/j.neuron.2011.05.042.
8
Neurogenetics and pharmacology of learning, motivation, and cognition.学习、动机和认知的神经遗传学和药理学。
Neuropsychopharmacology. 2011 Jan;36(1):133-52. doi: 10.1038/npp.2010.96. Epub 2010 Jul 14.

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验