多维环境中强化学习与注意力之间的动态交互

Dynamic Interaction between Reinforcement Learning and Attention in Multidimensional Environments.

作者信息

Leong Yuan Chang, Radulescu Angela, Daniel Reka, DeWoskin Vivian, Niv Yael

机构信息

Department of Psychology, Stanford University, Stanford, CA 94305, USA.

Department of Psychology, Princeton University, Princeton, NJ 08544, USA.

出版信息

Neuron. 2017 Jan 18;93(2):451-463. doi: 10.1016/j.neuron.2016.12.040.

DOI:10.1016/j.neuron.2016.12.040

PMID:28103483

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5287409/

Abstract

Little is known about the relationship between attention and learning during decision making. Using eye tracking and multivariate pattern analysis of fMRI data, we measured participants' dimensional attention as they performed a trial-and-error learning task in which only one of three stimulus dimensions was relevant for reward at any given time. Analysis of participants' choices revealed that attention biased both value computation during choice and value update during learning. Value signals in the ventromedial prefrontal cortex and prediction errors in the striatum were similarly biased by attention. In turn, participants' focus of attention was dynamically modulated by ongoing learning. Attentional switches across dimensions correlated with activity in a frontoparietal attention network, which showed enhanced connectivity with the ventromedial prefrontal cortex between switches. Our results suggest a bidirectional interaction between attention and learning: attention constrains learning to relevant dimensions of the environment, while we learn what to attend to via trial and error.

摘要

关于决策过程中注意力与学习之间的关系，我们所知甚少。我们使用眼动追踪和功能磁共振成像（fMRI）数据的多变量模式分析，测量了参与者在执行试错学习任务时的维度注意力，在该任务中，在任何给定时间，三个刺激维度中只有一个与奖励相关。对参与者选择的分析表明，注意力在选择过程中的价值计算和学习过程中的价值更新方面都存在偏差。腹内侧前额叶皮层中的价值信号和纹状体中的预测误差同样受到注意力的影响。反过来，参与者的注意力焦点会受到持续学习的动态调节。维度间的注意力切换与额顶叶注意力网络的活动相关，该网络在切换之间与腹内侧前额叶皮层的连接增强。我们的结果表明注意力与学习之间存在双向交互作用：注意力将学习限制在环境的相关维度上，而我们通过试错学习应该关注什么。

相似文献

Dynamic Interaction between Reinforcement Learning and Attention in Multidimensional Environments.

Neuron. 2017 Jan 18;93(2):451-463. doi: 10.1016/j.neuron.2016.12.040.

Learning relative values in the striatum induces violations of normative decision making.

Nat Commun. 2017 Jun 20;8:16033. doi: 10.1038/ncomms16033.

Reinforcement learning in multidimensional environments relies on attention mechanisms.

J Neurosci. 2015 May 27;35(21):8145-57. doi: 10.1523/JNEUROSCI.2978-14.2015.

Reward-dependent learning in neuronal networks for planning and decision making.

Prog Brain Res. 2000;126:217-29. doi: 10.1016/S0079-6123(00)26016-0.

How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.

J Cogn Neurosci. 2014 Mar;26(3):635-44. doi: 10.1162/jocn_a_00509. Epub 2013 Oct 29.

Parametric modulation of reward sequences during a reversal task in ACC and VMPFC but not amygdala and striatum.

Neuroimage. 2016 Dec;143:50-57. doi: 10.1016/j.neuroimage.2016.09.022. Epub 2016 Sep 10.

Role of the medial prefrontal cortex in impaired decision making in juvenile attention-deficit/hyperactivity disorder.

JAMA Psychiatry. 2014 Oct;71(10):1165-73. doi: 10.1001/jamapsychiatry.2014.1093.

Intact Reinforcement Learning But Impaired Attentional Control During Multidimensional Probabilistic Learning in Older Adults.

J Neurosci. 2020 Jan 29;40(5):1084-1096. doi: 10.1523/JNEUROSCI.0254-19.2019. Epub 2019 Dec 11.

Individual differences and the neural representations of reward expectation and reward prediction error.

Soc Cogn Affect Neurosci. 2007 Mar;2(1):20-30. doi: 10.1093/scan/nsl021.

Separate mesocortical and mesolimbic pathways encode effort and reward learning signals.

Proc Natl Acad Sci U S A. 2017 Aug 29;114(35):E7395-E7404. doi: 10.1073/pnas.1705643114. Epub 2017 Aug 14.

引用本文的文献

How working memory and reinforcement learning interact when avoiding punishment and pursuing reward concurrently.

J Exp Psychol Gen. 2025 Sep 1. doi: 10.1037/xge0001817.

Abstract rule learning promotes cognitive flexibility in complex environments across species.

Nat Commun. 2025 Jun 25;16(1):5396. doi: 10.1038/s41467-025-60943-7.

Impaired reinforcement learning and coding of prediction errors in patients with cerebellar degeneration - a study with EEG and voxel-based morphometry.

Cogn Affect Behav Neurosci. 2025 May 28. doi: 10.3758/s13415-025-01303-2.

Reinforcement learning increasingly relates to memory specificity from childhood to adulthood.

Nat Commun. 2025 Apr 30;16(1):4074. doi: 10.1038/s41467-025-59379-w.

Humans learn generalizable representations through efficient coding.

Nat Commun. 2025 Apr 29;16(1):3989. doi: 10.1038/s41467-025-58848-6.

Negative affect-driven impulsivity as hierarchical model-based overgeneralization.

Trends Cogn Sci. 2025 May;29(5):407-420. doi: 10.1016/j.tics.2025.01.002. Epub 2025 Feb 6.

Intracranial substrates of meditation-induced neuromodulation in the amygdala and hippocampus.

Proc Natl Acad Sci U S A. 2025 Feb 11;122(6):e2409423122. doi: 10.1073/pnas.2409423122. Epub 2025 Feb 4.

Error-driven upregulation of memory representations.

Commun Psychol. 2025 Jan 30;3(1):17. doi: 10.1038/s44271-025-00199-5.

Rewards transiently and automatically enhance sustained attention.

J Exp Psychol Gen. 2025 Apr;154(4):1063-1079. doi: 10.1037/xge0001727. Epub 2025 Jan 20.

Schemas, reinforcement learning and the medial prefrontal cortex.

Nat Rev Neurosci. 2025 Mar;26(3):141-157. doi: 10.1038/s41583-024-00893-z. Epub 2025 Jan 7.

本文引用的文献

Modeling the Evolution of Beliefs Using an Attentional Focus Mechanism.

PLoS Comput Biol. 2015 Oct 23;11(10):e1004558. doi: 10.1371/journal.pcbi.1004558. eCollection 2015 Oct.

Reinforcement learning in multidimensional environments relies on attention mechanisms.

J Neurosci. 2015 May 27;35(21):8145-57. doi: 10.1523/JNEUROSCI.2978-14.2015.

Orbitofrontal cortex as a cognitive map of task space.

Neuron. 2014 Jan 22;81(2):267-279. doi: 10.1016/j.neuron.2013.11.005.

A causal link between prediction errors, dopamine neurons and learning.

Nat Neurosci. 2013 Jul;16(7):966-73. doi: 10.1038/nn.3413. Epub 2013 May 26.

Attention, learning, and the value of information.

Neuron. 2012 Oct 18;76(2):281-95. doi: 10.1016/j.neuron.2012.09.034. Epub 2012 Oct 17.

Biol Psychiatry. 2012 Dec 15;72(12):1012-9. doi: 10.1016/j.biopsych.2012.05.023. Epub 2012 Jul 3.

Hierarchical reinforcement learning and decision making.

Curr Opin Neurobiol. 2012 Dec;22(6):956-62. doi: 10.1016/j.conb.2012.05.008. Epub 2012 Jun 11.

The attention system of the human brain: 20 years after.

Annu Rev Neurosci. 2012;35:73-89. doi: 10.1146/annurev-neuro-062111-150525. Epub 2012 Apr 12.

Beyond simple reinforcement learning: the computational neurobiology of reward-learning and valuation.

Eur J Neurosci. 2012 Apr;35(7):987-90. doi: 10.1111/j.1460-9568.2012.08074.x.

Neural basis of reinforcement learning and decision making.

Annu Rev Neurosci. 2012;35:287-308. doi: 10.1146/annurev-neuro-062111-150512. Epub 2012 Mar 29.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

多维环境中强化学习与注意力之间的动态交互

Dynamic Interaction between Reinforcement Learning and Attention in Multidimensional Environments.

作者信息

Leong Yuan Chang, Radulescu Angela, Daniel Reka, DeWoskin Vivian, Niv Yael

机构信息

Department of Psychology, Stanford University, Stanford, CA 94305, USA.

Department of Psychology, Princeton University, Princeton, NJ 08544, USA.

出版信息

Neuron. 2017 Jan 18;93(2):451-463. doi: 10.1016/j.neuron.2016.12.040.

DOI:10.1016/j.neuron.2016.12.040

PMID:28103483

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5287409/

Abstract

摘要

多维环境中强化学习与注意力之间的动态交互

Dynamic Interaction between Reinforcement Learning and Attention in Multidimensional Environments.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

多维环境中强化学习与注意力之间的动态交互

Dynamic Interaction between Reinforcement Learning and Attention in Multidimensional Environments.

作者信息

机构信息

出版信息