人类概率奖励学习中的神经结构映射。

Neural structure mapping in human probabilistic reward learning.

机构信息

Department of Experimental Psychology, University of Oxford, Oxford, United Kingdom.

Wellcome Centre for Integrative Neuroimaging, University of Oxford, Oxford, United Kingdom.

出版信息

Elife. 2019 Mar 7;8:e42816. doi: 10.7554/eLife.42816.

DOI:10.7554/eLife.42816

PMID:30843789

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6405242/

Abstract

Humans can learn abstract concepts that describe invariances over relational patterns in data. One such concept, known as magnitude, allows stimuli to be compactly represented on a single dimension (i.e. on a mental line). Here, we measured representations of magnitude in humans by recording neural signals whilst they viewed symbolic numbers. During a subsequent reward-guided learning task, the neural patterns elicited by novel complex visual images reflected their payout probability in a way that suggested they were encoded onto the same mental number line, with 'bad' bandits sharing neural representation with 'small' numbers and 'good' bandits with 'large' numbers. Using neural network simulations, we provide a mechanistic model that explains our findings and shows how structural alignment can promote transfer learning. Our findings suggest that in humans, learning about reward probability is accompanied by structural alignment of value representations with neural codes for the abstract concept of magnitude.

摘要

人类可以学习描述数据中关系模式不变性的抽象概念。其中一个概念称为“数量”，它允许刺激在单个维度上（即在心理线上）进行紧凑表示。在这里，我们通过记录人类在观看符号数字时的神经信号来测量数量的表示。在随后的奖励引导学习任务中，新的复杂视觉图像引起的神经模式以一种表明它们被编码到相同的心理数字线上的方式反映了它们的支付概率，其中“坏”的赌徒与“小”的数字具有相同的神经表示，而“好”的赌徒与“大”的数字具有相同的神经表示。使用神经网络模拟，我们提供了一个机械模型，解释了我们的发现，并展示了结构对齐如何促进迁移学习。我们的发现表明，在人类中，对奖励概率的学习伴随着价值表示与数量的抽象概念的神经编码的结构对齐。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/6c9b/6405242/6fbf798ded15/elife-42816-fig1.jpg

相似文献

Neural structure mapping in human probabilistic reward learning.人类概率奖励学习中的神经结构映射。

Elife. 2019 Mar 7;8:e42816. doi: 10.7554/eLife.42816.

Learning what matters: A neural explanation for the sparsity bias.学习重要的东西：稀疏偏差的神经解释。

Int J Psychophysiol. 2018 May;127:62-72. doi: 10.1016/j.ijpsycho.2018.03.006. Epub 2018 Mar 15.

How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.我们如何学习做决策：强化学习预测错误在人类中的快速传播。

J Cogn Neurosci. 2014 Mar;26(3):635-44. doi: 10.1162/jocn_a_00509. Epub 2013 Oct 29.

Correlates of reward-predictive value in learning-related hippocampal neural activity.学习相关海马体神经活动中奖励预测价值的相关因素。

Hippocampus. 2009 May;19(5):487-506. doi: 10.1002/hipo.20535.

Neural state space alignment for magnitude generalization in humans and recurrent networks.用于人类和递归网络中幅度泛化的神经状态空间对准。

Neuron. 2021 Apr 7;109(7):1214-1226.e8. doi: 10.1016/j.neuron.2021.02.004. Epub 2021 Feb 23.

Reward and avoidance learning in the context of aversive environments and possible implications for depressive symptoms.在厌恶环境背景下的奖励和回避学习及其对抑郁症状的可能影响。

Psychopharmacology (Berl). 2019 Aug;236(8):2437-2449. doi: 10.1007/s00213-019-05299-9. Epub 2019 Jun 28.

Neural evidence for adaptive strategy selection in value-based decision-making.基于价值的决策中适应性策略选择的神经证据。

Cereb Cortex. 2014 Aug;24(8):2009-21. doi: 10.1093/cercor/bht049. Epub 2013 Mar 8.

Reward-based training of recurrent neural networks for cognitive and value-based tasks.用于认知和基于价值任务的循环神经网络的基于奖励的训练。

Elife. 2017 Jan 13;6:e21492. doi: 10.7554/eLife.21492.

Adult age differences in frontostriatal representation of prediction error but not reward outcome.预测误差而非奖励结果在额纹状体表征中的成人年龄差异。

Cogn Affect Behav Neurosci. 2014 Jun;14(2):672-82. doi: 10.3758/s13415-014-0297-4.

Dopaminergic modulation of the trade-off between probability and time in economic decision-making.经济决策中概率与时间权衡的多巴胺能调节

Eur Neuropsychopharmacol. 2015 Jun;25(6):817-27. doi: 10.1016/j.euroneuro.2015.02.011. Epub 2015 Mar 16.

引用本文的文献

Trait anxiety is associated with reduced reward-related replay at rest.特质焦虑与静息时与奖励相关的重演减少有关。

Nat Commun. 2025 Aug 26;16(1):7975. doi: 10.1038/s41467-025-63281-w.

The neural dynamics of loss aversion.损失厌恶的神经动力学

Imaging Neurosci (Camb). 2023 Dec 18;1. doi: 10.1162/imag_a_00047. eCollection 2023.

Distinct neural representational geometries of numerosity in early visual and association regions across visual streams.跨视觉流的早期视觉区域和联合区域中数字的不同神经表征几何结构。

Commun Biol. 2025 Jul 9;8(1):1029. doi: 10.1038/s42003-025-08395-z.

Thinking as Analogy-Making: Toward a Neural Process Account of General Intelligence.作为类比推理的思维：迈向关于一般智力的神经过程解释。

J Neurosci. 2025 Apr 30;45(18):e1555242025. doi: 10.1523/JNEUROSCI.1555-24.2025.

Humans actively reconfigure neural task states.人类会主动重新配置神经任务状态。

bioRxiv. 2025 Feb 28:2024.09.29.615736. doi: 10.1101/2024.09.29.615736.

A geometrical solution underlies general neural principle for serial ordering.一种几何解为序列排序的一般神经原则提供了基础。

Nat Commun. 2024 Sep 19;15(1):8238. doi: 10.1038/s41467-024-52240-6.

Emergent neural dynamics and geometry for generalization in a transitive inference task.在传递性推理任务中用于泛化的紧急神经动力学和几何形状。

PLoS Comput Biol. 2024 Apr 25;20(4):e1011954. doi: 10.1371/journal.pcbi.1011954. eCollection 2024 Apr.

Inferior parietal cortex represents relational structures for explicit transitive inference.下顶叶皮层为外显传递推理的关系结构提供了表征。

Cereb Cortex. 2024 Apr 1;34(4). doi: 10.1093/cercor/bhae137.

Identifying content-invariant neural signatures of perceptual vividness.识别感知生动性的内容不变神经特征。

PNAS Nexus. 2024 Feb 14;3(2):pgae061. doi: 10.1093/pnasnexus/pgae061. eCollection 2024 Feb.

The parieto-occipital cortex is a candidate neural substrate for the human ability to approximate Bayesian inference.顶枕叶皮层是人类进行近似贝叶斯推理能力的候选神经基础。

Commun Biol. 2024 Feb 9;7(1):165. doi: 10.1038/s42003-024-05821-6.

本文引用的文献

Inferring exemplar discriminability in brain representations.在大脑表征中推断范例可辨别性。

PLoS One. 2020 Jun 10;15(6):e0232551. doi: 10.1371/journal.pone.0232551. eCollection 2020.

Selective overweighting of larger magnitudes during noisy numerical comparison.在有噪声的数字比较过程中对较大数值进行选择性加权。

Nat Hum Behav. 2017 Jul 17;1(8):145. doi: 10.1038/s41562-017-0145.

Putting the pieces together: Generating a novel representational space through deductive reasoning.将碎片组合在一起：通过演绎推理生成新的表示空间。

Neuroimage. 2018 Dec;183:99-111. doi: 10.1016/j.neuroimage.2018.07.062. Epub 2018 Aug 4.

Number concepts: abstract and embodied.数字概念：抽象与具体。

Philos Trans R Soc Lond B Biol Sci. 2018 Aug 5;373(1752). doi: 10.1098/rstb.2017.0125.

Decoding Digits and Dice with Magnetoencephalography: Evidence for a Shared Representation of Magnitude.用脑磁图解码数字和骰子：数量的共享表示的证据。

J Cogn Neurosci. 2018 Jul;30(7):999-1010. doi: 10.1162/jocn_a_01257. Epub 2018 Mar 21.

Flexible timing by temporal scaling of cortical responses.通过皮质响应的时间缩放实现灵活的定时。

Nat Neurosci. 2018 Jan;21(1):102-110. doi: 10.1038/s41593-017-0028-6. Epub 2017 Dec 4.

Neural correlates of evidence accumulation during value-based decisions revealed via simultaneous EEG-fMRI.基于价值的决策中证据积累的神经关联通过同时 EEG-fMRI 揭示。

Nat Commun. 2017 Jun 9;8:15808. doi: 10.1038/ncomms15808.

Building machines that learn and think like people.建造像人一样学习和思考的机器。

Behav Brain Sci. 2017 Jan;40:e253. doi: 10.1017/S0140525X16001837. Epub 2016 Nov 24.

Toward the neural implementation of structure learning.迈向结构学习的神经实现。

Curr Opin Neurobiol. 2016 Apr;37:99-105. doi: 10.1016/j.conb.2016.01.014. Epub 2016 Feb 11.

The classic P300 encodes a build-to-threshold decision variable.经典的P300编码一个累积至阈值的决策变量。

Eur J Neurosci. 2015 Jul;42(1):1636-43. doi: 10.1111/ejn.12936. Epub 2015 May 28.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

人类概率奖励学习中的神经结构映射。

Neural structure mapping in human probabilistic reward learning.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献