训练多样性促进绝对值引导选择。

Training diversity promotes absolute-value-guided choice.

机构信息

The Hebrew University of Jerusalem, Jerusalem, Israel.

出版信息

PLoS Comput Biol. 2022 Nov 2;18(11):e1010664. doi: 10.1371/journal.pcbi.1010664. eCollection 2022 Nov.

DOI:10.1371/journal.pcbi.1010664

PMID:36322560

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9678339/

Abstract

Many decision-making studies have demonstrated that humans learn either expected values or relative preferences among choice options, yet little is known about what environmental conditions promote one strategy over the other. Here, we test the novel hypothesis that humans adapt the degree to which they form absolute values to the diversity of the learning environment. Since absolute values generalize better to new sets of options, we predicted that the more options a person learns about the more likely they would be to form absolute values. To test this, we designed a multi-day learning experiment comprising twenty learning sessions in which subjects chose among pairs of images each associated with a different probability of reward. We assessed the degree to which subjects formed absolute values and relative preferences by asking them to choose between images they learned about in separate sessions. We found that concurrently learning about more images within a session enhanced absolute-value, and suppressed relative-preference, learning. Conversely, cumulatively pitting each image against a larger number of other images across multiple sessions did not impact the form of learning. These results show that the way humans encode preferences is adapted to the diversity of experiences offered by the immediate learning context.

摘要

许多决策研究表明，人类学习的是选择选项的预期值或相对偏好，但对于什么环境条件促进一种策略而不是另一种策略知之甚少。在这里，我们检验了一个新颖的假设，即人类会根据学习环境的多样性来调整形成绝对价值的程度。由于绝对价值可以更好地推广到新的选项集，我们预测一个人学习的选项越多，他们就越有可能形成绝对价值。为了验证这一点，我们设计了一个为期多天的学习实验，由二十个学习阶段组成，每个阶段参与者在两组图像中进行选择，每组图像都与不同的奖励概率相关联。我们通过让参与者在单独的阶段中选择他们学习过的图像，来评估他们形成绝对价值和相对偏好的程度。我们发现，在一个阶段内同时学习更多的图像可以增强绝对价值学习，并抑制相对偏好学习。相反，在多个阶段中，将每张图像与更多的其他图像进行累计比较，并不会影响学习的形式。这些结果表明，人类编码偏好的方式适应了即时学习环境提供的多样性体验。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/40bc/9678339/4b058b235ad9/pcbi.1010664.g001.jpg

相似文献

Training diversity promotes absolute-value-guided choice.

PLoS Comput Biol. 2022 Nov 2;18(11):e1010664. doi: 10.1371/journal.pcbi.1010664. eCollection 2022 Nov.

Context-dependent preferences in starlings: linking ecology, foraging and choice.

PLoS One. 2013 May 21;8(5):e64934. doi: 10.1371/journal.pone.0064934. Print 2013.

The effect of preference learning on context effects in multi-alternative, multi-attribute choice.

Cognition. 2023 Apr;233:105365. doi: 10.1016/j.cognition.2022.105365. Epub 2022 Dec 30.

The Good, the Bad, and the Irrelevant: Neural Mechanisms of Learning Real and Hypothetical Rewards and Effort.

J Neurosci. 2015 Aug 12;35(32):11233-51. doi: 10.1523/JNEUROSCI.0396-15.2015.

Preferences for nutrients and sensory food qualities identify biological sources of economic values in monkeys.

Proc Natl Acad Sci U S A. 2021 Jun 29;118(26). doi: 10.1073/pnas.2101954118.

Influence of learning strategy on response time during complex value-based learning and choice.

PLoS One. 2018 May 22;13(5):e0197263. doi: 10.1371/journal.pone.0197263. eCollection 2018.

Intrinsic motivation for choice varies with individual risk attitudes and the controllability of the environment.

PLoS Comput Biol. 2023 Aug 11;19(8):e1010551. doi: 10.1371/journal.pcbi.1010551. eCollection 2023 Aug.

Impaired Expected Value Computations Coupled With Overreliance on Stimulus-Response Learning in Schizophrenia.

Biol Psychiatry Cogn Neurosci Neuroimaging. 2018 Nov;3(11):916-926. doi: 10.1016/j.bpsc.2018.03.014. Epub 2018 Apr 3.

Visual fixation patterns during economic choice reflect covert valuation processes that emerge with learning.

Proc Natl Acad Sci U S A. 2019 Nov 5;116(45):22795-22801. doi: 10.1073/pnas.1906662116. Epub 2019 Oct 21.

Learned valuation during forage decision-making in cuttlefish.

R Soc Open Sci. 2020 Dec 16;7(12):201602. doi: 10.1098/rsos.201602. eCollection 2020 Dec.

引用本文的文献

Experience-based risk taking is primarily shaped by prior learning rather than by decision-making.

Nat Commun. 2025 Jul 9;16(1):6310. doi: 10.1038/s41467-025-61609-0.

The Experience-Experience Gap: Distributional Learning Is Associated with a Divergence of Preferences from Estimations.

Res Sq. 2025 Apr 10:rs.3.rs-6282612. doi: 10.21203/rs.3.rs-6282612/v1.

Emotions as computations.

Neurosci Biobehav Rev. 2023 Jan;144:104977. doi: 10.1016/j.neubiorev.2022.104977. Epub 2022 Nov 24.

本文引用的文献

Value-free reinforcement learning: policy optimization as a minimal model of operant behavior.

Curr Opin Behav Sci. 2021 Oct;41:114-121. doi: 10.1016/j.cobeha.2021.04.020. Epub 2021 May 28.

Reinforcement learning in and out of context: The effects of attentional focus.

J Exp Psychol Learn Mem Cogn. 2023 Aug;49(8):1193-1217. doi: 10.1037/xlm0001145. Epub 2022 Jul 4.

Human value learning and representation reflect rational adaptation to task demands.

Nat Hum Behav. 2022 Sep;6(9):1268-1279. doi: 10.1038/s41562-022-01360-4. Epub 2022 May 30.

Humans perseverate on punishment avoidance goals in multigoal reinforcement learning.

Elife. 2022 Feb 24;11:e74402. doi: 10.7554/eLife.74402.

Asymmetric reinforcement learning facilitates human inference of transitive relations.

Nat Hum Behav. 2022 Apr;6(4):555-564. doi: 10.1038/s41562-021-01263-w. Epub 2022 Jan 31.

Memory and decision making interact to shape the value of unchosen options.

Nat Commun. 2021 Jul 30;12(1):4648. doi: 10.1038/s41467-021-24907-x.

Context-sensitive valuation and learning.

Curr Opin Behav Sci. 2021 Oct;41:122-127. doi: 10.1016/j.cobeha.2021.05.001. Epub 2021 Jun 9.

The case against economic values in the orbitofrontal cortex (or anywhere else in the brain).

Behav Neurosci. 2021 Apr;135(2):192-201. doi: 10.1037/bne0000448.

Two sides of the same coin: Beneficial and detrimental consequences of range adaptation in human reinforcement learning.

Sci Adv. 2021 Apr 2;7(14). doi: 10.1126/sciadv.abe0340. Print 2021 Apr.

It's all relative: Reward-induced cognitive control modulation depends on context.

J Exp Psychol Gen. 2021 Feb;150(2):306-313. doi: 10.1037/xge0000842. Epub 2020 Aug 13.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

训练多样性促进绝对值引导选择。

Training diversity promotes absolute-value-guided choice.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献