• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

关于行动结果的信息会对自主选择和强制选择的学习产生不同的影响。

Information about action outcomes differentially affects learning from self-determined versus imposed choices.

机构信息

Institut Jean Nicod, Département d'Études Cognitives, École Normale Supérieure, EHESS, CNRS, PSL University, Paris, France.

Laboratoire de Neurosciences Cognitives et Computationnelles, Département d'Études Cognitives, École Normale Supérieure, INSERM, PSL University, Paris, France.

出版信息

Nat Hum Behav. 2020 Oct;4(10):1067-1079. doi: 10.1038/s41562-020-0919-5. Epub 2020 Aug 3.

DOI:10.1038/s41562-020-0919-5
PMID:32747804
Abstract

The valence of new information influences learning rates in humans: good news tends to receive more weight than bad news. We investigated this learning bias in four experiments, by systematically manipulating the source of required action (free versus forced choices), outcome contingencies (low versus high reward) and motor requirements (go versus no-go choices). Analysis of model-estimated learning rates showed that the confirmation bias in learning rates was specific to free choices, but was independent of outcome contingencies. The bias was also unaffected by the motor requirements, thus suggesting that it operates in the representational space of decisions, rather than motoric actions. Finally, model simulations revealed that learning rates estimated from the choice-confirmation model had the effect of maximizing performance across low- and high-reward environments. We therefore suggest that choice-confirmation bias may be adaptive for efficient learning of action-outcome contingencies, above and beyond fostering person-level dispositions such as self-esteem.

摘要

新信息的效价会影响人类的学习速度

好消息往往比坏消息更受重视。我们通过系统地操纵所需行动的来源(自由选择与强制选择)、结果关联(低奖励与高奖励)和运动要求(是选择与否选择),在四个实验中研究了这种学习偏见。通过对模型估计的学习率进行分析,我们发现学习率中的确认偏见特定于自由选择,但与结果关联无关。这种偏见也不受运动要求的影响,因此表明它在决策的表示空间中起作用,而不是运动动作。最后,模型模拟表明,从选择确认模型估计的学习率具有在低奖励和高奖励环境中最大化性能的效果。因此,我们认为,选择确认偏差可能有助于有效地学习行为-结果关联,而不仅仅是培养自尊心等个人特质。

相似文献

1
Information about action outcomes differentially affects learning from self-determined versus imposed choices.关于行动结果的信息会对自主选择和强制选择的学习产生不同的影响。
Nat Hum Behav. 2020 Oct;4(10):1067-1079. doi: 10.1038/s41562-020-0919-5. Epub 2020 Aug 3.
2
Learning in anticipation of reward and punishment: perspectives across the human lifespan.期待奖惩的学习:人类生命周期的观点。
Neurobiol Aging. 2020 Dec;96:49-57. doi: 10.1016/j.neurobiolaging.2020.08.011. Epub 2020 Aug 22.
3
Diminished choice effect on anticipating improbable rewards.期待不可能回报时选择减少的效应。
Neuropsychologia. 2018 Mar;111:45-50. doi: 10.1016/j.neuropsychologia.2018.01.015. Epub 2018 Feb 2.
4
Hold it! The influence of lingering rewards on choice diversification and persistence.等等!延迟奖励对选择多样化和坚持性的影响。
J Exp Psychol Learn Mem Cogn. 2017 Nov;43(11):1752-1767. doi: 10.1037/xlm0000407. Epub 2017 Apr 6.
5
Optimism as a prior belief about the probability of future reward.乐观主义作为对未来奖励概率的一种先验信念。
PLoS Comput Biol. 2014 May 22;10(5):e1003605. doi: 10.1371/journal.pcbi.1003605. eCollection 2014 May.
6
Evidence for a dissociation between causal beliefs and instrumental actions.因果信念与工具性行动之间分离的证据。
Q J Exp Psychol (Hove). 2020 Apr;73(4):495-503. doi: 10.1177/1747021819899808. Epub 2020 Jan 29.
7
Processing of expected and unexpected monetary performance outcomes in healthy older subjects.健康老年受试者对预期和意外货币绩效结果的处理。
Behav Neurosci. 2011 Apr;125(2):241-51. doi: 10.1037/a0022536.
8
Adaptive History Biases Result from Confidence-Weighted Accumulation of past Choices.适应性历史偏差源于过去选择的置信度加权积累。
J Neurosci. 2018 Mar 7;38(10):2418-2429. doi: 10.1523/JNEUROSCI.2189-17.2017. Epub 2018 Jan 25.
9
Model-based choices involve prospective neural activity.基于模型的选择涉及前瞻性神经活动。
Nat Neurosci. 2015 May;18(5):767-72. doi: 10.1038/nn.3981. Epub 2015 Mar 23.
10
Adaptive learning via selectionism and Bayesianism, Part I: connection between the two.基于选择主义和贝叶斯主义的适应性学习,第一部分:两者之间的联系。
Neural Netw. 2009 Apr;22(3):220-8. doi: 10.1016/j.neunet.2009.03.018. Epub 2009 Apr 5.

引用本文的文献

1
Data-driven equation discovery reveals nonlinear reinforcement learning in humans.数据驱动的方程发现揭示了人类的非线性强化学习。
Proc Natl Acad Sci U S A. 2025 Aug 5;122(31):e2413441122. doi: 10.1073/pnas.2413441122. Epub 2025 Jul 31.
2
The benefit of choice on task performance: Reduced difficulty effects in free-choice versus forced-choice tasks.选择对任务表现的益处:自由选择任务与强制选择任务中难度效应的降低
Mem Cognit. 2025 May;53(4):1162-1177. doi: 10.3758/s13421-024-01641-5. Epub 2024 Oct 7.
3
Moderate confirmation bias enhances decision-making in groups of reinforcement-learning agents.
适度的确认偏差会增强强化学习智能体群体中的决策能力。
PLoS Comput Biol. 2024 Sep 4;20(9):e1012404. doi: 10.1371/journal.pcbi.1012404. eCollection 2024 Sep.
4
The influence of reward and loss outcomes after free- and forced-tasks on voluntary task choice.自由任务和强制任务后的奖励和损失结果对自愿任务选择的影响。
Psychol Res. 2024 Oct;88(7):2059-2079. doi: 10.1007/s00426-024-02009-9. Epub 2024 Jul 30.
5
A Competition of Critics in Human Decision-Making.人类决策中的批评者竞争
Comput Psychiatr. 2021 Aug 12;5(1):81-101. doi: 10.5334/cpsy.64. eCollection 2021.
6
Active reinforcement learning versus action bias and hysteresis: control with a mixture of experts and nonexperts.主动强化学习与动作偏差和滞后的比较:混合专家与非专家的控制。
PLoS Comput Biol. 2024 Mar 29;20(3):e1011950. doi: 10.1371/journal.pcbi.1011950. eCollection 2024 Mar.
7
The roots of polarization in the individual reward system.个体奖励系统中极化的根源。
Proc Biol Sci. 2024 Feb 28;291(2017):20232011. doi: 10.1098/rspb.2023.2011.
8
Dynamics Learning Rate Bias in Pigeons: Insights from Reinforcement Learning and Neural Correlates.鸽子的动态学习率偏差:强化学习及神经关联的见解
Animals (Basel). 2024 Feb 1;14(3):489. doi: 10.3390/ani14030489.
9
Computational mechanisms underlying latent value updating of unchosen actions.潜在未选动作价值更新的计算机制。
Sci Adv. 2023 Oct 20;9(42):eadi2704. doi: 10.1126/sciadv.adi2704.
10
The reliability of assistance systems modulates the sense of control and acceptability of human operators.辅助系统的可靠性调节着人类操作员的控制感和可接受性。
Sci Rep. 2023 Sep 2;13(1):14410. doi: 10.1038/s41598-023-41253-8.