• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

风险、意外不确定性和估计不确定性:不稳定环境下的贝叶斯学习。

Risk, unexpected uncertainty, and estimation uncertainty: Bayesian learning in unstable settings.

机构信息

University of New South Wales, Sydney, Australia.

出版信息

PLoS Comput Biol. 2011 Jan 20;7(1):e1001048. doi: 10.1371/journal.pcbi.1001048.

DOI:10.1371/journal.pcbi.1001048
PMID:21283774
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3024253/
Abstract

Recently, evidence has emerged that humans approach learning using Bayesian updating rather than (model-free) reinforcement algorithms in a six-arm restless bandit problem. Here, we investigate what this implies for human appreciation of uncertainty. In our task, a Bayesian learner distinguishes three equally salient levels of uncertainty. First, the Bayesian perceives irreducible uncertainty or risk: even knowing the payoff probabilities of a given arm, the outcome remains uncertain. Second, there is (parameter) estimation uncertainty or ambiguity: payoff probabilities are unknown and need to be estimated. Third, the outcome probabilities of the arms change: the sudden jumps are referred to as unexpected uncertainty. We document how the three levels of uncertainty evolved during the course of our experiment and how it affected the learning rate. We then zoom in on estimation uncertainty, which has been suggested to be a driving force in exploration, in spite of evidence of widespread aversion to ambiguity. Our data corroborate the latter. We discuss neural evidence that foreshadowed the ability of humans to distinguish between the three levels of uncertainty. Finally, we investigate the boundaries of human capacity to implement Bayesian learning. We repeat the experiment with different instructions, reflecting varying levels of structural uncertainty. Under this fourth notion of uncertainty, choices were no better explained by Bayesian updating than by (model-free) reinforcement learning. Exit questionnaires revealed that participants remained unaware of the presence of unexpected uncertainty and failed to acquire the right model with which to implement Bayesian updating.

摘要

最近的证据表明,在六臂不安定赌博问题中,人类在学习时使用贝叶斯更新,而不是(无模型)强化算法。在这里,我们研究这对人类对不确定性的理解意味着什么。在我们的任务中,贝叶斯学习者区分三种同样明显的不确定性水平。首先,贝叶斯感知到不可减少的不确定性或风险:即使知道给定手臂的收益概率,结果仍然是不确定的。其次,存在(参数)估计不确定性或模糊性:收益概率未知,需要进行估计。第三,手臂的结果概率发生变化:突然的跳跃被称为意外不确定性。我们记录了在实验过程中这三个不确定性水平是如何演变的,以及它如何影响学习速度。然后,我们将重点放在估计不确定性上,尽管有证据表明人们普遍厌恶模糊性,但它被认为是探索的驱动力。我们的数据证实了后者。我们讨论了预示人类能够区分三种不确定性水平的神经证据。最后,我们研究了人类实施贝叶斯学习的能力的界限。我们用不同的指令重复实验,反映了不同程度的结构不确定性。在这种第四种不确定性概念下,选择不能用贝叶斯更新来更好地解释,而只能用(无模型)强化学习来解释。退出问卷显示,参与者仍然没有意识到意外不确定性的存在,并且未能获得正确的模型来实施贝叶斯更新。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be36/3024253/4f16f4e9bf96/pcbi.1001048.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be36/3024253/d820056f3c62/pcbi.1001048.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be36/3024253/13c3de72ed00/pcbi.1001048.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be36/3024253/dfca047246ef/pcbi.1001048.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be36/3024253/ba3b7fd72ad8/pcbi.1001048.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be36/3024253/a729fb8c49ab/pcbi.1001048.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be36/3024253/4f16f4e9bf96/pcbi.1001048.g006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be36/3024253/d820056f3c62/pcbi.1001048.g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be36/3024253/13c3de72ed00/pcbi.1001048.g002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be36/3024253/dfca047246ef/pcbi.1001048.g003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be36/3024253/ba3b7fd72ad8/pcbi.1001048.g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be36/3024253/a729fb8c49ab/pcbi.1001048.g005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/be36/3024253/4f16f4e9bf96/pcbi.1001048.g006.jpg

相似文献

1
Risk, unexpected uncertainty, and estimation uncertainty: Bayesian learning in unstable settings.风险、意外不确定性和估计不确定性:不稳定环境下的贝叶斯学习。
PLoS Comput Biol. 2011 Jan 20;7(1):e1001048. doi: 10.1371/journal.pcbi.1001048.
2
Uncertainty-driven regulation of learning and exploration in adolescents: A computational account.不确定性驱动的青少年学习和探索的调节:一种计算解释。
PLoS Comput Biol. 2020 Sep 30;16(9):e1008276. doi: 10.1371/journal.pcbi.1008276. eCollection 2020 Sep.
3
Uncertainty in action-value estimation affects both action choice and learning rate of the choice behaviors of rats.动作值估计中的不确定性会影响大鼠的动作选择和选择行为的学习率。
Eur J Neurosci. 2012 Apr;35(7):1180-9. doi: 10.1111/j.1460-9568.2012.08025.x.
4
Neural correlates of valence-dependent belief and value updating during uncertainty reduction: An fNIRS study.在不确定性减少过程中,价值相关的信念和价值更新的神经关联:一项功能近红外光谱研究。
Neuroimage. 2023 Oct 1;279:120327. doi: 10.1016/j.neuroimage.2023.120327. Epub 2023 Aug 13.
5
How uncertain are you? Disentangling expected and unexpected uncertainty in pupil-linked brain arousal during reversal learning.你有多不确定?在反转学习过程中,从与瞳孔相关的大脑唤醒中区分预期和意外不确定性。
Cogn Affect Behav Neurosci. 2023 Jun;23(3):578-599. doi: 10.3758/s13415-023-01072-w. Epub 2023 Feb 23.
6
Neural Mechanisms of Updating under Reducible and Irreducible Uncertainty.可约与不可约不确定性下更新的神经机制
J Neurosci. 2017 Jul 19;37(29):6972-6982. doi: 10.1523/JNEUROSCI.0535-17.2017. Epub 2017 Jun 16.
7
Do not Bet on the Unknown Versus Try to Find Out More: Estimation Uncertainty and "Unexpected Uncertainty" Both Modulate Exploration.不要盲目猜测,要努力寻找更多信息:估计不确定性和“意外不确定性”都会调节探索。
Front Neurosci. 2012 Oct 16;6:150. doi: 10.3389/fnins.2012.00150. eCollection 2012.
8
Bayesian Solutions for Handling Uncertainty in Survival Extrapolation.用于处理生存外推中不确定性的贝叶斯解决方案。
Med Decis Making. 2017 May;37(4):367-376. doi: 10.1177/0272989X16650669. Epub 2016 Jun 8.
9
State anxiety biases estimates of uncertainty and impairs reward learning in volatile environments.状态焦虑会影响不确定性的估计,并在不稳定的环境中损害奖励学习。
Neuroimage. 2021 Jan 1;224:117424. doi: 10.1016/j.neuroimage.2020.117424. Epub 2020 Oct 6.
10
Bayesian regression explains how human participants handle parameter uncertainty.贝叶斯回归解释了人类参与者如何处理参数不确定性。
PLoS Comput Biol. 2020 May 18;16(5):e1007886. doi: 10.1371/journal.pcbi.1007886. eCollection 2020 May.

引用本文的文献

1
Dynamic estimation of the attentional field from visual cortical activity.从视觉皮层活动中动态估计注意力场
Elife. 2025 Sep 11;14:RP104222. doi: 10.7554/eLife.104222.
2
Neural correlates of reduced sensitivity to information about uncertainty during valuation in older adults: An fNIRS study.老年人估值过程中对不确定性信息敏感度降低的神经关联:一项功能近红外光谱研究。
Imaging Neurosci (Camb). 2025 Jun 24;3. doi: 10.1162/IMAG.a.61. eCollection 2025.
3
Higher-order and distributed synergistic functional interactions encode information gain in goal-directed learning.

本文引用的文献

1
Neural correlates of value, risk, and risk aversion contributing to decision making under risk.价值、风险和风险厌恶的神经关联对风险决策的影响。
J Neurosci. 2009 Oct 7;29(40):12574-83. doi: 10.1523/JNEUROSCI.2614-09.2009.
2
Neural correlates of risk prediction error during reinforcement learning in humans.人类强化学习过程中风险预测误差的神经关联
Neuroimage. 2009 Oct 1;47(4):1929-39. doi: 10.1016/j.neuroimage.2009.04.096. Epub 2009 May 13.
3
The hippocampus codes the uncertainty of cue-outcome associations: an intracranial electrophysiological study in humans.
高阶和分布式协同功能相互作用在目标导向学习中编码信息增益。
Nat Commun. 2025 Aug 5;16(1):7179. doi: 10.1038/s41467-025-62507-1.
4
Estimation-uncertainty affects decisions with and without learning opportunities.估计不确定性会影响有无学习机会情况下的决策。
Nat Commun. 2025 Jul 21;16(1):6706. doi: 10.1038/s41467-025-61960-2.
5
Neural mechanisms in resolving prior and likelihood uncertainty in scene recognition.场景识别中解决先验和似然不确定性的神经机制。
iScience. 2025 May 13;28(6):112663. doi: 10.1016/j.isci.2025.112663. eCollection 2025 Jun 20.
6
Uncertainty-modulated prediction errors in cortical microcircuits.皮质微回路中不确定性调制的预测误差
Elife. 2025 Jun 5;13:RP95127. doi: 10.7554/eLife.95127.
7
Understanding learning through uncertainty and bias.通过不确定性和偏差来理解学习。
Commun Psychol. 2025 Feb 13;3(1):24. doi: 10.1038/s44271-025-00203-y.
8
Reinforcement-Learning-Informed Queries Guide Behavioral Change.强化学习引导的查询指导行为改变。
Clin Psychol Sci. 2024 Nov;12(6):1146-1161. doi: 10.1177/21677026231213368. Epub 2024 Jan 24.
9
Dopamine transients encode reward prediction errors independent of learning rates.多巴胺瞬变独立于学习率编码奖励预测误差。
Cell Rep. 2024 Oct 22;43(10):114840. doi: 10.1016/j.celrep.2024.114840. Epub 2024 Oct 11.
10
Dynamic estimation of the attentional field from visual cortical activity.基于视觉皮层活动的注意力场动态估计。
bioRxiv. 2024 Oct 8:2024.09.05.611383. doi: 10.1101/2024.09.05.611383.
海马体编码线索-结果关联的不确定性:一项针对人类的颅内电生理学研究。
J Neurosci. 2009 Apr 22;29(16):5287-94. doi: 10.1523/JNEUROSCI.5298-08.2009.
4
Modulators of decision making.决策的调节因素。
Nat Neurosci. 2008 Apr;11(4):410-6. doi: 10.1038/nn2077.
5
Choice, uncertainty and value in prefrontal and cingulate cortex.前额叶皮质和扣带皮质中的选择、不确定性与价值
Nat Neurosci. 2008 Apr;11(4):389-97. doi: 10.1038/nn2066. Epub 2008 Mar 26.
6
Human insula activation reflects risk prediction errors as well as risk.人类脑岛激活反映了风险预测误差以及风险。
J Neurosci. 2008 Mar 12;28(11):2745-52. doi: 10.1523/JNEUROSCI.4286-07.2008.
7
Learning the value of information in an uncertain world.在一个不确定的世界中了解信息的价值。
Nat Neurosci. 2007 Sep;10(9):1214-21. doi: 10.1038/nn1954. Epub 2007 Aug 5.
8
Should I stay or should I go? How the human brain manages the trade-off between exploitation and exploration.我该留下还是离开?人类大脑如何权衡利用与探索。
Philos Trans R Soc Lond B Biol Sci. 2007 May 29;362(1481):933-42. doi: 10.1098/rstb.2007.2098.
9
Adding prediction risk to the theory of reward learning.将预测风险纳入奖励学习理论。
Ann N Y Acad Sci. 2007 May;1104:135-46. doi: 10.1196/annals.1390.005. Epub 2007 Mar 7.
10
Reward value coding distinct from risk attitude-related uncertainty coding in human reward systems.人类奖励系统中与风险态度相关的不确定性编码不同的奖励价值编码。
J Neurophysiol. 2007 Feb;97(2):1621-32. doi: 10.1152/jn.00745.2006. Epub 2006 Nov 22.