贝叶斯因子惊喜在多变环境中的学习

Learning in Volatile Environments With the Bayes Factor Surprise.

机构信息

École Polytechnique Fédérale de Lausanne, School of Computer and Communication Sciences and School of Life Sciences, 1015 Lausanne, Switzerland

出版信息

Neural Comput. 2021 Feb;33(2):269-340. doi: 10.1162/neco_a_01352. Epub 2021 Jan 5.

DOI:10.1162/neco_a_01352

PMID:33400898

Abstract

Surprise-based learning allows agents to rapidly adapt to nonstationary stochastic environments characterized by sudden changes. We show that exact Bayesian inference in a hierarchical model gives rise to a surprise-modulated trade-off between forgetting old observations and integrating them with the new ones. The modulation depends on a probability ratio, which we call the Bayes Factor Surprise, that tests the prior belief against the current belief. We demonstrate that in several existing approximate algorithms, the Bayes Factor Surprise modulates the rate of adaptation to new observations. We derive three novel surprise-based algorithms, one in the family of particle filters, one in the family of variational learning, and one in the family of message passing, that have constant scaling in observation sequence length and particularly simple update dynamics for any distribution in the exponential family. Empirical results show that these surprise-based algorithms estimate parameters better than alternative approximate approaches and reach levels of performance comparable to computationally more expensive algorithms. The Bayes Factor Surprise is related to but different from the Shannon Surprise. In two hypothetical experiments, we make testable predictions for physiological indicators that dissociate the Bayes Factor Surprise from the Shannon Surprise. The theoretical insight of casting various approaches as surprise-based learning, as well as the proposed online algorithms, may be applied to the analysis of animal and human behavior and to reinforcement learning in nonstationary environments.

摘要

基于惊讶的学习允许代理快速适应具有突然变化的非平稳随机环境。我们表明，在分层模型中的精确贝叶斯推断会导致旧观测值的遗忘与新观测值的整合之间的惊讶调节权衡。这种调制取决于一个概率比，我们称之为贝叶斯因子惊讶，它测试先验信念与当前信念的一致性。我们证明，在几个现有的近似算法中，贝叶斯因子惊讶会调节对新观测值的适应速度。我们推导出三种新的基于惊讶的算法，一种是在粒子滤波器家族中，一种是在变分学习家族中，一种是在消息传递家族中，它们在观测序列长度上具有常数缩放，并且对于指数族中的任何分布，更新动态都特别简单。实证结果表明，这些基于惊讶的算法比其他近似方法更好地估计参数，并达到与计算成本更高的算法相当的性能水平。贝叶斯因子惊讶与香农惊讶有关但不同。在两个假设实验中，我们对能够将贝叶斯因子惊讶与香农惊讶区分开来的生理指标做出了可检验的预测。将各种方法视为基于惊讶的学习的理论见解，以及提出的在线算法，可应用于非平稳环境中动物和人类行为的分析以及强化学习。

相似文献

Learning in Volatile Environments With the Bayes Factor Surprise.贝叶斯因子惊喜在多变环境中的学习

Neural Comput. 2021 Feb;33(2):269-340. doi: 10.1162/neco_a_01352. Epub 2021 Jan 5.

Learning and forgetting using reinforced Bayesian change detection.基于强化贝叶斯变化检测的学习和遗忘。

PLoS Comput Biol. 2019 Apr 17;15(4):e1006713. doi: 10.1371/journal.pcbi.1006713. eCollection 2019 Apr.

Of bits and wows: A Bayesian theory of surprise with applications to attention.比特与惊叹：应用于注意力的贝叶斯惊奇理论。

Neural Netw. 2010 Jun;23(5):649-66. doi: 10.1016/j.neunet.2009.12.007. Epub 2009 Dec 28.

Brain signals of a Surprise-Actor-Critic model: Evidence for multiple learning modules in human decision making.惊奇行动者-评论家模型的脑信号：人类决策中多个学习模块的证据。

Neuroimage. 2022 Feb 1;246:118780. doi: 10.1016/j.neuroimage.2021.118780. Epub 2021 Dec 5.

Lifelong Incremental Reinforcement Learning With Online Bayesian Inference.终身增量强化学习与在线贝叶斯推断。

IEEE Trans Neural Netw Learn Syst. 2022 Aug;33(8):4003-4016. doi: 10.1109/TNNLS.2021.3055499. Epub 2022 Aug 3.

A new method of Bayesian causal inference in non-stationary environments.一种新的非平稳环境下贝叶斯因果推断方法。

PLoS One. 2020 May 22;15(5):e0233559. doi: 10.1371/journal.pone.0233559. eCollection 2020.

Latent-space variational bayes.潜在空间变分贝叶斯

IEEE Trans Pattern Anal Mach Intell. 2008 Dec;30(12):2236-42. doi: 10.1109/TPAMI.2008.157.

Balancing New against Old Information: The Role of Puzzlement Surprise in Learning.平衡新旧信息：困惑惊喜在学习中的作用。

Neural Comput. 2018 Jan;30(1):34-83. doi: 10.1162/neco_a_01025. Epub 2017 Oct 24.

Brain dynamics for confidence-weighted learning.脑动力学与置信权重学习。

PLoS Comput Biol. 2020 Jun 2;16(6):e1007935. doi: 10.1371/journal.pcbi.1007935. eCollection 2020 Jun.

The Rosenblatt Bayesian algorithm learning in a nonstationary environment.罗森布拉特贝叶斯算法在非平稳环境中的学习。

IEEE Trans Neural Netw. 2007 Mar;18(2):584-8. doi: 10.1109/TNN.2006.889943.

引用本文的文献

Higher-order and distributed synergistic functional interactions encode information gain in goal-directed learning.高阶和分布式协同功能相互作用在目标导向学习中编码信息增益。

Nat Commun. 2025 Aug 5;16(1):7179. doi: 10.1038/s41467-025-62507-1.

Uncertainty estimation with prediction-error circuits.使用预测误差电路进行不确定性估计。

Nat Commun. 2025 Mar 28;16(1):3036. doi: 10.1038/s41467-025-58311-6.

Fast adaptation to rule switching using neuronal surprise.利用神经元惊讶实现快速规则切换适应。

PLoS Comput Biol. 2024 Feb 20;20(2):e1011839. doi: 10.1371/journal.pcbi.1011839. eCollection 2024 Feb.

P3-like signatures of temporal predictions: a computational EEG study.具有时间预测特征的 P3 样信号：一项基于 EEG 的计算研究。

Exp Brain Res. 2023 Jul;241(7):1919-1930. doi: 10.1007/s00221-023-06656-z. Epub 2023 Jun 24.

Neural spiking for causal inference and learning.神经尖峰用于因果推理和学习。

PLoS Comput Biol. 2023 Apr 4;19(4):e1011005. doi: 10.1371/journal.pcbi.1011005. eCollection 2023 Apr.

Revealing human sensitivity to a latent temporal structure of changes.揭示人类对变化的潜在时间结构的敏感性。

Front Behav Neurosci. 2022 Oct 17;16:962494. doi: 10.3389/fnbeh.2022.962494. eCollection 2022.

Active inference and the two-step task.主动推断与两步任务。

Sci Rep. 2022 Oct 21;12(1):17682. doi: 10.1038/s41598-022-21766-4.

A Bayesian Surprise Approach in Designing Cognitive Radar for Autonomous Driving.一种用于自动驾驶认知雷达设计的贝叶斯惊喜方法。

Entropy (Basel). 2022 May 10;24(5):672. doi: 10.3390/e24050672.

Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making.新颖性不是惊喜：人类在序列决策中的探索和适应行为。

PLoS Comput Biol. 2021 Jun 3;17(6):e1009070. doi: 10.1371/journal.pcbi.1009070. eCollection 2021 Jun.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

贝叶斯因子惊喜在多变环境中的学习

Learning in Volatile Environments With the Bayes Factor Surprise.

机构信息

出版信息

相似文献

引用本文的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献