• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

主动推理中习惯形成的缓存机制。

Caching mechanisms for habit formation in Active Inference.

作者信息

Maisto D, Friston K, Pezzulo G

机构信息

Institute for High Performance Computing and Networking, National Research Council, Via P. Castellino, 111, Naples 80131, Italy.

The Wellcome Trust Centre for Neuroimaging, Institute of Neurology, University College London, London, UK.

出版信息

Neurocomputing (Amst). 2019 Sep 24;359:298-314. doi: 10.1016/j.neucom.2019.05.083.

DOI:10.1016/j.neucom.2019.05.083
PMID:32055104
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7001981/
Abstract

A popular distinction in the human and animal learning literature is between deliberate (or willed) and habitual (or automatic) modes of control. Extensive evidence indicates that, after sufficient learning, living organisms develop behavioural habits that permit them saving computational resources. Furthermore, humans and other animals are able to transfer control from deliberate to habitual modes (and vice versa), trading off efficiently flexibility and parsimony - an ability that is currently unparalleled by artificial control systems. Here, we discuss a computational implementation of habit formation, and the transfer of control from deliberate to habitual modes (and vice versa) within Active Inference: a computational framework that merges aspects of cybernetic theory and of Bayesian inference. To model habit formation, we endow an Active Inference agent with a mechanism to "cache" (or memorize) policy probabilities from previous trials, and reuse them to skip - in part or in full - the inferential steps of deliberative processing. We exploit the fact that the relative quality of policies, conditioned upon hidden states, is constant over trials; provided that contingencies and prior preferences do not change. This means the only quantity that can change policy selection is the prior distribution over the initial state - where this prior is based upon the posterior beliefs from previous trials. Thus, an agent that caches the quality (or the probability) of policies can safely reuse cached values to save on cognitive and computational resources - unless contingencies change. Our simulations illustrate the computational benefits, but also the limits, of three caching schemes under Active Inference. They suggest that key aspects of habitual behaviour - such as perseveration - can be explained in terms of caching policy probabilities. Furthermore, they suggest that there may be many kinds (or stages) of habitual behaviour, each associated with a different caching scheme; for example, caching associated or not associated with contextual estimation. These schemes are more or less impervious to contextual and contingency changes.

摘要

在人类和动物学习文献中,一个常见的区别在于刻意(或有意志的)和习惯性(或自动的)控制模式。大量证据表明,经过充分学习后,生物体形成行为习惯,从而节省计算资源。此外,人类和其他动物能够将控制权从刻意模式转移到习惯模式(反之亦然),有效地权衡灵活性和简约性——这是目前人工控制系统无法比拟的能力。在此,我们讨论习惯形成的计算实现,以及在主动推理中从刻意模式到习惯模式(反之亦然)的控制权转移:这是一个融合控制论和贝叶斯推理各方面的计算框架。为了模拟习惯形成,我们赋予主动推理智能体一种机制,用于“缓存”(或记忆)先前试验中的策略概率,并重新使用它们部分或全部跳过刻意处理的推理步骤。我们利用这样一个事实,即在隐藏状态条件下,策略的相对质量在各次试验中是恒定的;前提是意外情况和先验偏好不变。这意味着唯一能改变策略选择的量是初始状态上的先验分布——这里的先验基于先前试验的后验信念。因此,一个缓存策略质量(或概率)的智能体可以安全地重新使用缓存值以节省认知和计算资源——除非意外情况发生变化。我们的模拟展示了主动推理下三种缓存方案的计算优势,但也展示了其局限性。它们表明习惯性行为的关键方面——比如固执——可以用缓存策略概率来解释。此外,它们表明可能存在多种(或阶段)习惯性行为,每种行为都与不同的缓存方案相关联;例如,与上下文估计相关或不相关的缓存。这些方案或多或少不受上下文和意外情况变化的影响。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/664e/7001981/b3b965e9f07c/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/664e/7001981/a1fef136a977/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/664e/7001981/cb3c125ed2c9/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/664e/7001981/357c2e3916ba/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/664e/7001981/8648a8316e7a/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/664e/7001981/8f4ba87c3b10/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/664e/7001981/b3b965e9f07c/gr6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/664e/7001981/a1fef136a977/gr1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/664e/7001981/cb3c125ed2c9/gr2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/664e/7001981/357c2e3916ba/gr3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/664e/7001981/8648a8316e7a/gr4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/664e/7001981/8f4ba87c3b10/gr5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/664e/7001981/b3b965e9f07c/gr6.jpg

相似文献

1
Caching mechanisms for habit formation in Active Inference.主动推理中习惯形成的缓存机制。
Neurocomputing (Amst). 2019 Sep 24;359:298-314. doi: 10.1016/j.neucom.2019.05.083.
2
Active inference and learning.主动推理与学习
Neurosci Biobehav Rev. 2016 Sep;68:862-879. doi: 10.1016/j.neubiorev.2016.06.022. Epub 2016 Jun 29.
3
PPCS: A Progressive Popularity-Aware Caching Scheme for Edge-Based Cache Redundancy Avoidance in Information-Centric Networks.PPCS:一种基于渐进式流行度感知的缓存方案,用于避免信息中心网络中的边缘缓存冗余。
Sensors (Basel). 2019 Feb 8;19(3):694. doi: 10.3390/s19030694.
4
A Bayesian Account of Generalist and Specialist Formation Under the Active Inference Framework.主动推理框架下通才和专才形成的贝叶斯解释。
Front Artif Intell. 2020 Sep 3;3:69. doi: 10.3389/frai.2020.00069. eCollection 2020.
5
Effects of experience and social context on prospective caching strategies by scrub jays.经验和社会环境对灌丛鸦前瞻性贮藏策略的影响。
Nature. 2001 Nov 22;414(6862):443-6. doi: 10.1038/35106560.
6
Western scrub-jays ( Aphelocoma californica) use cognitive strategies to protect their caches from thieving conspecifics.西丛鸦(加州丛鸦)会运用认知策略来保护它们的贮藏物,以防被同种的其他西丛鸦偷走。
Anim Cogn. 2004 Jan;7(1):37-43. doi: 10.1007/s10071-003-0178-7. Epub 2003 Jun 26.
7
Value-Based Caching in Information-Centric Wireless Body Area Networks.以信息为中心的无线体域网中的基于价值的缓存
Sensors (Basel). 2017 Jan 19;17(1):181. doi: 10.3390/s17010181.
8
Influence of competitors on caching behaviour in the common raven, Corvus corax.竞争者对普通渡鸦(Corvus corax)贮藏行为的影响。
Anim Behav. 1998 Nov;56(5):1083-1090. doi: 10.1006/anbe.1998.0906.
9
Computational models of episodic-like memory in food-caching birds.食籽鸟类情景记忆的计算模型。
Nat Commun. 2023 May 23;14(1):2979. doi: 10.1038/s41467-023-38570-x.
10
An Efficient Distributed Content Store-Based Caching Policy for Information-Centric Networking.一种基于高效分布式内容存储的信息中心网络缓存策略。
Sensors (Basel). 2022 Feb 17;22(4):1577. doi: 10.3390/s22041577.

引用本文的文献

1
Flow and intuition: a systems neuroscience comparison.流动与直觉:系统神经科学比较
Neurosci Conscious. 2025 Jan 4;2025(1):niae040. doi: 10.1093/nc/niae040. eCollection 2025.
2
Forgetting ourselves in flow: an active inference account of flow states and how we experience ourselves within them.在心流中忘却自我:心流状态的主动推理阐释以及我们在其中的自我体验方式。
Front Psychol. 2024 Jun 3;15:1354719. doi: 10.3389/fpsyg.2024.1354719. eCollection 2024.
3
When the interoceptive and conceptual clash: The case of oppositional phenomenal self-modelling in Tourette syndrome.

本文引用的文献

1
Habits without values.无价值观的习惯。
Psychol Rev. 2019 Mar;126(2):292-311. doi: 10.1037/rev0000120. Epub 2019 Jan 24.
2
Model-based spatial navigation in the hippocampus-ventral striatum circuit: A computational analysis.基于模型的海马-腹侧纹状体回路中的空间导航:计算分析。
PLoS Comput Biol. 2018 Sep 17;14(9):e1006316. doi: 10.1371/journal.pcbi.1006316. eCollection 2018 Sep.
3
Hierarchical Active Inference: A Theory of Motivated Control.分层主动推理:动机控制理论。
当内感受和概念冲突时:抽动秽语综合征中对立的现象自我建模的案例。
Cogn Affect Behav Neurosci. 2024 Aug;24(4):660-680. doi: 10.3758/s13415-024-01189-6. Epub 2024 May 22.
4
Feeling our place in the world: an active inference account of self-esteem.感受我们在世界中的位置:自尊的主动推理解释
Neurosci Conscious. 2024 Apr 1;2024(1):niae007. doi: 10.1093/nc/niae007. eCollection 2024.
5
The 3Ps: A tool for coach observation.3P原则:教练观察工具
Front Sports Act Living. 2023 Jan 20;4:1066378. doi: 10.3389/fspor.2022.1066378. eCollection 2022.
6
Understanding, Explanation, and Active Inference.理解、解释与主动推理
Front Syst Neurosci. 2021 Nov 5;15:772641. doi: 10.3389/fnsys.2021.772641. eCollection 2021.
7
Everything is connected: Inference and attractors in delusions.万物皆相连:妄想中的推理与吸引子
Schizophr Res. 2022 Jul;245:5-22. doi: 10.1016/j.schres.2021.07.032. Epub 2021 Aug 9.
8
Meta-control of the exploration-exploitation dilemma emerges from probabilistic inference over a hierarchy of time scales.元控制探索-开发困境源自于对时间尺度层级的概率推断。
Cogn Affect Behav Neurosci. 2021 Jun;21(3):509-533. doi: 10.3758/s13415-020-00837-x. Epub 2020 Dec 28.
Trends Cogn Sci. 2018 Apr;22(4):294-306. doi: 10.1016/j.tics.2018.01.009. Epub 2018 Feb 20.
4
Uncertainty, epistemics and active inference.不确定性、认识论与主动推理。
J R Soc Interface. 2017 Nov;14(136). doi: 10.1098/rsif.2017.0376.
5
Active Inference, Curiosity and Insight.主动推理、好奇心与洞察力。
Neural Comput. 2017 Oct;29(10):2633-2683. doi: 10.1162/neco_a_00999. Epub 2017 Aug 4.
6
Internally generated hippocampal sequences as a vantage point to probe future-oriented cognition.内部产生的海马体序列作为探究面向未来的认知的有利视角。
Ann N Y Acad Sci. 2017 May;1396(1):144-165. doi: 10.1111/nyas.13329.
7
Active Inference: A Process Theory.主动推理:一种过程理论。
Neural Comput. 2017 Jan;29(1):1-49. doi: 10.1162/NECO_a_00912. Epub 2016 Nov 21.
8
Adaptive integration of habits into depth-limited planning defines a habitual-goal-directed spectrum.将习惯适应性地整合到深度受限的规划中定义了一个习惯-目标导向频谱。
Proc Natl Acad Sci U S A. 2016 Nov 8;113(45):12868-12873. doi: 10.1073/pnas.1609094113. Epub 2016 Oct 24.
9
Active inference and learning.主动推理与学习
Neurosci Biobehav Rev. 2016 Sep;68:862-879. doi: 10.1016/j.neubiorev.2016.06.022. Epub 2016 Jun 29.
10
Active Inference, epistemic value, and vicarious trial and error.主动推理、认知价值与替代性试错
Learn Mem. 2016 Jun 17;23(7):322-38. doi: 10.1101/lm.041780.116. Print 2016 Jul.