• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

无价值观的习惯。

Habits without values.

机构信息

Princeton Neuroscience Institute, Princeton University.

Department of Cognitive, Linguistic, and Psychological Sciences, Brown Institute for Brain Science, Brown University.

出版信息

Psychol Rev. 2019 Mar;126(2):292-311. doi: 10.1037/rev0000120. Epub 2019 Jan 24.

DOI:10.1037/rev0000120
PMID:30676040
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6548181/
Abstract

Habits form a crucial component of behavior. In recent years, key computational models have conceptualized habits as arising from model-free reinforcement learning mechanisms, which typically select between available actions based on the future value expected to result from each. Traditionally, however, habits have been understood as behaviors that can be triggered directly by a stimulus, without requiring the animal to evaluate expected outcomes. Here, we develop a computational model instantiating this traditional view, in which habits develop through the direct strengthening of recently taken actions rather than through the encoding of outcomes. We demonstrate that this model accounts for key behavioral manifestations of habits, including insensitivity to outcome devaluation and contingency degradation, as well as the effects of reinforcement schedule on the rate of habit formation. The model also explains the prevalent observation of perseveration in repeated-choice tasks as an additional behavioral manifestation of the habit system. We suggest that mapping habitual behaviors onto value-free mechanisms provides a parsimonious account of existing behavioral and neural data. This mapping may provide a new foundation for building robust and comprehensive models of the interaction of habits with other, more goal-directed types of behaviors and help to better guide research into the neural mechanisms underlying control of instrumental behavior more generally. (PsycINFO Database Record (c) 2019 APA, all rights reserved).

摘要

习惯是行为的一个重要组成部分。近年来,关键的计算模型将习惯概念化为源于无模型强化学习机制,这些机制通常根据每个动作预期的未来价值在可用动作之间进行选择。然而,传统上,习惯被理解为可以直接由刺激触发的行为,而不需要动物评估预期的结果。在这里,我们开发了一个计算模型,实例化了这种传统观点,其中习惯是通过最近采取的行动的直接强化而不是通过结果的编码来发展的。我们证明,该模型解释了习惯的关键行为表现,包括对结果贬值和条件性降低的不敏感性,以及强化时间表对习惯形成速度的影响。该模型还解释了在重复选择任务中普遍存在的坚持现象,这是习惯系统的另一种行为表现。我们认为,将习惯性行为映射到无价值机制上,为现有的行为和神经数据提供了一种简洁的解释。这种映射可能为习惯与其他更有目标导向的行为类型的相互作用建立稳健而全面的模型提供新的基础,并有助于更好地指导更普遍的工具行为控制的神经机制的研究。(PsycINFO 数据库记录(c)2019 APA,保留所有权利)。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67dd/6548181/42308bed113f/nihms-984385-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67dd/6548181/c492abf1ed21/nihms-984385-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67dd/6548181/c3e139943d19/nihms-984385-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67dd/6548181/688ab8de8a30/nihms-984385-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67dd/6548181/6dad62649be1/nihms-984385-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67dd/6548181/19f52536be99/nihms-984385-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67dd/6548181/adaa3ace2e44/nihms-984385-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67dd/6548181/42308bed113f/nihms-984385-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67dd/6548181/c492abf1ed21/nihms-984385-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67dd/6548181/c3e139943d19/nihms-984385-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67dd/6548181/688ab8de8a30/nihms-984385-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67dd/6548181/6dad62649be1/nihms-984385-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67dd/6548181/19f52536be99/nihms-984385-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67dd/6548181/adaa3ace2e44/nihms-984385-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/67dd/6548181/42308bed113f/nihms-984385-f0007.jpg

相似文献

1
Habits without values.无价值观的习惯。
Psychol Rev. 2019 Mar;126(2):292-311. doi: 10.1037/rev0000120. Epub 2019 Jan 24.
2
Delayed rewards facilitate habit formation.延迟奖励有助于习惯养成。
J Exp Psychol Anim Learn Cogn. 2019 Oct;45(4):413-421. doi: 10.1037/xan0000221. Epub 2019 Aug 1.
3
Goal-directed decision making as probabilistic inference: a computational framework and potential neural correlates.目标导向决策作为概率推理:计算框架和潜在的神经关联。
Psychol Rev. 2012 Jan;119(1):120-54. doi: 10.1037/a0026435.
4
A theory of actions and habits: The interaction of rate correlation and contiguity systems in free-operant behavior.行动和习惯理论:自由操作行为中比率关联和接近系统的相互作用。
Psychol Rev. 2020 Nov;127(6):945-971. doi: 10.1037/rev0000201. Epub 2020 May 14.
5
Stimulus control of actions and habits: A role for reinforcer predictability and attention in the development of habitual behavior.动作与习惯的刺激控制:强化物可预测性和注意力在习惯行为发展中的作用。
J Exp Psychol Anim Learn Cogn. 2018 Oct;44(4):370-384. doi: 10.1037/xan0000188.
6
Goal-Directed Decision Making with Spiking Neurons.基于脉冲神经元的目标导向决策
J Neurosci. 2016 Feb 3;36(5):1529-46. doi: 10.1523/JNEUROSCI.2854-15.2016.
7
A model of prefrontal cortical mechanisms for goal-directed behavior.一种用于目标导向行为的前额叶皮质机制模型。
J Cogn Neurosci. 2005 Jul;17(7):1115-29. doi: 10.1162/0898929054475190.
8
Model-based learning protects against forming habits.基于模型的学习可防止形成习惯。
Cogn Affect Behav Neurosci. 2015 Sep;15(3):523-36. doi: 10.3758/s13415-015-0347-6.
9
Goal-Directed and Habit-Like Modulations of Stimulus Processing during Reinforcement Learning.强化学习过程中刺激处理的目标导向与习惯样调制
J Neurosci. 2017 Mar 15;37(11):3009-3017. doi: 10.1523/JNEUROSCI.3205-16.2017. Epub 2017 Feb 13.
10
Habits, action sequences and reinforcement learning.习惯、动作序列和强化学习。
Eur J Neurosci. 2012 Apr;35(7):1036-51. doi: 10.1111/j.1460-9568.2012.08050.x.

引用本文的文献

1
How working memory and reinforcement learning interact when avoiding punishment and pursuing reward concurrently.当同时避免惩罚和追求奖励时,工作记忆与强化学习是如何相互作用的。
J Exp Psychol Gen. 2025 Sep 1. doi: 10.1037/xge0001817.
2
Modelling cognitive flexibility with deep neural networks.使用深度神经网络对认知灵活性进行建模。
Curr Opin Behav Sci. 2024 Jun;57:101361. doi: 10.1016/j.cobeha.2024.101361.
3
Neural and behavioral signatures of policy compression in cognitive control.认知控制中策略压缩的神经和行为特征

本文引用的文献

1
Dorsal hippocampus contributes to model-based planning.背侧海马体有助于基于模型的规划。
Nat Neurosci. 2017 Sep;20(9):1269-1276. doi: 10.1038/nn.4613. Epub 2017 Jul 31.
2
Selective impairment of goal-directed decision-making following lesions to the human ventromedial prefrontal cortex.人类腹内侧前额叶皮质受损后目标导向决策的选择性损伤。
Brain. 2017 Jun 1;140(6):1743-1756. doi: 10.1093/brain/awx105.
3
Coherency Maximizing Exploration in the Supermarket.超市中的一致性最大化探索
bioRxiv. 2025 May 7:2025.05.06.652533. doi: 10.1101/2025.05.06.652533.
4
Model-based algorithms shape automatic evaluative processing.基于模型的算法塑造自动评价性加工。
Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2417068122. doi: 10.1073/pnas.2417068122. Epub 2025 Jun 20.
5
Cognitive computational model reveals repetition bias in a sequential decision-making task.认知计算模型揭示了序列决策任务中的重复偏差。
Commun Psychol. 2025 Jun 13;3(1):92. doi: 10.1038/s44271-025-00271-0.
6
Fentanyl reinforcement history has sex-specific effects on multi-step decision-making.芬太尼强化史对多步骤决策具有性别特异性影响。
bioRxiv. 2025 Jun 5:2024.10.10.617707. doi: 10.1101/2024.10.10.617707.
7
Delayed rewards weaken human goal directed actions.延迟奖励会削弱人类的目标导向行为。
NPJ Sci Learn. 2025 Jun 7;10(1):36. doi: 10.1038/s41539-025-00325-2.
8
The reward positivity is insensitive to reinforcer devaluation.奖励积极性对强化物贬值不敏感。
Cogn Affect Behav Neurosci. 2025 Jun 5. doi: 10.3758/s13415-025-01306-z.
9
Dissociable habits of response preparation versus response initiation.反应准备与反应启动的可分离习惯
Nat Hum Behav. 2025 Jun 2. doi: 10.1038/s41562-025-02215-4.
10
Computationally Informed Insights Into Anhedonia and Treatment by Kappa Opioid Receptor Antagonism.通过κ阿片受体拮抗作用对快感缺失及其治疗的计算性洞察
Biol Psychiatry Cogn Neurosci Neuroimaging. 2025 May 28. doi: 10.1016/j.bpsc.2025.05.011.
Nat Hum Behav. 2017 Jan 9;1. doi: 10.1038/s41562-016-0017.
4
Adaptive integration of habits into depth-limited planning defines a habitual-goal-directed spectrum.将习惯适应性地整合到深度受限的规划中定义了一个习惯-目标导向频谱。
Proc Natl Acad Sci U S A. 2016 Nov 8;113(45):12868-12873. doi: 10.1073/pnas.1609094113. Epub 2016 Oct 24.
5
Midbrain dopamine neurons compute inferred and cached value prediction errors in a common framework.中脑多巴胺神经元在一个通用框架中计算推断和缓存的价值预测误差。
Elife. 2016 Mar 7;5:e13665. doi: 10.7554/eLife.13665.
6
Characterizing a psychiatric symptom dimension related to deficits in goal-directed control.表征与目标导向控制缺陷相关的一种精神症状维度。
Elife. 2016 Mar 1;5:e11305. doi: 10.7554/eLife.11305.
7
Vicarious trial and error.替代性试错。
Nat Rev Neurosci. 2016 Mar;17(3):147-59. doi: 10.1038/nrn.2015.30.
8
Goal-Directed Decision Making with Spiking Neurons.基于脉冲神经元的目标导向决策
J Neurosci. 2016 Feb 3;36(5):1529-46. doi: 10.1523/JNEUROSCI.2854-15.2016.
9
Dopamine selectively remediates 'model-based' reward learning: a computational approach.多巴胺选择性修复“基于模型”的奖赏学习:一种计算方法。
Brain. 2016 Feb;139(Pt 2):355-64. doi: 10.1093/brain/awv347. Epub 2015 Dec 17.
10
Sources of noise during accumulation of evidence in unrestrained and voluntarily head-restrained rats.在无约束和自愿头部约束大鼠证据积累过程中的噪声来源。
Elife. 2015 Dec 17;4:e11308. doi: 10.7554/eLife.11308.