饥饿改善了强化驱动但不是计划好的行为。

Hunger improves reinforcement-driven but not planned action.

机构信息

Nuffield Department of Clinical Neuroscience, University of Oxford, Oxford, UK.

出版信息

Cogn Affect Behav Neurosci. 2021 Dec;21(6):1196-1206. doi: 10.3758/s13415-021-00921-w. Epub 2021 Oct 15.

DOI:10.3758/s13415-021-00921-w

PMID:34652602

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8563670/

Abstract

Human decisions can be reflexive or planned, being governed respectively by model-free and model-based learning systems. These two systems might differ in their responsiveness to our needs. Hunger drives us to specifically seek food rewards, but here we ask whether it might have more general effects on these two decision systems. On one hand, the model-based system is often considered flexible and context-sensitive, and might therefore be modulated by metabolic needs. On the other hand, the model-free system's primitive reinforcement mechanisms may have closer ties to biological drives. Here, we tested participants on a well-established two-stage sequential decision-making task that dissociates the contribution of model-based and model-free control. Hunger enhanced overall performance by increasing model-free control, without affecting model-based control. These results demonstrate a generalized effect of hunger on decision-making that enhances reliance on primitive reinforcement learning, which in some situations translates into adaptive benefits.

摘要

人类的决策可以是反射性的，也可以是有计划的，分别由无模型和基于模型的学习系统来控制。这两个系统在响应我们的需求方面可能有所不同。饥饿驱使我们专门寻找食物奖励，但在这里我们要问的是，它是否可能对这两个决策系统产生更普遍的影响。一方面，基于模型的系统通常被认为是灵活和敏感的，因此可能会受到代谢需求的调节。另一方面，无模型系统的原始强化机制可能与生物驱动更紧密相关。在这里，我们在一个成熟的两阶段序列决策任务中测试了参与者，该任务可以分离基于模型和无模型控制的贡献。饥饿通过增加无模型控制来提高整体表现，而不影响基于模型的控制。这些结果表明，饥饿对决策有普遍的影响，增强了对原始强化学习的依赖，在某些情况下，这转化为适应性的好处。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a7b/8563670/26897b214fcc/13415_2021_921_Fig1_HTML.jpg

相似文献

Hunger improves reinforcement-driven but not planned action.

Cogn Affect Behav Neurosci. 2021 Dec;21(6):1196-1206. doi: 10.3758/s13415-021-00921-w. Epub 2021 Oct 15.

Reward-Mediated, Model-Free Reinforcement-Learning Mechanisms in Pavlovian and Instrumental Tasks Are Related.

J Neurosci. 2023 Jan 18;43(3):458-471. doi: 10.1523/JNEUROSCI.1113-22.2022. Epub 2022 Oct 10.

Dorsal-Ventral Reinforcement Learning Network Connectivity and Incentive-Driven Changes in Exploration.

J Neurosci. 2025 Apr 9;45(15):e0422242025. doi: 10.1523/JNEUROSCI.0422-24.2025.

Multiple memory systems as substrates for multiple decision systems.

Neurobiol Learn Mem. 2015 Jan;117:4-13. doi: 10.1016/j.nlm.2014.04.014. Epub 2014 May 15.

Reinforcement learning signals in the human striatum distinguish learners from nonlearners during reward-based decision making.

J Neurosci. 2007 Nov 21;27(47):12860-7. doi: 10.1523/JNEUROSCI.2496-07.2007.

Higher motivation and pleasure scores predict more reliance on model-free decision making.

Cogn Affect Behav Neurosci. 2025 May 22. doi: 10.3758/s13415-025-01302-3.

Effort Reinforces Learning.

J Neurosci. 2022 Oct 5;42(40):7648-7658. doi: 10.1523/JNEUROSCI.2223-21.2022. Epub 2022 Sep 12.

Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making.

PLoS Comput Biol. 2021 Jun 3;17(6):e1009070. doi: 10.1371/journal.pcbi.1009070. eCollection 2021 Jun.

Cost-Benefit Arbitration Between Multiple Reinforcement-Learning Systems.

Psychol Sci. 2017 Sep;28(9):1321-1333. doi: 10.1177/0956797617708288. Epub 2017 Jul 21.

A reinforcement learning diffusion decision model for value-based decisions.

Psychon Bull Rev. 2019 Aug;26(4):1099-1121. doi: 10.3758/s13423-018-1554-2.

引用本文的文献

The interoceptive origin of reinforcement learning.

Trends Cogn Sci. 2025 Sep;29(9):840-854. doi: 10.1016/j.tics.2025.05.008. Epub 2025 Jun 10.

Reward Bases: A simple mechanism for adaptive acquisition of multiple reward types.

PLoS Comput Biol. 2024 Nov 19;20(11):e1012580. doi: 10.1371/journal.pcbi.1012580. eCollection 2024 Nov.

Gambling on an empty stomach: Hunger modulates preferences for learned but not described risks.

Brain Behav. 2023 May;13(5):e2978. doi: 10.1002/brb3.2978. Epub 2023 Apr 5.

The effect of body image dissatisfaction on goal-directed decision making in a population marked by negative appearance beliefs and disordered eating.

PLoS One. 2022 Nov 28;17(11):e0276750. doi: 10.1371/journal.pone.0276750. eCollection 2022.

本文引用的文献

Humans primarily use model-based inference in the two-stage task.

Nat Hum Behav. 2020 Oct;4(10):1053-1066. doi: 10.1038/s41562-020-0905-y. Epub 2020 Jul 6.

Ten simple rules for the computational modeling of behavioral data.

Elife. 2019 Nov 26;8:e49547. doi: 10.7554/eLife.49547.

No substantial change in the balance between model-free and model-based control via training on the two-step task.

PLoS Comput Biol. 2019 Nov 14;15(11):e1007443. doi: 10.1371/journal.pcbi.1007443. eCollection 2019 Nov.

Hunger increases delay discounting of food and non-food rewards.

Psychon Bull Rev. 2019 Oct;26(5):1729-1737. doi: 10.3758/s13423-019-01655-0.

Credit assignment to state-independent task representations and its relationship with model-based decision making.

Proc Natl Acad Sci U S A. 2019 Aug 6;116(32):15871-15876. doi: 10.1073/pnas.1821647116. Epub 2019 Jul 18.

Objective Physiological Measurements but Not Subjective Reports Moderate the Effect of Hunger on Choice Behavior.

Front Psychol. 2018 May 23;9:750. doi: 10.3389/fpsyg.2018.00750. eCollection 2018.

Interactions between metabolic, reward and cognitive processes in appetite control: Implications for novel weight management therapies.

J Psychopharmacol. 2017 Nov;31(11):1460-1474. doi: 10.1177/0269881117736917. Epub 2017 Oct 26.

Confirmation bias in human reinforcement learning: Evidence from counterfactual feedback processing.

PLoS Comput Biol. 2017 Aug 11;13(8):e1005684. doi: 10.1371/journal.pcbi.1005684. eCollection 2017 Aug.

Integration of homeostatic signaling and food reward processing in the human brain.

JCI Insight. 2017 Aug 3;2(15). doi: 10.1172/jci.insight.92970.

Stress enhances model-free reinforcement learning only after negative outcome.

PLoS One. 2017 Jul 19;12(7):e0180588. doi: 10.1371/journal.pone.0180588. eCollection 2017.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

饥饿改善了强化驱动但不是计划好的行为。

Hunger improves reinforcement-driven but not planned action.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献