• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

饥饿改善了强化驱动但不是计划好的行为。

Hunger improves reinforcement-driven but not planned action.

机构信息

Nuffield Department of Clinical Neuroscience, University of Oxford, Oxford, UK.

出版信息

Cogn Affect Behav Neurosci. 2021 Dec;21(6):1196-1206. doi: 10.3758/s13415-021-00921-w. Epub 2021 Oct 15.

DOI:10.3758/s13415-021-00921-w
PMID:34652602
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8563670/
Abstract

Human decisions can be reflexive or planned, being governed respectively by model-free and model-based learning systems. These two systems might differ in their responsiveness to our needs. Hunger drives us to specifically seek food rewards, but here we ask whether it might have more general effects on these two decision systems. On one hand, the model-based system is often considered flexible and context-sensitive, and might therefore be modulated by metabolic needs. On the other hand, the model-free system's primitive reinforcement mechanisms may have closer ties to biological drives. Here, we tested participants on a well-established two-stage sequential decision-making task that dissociates the contribution of model-based and model-free control. Hunger enhanced overall performance by increasing model-free control, without affecting model-based control. These results demonstrate a generalized effect of hunger on decision-making that enhances reliance on primitive reinforcement learning, which in some situations translates into adaptive benefits.

摘要

人类的决策可以是反射性的,也可以是有计划的,分别由无模型和基于模型的学习系统来控制。这两个系统在响应我们的需求方面可能有所不同。饥饿驱使我们专门寻找食物奖励,但在这里我们要问的是,它是否可能对这两个决策系统产生更普遍的影响。一方面,基于模型的系统通常被认为是灵活和敏感的,因此可能会受到代谢需求的调节。另一方面,无模型系统的原始强化机制可能与生物驱动更紧密相关。在这里,我们在一个成熟的两阶段序列决策任务中测试了参与者,该任务可以分离基于模型和无模型控制的贡献。饥饿通过增加无模型控制来提高整体表现,而不影响基于模型的控制。这些结果表明,饥饿对决策有普遍的影响,增强了对原始强化学习的依赖,在某些情况下,这转化为适应性的好处。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a7b/8563670/c39884a7a770/13415_2021_921_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a7b/8563670/26897b214fcc/13415_2021_921_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a7b/8563670/57eb157ca83e/13415_2021_921_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a7b/8563670/3caf70a58b71/13415_2021_921_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a7b/8563670/c39884a7a770/13415_2021_921_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a7b/8563670/26897b214fcc/13415_2021_921_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a7b/8563670/57eb157ca83e/13415_2021_921_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a7b/8563670/3caf70a58b71/13415_2021_921_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/7a7b/8563670/c39884a7a770/13415_2021_921_Fig4_HTML.jpg

相似文献

1
Hunger improves reinforcement-driven but not planned action.饥饿改善了强化驱动但不是计划好的行为。
Cogn Affect Behav Neurosci. 2021 Dec;21(6):1196-1206. doi: 10.3758/s13415-021-00921-w. Epub 2021 Oct 15.
2
Reward-Mediated, Model-Free Reinforcement-Learning Mechanisms in Pavlovian and Instrumental Tasks Are Related.奖赏介导的、无模型的强化学习机制在条件反射和工具性任务中是相关的。
J Neurosci. 2023 Jan 18;43(3):458-471. doi: 10.1523/JNEUROSCI.1113-22.2022. Epub 2022 Oct 10.
3
Dorsal-Ventral Reinforcement Learning Network Connectivity and Incentive-Driven Changes in Exploration.背腹侧强化学习网络连接性与探索中动机驱动的变化
J Neurosci. 2025 Apr 9;45(15):e0422242025. doi: 10.1523/JNEUROSCI.0422-24.2025.
4
Multiple memory systems as substrates for multiple decision systems.多种记忆系统作为多种决策系统的基础。
Neurobiol Learn Mem. 2015 Jan;117:4-13. doi: 10.1016/j.nlm.2014.04.014. Epub 2014 May 15.
5
Reinforcement learning signals in the human striatum distinguish learners from nonlearners during reward-based decision making.在基于奖励的决策过程中,人类纹状体中的强化学习信号可区分学习者和非学习者。
J Neurosci. 2007 Nov 21;27(47):12860-7. doi: 10.1523/JNEUROSCI.2496-07.2007.
6
Higher motivation and pleasure scores predict more reliance on model-free decision making.更高的动机和愉悦得分预示着对无模型决策的更多依赖。
Cogn Affect Behav Neurosci. 2025 May 22. doi: 10.3758/s13415-025-01302-3.
7
Effort Reinforces Learning.努力强化学习。
J Neurosci. 2022 Oct 5;42(40):7648-7658. doi: 10.1523/JNEUROSCI.2223-21.2022. Epub 2022 Sep 12.
8
Novelty is not surprise: Human exploratory and adaptive behavior in sequential decision-making.新颖性不是惊喜:人类在序列决策中的探索和适应行为。
PLoS Comput Biol. 2021 Jun 3;17(6):e1009070. doi: 10.1371/journal.pcbi.1009070. eCollection 2021 Jun.
9
Cost-Benefit Arbitration Between Multiple Reinforcement-Learning Systems.多强化学习系统的成本效益仲裁。
Psychol Sci. 2017 Sep;28(9):1321-1333. doi: 10.1177/0956797617708288. Epub 2017 Jul 21.
10
A reinforcement learning diffusion decision model for value-based decisions.基于价值的决策的强化学习扩散决策模型。
Psychon Bull Rev. 2019 Aug;26(4):1099-1121. doi: 10.3758/s13423-018-1554-2.

引用本文的文献

1
The interoceptive origin of reinforcement learning.强化学习的内感受起源
Trends Cogn Sci. 2025 Sep;29(9):840-854. doi: 10.1016/j.tics.2025.05.008. Epub 2025 Jun 10.
2
Reward Bases: A simple mechanism for adaptive acquisition of multiple reward types.奖励基础:一种用于适应性获取多种奖励类型的简单机制。
PLoS Comput Biol. 2024 Nov 19;20(11):e1012580. doi: 10.1371/journal.pcbi.1012580. eCollection 2024 Nov.
3
Gambling on an empty stomach: Hunger modulates preferences for learned but not described risks.空腹赌博:饥饿调节对习得但未描述风险的偏好。

本文引用的文献

1
Humans primarily use model-based inference in the two-stage task.人类主要在两阶段任务中使用基于模型的推理。
Nat Hum Behav. 2020 Oct;4(10):1053-1066. doi: 10.1038/s41562-020-0905-y. Epub 2020 Jul 6.
2
Ten simple rules for the computational modeling of behavioral data.计算行为数据建模的 10 个简单规则。
Elife. 2019 Nov 26;8:e49547. doi: 10.7554/eLife.49547.
3
No substantial change in the balance between model-free and model-based control via training on the two-step task.通过两步任务训练,在无模型和基于模型控制之间的平衡没有实质性变化。
Brain Behav. 2023 May;13(5):e2978. doi: 10.1002/brb3.2978. Epub 2023 Apr 5.
4
The effect of body image dissatisfaction on goal-directed decision making in a population marked by negative appearance beliefs and disordered eating.身体意象不满对具有负面外貌信念和饮食失调人群的目标导向决策的影响。
PLoS One. 2022 Nov 28;17(11):e0276750. doi: 10.1371/journal.pone.0276750. eCollection 2022.
PLoS Comput Biol. 2019 Nov 14;15(11):e1007443. doi: 10.1371/journal.pcbi.1007443. eCollection 2019 Nov.
4
Hunger increases delay discounting of food and non-food rewards.饥饿会增加对食物和非食物奖励的延迟折扣。
Psychon Bull Rev. 2019 Oct;26(5):1729-1737. doi: 10.3758/s13423-019-01655-0.
5
Credit assignment to state-independent task representations and its relationship with model-based decision making.状态独立任务表示的信用分配及其与基于模型的决策的关系。
Proc Natl Acad Sci U S A. 2019 Aug 6;116(32):15871-15876. doi: 10.1073/pnas.1821647116. Epub 2019 Jul 18.
6
Objective Physiological Measurements but Not Subjective Reports Moderate the Effect of Hunger on Choice Behavior.目的 生理测量而非主观报告调节饥饿对选择行为的影响。
Front Psychol. 2018 May 23;9:750. doi: 10.3389/fpsyg.2018.00750. eCollection 2018.
7
Interactions between metabolic, reward and cognitive processes in appetite control: Implications for novel weight management therapies.代谢、奖励和认知过程在食欲控制中的相互作用:对新型体重管理疗法的启示。
J Psychopharmacol. 2017 Nov;31(11):1460-1474. doi: 10.1177/0269881117736917. Epub 2017 Oct 26.
8
Confirmation bias in human reinforcement learning: Evidence from counterfactual feedback processing.人类强化学习中的确认偏差:来自反事实反馈处理的证据。
PLoS Comput Biol. 2017 Aug 11;13(8):e1005684. doi: 10.1371/journal.pcbi.1005684. eCollection 2017 Aug.
9
Integration of homeostatic signaling and food reward processing in the human brain.人体大脑中稳态信号与食物奖励处理的整合。
JCI Insight. 2017 Aug 3;2(15). doi: 10.1172/jci.insight.92970.
10
Stress enhances model-free reinforcement learning only after negative outcome.压力仅在负面结果后增强无模型强化学习。
PLoS One. 2017 Jul 19;12(7):e0180588. doi: 10.1371/journal.pone.0180588. eCollection 2017.