• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

自适应控制策略的强化学习

Reinforcement learning of adaptive control strategies.

作者信息

Held Leslie K, Vermeylen Luc, Dignath David, Notebaert Wim, Krebs Ruth M, Braem Senne

机构信息

Department of Experimental Psychology, Ghent University, Henri Dunantlaan 2, 9000, Ghent, Belgium.

Department of Psychology, Tübingen University, Schleichstraße 4, 72076, Tübingen, Germany.

出版信息

Commun Psychol. 2024 Jan 12;2(1):8. doi: 10.1038/s44271-024-00055-y.

DOI:10.1038/s44271-024-00055-y
PMID:39242891
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11332247/
Abstract

Humans can up- or downregulate the degree to which they rely on task information for goal-directed behaviour, a process often referred to as cognitive control. Adjustments in cognitive control are traditionally studied in response to experienced or expected task-rule conflict. However, recent theories suggest that people can also learn to adapt control settings through reinforcement. Across three preregistered task switching experiments (n = 415), we selectively rewarded correct performance on trials with either more (incongruent) or less (congruent) task-rule conflict. Results confirmed the hypothesis that people rewarded more on incongruent trials showed smaller task-rule congruency effects, thus optimally adapting their control settings to the reward scheme. Using drift diffusion modelling, we further show that this reinforcement of cognitive control may occur through conflict-dependent within-trial adjustments of response thresholds after conflict detection. Together, our findings suggest that, while people remain more efficient at learning stimulus-response associations through reinforcement, they can similarly learn cognitive control strategies through reinforcement.

摘要

人类可以上调或下调其在目标导向行为中依赖任务信息的程度,这一过程通常被称为认知控制。传统上,认知控制的调整是针对经历过的或预期的任务规则冲突进行研究的。然而,最近的理论表明,人们也可以通过强化来学习调整控制设置。在三项预先注册的任务切换实验(n = 415)中,我们在任务规则冲突较多(不一致)或较少(一致)的试验中,选择性地奖励正确表现。结果证实了这一假设,即在不一致试验中获得更多奖励的人表现出较小的任务规则一致性效应,从而使他们的控制设置最佳地适应奖励方案。使用漂移扩散模型,我们进一步表明,这种认知控制的强化可能是通过在冲突检测后根据冲突进行的试验内反应阈值调整来实现的。总之,我们的研究结果表明,虽然人们通过强化在学习刺激-反应关联方面仍然更有效率,但他们同样可以通过强化来学习认知控制策略。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d399/11332247/07fe3d7ed1e3/44271_2024_55_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d399/11332247/76c7808eab10/44271_2024_55_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d399/11332247/59c63665ce82/44271_2024_55_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d399/11332247/019392916686/44271_2024_55_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d399/11332247/31113adaa2e4/44271_2024_55_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d399/11332247/9fdbdd040be7/44271_2024_55_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d399/11332247/b7065226ff30/44271_2024_55_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d399/11332247/9156c696d63a/44271_2024_55_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d399/11332247/07fe3d7ed1e3/44271_2024_55_Fig8_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d399/11332247/76c7808eab10/44271_2024_55_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d399/11332247/59c63665ce82/44271_2024_55_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d399/11332247/019392916686/44271_2024_55_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d399/11332247/31113adaa2e4/44271_2024_55_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d399/11332247/9fdbdd040be7/44271_2024_55_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d399/11332247/b7065226ff30/44271_2024_55_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d399/11332247/9156c696d63a/44271_2024_55_Fig7_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/d399/11332247/07fe3d7ed1e3/44271_2024_55_Fig8_HTML.jpg

相似文献

1
Reinforcement learning of adaptive control strategies.自适应控制策略的强化学习
Commun Psychol. 2024 Jan 12;2(1):8. doi: 10.1038/s44271-024-00055-y.
2
Reinforcement learning of irrelevant stimulus-response associations modulates cognitive control.无关刺激-反应关联的强化学习调节认知控制。
J Exp Psychol Learn Mem Cogn. 2021 Oct;47(10):1585-1598. doi: 10.1037/xlm0000850. Epub 2020 Apr 23.
3
Performance-contingent reward increases the use of congruent distracting information.绩效相关的奖励会增加对一致分心信息的使用。
Atten Percept Psychophys. 2023 Apr;85(3):905-929. doi: 10.3758/s13414-023-02682-9. Epub 2023 Mar 14.
4
The selective use of punishments on congruent versus incongruent trials in the Stroop task.在斯特鲁普任务中,对一致和不一致试验有选择地使用惩罚。
Neurobiol Learn Mem. 2022 Sep;193:107654. doi: 10.1016/j.nlm.2022.107654. Epub 2022 Jun 28.
5
Conditioning task switching behavior.条件性任务转换行为。
Cognition. 2017 Sep;166:272-276. doi: 10.1016/j.cognition.2017.05.037. Epub 2017 Jun 5.
6
It's more than just conflict: The functional role of congruency in the sequential control adaptation.这不仅仅是冲突:一致性在序列控制适应中的功能作用。
Acta Psychol (Amst). 2019 Jun;197:64-72. doi: 10.1016/j.actpsy.2019.04.016. Epub 2019 May 16.
7
Local and global control adjustments to stimulus-based task conflict in task switching.任务切换中基于刺激的任务冲突的局部和全局控制调整。
Q J Exp Psychol (Hove). 2025 May;78(5):963-977. doi: 10.1177/17470218231200442. Epub 2023 Sep 28.
8
Implicit learning of the one-back reinforcement matching-mismatching task by pigeons.鸽子对 1 -back 强化匹配-不匹配任务的内隐学习。
Curr Biol. 2023 Jun 19;33(12):2582-2585.e2. doi: 10.1016/j.cub.2023.05.045. Epub 2023 Jun 9.
9
Adjustments of selective attention to response conflict - controlling for perceptual conflict, target-distractor identity, and congruency level sequence pertaining to the congruency sequence effect.调整对反应冲突的选择性注意——控制知觉冲突、目标-干扰物身份以及与一致性序列效应有关的一致性水平序列。
Atten Percept Psychophys. 2021 Aug;83(6):2531-2550. doi: 10.3758/s13414-021-02294-1. Epub 2021 May 4.
10
Selective reinforcement of conflict processing in the Stroop task.斯特鲁普任务中冲突处理的选择性强化。
PLoS One. 2021 Jul 30;16(7):e0255430. doi: 10.1371/journal.pone.0255430. eCollection 2021.

引用本文的文献

1
Cognitive biases as Bayesian probability weighting in context.认知偏差作为情境中的贝叶斯概率加权。
Front Psychol. 2025 Aug 6;16:1572168. doi: 10.3389/fpsyg.2025.1572168. eCollection 2025.
2
The temporal dynamics of metacognitive experiences track rational adaptations in task performance.元认知体验的时间动态追踪任务表现中的理性适应。
Commun Psychol. 2025 Jul 3;3(1):96. doi: 10.1038/s44271-025-00282-x.
3
Experience and advice consequences shape information sharing strategies.经验和建议结果塑造信息共享策略。

本文引用的文献

1
Feeling and deciding: Subjective experiences rather than objective factors drive the decision to invest cognitive control.感觉和决策:主观体验而非客观因素驱动着投入认知控制的决策。
Cognition. 2023 Nov;240:105587. doi: 10.1016/j.cognition.2023.105587. Epub 2023 Aug 17.
2
Learning how to reason and deciding when to decide.学习推理和决定何时决策。
Behav Brain Sci. 2023 Jul 18;46:e115. doi: 10.1017/S0140525X22003090.
3
The influence of reward in the Simon task: Differences and similarities to the Stroop and Eriksen flanker tasks.
Commun Psychol. 2024 Dec 19;2(1):123. doi: 10.1038/s44271-024-00175-5.
4
Revolutionizing Molecular Design for Innovative Therapeutic Applications through Artificial Intelligence.通过人工智能为创新治疗应用彻底改变分子设计。
Molecules. 2024 Sep 29;29(19):4626. doi: 10.3390/molecules29194626.
5
Cognitive Control.认知控制
Annu Rev Psychol. 2025 Jan;76(1):167-195. doi: 10.1146/annurev-psych-022024-103901. Epub 2024 Dec 3.
奖励在 Simon 任务中的影响:与 Stroop 和 Eriksen 侧翼任务的差异和相似之处。
Atten Percept Psychophys. 2023 Apr;85(3):949-959. doi: 10.3758/s13414-022-02563-7. Epub 2022 Oct 31.
4
The selective use of punishments on congruent versus incongruent trials in the Stroop task.在斯特鲁普任务中,对一致和不一致试验有选择地使用惩罚。
Neurobiol Learn Mem. 2022 Sep;193:107654. doi: 10.1016/j.nlm.2022.107654. Epub 2022 Jun 28.
5
Is the juice worth the squeeze? Learning the marginal value of mental effort over time.付出的努力是否值得?随着时间的推移了解精神努力的边际价值。
J Exp Psychol Gen. 2022 Oct;151(10):2324-2341. doi: 10.1037/xge0001208. Epub 2022 Apr 7.
6
Reward Modulates Unconsciously Triggered Adaptive Control Processes.奖励调节无意识触发的适应性控制过程。
Iperception. 2022 Feb 14;13(1):20416695211073819. doi: 10.1177/20416695211073819. eCollection 2022 Jan-Feb.
7
Cognitive Control as a Multivariate Optimization Problem.认知控制作为一个多元优化问题。
J Cogn Neurosci. 2022 Mar 5;34(4):569-591. doi: 10.1162/jocn_a_01822.
8
Aversive motivation and cognitive control.厌恶动机与认知控制。
Neurosci Biobehav Rev. 2022 Feb;133:104493. doi: 10.1016/j.neubiorev.2021.12.016. Epub 2021 Dec 12.
9
Filling the gaps: Cognitive control as a critical lens for understanding mechanisms of value-based decision-making.填补空白:认知控制作为理解基于价值的决策机制的关键视角。
Neurosci Biobehav Rev. 2022 Mar;134:104483. doi: 10.1016/j.neubiorev.2021.12.006. Epub 2021 Dec 10.
10
Bayesian model-averaged meta-analysis in medicine.贝叶斯模型平均荟萃分析在医学中的应用。
Stat Med. 2021 Dec 30;40(30):6743-6761. doi: 10.1002/sim.9170. Epub 2021 Oct 27.