背腹侧强化学习网络连接性与探索中动机驱动的变化

Dorsal-Ventral Reinforcement Learning Network Connectivity and Incentive-Driven Changes in Exploration.

作者信息

Campbell Ethan M, Zhong Wanting, Hogeveen Jeremy, Grafman Jordan

机构信息

Department of Psychology, University of New Mexico, Albuquerque, New Mexico 87131

Clinical Neuroscience Center, University of New Mexico, Albuquerque, New Mexico 87131.

出版信息

J Neurosci. 2025 Apr 9;45(15):e0422242025. doi: 10.1523/JNEUROSCI.0422-24.2025.

DOI:10.1523/JNEUROSCI.0422-24.2025

PMID:40015985

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11984077/

Abstract

Probabilistic reinforcement learning (RL) tasks assay how individuals make decisions under uncertainty. The use of internal models (model-based) or direct learning from experiences (model-free), and the degree of choice stochasticity across alternatives (i.e., exploration), can all be influenced by the state space of the decision-making task. There is considerable individual variation in the balance between model-based and model-free control during decision-making, and this balance is affected by incentive motivation. The effect of variable reward incentives on the arbitration between model-based and model-free learning remains understudied, and individual differences in neural signatures and cognitive traits that moderate the effect of reward on model-free/model-based control are unknown. Here we combined a two-stage decision-making task utilizing differing reward incentives with computational modeling, neuropsychological tests, and neuroimaging to address these questions. Results showed the prospect of greater reward decreased exploration of alternative options and increased the balance toward model-based learning. These behavioral effects were replicated across two independent datasets including both sexes. Individual differences in processing speed and analytical thinking style affected how reward altered the dependence on both systems. Using a systems neuroscience-inspired approach to resting-state functional connectivity, we found reduced exploration of the options during the first stage of our task under high relative to low incentives was predicted by increased cross-network coupling between ventral and dorsal RL circuitry. These findings suggest that integrity of functional connections between stimulus valuation (ventral) and action valuation (dorsal) RL networks is associated with changes in the balance between explore-exploit decisions under changing reward incentives.

摘要

概率强化学习（RL）任务用于分析个体在不确定性下如何做出决策。内部模型的使用（基于模型）或从经验中直接学习（无模型），以及跨选项的选择随机性程度（即探索），都可能受到决策任务状态空间的影响。在决策过程中，基于模型和无模型控制之间的平衡存在相当大的个体差异，并且这种平衡会受到激励动机的影响。可变奖励激励对基于模型和无模型学习之间仲裁的影响仍未得到充分研究，调节奖励对无模型/基于模型控制影响的神经特征和认知特质的个体差异也尚不清楚。在这里，我们将一个利用不同奖励激励的两阶段决策任务与计算建模、神经心理学测试和神经影像学相结合，以解决这些问题。结果表明，更高奖励的前景减少了对替代选项的探索，并增加了向基于模型学习的平衡。这些行为效应在包括男女的两个独立数据集中得到了重复。处理速度和分析思维方式的个体差异影响了奖励如何改变对这两个系统的依赖。使用一种受系统神经科学启发的方法来研究静息态功能连接，我们发现，相对于低激励，在高激励下我们任务的第一阶段对选项的探索减少是由腹侧和背侧RL回路之间跨网络耦合的增加所预测的。这些发现表明，刺激评估（腹侧）和行动评估（背侧）RL网络之间功能连接的完整性与在不断变化的奖励激励下探索 - 利用决策之间平衡的变化相关。

相似文献

Dorsal-Ventral Reinforcement Learning Network Connectivity and Incentive-Driven Changes in Exploration.背腹侧强化学习网络连接性与探索中动机驱动的变化

J Neurosci. 2025 Apr 9;45(15):e0422242025. doi: 10.1523/JNEUROSCI.0422-24.2025.

A computational neuroimaging study of reinforcement learning and goal-directed exploration in schizophrenia spectrum disorders.一项关于精神分裂症谱系障碍中强化学习和目标导向探索的计算神经影像学研究。

Psychol Med. 2023 Oct;53(14):6600-6610. doi: 10.1017/S0033291722003993. Epub 2023 Feb 8.

Reward and executive control network resting-state functional connectivity is associated with impulsivity during reward-based decision making for cocaine users.奖励和执行控制网络静息态功能连接与可卡因使用者基于奖励的决策过程中的冲动性有关。

Drug Alcohol Depend. 2019 Jan 1;194:32-39. doi: 10.1016/j.drugalcdep.2018.09.013. Epub 2018 Oct 24.

Neural and psychological maturation of decision-making in adolescence and young adulthood.青少年和青年期决策的神经和心理成熟。

J Cogn Neurosci. 2013 Nov;25(11):1807-23. doi: 10.1162/jocn_a_00447. Epub 2013 Jul 16.

Attenuated Directed Exploration during Reinforcement Learning in Gambling Disorder.赌博障碍的强化学习中减弱的有导向探索。

J Neurosci. 2021 Mar 17;41(11):2512-2522. doi: 10.1523/JNEUROSCI.1607-20.2021. Epub 2021 Feb 2.

Reinforcement learning signals in the human striatum distinguish learners from nonlearners during reward-based decision making.在基于奖励的决策过程中，人类纹状体中的强化学习信号可区分学习者和非学习者。

J Neurosci. 2007 Nov 21;27(47):12860-7. doi: 10.1523/JNEUROSCI.2496-07.2007.

HMM for discovering decision-making dynamics using reinforcement learning experiments.用于通过强化学习实验发现决策动态的隐马尔可夫模型。

Biostatistics. 2024 Dec 31;26(1). doi: 10.1093/biostatistics/kxae033.

Sex differences in learning from exploration.从探索中学习的性别差异。

Elife. 2021 Nov 19;10:e69748. doi: 10.7554/eLife.69748.

Exploration versus exploitation decisions in the human brain: A systematic review of functional neuroimaging and neuropsychological studies.人类大脑中的探索与开发决策：功能神经影像学和神经心理学研究的系统综述。

Neuropsychologia. 2024 Jan 10;192:108740. doi: 10.1016/j.neuropsychologia.2023.108740. Epub 2023 Nov 29.

Multiple memory systems as substrates for multiple decision systems.多种记忆系统作为多种决策系统的基础。

Neurobiol Learn Mem. 2015 Jan;117:4-13. doi: 10.1016/j.nlm.2014.04.014. Epub 2014 May 15.

本文引用的文献

The neural substrates of how model-based learning affects risk taking: Functional coupling between right cerebellum and left caudate.基于模型的学习如何影响风险承担的神经基础：右小脑与左尾状核之间的功能耦合。

Brain Cogn. 2023 Nov;172:106088. doi: 10.1016/j.bandc.2023.106088. Epub 2023 Sep 30.

The neurocomputational bases of explore-exploit decision-making.探索-利用决策的神经计算基础。

Neuron. 2022 Jun 1;110(11):1869-1879.e5. doi: 10.1016/j.neuron.2022.03.014. Epub 2022 Apr 6.

Decision-making ability, psychopathology, and brain connectivity.决策能力、精神病理学和大脑连接。

Neuron. 2021 Jun 16;109(12):2025-2040.e7. doi: 10.1016/j.neuron.2021.04.019. Epub 2021 May 20.

Hypothalamic Interactions with Large-Scale Neural Circuits Underlying Reinforcement Learning and Motivated Behavior.下丘脑与强化学习和动机行为相关的大规模神经回路的相互作用。

Trends Neurosci. 2020 Sep;43(9):681-694. doi: 10.1016/j.tins.2020.06.006. Epub 2020 Aug 3.

Reinforcement learning across development: What insights can we draw from a decade of research?发展中的强化学习：我们能从十年的研究中得到哪些启示？

Dev Cogn Neurosci. 2019 Dec;40:100733. doi: 10.1016/j.dcn.2019.100733. Epub 2019 Nov 6.

Prediction of neurocognition in youth from resting state fMRI.基于静息态功能磁共振成像预测青少年的神经认知情况

Mol Psychiatry. 2020 Dec;25(12):3413-3421. doi: 10.1038/s41380-019-0481-6. Epub 2019 Aug 19.

Metacontrol of decision-making strategies in human aging.人类衰老中决策策略的元控制。

Elife. 2019 Aug 9;8:e49154. doi: 10.7554/eLife.49154.

fMRIPrep: a robust preprocessing pipeline for functional MRI.fMRIPrep：用于功能磁共振成像的强大预处理流水线。

Nat Methods. 2019 Jan;16(1):111-116. doi: 10.1038/s41592-018-0235-4. Epub 2018 Dec 10.

Dopaminergic genes are associated with both directed and random exploration.多巴胺能基因与定向探索和随机探索都有关联。

Neuropsychologia. 2018 Nov;120:97-104. doi: 10.1016/j.neuropsychologia.2018.10.009. Epub 2018 Oct 19.

Incentives Boost Model-Based Control Across a Range of Severity on Several Psychiatric Constructs.激励措施提高了多种精神疾病概念模型为基础的控制在一系列严重程度上的效果。

Biol Psychiatry. 2019 Mar 1;85(5):425-433. doi: 10.1016/j.biopsych.2018.06.018. Epub 2018 Jul 2.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验