人类背侧纹状体与中脑的连接性可预测强化如何用于指导决策。

Dorsal striatal-midbrain connectivity in humans predicts how reinforcements are used to guide decisions.

作者信息

Kahnt Thorsten, Park Soyoung Q, Cohen Michael X, Beck Anne, Heinz Andreas, Wrase Jana

机构信息

Department of Psychiatry and Psychotherapy, Charité-Universitätsmedizin Berlin (Charité Campus Mitte), Berlin, Germany.

出版信息

J Cogn Neurosci. 2009 Jul;21(7):1332-45. doi: 10.1162/jocn.2009.21092.

DOI:10.1162/jocn.2009.21092

PMID:18752410

Abstract

It has been suggested that the target areas of dopaminergic midbrain neurons, the dorsal (DS) and ventral striatum (VS), are differently involved in reinforcement learning especially as actor and critic. Whereas the critic learns to predict rewards, the actor maintains action values to guide future decisions. The different midbrain connections to the DS and the VS seem to play a critical role in this functional distinction. Here, subjects performed a dynamic, reward-based decision-making task during fMRI acquisition. A computational model of reinforcement learning was used to estimate the different effects of positive and negative reinforcements on future decisions for each subject individually. We found that activity in both the DS and the VS correlated with reward prediction errors. Using functional connectivity, we show that the DS and the VS are differentially connected to different midbrain regions (possibly corresponding to the substantia nigra [SN] and the ventral tegmental area [VTA], respectively). However, only functional connectivity between the DS and the putative SN predicted the impact of different reinforcement types on future behavior. These results suggest that connections between the putative SN and the DS are critical for modulating action values in the DS according to both positive and negative reinforcements to guide future decision making.

摘要

有人提出，多巴胺能中脑神经元的目标区域，即背侧纹状体（DS）和腹侧纹状体（VS），在强化学习中有着不同的参与方式，特别是作为行动者和评判者。评判者学习预测奖励，而行动者维持行动价值以指导未来决策。中脑与DS和VS的不同连接似乎在这种功能区分中起着关键作用。在此，受试者在功能磁共振成像（fMRI）采集过程中执行了一项基于奖励的动态决策任务。使用强化学习的计算模型来分别估计正向和负向强化对每个受试者未来决策的不同影响。我们发现DS和VS中的活动都与奖励预测误差相关。通过功能连接性，我们表明DS和VS与不同的中脑区域有不同的连接（可能分别对应黑质[SN]和腹侧被盖区[VTA]）。然而，只有DS与假定的SN之间的功能连接预测了不同强化类型对未来行为的影响。这些结果表明，假定的SN与DS之间的连接对于根据正向和负向强化来调节DS中的行动价值以指导未来决策至关重要。

相似文献

Dorsal striatal-midbrain connectivity in humans predicts how reinforcements are used to guide decisions.

J Cogn Neurosci. 2009 Jul;21(7):1332-45. doi: 10.1162/jocn.2009.21092.

Heterarchical reinforcement-learning model for integration of multiple cortico-striatal loops: fMRI examination in stimulus-action-reward association learning.

Neural Netw. 2006 Oct;19(8):1242-54. doi: 10.1016/j.neunet.2006.06.007. Epub 2006 Sep 20.

Frontal-striatal circuitry activated by human peak-interval timing in the supra-seconds range.

Brain Res Cogn Brain Res. 2004 Oct;21(2):171-82. doi: 10.1016/j.cogbrainres.2004.08.005.

Overlapping prediction errors in dorsal striatum during instrumental learning with juice and money reward in the human brain.

J Neurophysiol. 2009 Dec;102(6):3384-91. doi: 10.1152/jn.91195.2008. Epub 2009 Sep 30.

"Virus and epidemic": causal knowledge activates prediction error circuitry.

J Cogn Neurosci. 2010 Oct;22(10):2151-63. doi: 10.1162/jocn.2009.21387.

Expected value and prediction error abnormalities in depression and schizophrenia.

Brain. 2011 Jun;134(Pt 6):1751-64. doi: 10.1093/brain/awr059. Epub 2011 Apr 10.

Dissociating early and late error signals in perceptual recognition.

J Cogn Neurosci. 2008 Dec;20(12):2211-25. doi: 10.1162/jocn.2008.20155.

Novelty increases the mesolimbic functional connectivity of the substantia nigra/ventral tegmental area (SN/VTA) during reward anticipation: Evidence from high-resolution fMRI.

Neuroimage. 2011 Sep 15;58(2):647-55. doi: 10.1016/j.neuroimage.2011.06.038. Epub 2011 Jun 24.

Attentional control of task and response in lateral and medial frontal cortex: brain activity and reaction time distributions.

Neuropsychologia. 2009 Aug;47(10):2089-99. doi: 10.1016/j.neuropsychologia.2009.03.019. Epub 2009 Apr 5.

Temporal difference modeling of the blood-oxygen level dependent response during aversive conditioning in humans: effects of dopaminergic modulation.

Biol Psychiatry. 2007 Oct 1;62(7):765-72. doi: 10.1016/j.biopsych.2006.10.020. Epub 2007 Jan 16.

引用本文的文献

Aha! and D'oh! experiences enhance learning for incidental information-new evidence supports the insight memory advantage.

Cogn Affect Behav Neurosci. 2024 Jun;24(3):505-516. doi: 10.3758/s13415-024-01184-x. Epub 2024 Mar 27.

Observational reinforcement learning in children and young adults.

NPJ Sci Learn. 2024 Mar 13;9(1):18. doi: 10.1038/s41539-024-00227-9.

Dynamics Learning Rate Bias in Pigeons: Insights from Reinforcement Learning and Neural Correlates.

Animals (Basel). 2024 Feb 1;14(3):489. doi: 10.3390/ani14030489.

Test-retest reliability of reinforcement learning parameters.

Behav Res Methods. 2024 Aug;56(5):4582-4599. doi: 10.3758/s13428-023-02203-4. Epub 2023 Sep 8.

Translational models of addiction phenotypes to advance addiction pharmacotherapy.

Ann N Y Acad Sci. 2023 Jan;1519(1):118-128. doi: 10.1111/nyas.14929. Epub 2022 Nov 17.

Computational reinforcement learning, reward (and punishment), and dopamine in psychiatric disorders.

Front Psychiatry. 2022 Oct 20;13:886297. doi: 10.3389/fpsyt.2022.886297. eCollection 2022.

Learning under social versus nonsocial uncertainty: A meta-analytic approach.

Hum Brain Mapp. 2022 Sep;43(13):4185-4206. doi: 10.1002/hbm.25948. Epub 2022 May 27.

Learning at Variable Attentional Load Requires Cooperation of Working Memory, Meta-learning, and Attention-augmented Reinforcement Learning.

J Cogn Neurosci. 2021 Dec 6;34(1):79-107. doi: 10.1162/jocn_a_01780.

Spontaneous eye blink rate predicts individual differences in exploration and exploitation during reinforcement learning.

Sci Rep. 2019 Nov 22;9(1):17436. doi: 10.1038/s41598-019-53805-y.

Learning in Visual Regions as Support for the Bias in Future Value-Driven Choice.

Cereb Cortex. 2020 Apr 14;30(4):2005-2018. doi: 10.1093/cercor/bhz218.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

人类背侧纹状体与中脑的连接性可预测强化如何用于指导决策。

Dorsal striatal-midbrain connectivity in humans predicts how reinforcements are used to guide decisions.

作者信息

Kahnt Thorsten, Park Soyoung Q, Cohen Michael X, Beck Anne, Heinz Andreas, Wrase Jana

机构信息

Department of Psychiatry and Psychotherapy, Charité-Universitätsmedizin Berlin (Charité Campus Mitte), Berlin, Germany.

出版信息

J Cogn Neurosci. 2009 Jul;21(7):1332-45. doi: 10.1162/jocn.2009.21092.

DOI:10.1162/jocn.2009.21092

PMID:18752410

Abstract

摘要

人类背侧纹状体与中脑的连接性可预测强化如何用于指导决策。

Dorsal striatal-midbrain connectivity in humans predicts how reinforcements are used to guide decisions.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

人类背侧纹状体与中脑的连接性可预测强化如何用于指导决策。

Dorsal striatal-midbrain connectivity in humans predicts how reinforcements are used to guide decisions.

作者信息

机构信息

出版信息

相似文献

引用本文的文献