一个扩展的基底神经节强化学习模型，用于理解血清素和多巴胺在风险决策、奖励预测和惩罚学习中的贡献。

An extended reinforcement learning model of basal ganglia to understand the contributions of serotonin and dopamine in risk-based decision making, reward prediction, and punishment learning.

机构信息

Department of Biotechnology, Indian Institute of Technology - Madras Chennai, India.

Department of Computer Science and Engineering, Indian Institute of Technology - Madras Chennai, India.

出版信息

Front Comput Neurosci. 2014 Apr 16;8:47. doi: 10.3389/fncom.2014.00047. eCollection 2014.

DOI:10.3389/fncom.2014.00047

PMID:24795614

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC3997037/

Abstract

Although empirical and neural studies show that serotonin (5HT) plays many functional roles in the brain, prior computational models mostly focus on its role in behavioral inhibition. In this study, we present a model of risk based decision making in a modified Reinforcement Learning (RL)-framework. The model depicts the roles of dopamine (DA) and serotonin (5HT) in Basal Ganglia (BG). In this model, the DA signal is represented by the temporal difference error (δ), while the 5HT signal is represented by a parameter (α) that controls risk prediction error. This formulation that accommodates both 5HT and DA reconciles some of the diverse roles of 5HT particularly in connection with the BG system. We apply the model to different experimental paradigms used to study the role of 5HT: (1) Risk-sensitive decision making, where 5HT controls risk assessment, (2) Temporal reward prediction, where 5HT controls time-scale of reward prediction, and (3) Reward/Punishment sensitivity, in which the punishment prediction error depends on 5HT levels. Thus the proposed integrated RL model reconciles several existing theories of 5HT and DA in the BG.

摘要

尽管实证和神经研究表明 5-羟色胺（5HT）在大脑中发挥着许多功能作用，但之前的计算模型主要关注其在行为抑制中的作用。在这项研究中，我们在改进的强化学习（RL）框架中提出了一种基于风险的决策模型。该模型描绘了多巴胺（DA）和 5-羟色胺（5HT）在基底神经节（BG）中的作用。在这个模型中，DA 信号由时间差分误差（δ）表示，而 5HT 信号由一个参数（α）表示，该参数控制风险预测误差。这种同时包含 5HT 和 DA 的表述方式，调和了 5HT 的一些不同作用，特别是与 BG 系统的关系。我们将该模型应用于研究 5HT 作用的不同实验范式：（1）风险敏感决策，其中 5HT 控制风险评估，（2）时间奖励预测，其中 5HT 控制奖励预测的时间尺度，以及（3）奖励/惩罚敏感性，其中惩罚预测误差取决于 5HT 水平。因此，所提出的综合 RL 模型调和了 BG 中几个现有的 5HT 和 DA 理论。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/1079/3997037/1f37a92def17/fncom-08-00047-g0001.jpg

相似文献

An extended reinforcement learning model of basal ganglia to understand the contributions of serotonin and dopamine in risk-based decision making, reward prediction, and punishment learning.

Front Comput Neurosci. 2014 Apr 16;8:47. doi: 10.3389/fncom.2014.00047. eCollection 2014.

A network model of basal ganglia for understanding the roles of dopamine and serotonin in reward-punishment-risk based decision making.

Front Comput Neurosci. 2015 Jun 17;9:76. doi: 10.3389/fncom.2015.00076. eCollection 2015.

Identifying the Basal Ganglia network model markers for medication-induced impulsivity in Parkinson's disease patients.

PLoS One. 2015 Jun 4;10(6):e0127542. doi: 10.1371/journal.pone.0127542. eCollection 2015.

Striatal dopamine ramping may indicate flexible reinforcement learning with forgetting in the cortico-basal ganglia circuits.

Front Neural Circuits. 2014 Apr 9;8:36. doi: 10.3389/fncir.2014.00036. eCollection 2014.

Computing reward-prediction error: an integrated account of cortical timing and basal-ganglia pathways for appetitive and aversive learning.

Eur J Neurosci. 2015 Aug;42(4):2003-21. doi: 10.1111/ejn.12994. Epub 2015 Jul 25.

Adapting the flow of time with dopamine.

J Neurophysiol. 2019 May 1;121(5):1748-1760. doi: 10.1152/jn.00817.2018. Epub 2019 Mar 13.

Neural systems for choice and valuation with counterfactual learning signals.

Neuroimage. 2014 Apr 1;89:57-69. doi: 10.1016/j.neuroimage.2013.11.051. Epub 2013 Dec 7.

A neural network model with dopamine-like reinforcement signal that learns a spatial delayed response task.

Neuroscience. 1999;91(3):871-90. doi: 10.1016/s0306-4522(98)00697-6.

The many worlds hypothesis of dopamine prediction error: implications of a parallel circuit architecture in the basal ganglia.

Curr Opin Neurobiol. 2017 Oct;46:241-247. doi: 10.1016/j.conb.2017.08.015. Epub 2017 Oct 3.

How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.

J Cogn Neurosci. 2014 Mar;26(3):635-44. doi: 10.1162/jocn_a_00509. Epub 2013 Oct 29.

引用本文的文献

Impaired arbitration between reward-related decision-making strategies in Alcohol Users compared to Alcohol Non-Users: a computational modeling study.

NPP Digit Psychiatry Neurosci. 2025;3(1):1. doi: 10.1038/s44277-024-00023-8. Epub 2025 Jan 3.

A Basal Ganglia model for understanding working memory functions in healthy and Parkinson's conditions.

Cogn Neurodyn. 2024 Aug;18(4):1913-1929. doi: 10.1007/s11571-023-10056-y. Epub 2024 Jan 9.

A generalized reinforcement learning based deep neural network agent model for diverse cognitive constructs.

Sci Rep. 2023 Apr 12;13(1):5928. doi: 10.1038/s41598-023-32234-y.

Distinct neural activations correlate with maximization of reward magnitude versus frequency.

Cereb Cortex. 2023 May 9;33(10):6038-6050. doi: 10.1093/cercor/bhac482.

Linking Salience Signaling With Early Adversity and Affective Distress in Individuals at Clinical High Risk for Psychosis: Results From an Event-Related fMRI Study.

Schizophr Bull Open. 2022 Jun 17;3(1):sgac039. doi: 10.1093/schizbullopen/sgac039. eCollection 2022 Jan.

Coarse-Grained Neural Network Model of the Basal Ganglia to Simulate Reinforcement Learning Tasks.

Brain Sci. 2022 Feb 14;12(2):262. doi: 10.3390/brainsci12020262.

A global framework for a systemic view of brain modeling.

Brain Inform. 2021 Feb 16;8(1):3. doi: 10.1186/s40708-021-00126-4.

The Role of Tryptophan and Tyrosine in Executive Function and Reward Processing.

Int J Tryptophan Res. 2020 Oct 22;13:1178646920964825. doi: 10.1177/1178646920964825. eCollection 2020.

Bipolar oscillations between positive and negative mood states in a computational model of Basal Ganglia.

Cogn Neurodyn. 2020 Apr;14(2):181-202. doi: 10.1007/s11571-019-09564-7. Epub 2019 Nov 20.

The Protective Action Encoding of Serotonin Transients in the Human Brain.

Neuropsychopharmacology. 2018 May;43(6):1425-1435. doi: 10.1038/npp.2017.304. Epub 2018 Jan 3.

本文引用的文献

Serotonergic genotypes, neuroticism, and financial choices.

PLoS One. 2013;8(1):e54632. doi: 10.1371/journal.pone.0054632. Epub 2013 Jan 30.

Reinforcement learning: computing the temporal difference of values via distinct corticostriatal pathways.

Trends Neurosci. 2012 Aug;35(8):457-67. doi: 10.1016/j.tins.2012.04.009. Epub 2012 May 30.

On the neural substrates for exploratory dynamics in basal ganglia: a model.

Neural Netw. 2012 Aug;32:65-73. doi: 10.1016/j.neunet.2012.02.031. Epub 2012 Feb 14.

Tryptophan depletion disinhibits punishment but not reward prediction: implications for resilience.

Psychopharmacology (Berl). 2012 Jan;219(2):599-605. doi: 10.1007/s00213-011-2410-5. Epub 2011 Jul 19.

Modeling the role of basal ganglia in saccade generation: is the indirect pathway the explorer?

Neural Netw. 2011 Oct;24(8):801-13. doi: 10.1016/j.neunet.2011.06.002. Epub 2011 Jun 21.

Evolution and function in serotonergic systems.

Integr Comp Biol. 2006 Dec;46(6):838-46. doi: 10.1093/icb/icl024. Epub 2006 Jul 25.

Serotonin innervation of human basal ganglia.

Eur J Neurosci. 2011 Apr;33(8):1519-32. doi: 10.1111/j.1460-9568.2011.07621.x. Epub 2011 Mar 7.

Contributions of the nucleus accumbens and its subregions to different aspects of risk-based decision making.

Cogn Affect Behav Neurosci. 2011 Mar;11(1):97-112. doi: 10.3758/s13415-010-0015-9.

Opponency revisited: competition and cooperation between dopamine and serotonin.

Neuropsychopharmacology. 2011 Jan;36(1):74-97. doi: 10.1038/npp.2010.151. Epub 2010 Sep 29.

The roles of dopamine and serotonin in decision making: evidence from pharmacological experiments in humans.

Neuropsychopharmacology. 2011 Jan;36(1):114-32. doi: 10.1038/npp.2010.165. Epub 2010 Sep 29.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

一个扩展的基底神经节强化学习模型，用于理解血清素和多巴胺在风险决策、奖励预测和惩罚学习中的贡献。

An extended reinforcement learning model of basal ganglia to understand the contributions of serotonin and dopamine in risk-based decision making, reward prediction, and punishment learning.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献