人类强化学习过程中风险预测误差的神经关联

Neural correlates of risk prediction error during reinforcement learning in humans.

作者信息

d'Acremont Mathieu, Lu Zhong-Lin, Li Xiangrui, Van der Linden Martial, Bechara Antoine

机构信息

National Centre of Competence in Research (NCCR) in Affective Sciences, University of Geneva, Geneva, Switzerland.

出版信息

Neuroimage. 2009 Oct 1;47(4):1929-39. doi: 10.1016/j.neuroimage.2009.04.096. Epub 2009 May 13.

DOI:10.1016/j.neuroimage.2009.04.096

PMID:19442744

Abstract

Behavioral studies have shown for decades that humans are sensitive to risk when making decisions. More recently, brain activities have been shown to be correlated with risky choices. But an important gap needs to be filled: How does the human brain learn which decisions are risky? In cognitive neuroscience, reinforcement learning has never been used to estimate reward variance, a common measure of risk in economics and psychology. It is thus unknown which brain regions are involved in risk learning. To address this question, participants completed a decision-making task during fMRI. They chose repetitively from four decks of cards and each selection was followed by a stochastic payoff. Expected reward and risk differed among the decks. Participants' aim was to maximize payoffs. Risk and reward prediction errors were calculated after each payoff based on a novel reinforcement learning model. For reward prediction error, the strongest correlation was found with the BOLD response in the striatum. For risk prediction error, the strongest correlation was found with the BOLD responses in the insula and inferior frontal gyrus. We conclude that risk and reward prediction errors are processed by distinct neural circuits during reinforcement learning. Additional analyses revealed that the BOLD response in the inferior frontal gyrus was more pronounced for risk aversive participants, suggesting that this region also serves to inhibit risky choices.

摘要

几十年来，行为研究表明，人类在做决策时对风险很敏感。最近，大脑活动已被证明与风险选择相关。但一个重要的空白有待填补：人类大脑是如何学会识别哪些决策是有风险的？在认知神经科学中，强化学习从未被用于估计奖励方差，而奖励方差是经济学和心理学中衡量风险的常用指标。因此，尚不清楚哪些脑区参与了风险学习。为了解决这个问题，参与者在功能磁共振成像（fMRI）期间完成了一项决策任务。他们从四组牌中反复进行选择，每次选择后都会有一个随机的收益。不同组牌的预期奖励和风险各不相同。参与者的目标是使收益最大化。基于一种新颖的强化学习模型，在每次收益后计算风险和奖励预测误差。对于奖励预测误差，发现与纹状体中的血氧水平依赖（BOLD）反应相关性最强。对于风险预测误差，发现与脑岛和额下回中的BOLD反应相关性最强。我们得出结论，在强化学习过程中，风险和奖励预测误差由不同的神经回路处理。进一步的分析表明，对于风险厌恶型参与者，额下回中的BOLD反应更为明显，这表明该区域也有助于抑制风险选择。

相似文献

Neural correlates of risk prediction error during reinforcement learning in humans.

Neuroimage. 2009 Oct 1;47(4):1929-39. doi: 10.1016/j.neuroimage.2009.04.096. Epub 2009 May 13.

How we learn to make decisions: rapid propagation of reinforcement learning prediction errors in humans.

J Cogn Neurosci. 2014 Mar;26(3):635-44. doi: 10.1162/jocn_a_00509. Epub 2013 Oct 29.

Expected value and prediction error abnormalities in depression and schizophrenia.

Brain. 2011 Jun;134(Pt 6):1751-64. doi: 10.1093/brain/awr059. Epub 2011 Apr 10.

Overlapping prediction errors in dorsal striatum during instrumental learning with juice and money reward in the human brain.

J Neurophysiol. 2009 Dec;102(6):3384-91. doi: 10.1152/jn.91195.2008. Epub 2009 Sep 30.

Learning from other people's experience: a neuroimaging study of decisional interactive-learning.

Neuroimage. 2011 Mar 1;55(1):353-62. doi: 10.1016/j.neuroimage.2010.11.065. Epub 2010 Nov 29.

Neural correlates of traditional Chinese medicine induced advantageous risk-taking decision making.

Brain Cogn. 2009 Dec;71(3):354-61. doi: 10.1016/j.bandc.2009.06.006. Epub 2009 Aug 12.

Reinforcement learning signal predicts social conformity.

Neuron. 2009 Jan 15;61(1):140-51. doi: 10.1016/j.neuron.2008.11.027.

Temporal difference modeling of the blood-oxygen level dependent response during aversive conditioning in humans: effects of dopaminergic modulation.

Biol Psychiatry. 2007 Oct 1;62(7):765-72. doi: 10.1016/j.biopsych.2006.10.020. Epub 2007 Jan 16.

Individual differences and the neural representations of reward expectation and reward prediction error.

Soc Cogn Affect Neurosci. 2007 Mar;2(1):20-30. doi: 10.1093/scan/nsl021.

Imaging the changing role of feedback during learning in decision-making.

Neuroimage. 2007 Oct 1;37(4):1474-86. doi: 10.1016/j.neuroimage.2007.07.012. Epub 2007 Jul 24.

引用本文的文献

Impaired arbitration between reward-related decision-making strategies in Alcohol Users compared to Alcohol Non-Users: a computational modeling study.

NPP Digit Psychiatry Neurosci. 2025;3(1):1. doi: 10.1038/s44277-024-00023-8. Epub 2025 Jan 3.

Belief Updating in Subclinical and Clinical Delusions.

Schizophr Bull Open. 2022 Dec 14;4(1):sgac074. doi: 10.1093/schizbullopen/sgac074. eCollection 2023 Jan.

A Competition of Critics in Human Decision-Making.

Comput Psychiatr. 2021 Aug 12;5(1):81-101. doi: 10.5334/cpsy.64. eCollection 2021.

Temporally organized representations of reward and risk in the human brain.

Nat Commun. 2024 Mar 9;15(1):2162. doi: 10.1038/s41467-024-46094-1.

Predictions about reward outcomes in rhesus monkeys.

Behav Neurosci. 2024 Feb;138(1):43-58. doi: 10.1037/bne0000573. Epub 2023 Dec 7.

Gambling on an empty stomach: Hunger modulates preferences for learned but not described risks.

Brain Behav. 2023 May;13(5):e2978. doi: 10.1002/brb3.2978. Epub 2023 Apr 5.

Front Aging Neurosci. 2023 Mar 6;15:1078455. doi: 10.3389/fnagi.2023.1078455. eCollection 2023.

Development of a novel computational model for the Balloon Analogue Risk Task: The Exponential-Weight Mean-Variance Model.

J Math Psychol. 2021 Jun;102. doi: 10.1016/j.jmp.2021.102532. Epub 2021 Apr 21.

Bipolar oscillations between positive and negative mood states in a computational model of Basal Ganglia.

Cogn Neurodyn. 2020 Apr;14(2):181-202. doi: 10.1007/s11571-019-09564-7. Epub 2019 Nov 20.

Born for fairness: evidence of genetic contribution to a neural basis of fairness intuition.

Soc Cogn Affect Neurosci. 2019 May 31;14(5):539-548. doi: 10.1093/scan/nsz031.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

人类强化学习过程中风险预测误差的神经关联

Neural correlates of risk prediction error during reinforcement learning in humans.

作者信息

d'Acremont Mathieu, Lu Zhong-Lin, Li Xiangrui, Van der Linden Martial, Bechara Antoine

机构信息

National Centre of Competence in Research (NCCR) in Affective Sciences, University of Geneva, Geneva, Switzerland.

出版信息

Neuroimage. 2009 Oct 1;47(4):1929-39. doi: 10.1016/j.neuroimage.2009.04.096. Epub 2009 May 13.

DOI:10.1016/j.neuroimage.2009.04.096

PMID:19442744

Abstract

摘要

人类强化学习过程中风险预测误差的神经关联

Neural correlates of risk prediction error during reinforcement learning in humans.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

人类强化学习过程中风险预测误差的神经关联

Neural correlates of risk prediction error during reinforcement learning in humans.

作者信息

机构信息

出版信息

相似文献

引用本文的文献