Herd Seth, Krueger Kai, Nair Ananta, Mollick Jessica, O'Reilly Randall
eCortex, Inc., Boulder, CO, USA.
University of Colorado, Boulder, CO, USA.
Cogn Affect Behav Neurosci. 2021 Feb;21(1):35-57. doi: 10.3758/s13415-020-00842-0. Epub 2021 Jan 6.
We present a theory and neural network model of the neural mechanisms underlying human decision-making. We propose a detailed model of the interaction between brain regions, under a proposer-predictor-actor-critic framework. This theory is based on detailed animal data and theories of action-selection. Those theories are adapted to serial operation to bridge levels of analysis and explain human decision-making. Task-relevant areas of cortex propose a candidate plan using fast, model-free, parallel neural computations. Other areas of cortex and medial temporal lobe can then predict likely outcomes of that plan in this situation. This optional prediction- (or model-) based computation can produce better accuracy and generalization, at the expense of speed. Next, linked regions of basal ganglia act to accept or reject the proposed plan based on its reward history in similar contexts. If that plan is rejected, the process repeats to consider a new option. The reward-prediction system acts as a critic to determine the value of the outcome relative to expectations and produce dopamine as a training signal for cortex and basal ganglia. By operating sequentially and hierarchically, the same mechanisms previously proposed for animal action-selection could explain the most complex human plans and decisions. We discuss explanations of model-based decisions, habitization, and risky behavior based on the computational model.
我们提出了一种关于人类决策背后神经机制的理论和神经网络模型。我们在提议者-预测者-行动者-批评者框架下,提出了一个大脑区域间相互作用的详细模型。该理论基于详细的动物数据和行动选择理论。这些理论经过调整以适应串行操作,从而跨越分析层次并解释人类决策。与任务相关的皮质区域使用快速、无模型的并行神经计算提出候选计划。然后,皮质的其他区域和内侧颞叶可以预测该计划在这种情况下可能产生的结果。这种基于预测(或模型)的可选计算能够以速度为代价产生更高的准确性和泛化能力。接下来,基底神经节的相关区域根据该计划在类似情境中的奖励历史来决定接受或拒绝该提议计划。如果该计划被拒绝,过程将重复以考虑新的选项。奖励预测系统作为批评者,确定结果相对于预期的价值,并产生多巴胺作为皮质和基底神经节的训练信号。通过顺序和分层操作,先前为动物行动选择提出的相同机制可以解释最复杂的人类计划和决策。我们基于该计算模型讨论了对基于模型的决策、习惯化和风险行为的解释。