Battista Aldo, Padoa-Schioppa Camillo, Wang Xiao-Jing
Center for Neural Science, New York University, New York, NY, USA.
Department of Neuroscience, Washington University in St. Louis, St. Louis, MO, USA.
bioRxiv. 2025 Mar 13:2025.03.13.643098. doi: 10.1101/2025.03.13.643098.
Value-guided decisions are at the core of reinforcement learning and neuroeconomics, yet the basic computations they require remain poorly understood at the mechanistic level. For instance, how does the brain implement the multiplication of reward magnitude by probability to yield an expected value? Where within a neural circuit is the indifference point for comparing reward types encoded? How do learned values generalize to novel options? Here, we introduce a biologically plausible model that adheres to Dale's law and is trained on five choice tasks, offering potential answers to these questions. The model captures key neurophysiological observations from the orbitofrontal cortex of monkeys and generalizes to novel offer values. Using a single network model to solve diverse tasks, we identified compositional neural representations-quantified via task variance analysis and corroborated by curriculum learning. This work provides testable predictions that probe the neural basis of decision making and its disruption in neuropsychiatric disorders.
价值引导的决策是强化学习和神经经济学的核心,但在机械层面上,它们所需的基本计算仍知之甚少。例如,大脑如何通过概率对奖励幅度进行乘法运算以产生预期值?在神经回路的哪个位置编码了用于比较奖励类型的无差异点?习得的价值如何推广到新的选项?在这里,我们引入了一个符合戴尔定律的具有生物学合理性的模型,并在五个选择任务上进行训练,为这些问题提供了潜在答案。该模型捕捉了来自猴子眶额皮质的关键神经生理学观察结果,并推广到新的报价价值。通过使用单一网络模型解决各种任务,我们确定了通过任务方差分析量化并经课程学习证实的组合神经表征。这项工作提供了可测试的预测,以探究决策的神经基础及其在神经精神疾病中的破坏情况。