Suppr超能文献

灵长类动物背外侧前额叶皮层中的强化学习与决策机制。

Mechanisms of reinforcement learning and decision making in the primate dorsolateral prefrontal cortex.

作者信息

Lee Daeyeol, Seo Hyojung

机构信息

Department of Neurobiology, Yale University School of Medicine, New Haven, CT 06510, USA.

出版信息

Ann N Y Acad Sci. 2007 May;1104:108-22. doi: 10.1196/annals.1390.007. Epub 2007 Mar 8.

Abstract

To a first approximation, decision making is a process of optimization in which the decision maker tries to maximize the desirability of the outcomes resulting from chosen actions. Estimates of desirability are referred to as utilities or value functions, and they must be continually revised through experience according to the discrepancies between the predicted and obtained rewards. Reinforcement learning theory prescribes various algorithms for updating value functions and can parsimoniously account for the results of numerous behavioral, neurophysiological, and imaging studies in humans and other primates. In this article, we first discuss relative merits of various decision-making tasks used in neurophysiological studies of decision making in nonhuman primates. We then focus on how reinforcement learning theory can shed new light on the function of the primate dorsolateral prefrontal cortex. Similar to the findings from other brain areas, such as cingulate cortex and basal ganglia, activity in the dorsolateral prefrontal cortex often signals the value of expected reward and actual outcome. Thus, the dorsolateral prefrontal cortex is likely to be a part of the broader network involved in adaptive decision making. In addition, reward-related activity in the dorsolateral prefrontal cortex is influenced by the animal's choices and other contextual information, and therefore may provide a neural substrate by which the animals can flexibly modify their decision-making strategies according to the demands of specific tasks.

摘要

初步而言,决策是一个优化过程,在这个过程中,决策者试图使所选行动产生的结果的合意性最大化。合意性的估计被称为效用或价值函数,并且必须根据预测奖励与实际获得奖励之间的差异,通过经验不断进行修正。强化学习理论规定了各种更新价值函数的算法,并且能够简洁地解释人类和其他灵长类动物的众多行为、神经生理学和影像学研究结果。在本文中,我们首先讨论在非人灵长类动物决策的神经生理学研究中使用的各种决策任务的相对优点。然后我们将重点关注强化学习理论如何能够为灵长类动物背外侧前额叶皮层的功能提供新的见解。与其他脑区(如扣带回皮层和基底神经节)的研究结果类似,背外侧前额叶皮层的活动常常表明预期奖励和实际结果的价值。因此,背外侧前额叶皮层很可能是参与适应性决策的更广泛网络的一部分。此外,背外侧前额叶皮层中与奖励相关的活动受动物的选择和其他情境信息的影响,因此可能提供一种神经基础,通过它动物能够根据特定任务的要求灵活地修改其决策策略。

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验