Langdon Angela J, Song Mingyu, Niv Yael
Princeton Neuroscience Institute and Department of Psychology, Princeton University, Princeton, NJ, 08544, United States.
Princeton Neuroscience Institute and Department of Psychology, Princeton University, Princeton, NJ, 08544, United States.
Behav Processes. 2019 Oct;167:103891. doi: 10.1016/j.beproc.2019.103891. Epub 2019 Aug 2.
We review the abstract concept of a 'state' - an internal representation posited by reinforcement learning theories to be used by an agent, whether animal, human or artificial, to summarize the features of the external and internal environment that are relevant for future behavior on a particular task. Armed with this summary representation, an agent can make decisions and perform actions to interact effectively with the world. Here, we review recent findings from the neurobiological and behavioral literature to ask: 'what is a state?' with respect to the internal representations that organize learning and decision making across a range of tasks. We find that state representations include information beyond a straightforward summary of the immediate cues in the environment, providing timing or contextual information from the recent or more distant past, which allows these additional factors to influence decision making and other goal-directed behaviors in complex and perhaps unexpected ways.
我们回顾了“状态”这一抽象概念——强化学习理论假定的一种内部表征,供主体(无论是动物、人类还是人工智能)用于总结外部和内部环境中与特定任务未来行为相关的特征。有了这种总结性表征,主体就能做出决策并采取行动,以便与世界进行有效互动。在此,我们回顾神经生物学和行为学文献中的最新发现,以探讨:就组织一系列任务中的学习和决策的内部表征而言,“什么是状态?”我们发现,状态表征所包含的信息不仅仅是对环境中即时线索的直接总结,还提供来自最近或更久远过去的时间或情境信息,这使得这些额外因素能够以复杂且可能意想不到的方式影响决策及其他目标导向行为。