Department of Neuroimaging, Institute of Psychiatry, Psychology & Neuroscience, King's College London, London, UK; Department of Humanities and Social Sciences, California Institute of Technology, Pasadena, CA, USA.
Department of Humanities and Social Sciences, California Institute of Technology, Pasadena, CA, USA; Department of Psychology, University of Maryland, College Park, MD, USA; Brain and Behavior Institute, University of Maryland, College Park, MD, USA.
Cell Rep. 2023 Aug 29;42(8):113008. doi: 10.1016/j.celrep.2023.113008. Epub 2023 Aug 22.
In social environments, survival can depend upon inferring and adapting to other agents' goal-directed behavior. However, it remains unclear how humans achieve this, despite the fact that many decisions must account for complex, dynamic agents acting according to their own goals. Here, we use a predator-prey task (total n = 510) to demonstrate that humans exploit an interactive cognitive map of the social environment to infer other agents' preferences and simulate their future behavior, providing for flexible, generalizable responses. A model-based inverse reinforcement learning model explained participants' inferences about threatening agents' preferences, with participants using this inferred knowledge to enact generalizable, model-based behavioral responses. Using tree-search planning models, we then found that behavior was best explained by a planning algorithm that incorporated simulations of the threat's goal-directed behavior. Our results indicate that humans use a cognitive map to determine other agents' preferences, facilitating generalized predictions of their behavior and effective responses.
在社交环境中,生存可能取决于推断和适应其他主体的目标导向行为。然而,尽管许多决策必须考虑到复杂的、动态的主体根据自己的目标行事,但人类如何做到这一点仍不清楚。在这里,我们使用一个捕食者-猎物任务(总人数 n=510)来证明人类利用社交环境的互动认知地图来推断其他主体的偏好,并模拟他们未来的行为,从而实现灵活、可推广的反应。基于模型的逆向强化学习模型解释了参与者对威胁主体偏好的推断,参与者利用这种推断的知识来实施可推广的、基于模型的行为反应。然后,我们使用树搜索规划模型发现,行为最好用一种包含威胁目标导向行为模拟的规划算法来解释。我们的研究结果表明,人类使用认知地图来确定其他主体的偏好,从而促进对其行为的普遍预测和有效反应。