白板智能体表现出新兴的群体行为。

Tabula rasa agents display emergent in-group behavior.

作者信息

Köster Raphael, Duéñez-Guzmán Edgar A, Cunningham William A, Leibo Joel Z

机构信息

Google DeepMind, London EC4A 3TW, United Kingdom.

Department of Psychology, University of Toronto, Toronto, ON M5S 3G3, Canada.

出版信息

Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2319947121. doi: 10.1073/pnas.2319947121. Epub 2025 Jun 16.

DOI:10.1073/pnas.2319947121

PMID:40523182

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12207463/

Abstract

Theories on group-bias often posit an internal preparedness to bias one's cognition to favor the in-group (often envisioned as a product of evolution). In contrast, other theories suggest that group-biases can emerge from nonspecialized cognitive processes. These perspectives have historically been difficult to disambiguate given that observed behavior can often be attributed to innate processes, even when groups are experimentally assigned. Here, we use modern techniques from the field of AI that allow us to ask what group biases can be expected from a learning agent that is a pure blank slate without any intrinsic social biases, and whose lifetime of experiences can be tightly controlled. This is possible because deep reinforcement-learning agents learn to convert raw sensory input (i.e. pixels) to reward-driven action, a unique feature among cognitive models. We find that blank slate agents do develop group biases based on arbitrary group differences (i.e. color). We show that the bias develops as a result of familiarity of experience and depends on the visual patterns becoming associated with reward through interaction. The bias artificial agents display is not a static reflection of the bias in their stream of experiences. In this minimal environment, the bias can be overcome given enough positive experiences, although unlearning the bias takes longer than acquiring it. Further, we show how this style of tabula rasa group behavior model can be used to test fine-grained predictions of psychological theories.

摘要

关于群体偏见的理论通常假定存在一种内在倾向，会使一个人的认知偏向于支持内群体（通常被视为进化的产物）。相比之下，其他理论则认为群体偏见可能源于非专门化的认知过程。鉴于观察到的行为往往可以归因于先天过程，即使群体是通过实验分配的，这些观点在历史上一直难以区分。在这里，我们使用人工智能领域的现代技术，使我们能够探究对于一个完全没有任何内在社会偏见且其一生经历能够得到严格控制的学习主体，可能会出现什么样的群体偏见。这是可行的，因为深度强化学习主体学会将原始感官输入（即像素）转化为受奖励驱动的行动，这是认知模型中的一个独特特征。我们发现，白板主体确实会基于任意的群体差异（如颜色）形成群体偏见。我们表明，这种偏见是由于对经验的熟悉而产生的，并且取决于视觉模式通过互动与奖励建立联系。人工主体所表现出的偏见并非其经验流中偏见的静态反映。在这种极简环境中，只要有足够多的积极经验，这种偏见是可以被克服的，尽管消除这种偏见比形成它所需的时间更长。此外，我们展示了这种白板群体行为模型如何能够用于测试心理学理论的细粒度预测。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/3d91/12207463/4cc1842e27f9/pnas.2319947121fig01.jpg

相似文献

Tabula rasa agents display emergent in-group behavior.白板智能体表现出新兴的群体行为。

Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2319947121. doi: 10.1073/pnas.2319947121. Epub 2025 Jun 16.

Stigma Management Strategies of Autistic Social Media Users.自闭症社交媒体用户的污名管理策略

Autism Adulthood. 2025 May 28;7(3):273-282. doi: 10.1089/aut.2023.0095. eCollection 2025 Jun.

Adapting Safety Plans for Autistic Adults with Involvement from the Autism Community.在自闭症群体的参与下为成年自闭症患者调整安全计划。

Autism Adulthood. 2025 May 28;7(3):293-302. doi: 10.1089/aut.2023.0124. eCollection 2025 Jun.

"Just Ask What Support We Need": Autistic Adults' Feedback on Social Skills Training.“只需询问我们需要什么支持”：成年自闭症患者对社交技能培训的反馈

Autism Adulthood. 2025 May 28;7(3):283-292. doi: 10.1089/aut.2023.0136. eCollection 2025 Jun.

Signs and symptoms to determine if a patient presenting in primary care or hospital outpatient settings has COVID-19.在基层医疗机构或医院门诊环境中，如果患者出现以下症状和体征，可判断其是否患有 COVID-19。

Cochrane Database Syst Rev. 2022 May 20;5(5):CD013665. doi: 10.1002/14651858.CD013665.pub3.

Behavioral interventions to reduce risk for sexual transmission of HIV among men who have sex with men.降低男男性行为者中艾滋病毒性传播风险的行为干预措施。

Cochrane Database Syst Rev. 2008 Jul 16(3):CD001230. doi: 10.1002/14651858.CD001230.pub2.

Interventions to improve inhaler technique for people with asthma.改善哮喘患者吸入器使用技术的干预措施。

Cochrane Database Syst Rev. 2017 Mar 13;3(3):CD012286. doi: 10.1002/14651858.CD012286.pub2.

How lived experiences of illness trajectories, burdens of treatment, and social inequalities shape service user and caregiver participation in health and social care: a theory-informed qualitative evidence synthesis.疾病轨迹的生活经历、治疗负担和社会不平等如何影响服务使用者和照顾者参与健康和社会护理：一项基于理论的定性证据综合分析

Health Soc Care Deliv Res. 2025 Jun;13(24):1-120. doi: 10.3310/HGTQ8159.

Assessing the comparative effects of interventions in COPD: a tutorial on network meta-analysis for clinicians.评估慢性阻塞性肺疾病干预措施的比较效果：面向临床医生的网状Meta分析教程

Respir Res. 2024 Dec 21;25(1):438. doi: 10.1186/s12931-024-03056-x.

Interventions targeted at women to encourage the uptake of cervical screening.针对女性的干预措施，以鼓励她们接受宫颈癌筛查。

Cochrane Database Syst Rev. 2021 Sep 6;9(9):CD002834. doi: 10.1002/14651858.CD002834.pub3.

引用本文的文献

Collective artificial intelligence and evolutionary dynamics.集体人工智能与进化动力学

Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2505860122. doi: 10.1073/pnas.2505860122. Epub 2025 Jun 16.

本文引用的文献

Motivated Categories: Social Structures Shape the Construction of Social Categories Through Attentional Mechanisms.动机类别：社会结构通过注意力机制影响社会类别构建。

Pers Soc Psychol Rev. 2023 Nov;27(4):393-413. doi: 10.1177/10888683231172255. Epub 2023 May 22.

The AI Economist: Taxation policy design via two-level deep multiagent reinforcement learning.《人工智能经济学家：通过两级深度多智能体强化学习进行税收政策设计》

Sci Adv. 2022 May 6;8(18):eabk2607. doi: 10.1126/sciadv.abk2607. Epub 2022 May 4.

Spurious normativity enhances learning of compliance and enforcement behavior in artificial agents.虚假规范性增强了对人工代理中合规和执行行为的学习。

Proc Natl Acad Sci U S A. 2022 Jan 18;119(3). doi: 10.1073/pnas.2106028118.

Avoidance begets avoidance: A computational account of negative stereotype persistence.回避导致回避：消极刻板印象持续存在的计算解释。

J Exp Psychol Gen. 2021 Oct;150(10):2078-2099. doi: 10.1037/xge0001037. Epub 2021 Aug 30.

Toward a computational theory of social groups: A finite set of cognitive primitives for representing any and all social groups in the context of conflict.迈向社会群体的计算理论：在冲突背景下表示任何和所有社会群体的有限认知基元集。

Behav Brain Sci. 2021 Apr 27;45:e97. doi: 10.1017/S0140525X21000583.

Using deep reinforcement learning to reveal how the brain encodes abstract state-space representations in high-dimensional environments.利用深度强化学习揭示大脑如何在高维环境中对抽象状态空间表示进行编码。

Neuron. 2021 Feb 17;109(4):724-738.e7. doi: 10.1016/j.neuron.2020.11.021. Epub 2020 Dec 15.

Deep Reinforcement Learning and Its Neuroscientific Implications.深度强化学习及其神经科学意义。

Neuron. 2020 Aug 19;107(4):603-616. doi: 10.1016/j.neuron.2020.06.014. Epub 2020 Jul 13.

Dopamine role in learning and action inference.多巴胺在学习和行动推断中的作用。

Elife. 2020 Jul 7;9:e53262. doi: 10.7554/eLife.53262.

Convolutional Neural Networks as a Model of the Visual System: Past, Present, and Future.作为视觉系统模型的卷积神经网络：过去、现在与未来。

J Cogn Neurosci. 2021 Sep 1;33(10):2017-2031. doi: 10.1162/jocn_a_01544.

Low self-esteem predicts out-group derogation via collective narcissism, but this relationship is obscured by in-group satisfaction.低自尊通过集体自恋预测对外群体的诋毁，但这种关系被内群体满意度所掩盖。

J Pers Soc Psychol. 2020 Sep;119(3):741-764. doi: 10.1037/pspp0000260. Epub 2019 Aug 26.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验