深度卷积神经网络中的多维面部表征揭示了人工智能种族主义背后的机制。

Multidimensional Face Representation in a Deep Convolutional Neural Network Reveals the Mechanism Underlying AI Racism.

作者信息

Tian Jinhua, Xie Hailun, Hu Siyuan, Liu Jia

机构信息

Beijing Key Laboratory of Applied Experimental Psychology, Faculty of Psychology, Beijing Normal University, Beijing, China.

Department of Psychology & Tsinghua Laboratory of Brain and Intelligence, Tsinghua University, Beijing, China.

出版信息

Front Comput Neurosci. 2021 Mar 10;15:620281. doi: 10.3389/fncom.2021.620281. eCollection 2021.

DOI:10.3389/fncom.2021.620281

PMID:33776675

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC7987832/

Abstract

The increasingly popular application of AI runs the risk of amplifying social bias, such as classifying non-white faces as animals. Recent research has largely attributed this bias to the training data implemented. However, the underlying mechanism is poorly understood; therefore, strategies to rectify the bias are unresolved. Here, we examined a typical deep convolutional neural network (DCNN), VGG-Face, which was trained with a face dataset consisting of more white faces than black and Asian faces. The transfer learning result showed significantly better performance in identifying white faces, similar to the well-known social bias in humans, the other-race effect (ORE). To test whether the effect resulted from the imbalance of face images, we retrained the VGG-Face with a dataset containing more Asian faces, and found a reverse ORE that the newly-trained VGG-Face preferred Asian faces over white faces in identification accuracy. Additionally, when the number of Asian faces and white faces were matched in the dataset, the DCNN did not show any bias. To further examine how imbalanced image input led to the ORE, we performed a representational similarity analysis on VGG-Face's activation. We found that when the dataset contained more white faces, the representation of white faces was more distinct, indexed by smaller in-group similarity and larger representational Euclidean distance. That is, white faces were scattered more sparsely in the representational face space of the VGG-Face than the other faces. Importantly, the distinctiveness of faces was positively correlated with identification accuracy, which explained the ORE observed in the VGG-Face. In summary, our study revealed the mechanism underlying the ORE in DCNNs, which provides a novel approach to studying AI ethics. In addition, the face multidimensional representation theory discovered in humans was also applicable to DCNNs, advocating for future studies to apply more cognitive theories to understand DCNNs' behavior.

摘要

人工智能日益广泛的应用存在加剧社会偏见的风险，比如将非白人面孔归类为动物。近期研究大多将这种偏见归因于所使用的训练数据。然而，其潜在机制却知之甚少；因此，纠正这种偏见的策略也尚未解决。在此，我们研究了一个典型的深度卷积神经网络（DCNN），即VGG - Face，它是用一个面部数据集训练的，其中白人面孔多于黑人和亚洲面孔。迁移学习结果显示，在识别白人面孔方面表现显著更好，这类似于人类中著名的社会偏见——异族效应（ORE）。为了测试这种效应是否由面部图像的不平衡导致，我们用一个包含更多亚洲面孔的数据集对VGG - Face进行重新训练，结果发现了一种反向的ORE，即新训练的VGG - Face在识别准确性上更倾向于亚洲面孔而非白人面孔。此外，当数据集中亚洲面孔和白人面孔数量相匹配时，深度卷积神经网络没有表现出任何偏见。为了进一步研究不平衡的图像输入是如何导致ORE的，我们对VGG - Face的激活进行了表征相似性分析。我们发现，当数据集中包含更多白人面孔时，白人面孔的表征更加独特，其指标是组内相似性更小和表征欧几里得距离更大。也就是说，在VGG - Face的表征面部空间中，白人面孔比其他面孔分布得更稀疏。重要的是，面孔的独特性与识别准确性呈正相关，这解释了在VGG - Face中观察到的ORE。总之，我们的研究揭示了深度卷积神经网络中ORE的潜在机制，这为研究人工智能伦理提供了一种新方法。此外，在人类中发现的面部多维表征理论也适用于深度卷积神经网络，倡导未来的研究应用更多认知理论来理解深度卷积神经网络的行为。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/16dc/7987832/9a2ba1f2aeb8/fncom-15-620281-g0001.jpg

相似文献

Multidimensional Face Representation in a Deep Convolutional Neural Network Reveals the Mechanism Underlying AI Racism.深度卷积神经网络中的多维面部表征揭示了人工智能种族主义背后的机制。

Front Comput Neurosci. 2021 Mar 10;15:620281. doi: 10.3389/fncom.2021.620281. eCollection 2021.

Implementation-Independent Representation for Deep Convolutional Neural Networks and Humans in Processing Faces.深度卷积神经网络和人类在处理面部时与实现无关的表示。

Front Comput Neurosci. 2021 Jan 26;14:601314. doi: 10.3389/fncom.2020.601314. eCollection 2020.

Modeling naturalistic face processing in humans with deep convolutional neural networks.用深度卷积神经网络对人类自然主义面孔处理进行建模。

Proc Natl Acad Sci U S A. 2023 Oct 24;120(43):e2304085120. doi: 10.1073/pnas.2304085120. Epub 2023 Oct 17.

The Influence of the Other-Race Effect on Susceptibility to Face Morphing Attacks.异族效应对面部变形攻击易感性的影响。

ACM Trans Appl Percept. 2024 Jan;21(1). doi: 10.1145/3618113. Epub 2023 Dec 9.

The Face Inversion Effect in Deep Convolutional Neural Networks.深度卷积神经网络中的面部倒置效应

Front Comput Neurosci. 2022 May 9;16:854218. doi: 10.3389/fncom.2022.854218. eCollection 2022.

Social Trait Information in Deep Convolutional Neural Networks Trained for Face Identification.深度卷积神经网络在人脸识别训练中所获取的社交特质信息。

Cogn Sci. 2019 Jun;43(6):e12729. doi: 10.1111/cogs.12729.

Distinct patterns of neural response to faces from different races in humans and deep networks.人类和深度网络对面孔的不同种族的神经反应存在明显差异。

Soc Cogn Affect Neurosci. 2023 Nov 4;18(1). doi: 10.1093/scan/nsad059.

Deep convolutional neural network VGG-16 model for differential diagnosing of papillary thyroid carcinomas in cytological images: a pilot study.用于甲状腺乳头状癌细胞学图像鉴别诊断的深度卷积神经网络VGG - 16模型：一项初步研究

J Cancer. 2019 Aug 27;10(20):4876-4882. doi: 10.7150/jca.28769. eCollection 2019.

Emerged human-like facial expression representation in a deep convolutional neural network.深度卷积神经网络中出现的类人面部表情表征。

Sci Adv. 2022 Mar 25;8(12):eabj4383. doi: 10.1126/sciadv.abj4383. Epub 2022 Mar 23.

Seeing through disguise: Getting to know you with a deep convolutional neural network.透过伪装看本质：用深度卷积神经网络了解你。

Cognition. 2021 Jun;211:104611. doi: 10.1016/j.cognition.2021.104611. Epub 2021 Feb 13.

引用本文的文献

Scoring facial attractiveness with deep convolutional neural networks: How training on standardized images reduces the bias of facial expressions.使用深度卷积神经网络对面部吸引力进行评分：在标准化图像上进行训练如何减少面部表情的偏差。

Orthod Craniofac Res. 2024 Dec;27 Suppl 2(Suppl 2):25-32. doi: 10.1111/ocr.12820. Epub 2024 Jun 2.

Distinct patterns of neural response to faces from different races in humans and deep networks.人类和深度网络对面孔的不同种族的神经反应存在明显差异。

Soc Cogn Affect Neurosci. 2023 Nov 4;18(1). doi: 10.1093/scan/nsad059.

Behavioral signatures of face perception emerge in deep neural networks optimized for face recognition.深度神经网络在优化人脸识别时会出现面部感知的行为特征。

Proc Natl Acad Sci U S A. 2023 Aug 8;120(32):e2220642120. doi: 10.1073/pnas.2220642120. Epub 2023 Jul 31.

Biased data, biased AI: deep networks predict the acquisition site of TCGA images.有偏数据，有偏 AI：深度网络预测 TCGA 图像的采集部位。

Diagn Pathol. 2023 May 17;18(1):67. doi: 10.1186/s13000-023-01355-3.

Disrupted visual input unveils the computational details of artificial neural networks for face perception.视觉输入的中断揭示了用于面部感知的人工神经网络的计算细节。

Front Comput Neurosci. 2022 Nov 29;16:1054421. doi: 10.3389/fncom.2022.1054421. eCollection 2022.

The Face Inversion Effect in Deep Convolutional Neural Networks.深度卷积神经网络中的面部倒置效应

Front Comput Neurosci. 2022 May 9;16:854218. doi: 10.3389/fncom.2022.854218. eCollection 2022.

本文引用的文献

ArcFace: Additive Angular Margin Loss for Deep Face Recognition.ArcFace：用于深度人脸识别的附加角度间隔损失。

IEEE Trans Pattern Anal Mach Intell. 2022 Oct;44(10):5962-5979. doi: 10.1109/TPAMI.2021.3087709. Epub 2022 Sep 14.

DNNBrain: A Unifying Toolbox for Mapping Deep Neural Networks and Brains.DNNBrain：用于映射深度神经网络与大脑的统一工具箱。

Front Comput Neurosci. 2020 Nov 30;14:580632. doi: 10.3389/fncom.2020.580632. eCollection 2020.

Face Space Representations in Deep Convolutional Neural Networks.深度卷积神经网络中的人脸空间表示。

Trends Cogn Sci. 2018 Sep;22(9):794-809. doi: 10.1016/j.tics.2018.06.006. Epub 2018 Aug 7.

Focal Loss for Dense Object Detection.用于密集目标检测的焦散损失

IEEE Trans Pattern Anal Mach Intell. 2020 Feb;42(2):318-327. doi: 10.1109/TPAMI.2018.2858826. Epub 2018 Jul 23.

AI can be sexist and racist - it's time to make it fair.人工智能可能存在性别歧视和种族歧视——是时候让它变得公平了。

Nature. 2018 Jul;559(7714):324-326. doi: 10.1038/d41586-018-05707-8.

Word embeddings quantify 100 years of gender and ethnic stereotypes.词嵌入量化了 100 年来的性别和种族刻板印象。

Proc Natl Acad Sci U S A. 2018 Apr 17;115(16):E3635-E3644. doi: 10.1073/pnas.1720347115. Epub 2018 Apr 3.

Semantics derived automatically from language corpora contain human-like biases.从语言语料库中自动推导出来的语义包含类人偏见。

Science. 2017 Apr 14;356(6334):183-186. doi: 10.1126/science.aal4230.

Face-space: A unifying concept in face recognition research.面部空间：人脸识别研究中的一个统一概念。

Q J Exp Psychol (Hove). 2016 Oct;69(10):1996-2019. doi: 10.1080/17470218.2014.990392. Epub 2015 Jan 27.

Recognition for faces of own and other race.对本族和其他种族面孔的识别。

J Pers Soc Psychol. 1969 Dec;13(4):330-4. doi: 10.1037/h0028434.

A unified account of the effects of distinctiveness, inversion, and race in face recognition.对面部识别中独特性、倒置和种族效应的统一解释。

Q J Exp Psychol A. 1991 May;43(2):161-204. doi: 10.1080/14640749108400966.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

深度卷积神经网络中的多维面部表征揭示了人工智能种族主义背后的机制。

Multidimensional Face Representation in a Deep Convolutional Neural Network Reveals the Mechanism Underlying AI Racism.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献