从语言语料库中自动推导出来的语义包含类人偏见。

Semantics derived automatically from language corpora contain human-like biases.

机构信息

Center for Information Technology Policy, Princeton University, Princeton, NJ, USA.

Department of Computer Science, University of Bath, Bath BA2 7AY, UK.

出版信息

Science. 2017 Apr 14;356(6334):183-186. doi: 10.1126/science.aal4230.

DOI:10.1126/science.aal4230

PMID:28408601

Abstract

Machine learning is a means to derive artificial intelligence by discovering patterns in existing data. Here, we show that applying machine learning to ordinary human language results in human-like semantic biases. We replicated a spectrum of known biases, as measured by the Implicit Association Test, using a widely used, purely statistical machine-learning model trained on a standard corpus of text from the World Wide Web. Our results indicate that text corpora contain recoverable and accurate imprints of our historic biases, whether morally neutral as toward insects or flowers, problematic as toward race or gender, or even simply veridical, reflecting the status quo distribution of gender with respect to careers or first names. Our methods hold promise for identifying and addressing sources of bias in culture, including technology.

摘要

机器学习是通过发现现有数据中的模式来获得人工智能的一种手段。在这里，我们表明，将机器学习应用于普通人类语言会导致类似人类的语义偏见。我们使用广泛使用的、仅基于统计的机器学习模型，该模型基于来自万维网的标准文本语料库进行训练，复制了一系列已知的偏见，这些偏见是通过内隐联想测试来衡量的。我们的结果表明，文本语料库包含可恢复和准确的历史偏见印记，无论是对昆虫或花朵等道德中立的偏见，还是对种族或性别等有问题的偏见，甚至是简单的真实性偏见，反映了性别在职业或名字方面的现状分布。我们的方法有望识别和解决文化中的偏见来源，包括技术。

相似文献

Semantics derived automatically from language corpora contain human-like biases.

Science. 2017 Apr 14;356(6334):183-186. doi: 10.1126/science.aal4230.

Gender bias at scale: Evidence from the usage of personal names.

Behav Res Methods. 2019 Aug;51(4):1601-1618. doi: 10.3758/s13428-019-01234-0.

Context Matters: Recovering Human Semantic Structure from Machine Learning Analysis of Large-Scale Text Corpora.

Cogn Sci. 2022 Feb;46(2):e13085. doi: 10.1111/cogs.13085.

The semantic representation of prejudice and stereotypes.

Cognition. 2017 Jul;164:46-60. doi: 10.1016/j.cognition.2017.03.016. Epub 2017 Mar 31.

Semantic Space models for classification of consumer webpages on metadata attributes.

J Biomed Inform. 2010 Oct;43(5):725-35. doi: 10.1016/j.jbi.2010.06.005. Epub 2010 Jun 23.

How useful are corpus-based methods for extrapolating psycholinguistic variables?

Q J Exp Psychol (Hove). 2015;68(8):1623-42. doi: 10.1080/17470218.2014.988735. Epub 2015 Feb 19.

The Moral Choice Machine.

Front Artif Intell. 2020 May 20;3:36. doi: 10.3389/frai.2020.00036. eCollection 2020.

A system for de-identifying medical message board text.

BMC Bioinformatics. 2011 Jun 9;12 Suppl 3(Suppl 3):S2. doi: 10.1186/1471-2105-12-S3-S2.

The role of corpus size and syntax in deriving lexico-semantic representations for a wide range of concepts.

Q J Exp Psychol (Hove). 2015;68(8):1643-64. doi: 10.1080/17470218.2014.994098. Epub 2015 Feb 26.

The influence of place and time on lexical behavior: A distributional analysis.

Behav Res Methods. 2019 Dec;51(6):2438-2453. doi: 10.3758/s13428-019-01289-z.

引用本文的文献

FanFAIR: sensitive data sets semi-automatic fairness assessment.

BMC Med Inform Decis Mak. 2025 Sep 12;25(Suppl 3):329. doi: 10.1186/s12911-025-03184-4.

Disembodied creativity in generative AI: challenges and limitations of prompting in creative practice.

Front Artif Intell. 2025 Aug 14;8:1651354. doi: 10.3389/frai.2025.1651354. eCollection 2025.

Evaluating gender bias in large language models in long-term care.

BMC Med Inform Decis Mak. 2025 Aug 11;25(1):274. doi: 10.1186/s12911-025-03118-0.

Benchmarking bias in embeddings of healthcare AI models: using SD-WEAT for detection and measurement across sensitive populations.

BMC Med Inform Decis Mak. 2025 Jul 10;25(1):258. doi: 10.1186/s12911-025-03102-8.

Biased echoes: Large language models reinforce investment biases and increase portfolio risks of private investors.

PLoS One. 2025 Jun 27;20(6):e0325459. doi: 10.1371/journal.pone.0325459. eCollection 2025.

Perceptual interventions ameliorate statistical discrimination in learning agents.

Proc Natl Acad Sci U S A. 2025 Jun 24;122(25):e2319933121. doi: 10.1073/pnas.2319933121. Epub 2025 Jun 16.

Disparate Model Performance and Stability in Machine Learning Clinical Support for Diabetes and Heart Diseases.

AMIA Jt Summits Transl Sci Proc. 2025 Jun 10;2025:95-104. eCollection 2025.

Bringing AI participation down to scale.

Patterns (N Y). 2025 May 9;6(5):101241. doi: 10.1016/j.patter.2025.101241.

Gender differences in resume language and gender gaps in salary expectations.

J R Soc Interface. 2025 Jun;22(227):20240784. doi: 10.1098/rsif.2024.0784. Epub 2025 Jun 4.

Talk to your data: Introducing text embedding similarity analysis (TESA) in psychological research.

Behav Res Methods. 2025 May 28;57(7):179. doi: 10.3758/s13428-025-02698-z.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从语言语料库中自动推导出来的语义包含类人偏见。

Semantics derived automatically from language corpora contain human-like biases.

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献