Suppr超能文献

我们如何使用语言?在 17 种世界语言中,词汇使用频率的共有模式。

How do we use language? Shared patterns in the frequency of word use across 17 world languages.

机构信息

School of Biological Sciences, University of Reading, Reading, UK.

出版信息

Philos Trans R Soc Lond B Biol Sci. 2011 Apr 12;366(1567):1101-7. doi: 10.1098/rstb.2010.0315.

Abstract

We present data from 17 languages on the frequency with which a common set of words is used in everyday language. The languages are drawn from six language families representing 65 per cent of the world's 7000 languages. Our data were collected from linguistic corpora that record frequencies of use for the 200 meanings in the widely used Swadesh fundamental vocabulary. Our interest is to assess evidence for shared patterns of language use around the world, and for the relationship of language use to rates of lexical replacement, defined as the replacement of a word by a new unrelated or non-cognate word. Frequencies of use for words in the Swadesh list range from just a few per million words of speech to 191 000 or more. The average inter-correlation among languages in the frequency of use across the 200 words is 0.73 (p < 0.0001). The first principal component of these data accounts for 70 per cent of the variance in frequency of use. Elsewhere, we have shown that frequently used words in the Indo-European languages tend to be more conserved, and that this relationship holds separately for different parts of speech. A regression model combining the principal factor loadings derived from the worldwide sample along with their part of speech predicts 46 per cent of the variance in the rates of lexical replacement in the Indo-European languages. This suggests that Indo-European lexical replacement rates might be broadly representative of worldwide rates of change. Evidence for this speculation comes from using the same factor loadings and part-of-speech categories to predict a word's position in a list of 110 words ranked from slowest to most rapidly evolving among 14 of the world's language families. This regression model accounts for 30 per cent of the variance. Our results point to a remarkable regularity in the way that human speakers use language, and hint that the words for a shared set of meanings have been slowly evolving and others more rapidly evolving throughout human history.

摘要

我们呈现了来自 17 种语言的数据,这些语言在日常语言中使用一组常见词汇的频率。这些语言来自代表世界上 7000 种语言的 65%的六种语言家族。我们的数据来自记录 Swadesh 基本词汇中 200 个含义使用频率的语言语料库。我们的兴趣是评估世界各地语言使用模式的共享证据,以及语言使用与词汇替换率之间的关系,词汇替换率定义为用一个新词替换一个旧词,新词与旧词没有关联或非同源。Swadesh 词汇表中的单词使用频率从每百万个单词中只有几个到 191000 个或更多不等。在 200 个单词的使用频率方面,语言之间的平均相互相关性为 0.73(p<0.0001)。这些数据的第一主成分解释了使用频率方差的 70%。在其他地方,我们已经表明,印欧语言中经常使用的单词往往更保守,而且这种关系在不同的词性中单独成立。一个将来自全球样本的主要因子负荷与它们的词性相结合的回归模型,预测了印欧语言中词汇替换率的 46%的方差。这表明印欧词汇替换率可能广泛代表全球变化率。这种推测的证据来自于使用相同的因子负荷和词性类别来预测 110 个单词列表中一个单词的位置,该列表是根据 14 种世界语言家族中最慢和最快进化的单词排名的。这个回归模型解释了 30%的方差。我们的结果指向人类说话者使用语言的一种显著规律性,并暗示共享词义的单词在人类历史上一直在缓慢进化,而其他单词则在快速进化。

相似文献

3
The deep history of the number words.数字词汇的深远历史。
Philos Trans R Soc Lond B Biol Sci. 2017 Feb 19;373(1740). doi: 10.1098/rstb.2016.0517.
5
Semantic Factors Predict the Rate of Lexical Replacement of Content Words.语义因素预测实词的词汇替换率。
PLoS One. 2016 Jan 28;11(1):e0147924. doi: 10.1371/journal.pone.0147924. eCollection 2016.
7
8
Ultraconserved words point to deep language ancestry across Eurasia.超保守词语指向欧亚大陆的深远语言渊源。
Proc Natl Acad Sci U S A. 2013 May 21;110(21):8471-6. doi: 10.1073/pnas.1218726110. Epub 2013 May 6.

引用本文的文献

2
Cross-linguistic conditions on word length.跨语言条件下的单词长度。
PLoS One. 2023 Jan 27;18(1):e0281041. doi: 10.1371/journal.pone.0281041. eCollection 2023.
3
The sound of swearing: Are there universal patterns in profanity?咒骂声:脏话是否存在普遍模式?
Psychon Bull Rev. 2023 Jun;30(3):1103-1114. doi: 10.3758/s13423-022-02202-0. Epub 2022 Dec 6.
4
Cultural transmission of traditional songs in the Ryukyu Archipelago.琉球群岛传统歌曲的文化传承。
PLoS One. 2022 Jun 24;17(6):e0270354. doi: 10.1371/journal.pone.0270354. eCollection 2022.
5
The history of number words in the world's languages-what have we learnt so far?世界语言中的数字词汇史——到目前为止我们学到了什么?
Philos Trans R Soc Lond B Biol Sci. 2021 May 10;376(1824):20200206. doi: 10.1098/rstb.2020.0206. Epub 2021 Mar 22.
8
Languages in Drier Climates Use Fewer Vowels.气候较为干燥地区的语言使用的元音较少。
Front Psychol. 2017 Jul 27;8:1285. doi: 10.3389/fpsyg.2017.01285. eCollection 2017.
9
Darwinian perspectives on the evolution of human languages.关于人类语言进化的达尔文观点。
Psychon Bull Rev. 2017 Feb;24(1):151-157. doi: 10.3758/s13423-016-1072-z.
10
Semantic Factors Predict the Rate of Lexical Replacement of Content Words.语义因素预测实词的词汇替换率。
PLoS One. 2016 Jan 28;11(1):e0147924. doi: 10.1371/journal.pone.0147924. eCollection 2016.

本文引用的文献

4
Rise of the digital machine.数字机器的崛起。
Nature. 2008 Apr 10;452(7188):699. doi: 10.1038/452699a.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验