Khishigsuren Temuulen, Regier Terry, Vylomova Ekaterina, Kemp Charles
School of Psychological Sciences, University of Melbourne, Parkville, VIC 3010, Australia.
Department of Linguistics, Cognitive Science Program, University of California, Berkeley, CA 94720.
Proc Natl Acad Sci U S A. 2025 Apr 15;122(15):e2417304122. doi: 10.1073/pnas.2417304122. Epub 2025 Apr 10.
Claims about lexical elaboration (e.g. Mongolian has many horse-related terms) are widespread in the scholarly and popular literature. Here, we show that computational analyses of bilingual dictionaries can be used to test claims about lexical elaboration at scale. We validate our approach by introducing BILA, a dataset including 1,574 bilingual dictionaries, and showing that it confirms 147 out of 163 previous claims from the literature. We then identify previously unreported examples of lexical elaboration, and analyze how lexical elaboration is influenced by ecological and cultural variables. Claims about lexical elaboration are sometimes dismissed as either obvious or fanciful, but our work suggests that large-scale computational approaches to the topic can produce nonobvious and well-grounded insights into language and culture.
关于词汇精细化的说法(例如蒙古语中有许多与马相关的词汇)在学术文献和通俗文学中广泛存在。在此,我们表明双语词典的计算分析可用于大规模检验关于词汇精细化的说法。我们通过引入BILA(一个包含1574本双语词典的数据集)来验证我们的方法,并表明它证实了文献中先前163个说法中的147个。然后,我们识别出先前未报道的词汇精细化示例,并分析词汇精细化是如何受到生态和文化变量影响的。关于词汇精细化的说法有时会被认为要么显而易见要么异想天开,但我们的研究表明,针对该主题的大规模计算方法可以对语言和文化产生非显而易见且有充分依据的见解。