List Johann-Mattis, Forkel Robert
Department of Linguistic and Cultural Evolution, Max Planck Institute for Evolutionary Anthropology, Leipzig, Thüringen, 04103, Germany.
Open Res Eur. 2022 Mar 23;1:79. doi: 10.12688/openreseurope.13843.3. eCollection 2021.
Although lexical borrowing is an important aspect of language evolution, there have been few attempts to automate the identification of borrowings in lexical datasets. Moreover, none of the solutions which have been proposed so far identify borrowings across multiple languages. This study proposes a new method for the task and tests it on a newly compiled large comparative dataset of 48 South-East Asian languages from Southern China. The method yields very promising results, while it is conceptually straightforward and easy to apply. This makes the approach a perfect candidate for computer-assisted exploratory studies on lexical borrowing in contact areas.
虽然词汇借用是语言演变的一个重要方面,但很少有人尝试在词汇数据集中自动识别借用词。此外,到目前为止提出的所有解决方案都无法识别多种语言中的借用词。本研究针对该任务提出了一种新方法,并在一个新编制的来自中国南方的48种东南亚语言的大型比较数据集上进行了测试。该方法产生了非常有前景的结果,而且在概念上简单易懂且易于应用。这使得该方法成为接触地区词汇借用的计算机辅助探索性研究的理想选择。