Laboratoire Dynamique Du Language (DDL) UMR5596, Université Lumière Lyon 2, Lyon, France.
PLoS One. 2021 Jun 30;16(6):e0253546. doi: 10.1371/journal.pone.0253546. eCollection 2021.
While it is generally accepted that language and speech have genetic foundations, and that the widespread inter-individual variation observed in many of their aspects is partly driven by variation in genes, it is much less clear if differences between languages may also be partly rooted in our genes. One such proposal is that the population frequencies of the so-called "derived" alleles of two genes involved in brain growth and development, ASPM and Microcephalin, are related to the probability of speaking a tone language or not. The original study introducing this proposal used a cross-linguistic statistical approach, showing that these associations are "special" when compared with many other possible relationships between genetic variants and linguistic features. Recent experimental evidence supports strongly a negative effect of the "derived" allele of ASPM on tone perception and/or processing within individuals, but failed to find any effect for Microcephalin. Motivated by these experimental findings, I conduct here a cross-linguistic statistical test, using a larger and updated dataset of 175 samples from 129 unique (meta)populations, and a battery of methods including mixed-effects regression (Bayesian and maximum-likelihood), mediation and path analysis, decision trees and random forests, using permutations and restricted sampling to control for the confounding effects of genealogy (language families) and contact (macroareas). Overall, the results support a negative weak effect of ASPM-D against the presence of tone above and beyond the strong confounding influences of genealogy and contact, but they suggest that the original association between tone and MCPH1 might have been a false positive, explained by differences between populations and languages within and outside Africa. Thus, these cross-linguistic population-scale statistical results are fully consonant with the inter-individual-level experimental results, and suggest that the observed linguistic diversity may be, at least in some cases, partly driven by genetic diversity.
虽然人们普遍认为语言和言语具有遗传基础,并且它们在许多方面表现出的广泛个体间差异部分是由基因变异驱动的,但语言之间的差异是否也可能部分源于我们的基因,这一点就远不那么清楚了。有一种观点认为,参与大脑生长和发育的两个基因——ASPM 和 Microcephalin 的所谓“衍生”等位基因的种群频率,与是否说声调语言的概率有关。提出这一观点的原始研究使用了一种跨语言的统计方法,表明与遗传变异和语言特征之间的许多其他可能关系相比,这些关联是“特殊的”。最近的实验证据强烈支持 ASPM 的“衍生”等位基因对个体内声调感知和/或处理有负面影响,但未能发现 Microcephalin 有任何影响。受这些实验结果的启发,我在这里进行了一项跨语言的统计测试,使用了一个更大、更新的数据集,其中包含来自 129 个独特(元)群体的 175 个样本,以及一系列方法,包括混合效应回归(贝叶斯和最大似然)、中介和路径分析、决策树和随机森林,使用置换和限制抽样来控制谱系(语言家族)和接触(宏观区域)的混杂影响。总的来说,结果支持了 ASPM-D 对声调存在的负弱影响,超出了谱系和接触的强烈混杂影响,但它们表明,声调与 MCPH1 之间的原始关联可能是一个假阳性,这可以用非洲内外的人群和语言之间的差异来解释。因此,这些跨语言的人群规模统计结果与个体水平的实验结果完全一致,并表明观察到的语言多样性至少在某些情况下可能部分是由遗传多样性驱动的。