Chaabouni Rahma, Kharitonov Eugene, Dupoux Emmanuel, Baroni Marco
Facebook AI Research, 75002 Paris, France;
Cognitive Machine Learning, ENS - EHESS - PSL Research University - CNRS - INRIA, 75012 Paris, France.
Proc Natl Acad Sci U S A. 2021 Mar 23;118(12). doi: 10.1073/pnas.2016569118.
Words categorize the semantic fields they refer to in ways that maximize communication accuracy while minimizing complexity. Focusing on the well-studied color domain, we show that artificial neural networks trained with deep-learning techniques to play a discrimination game develop communication systems whose distribution on the accuracy/complexity plane closely matches that of human languages. The observed variation among emergent color-naming systems is explained by different degrees of discriminative need, of the sort that might also characterize different human communities. Like human languages, emergent systems show a preference for relatively low-complexity solutions, even at the cost of imperfect communication. We demonstrate next that the nature of the emergent systems crucially depends on communication being discrete (as is human word usage). When continuous message passing is allowed, emergent systems become more complex and eventually less efficient. Our study suggests that efficient semantic categorization is a general property of discrete communication systems, not limited to human language. It suggests moreover that it is exactly the discrete nature of such systems that, acting as a bottleneck, pushes them toward low complexity and optimal efficiency.
词汇对它们所指的语义领域进行分类的方式,能在将复杂性降至最低的同时,最大限度地提高交流准确性。以研究充分的颜色领域为例,我们表明,通过深度学习技术训练来玩辨别游戏的人工神经网络会开发出交流系统,其在准确性/复杂性平面上的分布与人类语言的分布紧密匹配。新兴颜色命名系统中观察到的差异,可由不同程度的辨别需求来解释,这种需求也可能是不同人类群体的特征。与人类语言一样,新兴系统表现出对相对低复杂性解决方案的偏好,即使以不完美的交流为代价。接下来我们证明,新兴系统的性质关键取决于交流是离散的(就像人类使用单词一样)。当允许连续传递信息时,新兴系统会变得更加复杂,最终效率更低。我们的研究表明,高效的语义分类是离散交流系统的普遍属性,并不局限于人类语言。此外,研究表明,正是这种系统的离散性质,作为一个瓶颈,将它们推向低复杂性和最佳效率。