Lestrade Sander
Centre for Language Studies, Radboud University, Nijmegen, The Netherlands.
PLoS One. 2017 Aug 9;12(8):e0181987. doi: 10.1371/journal.pone.0181987. eCollection 2017.
In spite of decades of theorizing, the origins of Zipf's law remain elusive. I propose that a Zipfian distribution straightforwardly follows from the interaction of syntax (word classes differing in class size) and semantics (words having to be sufficiently specific to be distinctive and sufficiently general to be reusable). These factors are independently motivated and well-established ingredients of a natural-language system. Using a computational model, it is shown that neither of these ingredients suffices to produce a Zipfian distribution on its own and that the results deviate from the Zipfian ideal only in the same way as natural language itself does.
尽管经过了数十年的理论探讨,齐普夫定律的起源仍然难以捉摸。我认为,齐普夫分布直接源于句法(词类的规模不同)和语义(单词必须足够具体以具有独特性,同时又足够通用以便可重复使用)的相互作用。这些因素是自然语言系统中各自独立且已被充分确立的要素。通过一个计算模型表明,这些要素单独一个都不足以产生齐普夫分布,而且结果偏离齐普夫理想分布的方式与自然语言本身偏离的方式相同。