当存在潜在的、未被观察到的变量时，齐普夫定律自然产生。

Zipf's Law Arises Naturally When There Are Underlying, Unobserved Variables.

作者信息

Aitchison Laurence, Corradi Nicola, Latham Peter E

机构信息

Gatsby Computational Neuroscience Unit, University College London, London, United Kingdom.

Weill Medical College, Cornell University, New York, New York, United States of America.

出版信息

PLoS Comput Biol. 2016 Dec 20;12(12):e1005110. doi: 10.1371/journal.pcbi.1005110. eCollection 2016 Dec.

DOI:10.1371/journal.pcbi.1005110

PMID:27997544

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5172588/

Abstract

Zipf's law, which states that the probability of an observation is inversely proportional to its rank, has been observed in many domains. While there are models that explain Zipf's law in each of them, those explanations are typically domain specific. Recently, methods from statistical physics were used to show that a fairly broad class of models does provide a general explanation of Zipf's law. This explanation rests on the observation that real world data is often generated from underlying causes, known as latent variables. Those latent variables mix together multiple models that do not obey Zipf's law, giving a model that does. Here we extend that work both theoretically and empirically. Theoretically, we provide a far simpler and more intuitive explanation of Zipf's law, which at the same time considerably extends the class of models to which this explanation can apply. Furthermore, we also give methods for verifying whether this explanation applies to a particular dataset. Empirically, these advances allowed us extend this explanation to important classes of data, including word frequencies (the first domain in which Zipf's law was discovered), data with variable sequence length, and multi-neuron spiking activity.

摘要

齐普夫定律指出，一个观测值出现的概率与其排名成反比，这一规律在许多领域都有被观察到。虽然在每个领域都有解释齐普夫定律的模型，但这些解释通常是特定领域的。最近，统计物理学的方法被用来表明，相当广泛的一类模型确实能对齐普夫定律提供一般性解释。这种解释基于这样的观察：现实世界的数据通常是由潜在变量（即潜在原因）生成的。这些潜在变量将多个不服从齐普夫定律的模型混合在一起，从而产生一个服从该定律的模型。在此，我们从理论和实证两方面扩展了这项工作。在理论上，我们对齐普夫定律给出了一个简单得多且更直观的解释，同时极大地扩展了这一解释所能适用的模型类别。此外，我们还给出了验证这一解释是否适用于特定数据集的方法。在实证方面，这些进展使我们能够将这一解释扩展到重要的数据类别，包括词频（齐普夫定律最初被发现的领域）、具有可变序列长度的数据以及多神经元脉冲活动。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ac76/5172588/b9d80cdb1dbd/pcbi.1005110.g001.jpg

相似文献

Zipf's Law Arises Naturally When There Are Underlying, Unobserved Variables.

PLoS Comput Biol. 2016 Dec 20;12(12):e1005110. doi: 10.1371/journal.pcbi.1005110. eCollection 2016 Dec.

Zipf's law revisited: Spoken dialog, linguistic units, parameters, and the principle of least effort.

Psychon Bull Rev. 2023 Feb;30(1):77-101. doi: 10.3758/s13423-022-02142-9. Epub 2022 Jul 15.

Zipf's law leads to Heaps' law: analyzing their relation in finite-size systems.

PLoS One. 2010 Dec 2;5(12):e14139. doi: 10.1371/journal.pone.0014139.

Zipf's word frequency law in natural language: a critical review and future directions.

Psychon Bull Rev. 2014 Oct;21(5):1112-30. doi: 10.3758/s13423-014-0585-6.

Zipf's law and criticality in multivariate data without fine-tuning.

Phys Rev Lett. 2014 Aug 8;113(6):068102. doi: 10.1103/PhysRevLett.113.068102. Epub 2014 Aug 7.

Random texts do not exhibit the real Zipf's law-like rank distribution.

PLoS One. 2010 Mar 9;5(3):e9411. doi: 10.1371/journal.pone.0009411.

Zipf's Law for Word Frequencies: Word Forms versus Lemmas in Long Texts.

PLoS One. 2015 Jul 9;10(7):e0129031. doi: 10.1371/journal.pone.0129031. eCollection 2015.

Zipf's law holds for phrases, not words.

Sci Rep. 2015 Aug 11;5:12209. doi: 10.1038/srep12209.

The evolution of the exponent of Zipf's law in language ontogeny.

PLoS One. 2013;8(3):e53227. doi: 10.1371/journal.pone.0053227. Epub 2013 Mar 13.

Zipf's Law of Abbreviation holds for individual characters across a broad range of writing systems.

Cognition. 2023 Sep;238:105527. doi: 10.1016/j.cognition.2023.105527. Epub 2023 Jun 24.

引用本文的文献

The recovery of parabolic avalanches in spatially subsampled neuronal networks at criticality.

Sci Rep. 2024 Aug 20;14(1):19329. doi: 10.1038/s41598-024-70014-4.

Neural criticality from effective latent variables.

Elife. 2024 Mar 12;12:RP89337. doi: 10.7554/eLife.89337.

The recovery of parabolic avalanches in spatially subsampled neuronal networks at criticality.

bioRxiv. 2024 Jun 28:2024.02.26.582056. doi: 10.1101/2024.02.26.582056.

Extrinsic vs Intrinsic Criticality in Systems with Many Components.

ArXiv. 2023 Sep 25:arXiv:2309.13898v1.

Inferring couplings in networks across order-disorder phase transitions.

Phys Rev Res. 2022 Jun-Aug;4(2). doi: 10.1103/physrevresearch.4.023240. Epub 2022 Jun 24.

Quasiuniversal scaling in mouse-brain neuronal activity stems from edge-of-instability critical dynamics.

Proc Natl Acad Sci U S A. 2023 Feb 28;120(9):e2208998120. doi: 10.1073/pnas.2208998120. Epub 2023 Feb 24.

Neural criticality from effective latent variables.

ArXiv. 2023 Oct 13:arXiv:2301.00759v3.

Short- and Long-Range Connections Differentially Modulate the Dynamics and State of Small-World Networks.

Front Comput Neurosci. 2022 Jan 25;15:783474. doi: 10.3389/fncom.2021.783474. eCollection 2021.

Empirical evidence for concerted evolution in the 18S rDNA region of the planktonic diatom genus Chaetoceros.

Sci Rep. 2021 Jan 12;11(1):807. doi: 10.1038/s41598-020-80829-6.

Optimal Encoding in Stochastic Latent-Variable Models.

Entropy (Basel). 2020 Jun 28;22(7):714. doi: 10.3390/e22070714.

本文引用的文献

Signatures of criticality arise from random subsampling in simple population models.

PLoS Comput Biol. 2017 Oct 3;13(10):e1005718. doi: 10.1371/journal.pcbi.1005718. eCollection 2017 Oct.

Fluctuating fitness shapes the clone-size distribution of immune repertoires.

Proc Natl Acad Sci U S A. 2016 Jan 12;113(2):274-9. doi: 10.1073/pnas.1512977112. Epub 2015 Dec 28.

Thermodynamics and signatures of criticality in a network of neurons.

Proc Natl Acad Sci U S A. 2015 Sep 15;112(37):11508-13. doi: 10.1073/pnas.1514188112. Epub 2015 Sep 1.

Zipf's law and criticality in multivariate data without fine-tuning.

Phys Rev Lett. 2014 Aug 8;113(6):068102. doi: 10.1103/PhysRevLett.113.068102. Epub 2014 Aug 7.

On criticality in high-dimensional data.

Neural Comput. 2014 Jul;26(7):1329-39. doi: 10.1162/NECO_a_00607. Epub 2014 Apr 7.

Hierarchical model of natural images and the origin of scale invariance.

Proc Natl Acad Sci U S A. 2013 Feb 19;110(8):3071-6. doi: 10.1073/pnas.1222618110. Epub 2013 Feb 4.

A virtual retina for studying population coding.

PLoS One. 2013;8(1):e53363. doi: 10.1371/journal.pone.0053363. Epub 2013 Jan 14.

Retinal prosthetic strategy with the capacity to restore normal vision.

Proc Natl Acad Sci U S A. 2012 Sep 11;109(37):15012-7. doi: 10.1073/pnas.1207035109. Epub 2012 Aug 13.

Common input explains higher-order correlations and entropy in a simple model of neural population activity.

Phys Rev Lett. 2011 May 20;106(20):208102. doi: 10.1103/PhysRevLett.106.208102. Epub 2011 May 17.

Emergence of Zipf's law in the evolution of communication.

Phys Rev E Stat Nonlin Soft Matter Phys. 2011 Mar;83(3 Pt 2):036115. doi: 10.1103/PhysRevE.83.036115. Epub 2011 Mar 28.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

当存在潜在的、未被观察到的变量时，齐普夫定律自然产生。

Zipf's Law Arises Naturally When There Are Underlying, Unobserved Variables.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献