• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

论语言规律的物理起源及言语中的对数正态性。

On the physical origin of linguistic laws and lognormality in speech.

作者信息

Torre Iván G, Luque Bartolo, Lacasa Lucas, Kello Christopher T, Hernández-Fernández Antoni

机构信息

Departamento de Matemática Aplicada, ETSIAE, Universidad Politécnica de Madrid, Plaza Cardenal Cisneros, 28040 Madrid, Spain.

Cognitive and Information Sciences, University of California Merced, 5200 North Lake Road Merced, 95343 CA, USA.

出版信息

R Soc Open Sci. 2019 Aug 21;6(8):191023. doi: 10.1098/rsos.191023. eCollection 2019 Aug.

DOI:10.1098/rsos.191023
PMID:31598263
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6731709/
Abstract

Physical manifestations of linguistic units include sources of variability due to factors of speech production which are by definition excluded from counts of linguistic symbols. In this work, we examine whether linguistic laws hold with respect to the physical manifestations of linguistic units in spoken English. The data we analyse come from a phonetically transcribed database of acoustic recordings of spontaneous speech known as the Buckeye Speech corpus. First, we verify with unprecedented accuracy that acoustically transcribed durations of linguistic units at several scales comply with a lognormal distribution, and we quantitatively justify this 'lognormality law' using a stochastic generative model. Second, we explore the four classical linguistic laws (Zipf's Law, Herdan's Law, Brevity Law and Menzerath-Altmann's Law (MAL)) in oral communication, both in physical units and in symbolic units measured in the speech transcriptions, and find that the validity of these laws is typically stronger when using physical units than in their symbolic counterpart. Additional results include (i) coining a Herdan's Law in physical units, (ii) a precise mathematical formulation of Brevity Law, which we show to be connected to optimal compression principles in information theory and allows to formulate and validate yet another law which we call the size-rank law or (iii) a mathematical derivation of MAL which also highlights an additional regime where the law is inverted. Altogether, these results support the hypothesis that statistical laws in language have a physical origin.

摘要

语言单位的物理表现包括由于语音产生因素导致的变异性来源,根据定义,这些因素被排除在语言符号计数之外。在这项工作中,我们研究语言规律对于英语口语中语言单位的物理表现是否成立。我们分析的数据来自一个名为“七叶树语音语料库”的自发语音声学记录的语音转录数据库。首先,我们以前所未有的准确性验证了几个尺度上语言单位的声学转录时长符合对数正态分布,并使用随机生成模型对这一“对数正态性定律”进行了定量论证。其次,我们在口语交流中探索了四条经典语言规律(齐普夫定律、赫尔丹定律、简洁定律和门泽拉斯 - 阿尔特曼定律(MAL)),无论是在物理单位还是在语音转录中测量的符号单位中,并发现使用物理单位时这些定律的有效性通常比其符号对应物更强。其他结果包括:(i)在物理单位中创造了赫尔丹定律;(ii)简洁定律的精确数学公式,我们证明它与信息论中的最优压缩原理相关,并允许制定和验证另一条我们称为大小 - 秩定律的定律;或者(iii)MAL的数学推导,这也突出了该定律反转的另一种情况。总之,这些结果支持了语言中的统计规律具有物理起源这一假设。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/548c4f3ef7e9/rsos191023-g13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/e622874a629b/rsos191023-g1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/1096f8287235/rsos191023-g2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/77d91c5234a9/rsos191023-g3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/016303a901b7/rsos191023-g4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/6564db067bc9/rsos191023-g5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/8f5c61837312/rsos191023-g6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/9e5429a3c0ea/rsos191023-g7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/a1d3faa5a2bb/rsos191023-g8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/86e867a6c407/rsos191023-g9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/6bf82109a71f/rsos191023-g10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/07090a4e9bf8/rsos191023-g11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/353d225fe6e6/rsos191023-g12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/548c4f3ef7e9/rsos191023-g13.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/e622874a629b/rsos191023-g1.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/1096f8287235/rsos191023-g2.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/77d91c5234a9/rsos191023-g3.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/016303a901b7/rsos191023-g4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/6564db067bc9/rsos191023-g5.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/8f5c61837312/rsos191023-g6.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/9e5429a3c0ea/rsos191023-g7.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/a1d3faa5a2bb/rsos191023-g8.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/86e867a6c407/rsos191023-g9.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/6bf82109a71f/rsos191023-g10.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/07090a4e9bf8/rsos191023-g11.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/353d225fe6e6/rsos191023-g12.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/8496/6731709/548c4f3ef7e9/rsos191023-g13.jpg

相似文献

1
On the physical origin of linguistic laws and lognormality in speech.论语言规律的物理起源及言语中的对数正态性。
R Soc Open Sci. 2019 Aug 21;6(8):191023. doi: 10.1098/rsos.191023. eCollection 2019 Aug.
2
Do bats' social vocalizations conform to Zipf's law and the Menzerath-Altmann law?蝙蝠的社交发声是否符合齐普夫定律和门泽拉斯-阿尔特曼定律?
iScience. 2024 Jun 28;27(7):110401. doi: 10.1016/j.isci.2024.110401. eCollection 2024 Jul 19.
3
Linguistic laws of brevity: conformity in Indri indri.简明的语言规律:印度长尾猕猴的一致性。
Anim Cogn. 2021 Jul;24(4):897-906. doi: 10.1007/s10071-021-01495-3. Epub 2021 Mar 8.
4
Can Menzerath's law be a criterion of complexity in communication?门泽尔定律能否成为通信复杂性的一个标准?
PLoS One. 2021 Aug 20;16(8):e0256133. doi: 10.1371/journal.pone.0256133. eCollection 2021.
5
The Brevity Law as a Scaling Law, and a Possible Origin of Zipf's Law for Word Frequencies.作为标度律的简洁法则以及词频齐普夫定律的一种可能起源。
Entropy (Basel). 2020 Feb 17;22(2):224. doi: 10.3390/e22020224.
6
Linguistic law-like compression strategies emerge to maximize coding efficiency in marmoset vocal communication.语言类法则压缩策略的出现是为了在狨猴的声音交流中最大化编码效率。
Proc Biol Sci. 2023 Sep 27;290(2007):20231503. doi: 10.1098/rspb.2023.1503.
7
Do penguins' vocal sequences conform to linguistic laws?企鹅的叫声序列是否符合语言规律?
Biol Lett. 2020 Feb;16(2):20190589. doi: 10.1098/rsbl.2019.0589. Epub 2020 Feb 5.
8
Scaling Laws for Phonotactic Complexity in Spoken English Language Data.英语口语中韵律构词复杂度的标度定律。
Lang Speech. 2021 Sep;64(3):693-704. doi: 10.1177/0023830920944445. Epub 2020 Aug 1.
9
Zipf's law revisited: Spoken dialog, linguistic units, parameters, and the principle of least effort.再探齐夫定律:口语对话、语言单位、参数和省力原则。
Psychon Bull Rev. 2023 Feb;30(1):77-101. doi: 10.3758/s13423-022-02142-9. Epub 2022 Jul 15.
10
The Compressed Vocabulary of Microbial Life.微生物生命的精简词汇表。
Front Microbiol. 2021 Jul 7;12:655990. doi: 10.3389/fmicb.2021.655990. eCollection 2021.

引用本文的文献

1
Does Zipf's law of abbreviation shape birdsong?齐普夫缩写定律塑造鸟鸣吗?
PLoS Comput Biol. 2025 Aug 13;21(8):e1013228. doi: 10.1371/journal.pcbi.1013228. eCollection 2025 Aug.
2
Vocal efficiency in crows.乌鸦的发声效率
Anim Cogn. 2025 Jul 31;28(1):75. doi: 10.1007/s10071-025-01985-8.
3
Zipf's law in China's local government work reports: A 21-year study using natural language processing and regression analysis.中国地方政府工作报告中的齐普夫定律:一项运用自然语言处理和回归分析的21年研究。

本文引用的文献

1
Linguistic laws in chimpanzee gestural communication.黑猩猩手势交际中的语言规律
Proc Biol Sci. 2019 Feb 13;286(1896):20182900. doi: 10.1098/rspb.2018.2900.
2
Effects of Surprisal and Entropy on Vowel Duration in Japanese.意外性和熵对日语元音时长的影响。
Lang Speech. 2019 Mar;62(1):80-114. doi: 10.1177/0023830917737331. Epub 2017 Nov 5.
3
Zipf's Law of Abbreviation and the Principle of Least Effort: Language users optimise a miniature lexicon for efficient communication.齐普夫缩写定律与省力原则:语言使用者优化一个小型词汇库以实现高效交流。
PLoS One. 2025 May 20;20(5):e0324713. doi: 10.1371/journal.pone.0324713. eCollection 2025.
4
Language-like efficiency in whale communication.鲸鱼交流中类似语言的高效性。
Sci Adv. 2025 Feb 7;11(6):eads6014. doi: 10.1126/sciadv.ads6014. Epub 2025 Feb 5.
5
Spot the bot: the inverse problems of NLP.找出机器人:自然语言处理的逆问题。
PeerJ Comput Sci. 2024 Dec 9;10:e2550. doi: 10.7717/peerj-cs.2550. eCollection 2024.
6
Inter-call intervals, but not call durations, adhere to Menzerath's Law in the submissive vocal bouts of meerkats.在狐獴的顺从性发声回合中,叫声间隔(而非叫声时长)遵循门泽拉斯定律。
R Soc Open Sci. 2024 Dec 11;11(12):241351. doi: 10.1098/rsos.241351. eCollection 2024 Dec.
7
Word Length in Political Public Speaking: Distribution and Time Evolution.政治公开演讲中的词汇长度:分布与时间演变
Entropy (Basel). 2024 Feb 21;26(3):180. doi: 10.3390/e26030180.
8
Linguistic law-like compression strategies emerge to maximize coding efficiency in marmoset vocal communication.语言类法则压缩策略的出现是为了在狨猴的声音交流中最大化编码效率。
Proc Biol Sci. 2023 Sep 27;290(2007):20231503. doi: 10.1098/rspb.2023.1503.
9
A multiplex analysis of phonological and orthographic networks.音位和正字法网络的多重分析。
PLoS One. 2022 Sep 15;17(9):e0274617. doi: 10.1371/journal.pone.0274617. eCollection 2022.
10
Compression principle and Zipf's Law of brevity in infochemical communication.信息素通讯中的压缩原理和齐夫简短律。
Biol Lett. 2022 Jul;18(7):20220162. doi: 10.1098/rsbl.2022.0162. Epub 2022 Jul 27.
Cognition. 2017 Aug;165:45-52. doi: 10.1016/j.cognition.2017.05.001. Epub 2017 May 8.
4
Emergence of linguistic laws in human voice.人类语音中的语言规律的出现。
Sci Rep. 2017 Mar 8;7:43862. doi: 10.1038/srep43862.
5
Linguistic research in the empirical paradigm as outlined by Mario Bunge.如马里奥·邦格所概述的实证范式中的语言学研究。
Springerplus. 2016 Jul 26;5(1):1183. doi: 10.1186/s40064-016-2684-5. eCollection 2016.
6
Gelada vocal sequences follow Menzerath's linguistic law.狮尾狒的发声序列遵循门泽拉斯语言法则。
Proc Natl Acad Sci U S A. 2016 May 10;113(19):E2750-8. doi: 10.1073/pnas.1522072113. Epub 2016 Apr 18.
7
Log-Log Convexity of Type-Token Growth in Zipf's Systems.齐夫系统中 T 型标记增长的对数-对数凸性。
Phys Rev Lett. 2015 Jun 12;114(23):238701. doi: 10.1103/PhysRevLett.114.238701. Epub 2015 Jun 9.
8
Text mixing shapes the anatomy of rank-frequency distributions.文本混合塑造了等级-频率分布的结构。
Phys Rev E Stat Nonlin Soft Matter Phys. 2015 May;91(5):052811. doi: 10.1103/PhysRevE.91.052811. Epub 2015 May 20.
9
Scaling and universality in the human voice.人类声音中的标度律和普遍性。
J R Soc Interface. 2015 Apr 6;12(105). doi: 10.1098/rsif.2014.1344.
10
The log-dynamic brain: how skewed distributions affect network operations.对数动态大脑:偏态分布如何影响网络运作。
Nat Rev Neurosci. 2014 Apr;15(4):264-78. doi: 10.1038/nrn3687. Epub 2014 Feb 26.