Suppr超能文献

刻画不同语言的典型信息曲线。

Characterizing the Typical Information Curves of Diverse Languages.

作者信息

Klafka Josef, Yurovsky Daniel

机构信息

Department of Psychology, Carnegie Mellon University, 5000 Forbes Ave, Pittsburgh, PA 15213, USA.

出版信息

Entropy (Basel). 2021 Oct 2;23(10):1300. doi: 10.3390/e23101300.

Abstract

Optimal coding theories of language predict that speakers will keep the amount of information in their utterances relatively uniform under the constraints imposed by their language, but how much do these constraints influence information structure, and how does this influence vary across languages? We present a novel method for characterizing the information structure of sentences across a diverse set of languages. While the structure of English is broadly consistent with the shape predicted by optimal coding, many languages are not consistent with this prediction. We proceed to show that the characteristic information curves of languages are partly related to a variety of typological features from phonology to word order. These results present an important step in the direction of exploring upper bounds for the extent to which linguistic codes can be optimal for communication.

摘要

语言的最优编码理论预测,在语言所施加的限制条件下,说话者会使他们话语中的信息量相对保持一致,但这些限制对信息结构有多大影响,以及这种影响在不同语言之间如何变化?我们提出了一种新颖的方法来刻画多种不同语言中句子的信息结构。虽然英语的结构大致与最优编码预测的形式一致,但许多语言并不符合这一预测。我们进而表明,语言的特征信息曲线部分与从音系学到词序的各种类型学特征相关。这些结果朝着探索语言编码在何种程度上能够实现最优交流的上限迈出了重要一步。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/dbd4/8534556/68ae2e0dbc0e/entropy-23-01300-g001.jpg

相似文献

1
Characterizing the Typical Information Curves of Diverse Languages.
Entropy (Basel). 2021 Oct 2;23(10):1300. doi: 10.3390/e23101300.
3
Cross-linguistic gestures reflect typological universals: a subject-initial, verb-final bias in speakers of diverse languages.
Cognition. 2015 Mar;136:215-21. doi: 10.1016/j.cognition.2014.11.022. Epub 2014 Dec 11.
4
The extent and degree of utterance-final word lengthening in spontaneous speech from 10 languages.
Linguist Vanguard. 2021 Feb 9;7(1):20190063. doi: 10.1515/lingvan-2019-0063. eCollection 2021 Jan 1.
5
Non-Arbitrariness in Mapping Word Form to Meaning: Cross-Linguistic Formal Markers of Word Concreteness.
Cogn Sci. 2017 May;41(4):1071-1089. doi: 10.1111/cogs.12361. Epub 2016 Mar 14.
6
Adaptive Communication: Languages with More Non-Native Speakers Tend to Have Fewer Word Forms.
PLoS One. 2015 Jun 17;10(6):e0128254. doi: 10.1371/journal.pone.0128254. eCollection 2015.
7
Linguistic typology of motion events in visual narratives.
Cogn Semiot. 2022 Oct 17;15(2):197-222. doi: 10.1515/cogsem-2022-2013. eCollection 2022 Nov.
8
Learning structural alternations: What guides learners' generalization?
Cognition. 2021 Oct;215:104828. doi: 10.1016/j.cognition.2021.104828. Epub 2021 Jul 8.
9
Sociolinguistic Typology and Sign Languages.
Front Psychol. 2018 Feb 21;9:200. doi: 10.3389/fpsyg.2018.00200. eCollection 2018.
10
Consistency in Motion Event Encoding Across Languages.
Front Psychol. 2021 Mar 30;12:625153. doi: 10.3389/fpsyg.2021.625153. eCollection 2021.

引用本文的文献

1
Word frequency and cognitive effort in turns-at-talk: turn structure affects processing load in natural conversation.
Front Psychol. 2024 Jun 5;15:1208029. doi: 10.3389/fpsyg.2024.1208029. eCollection 2024.
2
Information distribution patterns in naturalistic dialogue differ across languages.
Psychon Bull Rev. 2024 Aug;31(4):1723-1734. doi: 10.3758/s13423-024-02452-0. Epub 2024 Jan 24.

本文引用的文献

1
childes-db: A flexible and reproducible interface to the child language data exchange system.
Behav Res Methods. 2019 Aug;51(4):1928-1941. doi: 10.3758/s13428-018-1176-7.
2
Compression and communication in the cultural evolution of linguistic structure.
Cognition. 2015 Aug;141:87-102. doi: 10.1016/j.cognition.2015.03.016. Epub 2015 May 14.
3
The Now-or-Never bottleneck: A fundamental constraint on language.
Behav Brain Sci. 2016 Jan;39:e62. doi: 10.1017/S0140525X1500031X. Epub 2015 Apr 14.
4
Language evolution can be shaped by the structure of the world.
Cogn Sci. 2014 May-Jun;38(4):775-93. doi: 10.1111/cogs.12102. Epub 2014 Jan 24.
5
An integrated theory of language production and comprehension.
Behav Brain Sci. 2013 Aug;36(4):329-47. doi: 10.1017/S0140525X12001495. Epub 2013 Jun 24.
6
The effect of word predictability on reading time is logarithmic.
Cognition. 2013 Sep;128(3):302-19. doi: 10.1016/j.cognition.2013.02.013. Epub 2013 Jun 6.
7
Info/information theory: speakers choose shorter words in predictive contexts.
Cognition. 2013 Feb;126(2):313-8. doi: 10.1016/j.cognition.2012.09.010. Epub 2012 Oct 30.
8
Kinship categories across languages reflect general communicative principles.
Science. 2012 May 25;336(6084):1049-54. doi: 10.1126/science.1218811.
9
Word lengths are optimized for efficient communication.
Proc Natl Acad Sci U S A. 2011 Mar 1;108(9):3526-9. doi: 10.1073/pnas.1012551108. Epub 2011 Jan 28.
10
Thirty years and counting: finding meaning in the N400 component of the event-related brain potential (ERP).
Annu Rev Psychol. 2011;62:621-47. doi: 10.1146/annurev.psych.093008.131123.

文献AI研究员

20分钟写一篇综述,助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型,支持多种主流文档格式。

立即体验