探寻语音中的时间可预测性：比较18种世界语言的统计方法

Seeking Temporal Predictability in Speech: Comparing Statistical Approaches on 18 World Languages.

作者信息

Jadoul Yannick, Ravignani Andrea, Thompson Bill, Filippi Piera, de Boer Bart

机构信息

Artificial Intelligence Lab, Vrije Universiteit Brussel Brussels, Belgium.

出版信息

Front Hum Neurosci. 2016 Dec 2;10:586. doi: 10.3389/fnhum.2016.00586. eCollection 2016.

DOI:10.3389/fnhum.2016.00586

PMID:27994544

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC5133256/

Abstract

Temporal regularities in speech, such as interdependencies in the timing of speech events, are thought to scaffold early acquisition of the building blocks in speech. By providing on-line clues to the location and duration of upcoming syllables, temporal structure may aid segmentation and clustering of continuous speech into separable units. This hypothesis tacitly assumes that learners exploit in the temporal structure of speech. Existing measures of speech timing tend to focus on first-order regularities among adjacent units, and are overly sensitive to idiosyncrasies in the data they describe. Here, we compare several statistical methods on a sample of 18 languages, testing whether syllable occurrence is predictable over time. Rather than looking for differences between languages, we aim to find across languages (using clearly defined acoustic, rather than orthographic, measures), temporal predictability in the speech signal which could be exploited by a language learner. First, we analyse distributional regularities using two novel techniques: a Bayesian ideal learner analysis, and a simple distributional measure. Second, we model temporal structure-regularities arising in an ordered of syllable timings-testing the hypothesis that non-adjacent temporal structures may explain the gap between subjectively-perceived temporal regularities, and the absence of universally-accepted lower-order objective measures. Together, our analyses provide limited evidence for predictability at different time scales, though higher-order predictability is difficult to reliably infer. We conclude that temporal predictability in speech may well arise from a combination of individually weak perceptual cues at multiple structural levels, but is challenging to pinpoint.

摘要

语音中的时间规律，比如语音事件时间上的相互依存关系，被认为是早期语音构建模块习得的支架。通过为即将出现的音节的位置和时长提供在线线索，时间结构可能有助于将连续语音分割和聚类成可分离的单元。这一假设默认学习者会利用语音的时间结构。现有的语音计时测量往往侧重于相邻单元之间的一阶规律，并且对它们所描述的数据中的特质过于敏感。在这里，我们在18种语言的样本上比较了几种统计方法，测试音节出现是否随时间可预测。我们的目标不是寻找语言之间的差异，而是（使用明确界定的声学而非正字法测量）在各种语言中找到语言学习者可以利用的语音信号中的时间可预测性。首先，我们使用两种新技术分析分布规律：贝叶斯理想学习者分析和一种简单的分布测量。其次，我们对音节计时顺序中出现的时间结构规律进行建模——检验非相邻时间结构可能解释主观感知的时间规律与缺乏普遍接受的低阶客观测量之间差距的假设。我们的分析共同提供了不同时间尺度上可预测性的有限证据，尽管高阶可预测性难以可靠推断。我们得出结论，语音中的时间可预测性很可能源于多个结构层面上各自微弱的感知线索的组合，但很难精确确定。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ce4d/5133256/73e4b4395d48/fnhum-10-00586-g0001.jpg

相似文献

Seeking Temporal Predictability in Speech: Comparing Statistical Approaches on 18 World Languages.探寻语音中的时间可预测性：比较18种世界语言的统计方法

Front Hum Neurosci. 2016 Dec 2;10:586. doi: 10.3389/fnhum.2016.00586. eCollection 2016.

Learning at a distance II. Statistical learning of non-adjacent dependencies in a non-human primate.远程学习II. 非人类灵长类动物中非相邻依赖关系的统计学习

Cogn Psychol. 2004 Sep;49(2):85-117. doi: 10.1016/j.cogpsych.2003.12.002.

Co-occurrence statistics as a language-dependent cue for speech segmentation.作为语音分割的语言相关线索的共现统计

Dev Sci. 2017 May;20(3). doi: 10.1111/desc.12390. Epub 2016 May 4.

Pre-linguistic segmentation of speech into syllable-like units.言语的前语言切分：将言语切分为类似音节的单位。

Cognition. 2018 Feb;171:130-150. doi: 10.1016/j.cognition.2017.11.003. Epub 2017 Nov 20.

Learning at a distance I. Statistical learning of non-adjacent dependencies.远程学习I. 非相邻依赖关系的统计学习

Cogn Psychol. 2004 Mar;48(2):127-62. doi: 10.1016/s0010-0285(03)00128-2.

Speech timing and linguistic rhythm: on the acoustic bases of rhythm typologies.言语时间与语言节奏：基于节奏类型的声学基础

J Acoust Soc Am. 2015 May;137(5):2834. doi: 10.1121/1.4919322.

Statistical speech segmentation and word learning in parallel: scaffolding from child-directed speech.并行的统计语音分割和单词学习：来自儿童导向语言的支撑。

Front Psychol. 2012 Oct 1;3:374. doi: 10.3389/fpsyg.2012.00374. eCollection 2012.

Perceptual tests of rhythmic similarity: II. Syllable rhythm.节奏相似性的感知测试：II. 音节节奏

Lang Speech. 2008;51(Pt 4):343-59. doi: 10.1177/0023830908099069.

Acoustic cues to grammatical structure in infant-directed speech: cross-linguistic evidence.婴儿指向性言语中语法结构的声学线索：跨语言证据。

Child Dev. 1996 Dec;67(6):3192-218.

Linguistic Constraints on Statistical Word Segmentation: The Role of Consonants in Arabic and English.统计分词中的语言限制：辅音在阿拉伯语和英语中的作用

Cogn Sci. 2018 May;42 Suppl 2:494-518. doi: 10.1111/cogs.12521. Epub 2017 Jul 26.

引用本文的文献

A universal of speech timing: Intonation units form low-frequency rhythms.言语节奏的一个普遍特征：语调单位构成低频节奏。

Proc Natl Acad Sci U S A. 2025 Aug 26;122(34):e2425166122. doi: 10.1073/pnas.2425166122. Epub 2025 Aug 19.

Refined analysis of the Speech-to-Speech Synchronization task reveals subharmonic synchronization.对语音到语音同步任务的精细分析揭示了亚谐波同步。

Front Neurosci. 2025 Jul 2;19:1611651. doi: 10.3389/fnins.2025.1611651. eCollection 2025.

Rhythmic Roots: The Adaptive Functions of Vocal Isochrony and Its Role in Human Music and Language Evolution.节奏根源：声乐等时性的适应性功能及其在人类音乐和语言进化中的作用

Psihol Teme. 2025;34(1):1-24. doi: 10.31820/pt.34.1.1.

Perception of temporal structure in speech is influenced by body movement and individual beat perception ability.言语中时间结构的感知受到身体运动和个体节拍感知能力的影响。

Atten Percept Psychophys. 2024 Jul;86(5):1746-1762. doi: 10.3758/s13414-024-02893-8. Epub 2024 May 20.

thebeat: A Python package for working with rhythms and other temporal sequences.thebeat：一个用于处理节奏和其他时间序列的 Python 包。

Behav Res Methods. 2024 Apr;56(4):3725-3736. doi: 10.3758/s13428-023-02334-8. Epub 2024 Feb 2.

What is a Rhythm for the Brain? The Impact of Contextual Temporal Variability on Auditory Perception.什么是大脑的节律？情境时间变异性对听觉感知的影响。

J Cogn. 2024 Jan 17;7(1):15. doi: 10.5334/joc.344. eCollection 2024.

Perceived rhythmic regularity is greater for song than speech: examining acoustic correlates of rhythmic regularity in speech and song.歌曲的感知节奏规律性比言语更强：探究言语和歌曲中节奏规律性的声学关联。

Front Psychol. 2023 May 26;14:1167003. doi: 10.3389/fpsyg.2023.1167003. eCollection 2023.

Dynamics of Functional Networks for Syllable and Word-Level Processing.音节和单词层面处理的功能网络动态

Neurobiol Lang (Camb). 2023 Mar 8;4(1):120-144. doi: 10.1162/nol_a_00089. eCollection 2023.

On the Role of Neural Oscillations Across Timescales in Speech and Music Processing.神经振荡在跨时间尺度的语音和音乐处理中的作用

Front Comput Neurosci. 2022 Jun 23;16:872093. doi: 10.3389/fncom.2022.872093. eCollection 2022.

An oscillating computational model can track pseudo-rhythmic speech by using linguistic predictions.一个震荡计算模型可以通过使用语言预测来跟踪类节奏语音。

Elife. 2021 Aug 2;10:e68066. doi: 10.7554/eLife.68066.

本文引用的文献

Temporal Regularity of the Environment Drives Time Perception.环境的时间规律驱动时间感知。

PLoS One. 2016 Jul 21;11(7):e0159842. doi: 10.1371/journal.pone.0159842. eCollection 2016.

Culture shapes the evolution of cognition.文化塑造认知的演变。

Proc Natl Acad Sci U S A. 2016 Apr 19;113(16):4530-5. doi: 10.1073/pnas.1523631113. Epub 2016 Apr 4.

Modeling Co-evolution of Speech and Biology.语音与生物学的协同进化建模

Top Cogn Sci. 2016 Apr;8(2):459-68. doi: 10.1111/tops.12191. Epub 2016 Mar 3.

Phonological perception by birds: budgerigars can perceive lexical stress.鸟类的语音感知：虎皮鹦鹉能够感知词汇重音。

Anim Cogn. 2016 May;19(3):643-54. doi: 10.1007/s10071-016-0968-3. Epub 2016 Feb 25.

Experience-dependent emergence of a grouping bias.依赖经验的分组偏差的出现。

Biol Lett. 2015 Sep;11(9):20150374. doi: 10.1098/rsbl.2015.0374.

More than one way to see it: Individual heuristics in avian visual computation.看待它的方式不止一种：鸟类视觉计算中的个体启发法。

Cognition. 2015 Oct;143:13-24. doi: 10.1016/j.cognition.2015.05.021. Epub 2015 Jun 22.

Pitch enhancement facilitates word learning across visual contexts.音高增强有助于跨视觉情境的词汇学习。

Front Psychol. 2014 Dec 22;5:1468. doi: 10.3389/fpsyg.2014.01468. eCollection 2014.

Chorusing, synchrony, and the evolutionary functions of rhythm.合唱、同步和节奏的进化功能。

Front Psychol. 2014 Oct 10;5:1118. doi: 10.3389/fpsyg.2014.01118. eCollection 2014.

Zebra finches are sensitive to prosodic features of human speech.斑胸草雀对人类语音的韵律特征很敏感。

Proc Biol Sci. 2014 Jul 22;281(1787). doi: 10.1098/rspb.2014.0480.

Degeneracy and long-range correlations.退化与长程相关性。

Chaos. 2013 Dec;23(4):043109. doi: 10.1063/1.4825250.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

探寻语音中的时间可预测性：比较18种世界语言的统计方法

Seeking Temporal Predictability in Speech: Comparing Statistical Approaches on 18 World Languages.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献