Peelle Jonathan E, Davis Matthew H
Center for Cognitive Neuroscience and Department of Neurology, University of Pennsylvania Philadelphia, PA, USA.
Front Psychol. 2012 Sep 6;3:320. doi: 10.3389/fpsyg.2012.00320. eCollection 2012.
A key feature of speech is the quasi-regular rhythmic information contained in its slow amplitude modulations. In this article we review the information conveyed by speech rhythm, and the role of ongoing brain oscillations in listeners' processing of this content. Our starting point is the fact that speech is inherently temporal, and that rhythmic information conveyed by the amplitude envelope contains important markers for place and manner of articulation, segmental information, and speech rate. Behavioral studies demonstrate that amplitude envelope information is relied upon by listeners and plays a key role in speech intelligibility. Extending behavioral findings, data from neuroimaging - particularly electroencephalography (EEG) and magnetoencephalography (MEG) - point to phase locking by ongoing cortical oscillations to low-frequency information (~4-8 Hz) in the speech envelope. This phase modulation effectively encodes a prediction of when important events (such as stressed syllables) are likely to occur, and acts to increase sensitivity to these relevant acoustic cues. We suggest a framework through which such neural entrainment to speech rhythm can explain effects of speech rate on word and segment perception (i.e., that the perception of phonemes and words in connected speech is influenced by preceding speech rate). Neuroanatomically, acoustic amplitude modulations are processed largely bilaterally in auditory cortex, with intelligible speech resulting in differential recruitment of left-hemisphere regions. Notable among these is lateral anterior temporal cortex, which we propose functions in a domain-general fashion to support ongoing memory and integration of meaningful input. Together, the reviewed evidence suggests that low-frequency oscillations in the acoustic speech signal form the foundation of a rhythmic hierarchy supporting spoken language, mirrored by phase-locked oscillations in the human brain.
言语的一个关键特征是其缓慢幅度调制中包含的准规则节奏信息。在本文中,我们回顾了言语节奏所传达的信息,以及大脑持续振荡在听众处理这些内容中的作用。我们的出发点是,言语本质上具有时间性,并且幅度包络所传达的节奏信息包含了发音部位和方式、音段信息以及语速的重要标记。行为研究表明,听众依赖幅度包络信息,并且该信息在言语可懂度中起关键作用。扩展行为研究结果,神经成像数据——尤其是脑电图(EEG)和脑磁图(MEG)——表明大脑皮层的持续振荡与言语包络中的低频信息(约4 - 8赫兹)存在锁相。这种相位调制有效地编码了对重要事件(如重读音节)可能发生时间的预测,并提高了对这些相关声学线索的敏感度。我们提出了一个框架,通过该框架,这种对言语节奏的神经夹带可以解释语速对单词和音段感知的影响(即连贯言语中音素和单词的感知受先前语速的影响)。从神经解剖学角度来看,听觉皮层在很大程度上双侧处理声学幅度调制,可懂度言语会导致左半球区域的不同激活。其中值得注意的是外侧前颞叶皮层,我们认为它以一种领域通用的方式发挥作用,以支持对有意义输入的持续记忆和整合。总之,所回顾的证据表明,声学言语信号中的低频振荡构成了支持口语的节奏层次结构的基础,而人脑的锁相振荡则与之相对应。