Karunathilake I M Dushyanthi, Brodbeck Christian, Bhattasali Shohini, Resnik Philip, Simon Jonathan Z
Department of Electrical and Computer Engineering, University of Maryland, College Park, MD, 20742.
Department of Computing and Software, McMaster University, Hamilton, ON, Canada.
bioRxiv. 2024 Dec 10:2024.02.02.578603. doi: 10.1101/2024.02.02.578603.
When we listen to speech, our brain's neurophysiological responses "track" its acoustic features, but it is less well understood how these auditory responses are enhanced by linguistic content. Here, we recorded magnetoencephalography (MEG) responses while subjects listened to four types of continuous-speech-like passages: speech-envelope modulated noise, English-like non-words, scrambled words, and a narrative passage. Temporal response function (TRF) analysis provides strong neural evidence for the emergent features of speech processing in cortex, from acoustics to higher-level linguistics, as incremental steps in neural speech processing. Critically, we show a stepwise hierarchical progression of progressively higher order features over time, reflected in both bottom-up (early) and top-down (late) processing stages. Linguistically driven top-down mechanisms take the form of late N400-like responses, suggesting a central role of predictive coding mechanisms at multiple levels. As expected, the neural processing of lower-level acoustic feature responses is bilateral or right lateralized, with left lateralization emerging only for lexical-semantic features. Finally, our results identify potential neural markers, linguistic level late responses, derived from TRF components modulated by linguistic content, suggesting that these markers are indicative of speech rather than mere speech perception.
当我们听言语时,大脑的神经生理反应会“追踪”其声学特征,但对于这些听觉反应如何通过语言内容得到增强,我们的了解还比较少。在这里,我们在受试者聆听四种类似连续言语的段落时记录了脑磁图(MEG)反应:言语包络调制噪声、类似英语的非词、打乱顺序的词以及一篇叙述性段落。时间响应函数(TRF)分析为皮质中言语处理的新兴特征提供了有力的神经证据,从声学特征到高级语言学特征,这些特征是神经言语处理中的渐进步骤。至关重要的是,我们展示了随着时间推移逐渐出现的更高阶特征的逐步分层进展,这在自下而上(早期)和自上而下(晚期)的处理阶段均有体现。语言驱动的自上而下机制表现为类似晚期N400的反应,这表明预测编码机制在多个层面发挥着核心作用。正如预期的那样,较低级声学特征反应的神经处理是双侧的或右侧化的,只有词汇语义特征的处理会出现左侧化。最后,我们的研究结果确定了潜在的神经标记,即由语言内容调制的TRF成分所产生的语言水平晚期反应,这表明这些标记指示的是言语,而不仅仅是言语感知。