Department of Electrical and Electronic Engineering, Southern University of Science and Technology, Shenzhen, China; Department of Electrical and Electronic Engineering, The University of Hong Kong, Hong Kong Special Administrative Region, China.
Department of Electrical and Electronic Engineering, Southern University of Science and Technology, Shenzhen, China; School of Biomedical Engineering, Shenzhen University, Shenzhen, China.
Hear Res. 2019 Nov;383:107808. doi: 10.1016/j.heares.2019.107808. Epub 2019 Oct 4.
Previous behavioral and neurophysiological studies indicated that the use of an appropriate segmentation method to parse speech streams into meaningful chunks is of vital importance for the examination of sentence perception and intelligibility. Researchers have recently proposed speech segmentation methods employing the relative root-mean-square (RMS) intensity to separate sentences into segments with distinct intelligibility information. However, the effects of different RMS-level segments containing distinct intelligibility information on neural oscillations are not clear. Using scalp-recorded electroencephalography (EEG) data, we investigated the hypothesis that perceptual responses to different RMS-level-dependent speech segments would have distinct EEG characteristics derived from the power values at each frequency band and the relationship between acoustics and neural oscillations at different response time and spatial distribution. We analyzed the EEG power and synchronized neural oscillations corresponding to auditory temporal fluctuations when subjects listened to Mandarin sentences with only high-RMS-level segments and only middle-RMS-level segments preserved, respectively. The results showed significantly stronger EEG spectral power in the delta and theta bands for high-RMS-level stimuli compared with middle-RMS-level stimuli, indicating that the former carry more speech-parsing information at the syllabic level. Differences in neural synchronization were also found between the high- and middle-RMS-level stimuli, allowing for the derivation of intelligibility indices for cortical responses corresponding to different RMS-level segments. These findings suggest that both high- and middle-RMS-level segments drive delta and theta rhythms to track stimuli, and that neural oscillations employ different tracking patterns for these two segment types during auditory sentence processing. Moreover, they suggest that neural oscillations can serve as effective indices for the identification of reliable intelligibility factors in RMS-level-dependent stimuli.
先前的行为和神经生理学研究表明,使用适当的分段方法将语音流分割成语义上有意义的片段,对于考察句子感知和可理解性至关重要。研究人员最近提出了使用相对均方根(RMS)强度的语音分段方法,将句子分割成具有不同可理解性信息的片段。然而,不同包含不同可理解性信息的 RMS 水平片段对神经振荡的影响尚不清楚。我们使用头皮记录的脑电图(EEG)数据,研究了以下假设:对不同 RMS 水平相关的语音片段的感知反应将具有来自每个频带的功率值和不同响应时间和空间分布的声学和神经振荡之间的关系的独特 EEG 特征。我们分析了当受试者分别聆听仅保留高 RMS 水平片段和仅保留中 RMS 水平片段的普通话句子时,与听觉时间波动相对应的 EEG 功率和同步神经振荡。结果表明,与中 RMS 水平刺激相比,高 RMS 水平刺激在 delta 和 theta 频段的 EEG 谱功率明显更强,这表明前者在音节水平上携带更多的语音分段信息。在高 RMS 水平和中 RMS 水平刺激之间也发现了神经同步的差异,这允许为对应于不同 RMS 水平片段的皮质反应导出可理解性指数。这些发现表明,高 RMS 水平和中 RMS 水平片段都可以驱动 delta 和 theta 节律来跟踪刺激,并且在听觉句子处理过程中,神经振荡对这两种片段类型采用不同的跟踪模式。此外,它们表明神经振荡可以作为 RMS 水平相关刺激中可靠可理解性因素的有效指标。