Cervantes Constantino Francisco, Caputi Ángel
Instituto de Investigaciones Biológicas Clemente Estable, Department of Integrative and Computational Neurosciences, Av. Italia 3318, Montevideo, 11.600, Uruguay.
Facultad de Psicología, Universidad de la República.
Cereb Cortex. 2024 Dec 3;34(12). doi: 10.1093/cercor/bhae472.
A social scene is particularly informative when people are distinguishable. To understand somebody amid a "cocktail party" chatter, we automatically index their voice. This ability is underpinned by parallel processing of vocal spectral contours from speech sounds, but it has not yet been established how this occurs in the brain's cortex. We investigate single-trial neural tracking of slow frequency modulations in speech using electroencephalography. Participants briefly listened to unfamiliar single speakers, and in addition, they performed a cocktail party comprehension task. Quantified through stimulus reconstruction methods, robust tracking was found in neural responses to slow (delta-theta range) modulations of frequency contours in the fourth and fifth formant band, equivalent to the 3.5-5 KHz audible range. The spectral spacing between neighboring instantaneous frequency contours (ΔF), which also yields indexical information from the vocal tract, was similarly decodable. Moreover, EEG evidence of listeners' spectral tracking abilities predicted their chances of succeeding at selective listening when faced with two-speaker speech mixtures. In summary, the results indicate that the communicating brain can rely on locking of cortical rhythms to major changes led by upper resonances of the vocal tract. Their corresponding articulatory mechanics hence continuously issue a fundamental credential for listeners to target in real time.
当人们能够被区分开来时,社交场景会特别具有信息量。为了在“鸡尾酒会”的闲聊中理解某个人,我们会自动对他们的声音进行索引。这种能力是由对语音声音的声谱轮廓进行并行处理来支撑的,但大脑皮层中这一过程是如何发生的尚未明确。我们使用脑电图来研究语音中低频调制的单次试验神经追踪。参与者短暂聆听不熟悉的单一说话者讲话,此外,他们还进行了一个鸡尾酒会理解任务。通过刺激重建方法进行量化,发现在对第四和第五共振峰频段中频率轮廓的缓慢(δ-θ范围)调制的神经反应中存在稳健的追踪,这相当于3.5 - 5千赫兹的可听范围。相邻瞬时频率轮廓之间的频谱间距(ΔF)同样是可解码的,它也能从声道中产生索引信息。此外,听众频谱追踪能力的脑电图证据预测了他们在面对两个说话者的语音混合时选择性聆听成功的几率。总之,结果表明,进行交流的大脑可以依赖皮层节律与由声道上部共振引起的主要变化同步。因此,它们相应的发音机制不断为听众实时定位提供基本依据。