Department of Electronic & Electrical Engineering, Trinity College, The University of Dublin, Dublin, Ireland.
Department of Mechanical, Manufacturing & Biomedical Engineering, Trinity College, The University of Dublin, Dublin, Ireland.
PLoS Comput Biol. 2021 Sep 17;17(9):e1009358. doi: 10.1371/journal.pcbi.1009358. eCollection 2021 Sep.
The human brain tracks amplitude fluctuations of both speech and music, which reflects acoustic processing in addition to the encoding of higher-order features and one's cognitive state. Comparing neural tracking of speech and music envelopes can elucidate stimulus-general mechanisms, but direct comparisons are confounded by differences in their envelope spectra. Here, we use a novel method of frequency-constrained reconstruction of stimulus envelopes using EEG recorded during passive listening. We expected to see music reconstruction match speech in a narrow range of frequencies, but instead we found that speech was reconstructed better than music for all frequencies we examined. Additionally, models trained on all stimulus types performed as well or better than the stimulus-specific models at higher modulation frequencies, suggesting a common neural mechanism for tracking speech and music. However, speech envelope tracking at low frequencies, below 1 Hz, was associated with increased weighting over parietal channels, which was not present for the other stimuli. Our results highlight the importance of low-frequency speech tracking and suggest an origin from speech-specific processing in the brain.
人类大脑会追踪语音和音乐的幅度波动,这反映了除了对更高阶特征的编码以及认知状态的编码之外的声学处理。比较语音和音乐包络的神经追踪可以阐明刺激的一般机制,但由于其包络频谱的差异,直接比较会受到混淆。在这里,我们使用一种新的方法,利用被动聆听期间记录的 EEG 对刺激包络进行频率约束重建。我们预计音乐重建会在较窄的频率范围内与语音匹配,但相反,我们发现对于我们检查的所有频率,语音的重建都优于音乐。此外,在更高的调制频率下,所有刺激类型的训练模型的表现与特定刺激模型一样好或更好,这表明跟踪语音和音乐的神经机制是通用的。然而,低于 1Hz 的低频语音包络跟踪与顶叶通道上的权重增加有关,而其他刺激则没有。我们的结果强调了低频语音跟踪的重要性,并暗示其起源于大脑中特定于语音的处理。