Sánchez-Costa Thaiz, Carboni Alejandra, Cervantes Constantino Francisco
Master's Program in Cognitive Sciences, Universidad de la República, Uruguay; Centro de Investigación Básica en Psicología, Facultad de Psicología, Universidad de la República, Uruguay.
Centro de Investigación Básica en Psicología, Facultad de Psicología, Universidad de la República, Uruguay; Instituto de Fundamentos y Métodos en Psicología, Facultad de Psicología, Universidad de la República, Uruguay.
Cortex. 2025 Aug;189:1-19. doi: 10.1016/j.cortex.2025.05.003. Epub 2025 May 20.
When the brain focuses on a conversation in a noisy environment, it exploits past experience to prioritize relevant elements from the auditory scene. This prompts the question of what changes occur in the selective neural processing of speech mixtures as listeners garner prior experience about single speech objects. In three different priming experiments, we quantified cortical selection of temporal landmarks from continuous speech, applying the temporal response function (TRF) method to single-trial electroencephalography (EEG) recordings. The designs specifically addressed how attention interacts with exact (Experiment 1), voice (Experiment 2a), or message (Experiment 2b) content priming of the target or background speakers in cortical responses to speech. Our results demonstrate that, during multispeaker listening, attentional gains typical of cortical responses under speech selection are met with attenuations as a consequence of prior experience. The changes were observed at the P2 processing stage (220-320 msec) of speech envelope onset processing and were specific to responses to primed speech targets (Experiment 1). Suppressions at stages earlier than the P2, or under partial priming conditions (Experiments 2a and 2b), were not observed. An exploratory analysis suggests the observed P2 reduction predicts listeners' ability to report target words, consistent with this component encoding in part temporal prediction error about onset edge cues exclusive to target speech. Our results show that at this late and definitive stage of selective attention, the auditory system may test the evidence for its own predictive model of the noise-invariant speech stream. Precise inference of its temporal structure is bound to tag all checkpoints where auditory evidence can be most reliably connected into higher-order representations of continuous speech.
当大脑在嘈杂环境中专注于一段对话时,它会利用过去的经验从听觉场景中筛选出相关元素。这就引出了一个问题:随着听众积累了关于单个语音对象的先前经验,语音混合的选择性神经处理会发生哪些变化?在三个不同的启动实验中,我们通过对单次试验脑电图(EEG)记录应用时间响应函数(TRF)方法,量化了从连续语音中对时间标记的皮层选择。这些设计专门探讨了在对语音的皮层反应中,注意力如何与目标或背景说话者的精确(实验1)、声音(实验2a)或信息(实验2b)内容启动相互作用。我们的结果表明,在多说话者聆听过程中,由于先前经验的影响,语音选择下典型的皮层反应中的注意力增益会减弱。这种变化在语音包络起始处理的P2处理阶段(220 - 320毫秒)被观察到,并且特定于对启动的语音目标的反应(实验1)。在早于P2的阶段或部分启动条件下(实验2a和2b)未观察到抑制现象。一项探索性分析表明,观察到的P2降低预示着听众报告目标单词的能力,这与该成分部分编码关于目标语音独有的起始边缘线索的时间预测误差一致。我们的结果表明,在这个选择性注意力的晚期和决定性阶段,听觉系统可能会检验其自身关于噪声不变语音流的预测模型的证据。对其时间结构的精确推断必然会标记出所有能将听觉证据最可靠地连接到连续语音的高阶表示的检查点。