Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY 10027, USA; Department of Electrical Engineering, Columbia University, New York, NY 10027, USA.
Mortimer B. Zuckerman Mind Brain Behavior Institute, Columbia University, New York, NY 10027, USA; Donders Institute for Brain Cognition and Behavior, Radboud University, Nijmegen, the Netherlands; Maastricht Centre for Systems Biology, Faculty of Science and Engineering, Maastricht University, Maastricht, the Netherlands.
Curr Biol. 2022 Sep 26;32(18):3971-3986.e4. doi: 10.1016/j.cub.2022.07.047. Epub 2022 Aug 15.
How the human auditory cortex represents spatially separated simultaneous talkers and how talkers' locations and voices modulate the neural representations of attended and unattended speech are unclear. Here, we measured the neural responses from electrodes implanted in neurosurgical patients as they performed single-talker and multi-talker speech perception tasks. We found that spatial separation between talkers caused a preferential encoding of the contralateral speech in Heschl's gyrus (HG), planum temporale (PT), and superior temporal gyrus (STG). Location and spectrotemporal features were encoded in different aspects of the neural response. Specifically, the talker's location changed the mean response level, whereas the talker's spectrotemporal features altered the variation of response around response's baseline. These components were differentially modulated by the attended talker's voice or location, which improved the population decoding of attended speech features. Attentional modulation due to the talker's voice only appeared in the auditory areas with longer latencies, but attentional modulation due to location was present throughout. Our results show that spatial multi-talker speech perception relies upon a separable pre-attentive neural representation, which could be further tuned by top-down attention to the location and voice of the talker.
人类听觉皮层如何表示空间上分离的同时说话者,以及说话者的位置和声音如何调节被注意和未被注意的语音的神经表示尚不清楚。在这里,我们测量了接受神经外科手术的患者在执行单说话者和多说话者语音感知任务时植入电极的神经反应。我们发现说话者之间的空间分离导致对侧言语在 Heschl 回(HG)、颞平面(PT)和颞上回(STG)中的优先编码。位置和频谱时间特征以神经反应的不同方面进行编码。具体来说,说话者的位置改变了平均反应水平,而说话者的频谱时间特征改变了反应基线周围的反应变化。这些成分被注意说话者的声音或位置以不同的方式调节,从而提高了对注意语音特征的人群解码。由于说话者的声音引起的注意力调节仅出现在潜伏期较长的听觉区域,但由于位置引起的注意力调节则存在于整个区域。我们的结果表明,空间上的多说话者语音感知依赖于可分离的前注意神经表示,该表示可以通过对说话者的位置和声音的自上而下的注意进一步调整。