Department of Electrical Engineering (ESAT), Stadius Center for Dynamical Systems, Signal Processing and Data Analytics, KU Leuven, Kasteelpark Arenberg 10, B-3001 Leuven, Belgium. Department of Neurosciences, ExpORL, KU Leuven, Herestraat 49 bus 721, B-3000 Leuven, Belgium.
J Neural Eng. 2018 Dec;15(6):066017. doi: 10.1088/1741-2552/aae0a6. Epub 2018 Sep 12.
A listener's neural responses can be decoded to identify the speaker the person is attending to in a cocktail party environment. Such auditory attention detection methods have the potential to provide noise suppression algorithms in hearing devices with information about the listener's attention. A challenge is the effect of noise and other acoustic conditions that can reduce the attention detection accuracy. Specifically, noise can impact the ability of the person to segregate the sound sources and perform selective attention, as well as the external signal processing necessary to decode the attention effectively. The aim of this work is to systematically analyze the effect of noise level and speaker position on attention decoding accuracy.
28 subjects participated in the experiment. Auditory stimuli consisted of stories narrated by different speakers from two different locations, along with surrounding multi-talker background babble. EEG signals of the subjects were recorded while they focused on one story and ignored the other. The strength of the babble noise as well as the spatial separation between the two speakers were varied between presentations. Spatio-temporal decoders were trained for each subject, and applied to decode attention of the subjects from every 30 s segment of data. Behavioral speech recognition thresholds were obtained for the different speaker separations.
Both the background noise level and the angular separation between speakers affected attention decoding accuracy. Remarkably, attention decoding performance was seen to increase with the inclusion of moderate background noise (versus no noise), while across the different noise conditions performance dropped significantly with increasing noise level. We also observed that decoding accuracy improved with increasing speaker separation, exhibiting the advantage of spatial release from masking. Furthermore, the effect of speaker separation on the decoding accuracy became stronger when the background noise level increased. A significant correlation between speech intelligibility and attention decoding accuracy was found across conditions.
This work shows how the background noise level and relative positions of competing talkers impact attention decoding accuracy. It indicates in which circumstances a neuro-steered noise suppression system may need to operate, in function of acoustic conditions. It also indicates the boundary conditions for the operation of EEG-based attention detection systems in neuro-steered hearing prostheses.
通过解码听者的神经反应,可以识别出在鸡尾酒会环境中听者正在关注的说话者。这种听觉注意力检测方法有可能为听力设备提供噪声抑制算法,并提供关于听者注意力的信息。一个挑战是噪声和其他声学条件的影响,这些条件可能会降低注意力检测的准确性。具体来说,噪声会影响人分离声源和进行选择性注意的能力,以及有效地解码注意力所需的外部信号处理。这项工作的目的是系统地分析噪声水平和说话者位置对注意力解码准确性的影响。
28 名受试者参与了实验。听觉刺激由来自两个不同位置的不同说话者讲述的故事以及周围的多说话者背景噪音组成。当受试者专注于一个故事并忽略另一个故事时,记录他们的 EEG 信号。在呈现过程中,改变背景噪声的强度以及两个说话者之间的空间分离。为每个受试者训练时空解码器,并将其应用于从每 30 秒数据段解码受试者的注意力。对于不同的说话者分离,获得了言语识别的行为阈值。
背景噪声水平和说话者之间的角度分离都影响注意力解码的准确性。值得注意的是,与没有噪声相比,注意力解码性能随着适度背景噪声的增加而提高,而在不同的噪声条件下,随着噪声水平的增加,性能显著下降。我们还观察到,随着说话者分离的增加,解码准确性提高,表现出空间掩蔽释放的优势。此外,当背景噪声水平增加时,说话者分离对解码准确性的影响变得更强。在各种条件下,都发现了言语可懂度和注意力解码准确性之间的显著相关性。
这项工作展示了背景噪声水平和竞争说话者的相对位置如何影响注意力解码的准确性。它表明了在何种情况下神经引导的噪声抑制系统需要根据声学条件进行操作。它还指示了基于 EEG 的注意力检测系统在神经引导听力假体中的操作的边界条件。