School of Engineering, Trinity College Dublin, University of Dublin, Dublin, Ireland. Trinity Centre for Bioengineering, Trinity College Dublin, Dublin, Ireland.
J Neural Eng. 2019 Jun;16(3):036017. doi: 10.1088/1741-2552/ab0cf1. Epub 2019 Mar 5.
It has been shown that attentional selection in a simple dichotic listening paradigm can be decoded offline by reconstructing the stimulus envelope from single-trial neural response data. Here, we test the efficacy of this approach in an environment with non-stationary talkers. We then look beyond the envelope reconstructions themselves and consider whether incorporating the decoder values-which reflect the weightings applied to the multichannel EEG data at different time lags and scalp locations when reconstructing the stimulus envelope-can improve decoding performance.
High-density EEG was recorded as subjects attended to one of two talkers. The two speech streams were filtered using HRTFs, and the talkers were alternated between the left and right locations at varying intervals to simulate a dynamic environment. We trained spatio-temporal decoders mapping from EEG data to the attended and unattended stimulus envelopes. We then decoded auditory attention by (1) using the attended decoder to reconstruct the envelope and (2) exploiting the fact that decoder weightings themselves contain signatures of attention, resulting in consistent patterns across subjects that can be classified.
The previously established decoding approach was found to be effective even with non-stationary talkers. Signatures of attentional selection and attended direction were found in the spatio-temporal structure of the decoders and were consistent across subjects. The inclusion of decoder weights into the decoding algorithm resulted in significantly improved decoding accuracies (from 61.07% to 65.31% for 4 s windows). An attempt was made to include alpha power lateralization as another feature to improve decoding, although this was unsuccessful at the single-trial level.
This work suggests that the spatial-temporal decoder weights can be utilised to improve decoding. More generally, looking beyond envelope reconstruction and incorporating other signatures of attention is an avenue that should be explored to improve selective auditory attention decoding.
已经证明,在简单的双耳分听范式中,通过从单试次神经反应数据中重建刺激包络,可以对注意力选择进行离线解码。在这里,我们在非稳态说话者的环境中测试这种方法的效果。然后,我们超越包络重建本身,考虑是否可以通过纳入解码器值来提高解码性能,解码器值反映了在重建刺激包络时对多通道 EEG 数据在不同时间延迟和头皮位置应用的权重。
当受试者关注两个说话者中的一个时,记录高密度 EEG。使用 HRTFs 对两个语音流进行滤波,并以不同的间隔将说话者交替到左右位置,以模拟动态环境。我们训练了从 EEG 数据映射到注意力和非注意力刺激包络的时空解码器。然后,我们通过以下两种方式进行听觉注意力解码:(1)使用注意力解码器重建包络;(2)利用解码器权重本身包含注意力特征的事实,从而在不同的受试者中产生一致的模式,可以进行分类。
即使对于非稳态说话者,先前建立的解码方法也被证明是有效的。在解码器的时空结构中发现了注意力选择和注意力方向的特征,并且在受试者中是一致的。将解码器权重纳入解码算法中,解码精度显著提高(4 秒窗口从 61.07%提高到 65.31%)。尽管在单试次水平上不成功,但尝试将α波侧化作为另一个特征来提高解码。
这项工作表明,时空解码器权重可用于提高解码。更一般地说,超越包络重建并纳入其他注意力特征是一个应该探索的途径,以提高选择性听觉注意力解码。