鸡尾酒会场景中目标说话人的 EEG 解码：关于说话人位置动态切换的考虑因素。

EEG decoding of the target speaker in a cocktail party scenario: considerations regarding dynamic switching of talker location.

机构信息

School of Engineering, Trinity College Dublin, University of Dublin, Dublin, Ireland. Trinity Centre for Bioengineering, Trinity College Dublin, Dublin, Ireland.

出版信息

J Neural Eng. 2019 Jun;16(3):036017. doi: 10.1088/1741-2552/ab0cf1. Epub 2019 Mar 5.

DOI:10.1088/1741-2552/ab0cf1

PMID:30836345

Abstract

OBJECTIVE

It has been shown that attentional selection in a simple dichotic listening paradigm can be decoded offline by reconstructing the stimulus envelope from single-trial neural response data. Here, we test the efficacy of this approach in an environment with non-stationary talkers. We then look beyond the envelope reconstructions themselves and consider whether incorporating the decoder values-which reflect the weightings applied to the multichannel EEG data at different time lags and scalp locations when reconstructing the stimulus envelope-can improve decoding performance.

APPROACH

High-density EEG was recorded as subjects attended to one of two talkers. The two speech streams were filtered using HRTFs, and the talkers were alternated between the left and right locations at varying intervals to simulate a dynamic environment. We trained spatio-temporal decoders mapping from EEG data to the attended and unattended stimulus envelopes. We then decoded auditory attention by (1) using the attended decoder to reconstruct the envelope and (2) exploiting the fact that decoder weightings themselves contain signatures of attention, resulting in consistent patterns across subjects that can be classified.

MAIN RESULTS

The previously established decoding approach was found to be effective even with non-stationary talkers. Signatures of attentional selection and attended direction were found in the spatio-temporal structure of the decoders and were consistent across subjects. The inclusion of decoder weights into the decoding algorithm resulted in significantly improved decoding accuracies (from 61.07% to 65.31% for 4 s windows). An attempt was made to include alpha power lateralization as another feature to improve decoding, although this was unsuccessful at the single-trial level.

SIGNIFICANCE

This work suggests that the spatial-temporal decoder weights can be utilised to improve decoding. More generally, looking beyond envelope reconstruction and incorporating other signatures of attention is an avenue that should be explored to improve selective auditory attention decoding.

摘要

目的

已经证明，在简单的双耳分听范式中，通过从单试次神经反应数据中重建刺激包络，可以对注意力选择进行离线解码。在这里，我们在非稳态说话者的环境中测试这种方法的效果。然后，我们超越包络重建本身，考虑是否可以通过纳入解码器值来提高解码性能，解码器值反映了在重建刺激包络时对多通道 EEG 数据在不同时间延迟和头皮位置应用的权重。

方法

当受试者关注两个说话者中的一个时，记录高密度 EEG。使用 HRTFs 对两个语音流进行滤波，并以不同的间隔将说话者交替到左右位置，以模拟动态环境。我们训练了从 EEG 数据映射到注意力和非注意力刺激包络的时空解码器。然后，我们通过以下两种方式进行听觉注意力解码：（1）使用注意力解码器重建包络；（2）利用解码器权重本身包含注意力特征的事实，从而在不同的受试者中产生一致的模式，可以进行分类。

主要结果

即使对于非稳态说话者，先前建立的解码方法也被证明是有效的。在解码器的时空结构中发现了注意力选择和注意力方向的特征，并且在受试者中是一致的。将解码器权重纳入解码算法中，解码精度显著提高（4 秒窗口从 61.07%提高到 65.31%）。尽管在单试次水平上不成功，但尝试将α波侧化作为另一个特征来提高解码。

意义

这项工作表明，时空解码器权重可用于提高解码。更一般地说，超越包络重建并纳入其他注意力特征是一个应该探索的途径，以提高选择性听觉注意力解码。

相似文献

EEG decoding of the target speaker in a cocktail party scenario: considerations regarding dynamic switching of talker location.

J Neural Eng. 2019 Jun;16(3):036017. doi: 10.1088/1741-2552/ab0cf1. Epub 2019 Mar 5.

Noise-robust cortical tracking of attended speech in real-world acoustic scenes.

Neuroimage. 2017 Aug 1;156:435-444. doi: 10.1016/j.neuroimage.2017.04.026. Epub 2017 Apr 13.

Where is the cocktail party? Decoding locations of attended and unattended moving sound sources using EEG.

Neuroimage. 2020 Jan 15;205:116283. doi: 10.1016/j.neuroimage.2019.116283. Epub 2019 Oct 17.

EEG-based auditory attention detection: boundary conditions for background noise and speaker positions.

J Neural Eng. 2018 Dec;15(6):066017. doi: 10.1088/1741-2552/aae0a6. Epub 2018 Sep 12.

The effect of head-related filtering and ear-specific decoding bias on auditory attention detection.

J Neural Eng. 2016 Oct;13(5):056014. doi: 10.1088/1741-2560/13/5/056014. Epub 2016 Sep 13.

EEG-based auditory attention decoding using speech-level-based segmented computational models.

J Neural Eng. 2021 May 25;18(4). doi: 10.1088/1741-2552/abfeba.

Look at me when I'm talking to you: Selective attention at a multisensory cocktail party can be decoded using stimulus reconstruction and alpha power modulations.

Eur J Neurosci. 2019 Oct;50(8):3282-3295. doi: 10.1111/ejn.14425. Epub 2019 May 17.

Decoding the attended speech stream with multi-channel EEG: implications for online, daily-life applications.

J Neural Eng. 2015 Aug;12(4):046007. doi: 10.1088/1741-2560/12/4/046007. Epub 2015 Jun 2.

Congruent audiovisual speech enhances auditory attention decoding with EEG.

J Neural Eng. 2019 Nov 6;16(6):066033. doi: 10.1088/1741-2552/ab4340.

Envelope responses in single-trial EEG indicate attended speaker in a 'cocktail party'.

J Neural Eng. 2014 Aug;11(4):046015. doi: 10.1088/1741-2560/11/4/046015. Epub 2014 Jun 25.

引用本文的文献

Neural Speech Tracking during Selective Attention: A Spatially Realistic Audiovisual Study.

eNeuro. 2025 Jun 24;12(6). doi: 10.1523/ENEURO.0132-24.2025. Print 2025 Jun.

Individual Differences in Cognition and Perception Predict Neural Processing of Speech in Noise for Audiometrically Normal Listeners.

eNeuro. 2025 Apr 29;12(4). doi: 10.1523/ENEURO.0381-24.2025. Print 2025 Apr.

Selective Auditory Attention Detection Using Combined Transformer and Convolutional Graph Neural Networks.

Bioengineering (Basel). 2024 Nov 30;11(12):1216. doi: 10.3390/bioengineering11121216.

Top-down modulation of dichotic listening affects interhemispheric connectivity: an electroencephalography study.

Front Neurosci. 2024 Sep 12;18:1424746. doi: 10.3389/fnins.2024.1424746. eCollection 2024.

Neural alpha oscillations index context-driven perception of ambiguous vowel sequences.

iScience. 2023 Nov 14;26(12):108457. doi: 10.1016/j.isci.2023.108457. eCollection 2023 Dec 15.

A Speech-Level-Based Segmented Model to Decode the Dynamic Auditory Attention States in the Competing Speaker Scenes.

Front Neurosci. 2022 Feb 10;15:760611. doi: 10.3389/fnins.2021.760611. eCollection 2021.

EEG alpha and pupil diameter reflect endogenous auditory attention switching and listening effort.

Eur J Neurosci. 2022 Mar;55(5):1262-1277. doi: 10.1111/ejn.15616. Epub 2022 Feb 16.

Behavioral Account of Attended Stream Enhances Neural Tracking.

Front Neurosci. 2021 Dec 13;15:674112. doi: 10.3389/fnins.2021.674112. eCollection 2021.

Attention Differentially Affects Acoustic and Phonetic Feature Encoding in a Multispeaker Environment.

J Neurosci. 2022 Jan 26;42(4):682-691. doi: 10.1523/JNEUROSCI.1455-20.2021. Epub 2021 Dec 10.

Dynamic selective auditory attention detection using RNN and reinforcement learning.

Sci Rep. 2021 Jul 29;11(1):15497. doi: 10.1038/s41598-021-94876-0.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

鸡尾酒会场景中目标说话人的 EEG 解码：关于说话人位置动态切换的考虑因素。

EEG decoding of the target speaker in a cocktail party scenario: considerations regarding dynamic switching of talker location.

机构信息

出版信息

OBJECTIVE

APPROACH

MAIN RESULTS

SIGNIFICANCE

目的

方法

主要结果

意义

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献