利用机器学习从连续语音诱发的脑电图中解码听众的注意力

Machine learning for decoding listeners' attention from electroencephalography evoked by continuous speech.

作者信息

de Taillez Tobias, Kollmeier Birger, Meyer Bernd T

机构信息

Medizinische Physik and Cluster of Excellence Hearing4all, Carl von Ossietzky Universität, Oldenburg, 26129, Germany.

出版信息

Eur J Neurosci. 2020 Mar;51(5):1234-1241. doi: 10.1111/ejn.13790. Epub 2018 Jan 4.

DOI:10.1111/ejn.13790

PMID:29205588

Abstract

Previous research has shown that it is possible to predict which speaker is attended in a multispeaker scene by analyzing a listener's electroencephalography (EEG) activity. In this study, existing linear models that learn the mapping from neural activity to an attended speech envelope are replaced by a non-linear neural network (NN). The proposed architecture takes into account the temporal context of the estimated envelope and is evaluated using EEG data obtained from 20 normal-hearing listeners who focused on one speaker in a two-speaker setting. The network is optimized with respect to the frequency range and the temporal segmentation of the EEG input, as well as the cost function used to estimate the model parameters. To identify the salient cues involved in auditory attention, a relevance algorithm is applied that highlights the electrode signals most important for attention decoding. In contrast to linear approaches, the NN profits from a wider EEG frequency range (1-32 Hz) and achieves a performance seven times higher than the linear baseline. Relevant EEG activations following the speech stimulus after 170 ms at physiologically plausible locations were found. This was not observed when the model was trained on the unattended speaker. Our findings therefore indicate that non-linear NNs can provide insight into physiological processes by analyzing EEG activity.

摘要

先前的研究表明，通过分析听众的脑电图（EEG）活动，可以预测在多说话者场景中被关注的是哪个说话者。在本研究中，将学习从神经活动到被关注语音包络映射的现有线性模型替换为非线性神经网络（NN）。所提出的架构考虑了估计包络的时间背景，并使用从20名正常听力的听众那里获得的EEG数据进行评估，这些听众在双说话者环境中专注于一个说话者。该网络针对EEG输入的频率范围、时间分割以及用于估计模型参数的代价函数进行了优化。为了识别听觉注意力中涉及的显著线索，应用了一种相关性算法，该算法突出显示对注意力解码最重要的电极信号。与线性方法相比，神经网络从更宽的EEG频率范围（1 - 32赫兹）中受益，并且实现了比线性基线高七倍的性能。在生理上合理的位置，在语音刺激后170毫秒发现了相关的EEG激活。当模型在未被关注的说话者上进行训练时，未观察到这种情况。因此，我们的研究结果表明，非线性神经网络可以通过分析EEG活动来洞察生理过程。

相似文献

Machine learning for decoding listeners' attention from electroencephalography evoked by continuous speech.

Eur J Neurosci. 2020 Mar;51(5):1234-1241. doi: 10.1111/ejn.13790. Epub 2018 Jan 4.

Congruent audiovisual speech enhances auditory attention decoding with EEG.

J Neural Eng. 2019 Nov 6;16(6):066033. doi: 10.1088/1741-2552/ab4340.

Effects of directional sound processing and listener's motivation on EEG responses to continuous noisy speech: Do normal-hearing and aided hearing-impaired listeners differ?

Hear Res. 2019 Jun;377:260-270. doi: 10.1016/j.heares.2019.04.005. Epub 2019 Apr 11.

EEG-based auditory attention detection: boundary conditions for background noise and speaker positions.

J Neural Eng. 2018 Dec;15(6):066017. doi: 10.1088/1741-2552/aae0a6. Epub 2018 Sep 12.

Effects of Sensorineural Hearing Loss on Cortical Synchronization to Competing Speech during Selective Attention.

J Neurosci. 2020 Mar 18;40(12):2562-2572. doi: 10.1523/JNEUROSCI.1936-19.2020. Epub 2020 Feb 24.

EEG-based auditory attention decoding using speech-level-based segmented computational models.

J Neural Eng. 2021 May 25;18(4). doi: 10.1088/1741-2552/abfeba.

Robust decoding of the speech envelope from EEG recordings through deep neural networks.

J Neural Eng. 2022 Jul 6;19(4). doi: 10.1088/1741-2552/ac7976.

Impact of Different Acoustic Components on EEG-Based Auditory Attention Decoding in Noisy and Reverberant Conditions.

IEEE Trans Neural Syst Rehabil Eng. 2019 Apr;27(4):652-663. doi: 10.1109/TNSRE.2019.2903404. Epub 2019 Mar 7.

Noise-robust cortical tracking of attended speech in real-world acoustic scenes.

Neuroimage. 2017 Aug 1;156:435-444. doi: 10.1016/j.neuroimage.2017.04.026. Epub 2017 Apr 13.

Neural tracking to go: auditory attention decoding and saliency detection with mobile EEG.

J Neural Eng. 2022 Jan 6;18(6). doi: 10.1088/1741-2552/ac42b5.

引用本文的文献

Contrastive representation learning with transformers for robust auditory EEG decoding.

Sci Rep. 2025 Aug 6;15(1):28744. doi: 10.1038/s41598-025-13646-4.

A Brain-Computer Interface for Improving Auditory Attention in Multi-Talker Environments.

bioRxiv. 2025 Mar 13:2025.03.13.641661. doi: 10.1101/2025.03.13.641661.

Cognitive component of auditory attention to natural speech events.

Front Hum Neurosci. 2025 Jan 6;18:1460139. doi: 10.3389/fnhum.2024.1460139. eCollection 2024.

Auditory-GAN: deep learning framework for improved auditory spatial attention detection.

PeerJ Comput Sci. 2024 Oct 30;10:e2394. doi: 10.7717/peerj-cs.2394. eCollection 2024.

Classifying coherent versus nonsense speech perception from EEG using linguistic speech features.

Sci Rep. 2024 Aug 14;14(1):18922. doi: 10.1038/s41598-024-69568-0.

Convolutional neural networks can identify brain interactions involved in decoding spatial auditory attention.

PLoS Comput Biol. 2024 Aug 8;20(8):e1012376. doi: 10.1371/journal.pcbi.1012376. eCollection 2024 Aug.

Decoding of the speech envelope from EEG using the VLAAI deep neural network.

Sci Rep. 2023 Jan 16;13(1):812. doi: 10.1038/s41598-022-27332-2.

Decoding the cognitive states of attention and distraction in a real-life setting using EEG.

Sci Rep. 2022 Nov 30;12(1):20649. doi: 10.1038/s41598-022-24417-w.

Decoding Object-Based Auditory Attention from Source-Reconstructed MEG Alpha Oscillations.

J Neurosci. 2021 Oct 13;41(41):8603-8617. doi: 10.1523/JNEUROSCI.0583-21.2021. Epub 2021 Aug 24.

Extracting the Auditory Attention in a Dual-Speaker Scenario From EEG Using a Joint CNN-LSTM Model.

Front Physiol. 2021 Aug 2;12:700655. doi: 10.3389/fphys.2021.700655. eCollection 2021.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

利用机器学习从连续语音诱发的脑电图中解码听众的注意力

Machine learning for decoding listeners' attention from electroencephalography evoked by continuous speech.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献