基于Transformer和卷积图神经网络组合的选择性听觉注意力检测

Selective Auditory Attention Detection Using Combined Transformer and Convolutional Graph Neural Networks.

作者信息

Geravanchizadeh Masoud, Shaygan Asl Amir, Danishvar Sebelan

机构信息

Faculty of Electrical & Computer Engineering, University of Tabriz, Tabriz 51666-15813, Iran.

College of Engineering, Design and Physical Sciences, Brunel University London, London UB8 3PH, UK.

出版信息

Bioengineering (Basel). 2024 Nov 30;11(12):1216. doi: 10.3390/bioengineering11121216.

DOI:10.3390/bioengineering11121216

PMID:39768034

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11673410/

Abstract

Attention is one of many human cognitive functions that are essential in everyday life. Given our limited processing capacity, attention helps us focus only on what matters. Focusing attention on one speaker in an environment with many speakers is a critical ability of the human auditory system. This paper proposes a new end-to-end method based on the combined transformer and graph convolutional neural network (TraGCNN) that can effectively detect auditory attention from electroencephalograms (EEGs). This approach eliminates the need for manual feature extraction, which is often time-consuming and subjective. Here, the first EEG signals are converted to graphs. We then extract attention information from these graphs using spatial and temporal approaches. Finally, our models are trained with these data. Our model can detect auditory attention in both the spatial and temporal domains. Here, the EEG input is first processed by transformer layers to obtain a sequential representation of EEG based on attention onsets. Then, a family of graph convolutional layers is used to find the most active electrodes using the spatial position of electrodes. Finally, the corresponding EEG features of active electrodes are fed into the graph attention layers to detect auditory attention. The Fuglsang 2020 dataset is used in the experiments to train and test the proposed and baseline systems. The new TraGCNN approach, as compared with state-of-the-art attention classification methods from the literature, yields the highest performance in terms of accuracy (80.12%) as a classification metric. Additionally, the proposed model results in higher performance than our previously graph-based model for different lengths of EEG segments. The new TraGCNN approach is advantageous because attenuation detection is achieved from EEG signals of subjects without requiring speech stimuli, as is the case with conventional auditory attention detection methods. Furthermore, examining the proposed model for different lengths of EEG segments shows that the model is faster than our previous graph-based detection method in terms of computational complexity. The findings of this study have important implications for the understanding and assessment of auditory attention, which is crucial for many applications, such as brain-computer interface (BCI) systems, speech separation, and neuro-steered hearing aid development.

摘要

注意力是人类众多认知功能之一，在日常生活中至关重要。鉴于我们有限的处理能力，注意力帮助我们只专注于重要的事情。在有许多说话者的环境中，将注意力集中在一个说话者身上是人类听觉系统的一项关键能力。本文提出了一种基于组合变压器和图卷积神经网络（TraGCNN）的新的端到端方法，该方法可以有效地从脑电图（EEG）中检测听觉注意力。这种方法无需手动特征提取，而手动特征提取通常既耗时又主观。在此，首先将EEG信号转换为图。然后，我们使用空间和时间方法从这些图中提取注意力信息。最后，用这些数据训练我们的模型。我们的模型可以在空间和时间域中检测听觉注意力。在此，EEG输入首先由变压器层进行处理，以基于注意力起始点获得EEG的序列表示。然后，使用一族图卷积层根据电极的空间位置找到最活跃的电极。最后，将活跃电极的相应EEG特征输入到图注意力层中以检测听觉注意力。实验中使用Fuglsang 2020数据集来训练和测试所提出的系统和基线系统。与文献中最先进的注意力分类方法相比，新的TraGCNN方法在作为分类指标的准确率（80.12%）方面产生了最高的性能。此外，对于不同长度的EEG段，所提出的模型比我们之前基于图的模型具有更高的性能。新的TraGCNN方法具有优势，因为与传统听觉注意力检测方法不同，它无需语音刺激即可从受试者的EEG信号中实现衰减检测。此外，对不同长度的EEG段检查所提出的模型表明，该模型在计算复杂度方面比我们之前基于图的检测方法更快。本研究的结果对于理解和评估听觉注意力具有重要意义，而听觉注意力对于许多应用至关重要，例如脑机接口（BCI）系统、语音分离和神经导向助听器的开发。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/27a4/11673410/d0148cd6a0ae/bioengineering-11-01216-g001.jpg

相似文献

Selective Auditory Attention Detection Using Combined Transformer and Convolutional Graph Neural Networks.

Bioengineering (Basel). 2024 Nov 30;11(12):1216. doi: 10.3390/bioengineering11121216.

DGSD: Dynamical graph self-distillation for EEG-based auditory spatial attention detection.

Neural Netw. 2024 Nov;179:106580. doi: 10.1016/j.neunet.2024.106580. Epub 2024 Jul 26.

Brain connectivity and time-frequency fusion-based auditory spatial attention detection.

Neuroscience. 2024 Nov 12;560:397-405. doi: 10.1016/j.neuroscience.2024.09.017. Epub 2024 Sep 10.

Multimodal depression detection based on an attention graph convolution and transformer.

Math Biosci Eng. 2025 Feb 27;22(3):652-676. doi: 10.3934/mbe.2025024.

Brain Topology Modeling With EEG-Graphs for Auditory Spatial Attention Detection.

IEEE Trans Biomed Eng. 2024 Jan;71(1):171-182. doi: 10.1109/TBME.2023.3294242. Epub 2023 Dec 22.

Selective auditory attention detection based on effective connectivity by single-trial EEG.

J Neural Eng. 2020 Apr 17;17(2):026021. doi: 10.1088/1741-2552/ab7c8d.

Emotion recognition using spatial-temporal EEG features through convolutional graph attention network.

J Neural Eng. 2023 Feb 14;20(1). doi: 10.1088/1741-2552/acb79e.

Attention-guided graph structure learning network for EEG-enabled auditory attention detection.

J Neural Eng. 2024 May 30;21(3). doi: 10.1088/1741-2552/ad4f1a.

EEG-based emotion recognition using graph convolutional neural network with dual attention mechanism.

Front Comput Neurosci. 2024 Jul 19;18:1416494. doi: 10.3389/fncom.2024.1416494. eCollection 2024.

Auditory attention tracking states in a cocktail party environment can be decoded by deep convolutional neural networks.

J Neural Eng. 2020 Jun 12;17(3):036013. doi: 10.1088/1741-2552/ab92b2.

引用本文的文献

Two-Dimensional Latent Space Manifold of Brain Connectomes Across the Spectrum of Clinical Cognitive Decline.

Bioengineering (Basel). 2025 Jul 29;12(8):819. doi: 10.3390/bioengineering12080819.

本文引用的文献

Low-Latency Auditory Spatial Attention Detection Based on Spectro-Spatial Features from EEG.

Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:5812-5815. doi: 10.1109/EMBC46164.2021.9630902.

Extracting the Auditory Attention in a Dual-Speaker Scenario From EEG Using a Joint CNN-LSTM Model.

Front Physiol. 2021 Aug 2;12:700655. doi: 10.3389/fphys.2021.700655. eCollection 2021.

Graph Neural Networks and Their Current Applications in Bioinformatics.

Front Genet. 2021 Jul 29;12:690049. doi: 10.3389/fgene.2021.690049. eCollection 2021.

Dynamic selective auditory attention detection using RNN and reinforcement learning.

Sci Rep. 2021 Jul 29;11(1):15497. doi: 10.1038/s41598-021-94876-0.

Ear-EEG-based binaural speech enhancement (ee-BSE) using auditory attention detection and audiometric characteristics of hearing-impaired subjects.

J Neural Eng. 2021 Aug 20;18(4). doi: 10.1088/1741-2552/ac16b4.

Attention in Psychology, Neuroscience, and Machine Learning.

Front Comput Neurosci. 2020 Apr 16;14:29. doi: 10.3389/fncom.2020.00029. eCollection 2020.

Selective auditory attention detection based on effective connectivity by single-trial EEG.

J Neural Eng. 2020 Apr 17;17(2):026021. doi: 10.1088/1741-2552/ab7c8d.

Effects of Sensorineural Hearing Loss on Cortical Synchronization to Competing Speech during Selective Attention.

J Neurosci. 2020 Mar 18;40(12):2562-2572. doi: 10.1523/JNEUROSCI.1936-19.2020. Epub 2020 Feb 24.

Comparison of Two-Talker Attention Decoding from EEG with Nonlinear Neural Networks and Linear Methods.

Sci Rep. 2019 Aug 8;9(1):11538. doi: 10.1038/s41598-019-47795-0.

Speaker-independent auditory attention decoding without access to clean speech sources.

Sci Adv. 2019 May 15;5(5):eaav6134. doi: 10.1126/sciadv.aav6134. eCollection 2019 May.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

基于Transformer和卷积图神经网络组合的选择性听觉注意力检测

Selective Auditory Attention Detection Using Combined Transformer and Convolutional Graph Neural Networks.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献