School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China.
School of Information Science and Engineering, East China University of Science and Technology, Shanghai 200237, China.
Hear Res. 2024 Nov;453:109104. doi: 10.1016/j.heares.2024.109104. Epub 2024 Aug 14.
Auditory spatial attention detection (ASAD) seeks to determine which speaker in a surround sound field a listener is focusing on based on the one's brain biosignals. Although existing studies have achieved ASAD from a single-trial electroencephalogram (EEG), the huge inter-subject variability makes them generally perform poorly in cross-subject scenarios. Besides, most ASAD methods do not take full advantage of topological relationships between EEG channels, which are crucial for high-quality ASAD. Recently, some advanced studies have introduced graph-based brain topology modeling into ASAD, but how to calculate edge weights in a graph to better capture actual brain connectivity is worthy of further investigation. To address these issues, we propose a new ASAD method in this paper. First, we model a multi-channel EEG segment as a graph, where differential entropy serves as the node feature, and a static adjacency matrix is generated based on inter-channel mutual information to quantify brain functional connectivity. Then, different subjects' EEG graphs are encoded into a shared embedding space through a total variation graph neural network. Meanwhile, feature distribution alignment based on multi-kernel maximum mean discrepancy is adopted to learn subject-invariant patterns. Note that we align EEG embeddings of different subjects to reference distributions rather than align them to each other for the purpose of privacy preservation. A series of experiments on open datasets demonstrate that the proposed model outperforms state-of-the-art ASAD models in cross-subject scenarios with relatively low computational complexity, and feature distribution alignment improves the generalizability of the proposed model to a new subject.
听觉空间注意检测(ASAD)旨在根据听者的大脑生物信号确定其在环绕声场中关注的是哪个说话者。尽管现有研究已经从单次脑电图(EEG)中实现了 ASAD,但巨大的个体间可变性使得它们在跨个体场景中的表现普遍较差。此外,大多数 ASAD 方法并没有充分利用 EEG 通道之间的拓扑关系,而这些关系对于高质量的 ASAD 至关重要。最近,一些先进的研究已经将基于图的脑拓扑建模引入 ASAD 中,但是如何在图中计算边权重以更好地捕捉实际的脑连接性值得进一步研究。针对这些问题,我们在本文中提出了一种新的 ASAD 方法。首先,我们将多通道 EEG 段建模为一个图,其中差分熵作为节点特征,并且根据通道间互信息生成静态邻接矩阵以量化脑功能连接。然后,通过全变分图神经网络将不同个体的 EEG 图编码到共享嵌入空间中。同时,采用基于多核最大均值差异的特征分布对齐来学习与主体无关的模式。请注意,我们将不同主体的 EEG 嵌入与参考分布对齐,而不是将它们彼此对齐,以保护隐私。在公开数据集上进行的一系列实验表明,所提出的模型在跨个体场景中表现优于最先进的 ASAD 模型,具有相对较低的计算复杂度,并且特征分布对齐提高了所提出的模型对新主体的泛化能力。