• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

听觉注意力识别方法教程。

A Tutorial on Auditory Attention Identification Methods.

作者信息

Alickovic Emina, Lunner Thomas, Gustafsson Fredrik, Ljung Lennart

机构信息

Department of Electrical Engineering, Linkoping University, Linkoping, Sweden.

Eriksholm Research Centre, Oticon A/S, Snekkersten, Denmark.

出版信息

Front Neurosci. 2019 Mar 19;13:153. doi: 10.3389/fnins.2019.00153. eCollection 2019.

DOI:10.3389/fnins.2019.00153
PMID:30941002
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6434370/
Abstract

Auditory attention identification methods attempt to identify the sound source of a listener's interest by analyzing measurements of electrophysiological data. We present a tutorial on the numerous techniques that have been developed in recent decades, and we present an overview of current trends in multivariate correlation-based and model-based learning frameworks. The focus is on the use of linear relations between electrophysiological and audio data. The way in which these relations are computed differs. For example, canonical correlation analysis (CCA) finds a linear subset of electrophysiological data that best correlates to audio data and a similar subset of audio data that best correlates to electrophysiological data. Model-based (encoding and decoding) approaches focus on either of these two sets. We investigate the similarities and differences between these linear model philosophies. We focus on (1) correlation-based approaches (CCA), (2) encoding/decoding models based on dense estimation, and (3) (adaptive) encoding/decoding models based on sparse estimation. The specific focus is on sparsity-driven adaptive encoding models and comparing the methodology in state-of-the-art models found in the auditory literature. Furthermore, we outline the main signal processing pipeline for how to identify the attended sound source in a cocktail party environment from the raw electrophysiological data with all the necessary steps, complemented with the necessary MATLAB code and the relevant references for each step. Our main aim is to compare the methodology of the available methods, and provide numerical illustrations to some of them to get a feeling for their potential. A thorough performance comparison is outside the scope of this tutorial.

摘要

听觉注意力识别方法试图通过分析电生理数据测量值来识别听众感兴趣的声源。我们提供了一个关于近几十年来所开发的众多技术的教程,并概述了基于多变量相关性和基于模型的学习框架的当前趋势。重点在于电生理数据与音频数据之间线性关系的应用。计算这些关系的方式有所不同。例如,典型相关分析(CCA)会找到与音频数据最佳相关的电生理数据线性子集以及与电生理数据最佳相关的类似音频数据子集。基于模型(编码和解码)的方法则侧重于这两组中的任意一组。我们研究这些线性模型理念之间的异同。我们重点关注:(1)基于相关性的方法(CCA);(2)基于密集估计的编码/解码模型;以及(3)基于稀疏估计的(自适应)编码/解码模型。具体重点是稀疏性驱动的自适应编码模型,并比较听觉文献中现有模型的方法。此外,我们概述了在鸡尾酒会环境中如何从原始电生理数据识别被关注声源的主要信号处理流程,包括所有必要步骤,并辅以必要的MATLAB代码以及每个步骤的相关参考文献。我们的主要目的是比较现有方法的方法,并对其中一些方法提供数值示例,以感受它们的潜力。本教程不涉及全面的性能比较。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/369b/6434370/64e8702066ab/fnins-13-00153-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/369b/6434370/c17359860ae2/fnins-13-00153-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/369b/6434370/64e8702066ab/fnins-13-00153-g0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/369b/6434370/c17359860ae2/fnins-13-00153-g0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/369b/6434370/64e8702066ab/fnins-13-00153-g0002.jpg

相似文献

1
A Tutorial on Auditory Attention Identification Methods.听觉注意力识别方法教程。
Front Neurosci. 2019 Mar 19;13:153. doi: 10.3389/fnins.2019.00153. eCollection 2019.
2
Deep Canonical Correlation Analysis For Decoding The Auditory Brain.用于解码听觉脑的深度典型相关分析
Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:3505-3508. doi: 10.1109/EMBC44109.2020.9176208.
3
EEG-based auditory attention detection: boundary conditions for background noise and speaker positions.基于脑电图的听觉注意力检测:背景噪声和说话人位置的边界条件。
J Neural Eng. 2018 Dec;15(6):066017. doi: 10.1088/1741-2552/aae0a6. Epub 2018 Sep 12.
4
Auditory Attention Detection via Cross-Modal Attention.通过跨模态注意力进行听觉注意力检测
Front Neurosci. 2021 Jul 21;15:652058. doi: 10.3389/fnins.2021.652058. eCollection 2021.
5
Where is the cocktail party? Decoding locations of attended and unattended moving sound sources using EEG.鸡尾酒会在哪里?使用 EEG 解码有注意和无注意移动声源的位置。
Neuroimage. 2020 Jan 15;205:116283. doi: 10.1016/j.neuroimage.2019.116283. Epub 2019 Oct 17.
6
A Comparison of Regularization Methods in Forward and Backward Models for Auditory Attention Decoding.听觉注意力解码的前向和后向模型中正则化方法的比较
Front Neurosci. 2018 Aug 7;12:531. doi: 10.3389/fnins.2018.00531. eCollection 2018.
7
Macromolecular crowding: chemistry and physics meet biology (Ascona, Switzerland, 10-14 June 2012).大分子拥挤现象:化学与物理邂逅生物学(瑞士阿斯科纳,2012年6月10日至14日)
Phys Biol. 2013 Aug;10(4):040301. doi: 10.1088/1478-3975/10/4/040301. Epub 2013 Aug 2.
8
Real-Time Tracking of Selective Auditory Attention From M/EEG: A Bayesian Filtering Approach.基于脑电信号的选择性听觉注意力实时追踪:一种贝叶斯滤波方法。
Front Neurosci. 2018 May 1;12:262. doi: 10.3389/fnins.2018.00262. eCollection 2018.
9
Cortical Auditory Attention Decoding During Music and Speech Listening.音乐和言语聆听过程中的皮质听觉注意解码。
IEEE Trans Neural Syst Rehabil Eng. 2023;31:2903-2911. doi: 10.1109/TNSRE.2023.3291239. Epub 2023 Jul 12.
10
An LMMSE-based Estimation of Temporal Response Function in Auditory Attention Decoding.基于线性混合模型最小均方误差的听觉注意力解码中时间响应函数估计
Annu Int Conf IEEE Eng Med Biol Soc. 2020 Jul;2020:2837-2840. doi: 10.1109/EMBC44109.2020.9175866.

引用本文的文献

1
Age-Related Differences in EEG-Based Speech Reception Threshold Estimation Using Scalp and Ear-EEG.使用头皮脑电图和耳部脑电图基于脑电图的言语接受阈值估计中的年龄相关差异。
Trends Hear. 2025 Jan-Dec;29:23312165251372462. doi: 10.1177/23312165251372462. Epub 2025 Aug 25.
2
Non-Invasive Brain Stimulation and Artificial Intelligence in Communication Neuroprosthetics: A Bidirectional Approach for Speech and Hearing Impairments.通信神经假体中的非侵入性脑刺激与人工智能:一种针对言语和听力障碍的双向方法。
Brain Sci. 2025 Apr 25;15(5):449. doi: 10.3390/brainsci15050449.
3
A Brain-Computer Interface for Improving Auditory Attention in Multi-Talker Environments.

本文引用的文献

1
EEG-based detection of the locus of auditory attention with convolutional neural networks.基于 EEG 的卷积神经网络对听觉注意力焦点的检测。
Elife. 2021 Apr 30;10:e56481. doi: 10.7554/eLife.56481.
2
Comparison of Two-Talker Attention Decoding from EEG with Nonlinear Neural Networks and Linear Methods.两种从 EEG 中解码双说话人注意力的方法比较:非线性神经网络与线性方法。
Sci Rep. 2019 Aug 8;9(1):11538. doi: 10.1038/s41598-019-47795-0.
3
Decoding of selective attention to continuous speech from the human auditory brainstem response.从人类听觉脑干反应中解码对连续语音的选择性注意。
一种用于在多说话者环境中改善听觉注意力的脑机接口。
bioRxiv. 2025 Mar 13:2025.03.13.641661. doi: 10.1101/2025.03.13.641661.
4
Speech Reception Threshold Estimation via EEG-Based Continuous Speech Envelope Reconstruction.基于脑电图的连续语音包络重构的言语接受阈值估计
Eur J Neurosci. 2025 Mar;61(6):e70083. doi: 10.1111/ejn.70083.
5
Improving Tracking of Selective Attention in Hearing Aid Users: The Role of Noise Reduction and Nonlinearity Compensation.改善助听器使用者选择性注意力的追踪:降噪与非线性补偿的作用。
eNeuro. 2025 Feb 19;12(2). doi: 10.1523/ENEURO.0275-24.2025. Print 2025 Feb.
6
Improving auditory attention decoding by classifying intracranial responses to glimpsed and masked acoustic events.通过对瞥见和掩蔽声学事件的颅内反应进行分类来改善听觉注意力解码。
Imaging Neurosci (Camb). 2024;2. doi: 10.1162/imag_a_00148. Epub 2024 Apr 26.
7
Cognitive component of auditory attention to natural speech events.对自然言语事件听觉注意的认知成分。
Front Hum Neurosci. 2025 Jan 6;18:1460139. doi: 10.3389/fnhum.2024.1460139. eCollection 2024.
8
Selective Auditory Attention Detection Using Combined Transformer and Convolutional Graph Neural Networks.基于Transformer和卷积图神经网络组合的选择性听觉注意力检测
Bioengineering (Basel). 2024 Nov 30;11(12):1216. doi: 10.3390/bioengineering11121216.
9
Novel bias-reduced coherence measure for EEG-based speech tracking in listeners with hearing impairment.用于听力受损听众基于脑电图的语音跟踪的新型偏差减少相干测量方法。
Front Neurosci. 2024 Nov 6;18:1415397. doi: 10.3389/fnins.2024.1415397. eCollection 2024.
10
Level-Dependent Subcortical Electroencephalography Responses to Continuous Speech.皮层下脑电活动对连续语音的水平依赖性反应。
eNeuro. 2024 Aug 27;11(8). doi: 10.1523/ENEURO.0135-24.2024. Print 2024 Aug.
Neuroimage. 2019 Oct 15;200:1-11. doi: 10.1016/j.neuroimage.2019.06.029. Epub 2019 Jun 15.
4
Predicting individual speech intelligibility from the cortical tracking of acoustic- and phonetic-level speech representations.从皮质追踪声学和语音水平的语音表示来预测个体言语可懂度。
Hear Res. 2019 Sep 1;380:1-9. doi: 10.1016/j.heares.2019.05.006. Epub 2019 May 28.
5
EEG-assisted Modulation of Sound Sources in the Auditory Scene.脑电图辅助对听觉场景中声源的调制
Biomed Signal Process Control. 2018 Jan;39:263-270. doi: 10.1016/j.bspc.2017.08.008. Epub 2017 Aug 16.
6
Towards reconstructing intelligible speech from the human auditory cortex.从人类听觉皮层重建可理解的语音。
Sci Rep. 2019 Jan 29;9(1):874. doi: 10.1038/s41598-018-37359-z.
7
Multiway canonical correlation analysis of brain data.多变量典型相关分析在脑数据中的应用。
Neuroimage. 2019 Feb 1;186:728-740. doi: 10.1016/j.neuroimage.2018.11.026. Epub 2018 Nov 27.
8
Neural tracking of the speech envelope in cochlear implant users.人工耳蜗使用者语音包络的神经追踪。
J Neural Eng. 2019 Feb;16(1):016003. doi: 10.1088/1741-2552/aae6b9. Epub 2018 Nov 16.
9
The effect of miniaturization and galvanic separation of EEG sensor devices in an auditory attention detection task.脑电图传感器设备的小型化和电流分离在听觉注意力检测任务中的效果。
Annu Int Conf IEEE Eng Med Biol Soc. 2018 Jul;2018:77-80. doi: 10.1109/EMBC.2018.8512212.
10
Late cortical tracking of ignored speech facilitates neural selectivity in acoustically challenging conditions.忽视语音的后期皮质追踪有助于在听觉挑战性条件下实现神经选择性。
Neuroimage. 2019 Feb 1;186:33-42. doi: 10.1016/j.neuroimage.2018.10.057. Epub 2018 Oct 24.