在竞争说话者环境中通过状态空间模型从脑磁图对选择性听觉注意力进行稳健解码。

Robust decoding of selective auditory attention from MEG in a competing-speaker environment via state-space modeling.

作者信息

Akram Sahar, Presacco Alessandro, Simon Jonathan Z, Shamma Shihab A, Babadi Behtash

机构信息

Department of Electrical and Computer Engineering, University of Maryland, College Park, MD 20742, USA; Institute for Systems Research, University of Maryland, College Park, MD 20742, USA.

Department of Hearing and Speech Science, University of Maryland, College Park, MD 20742, USA.

出版信息

Neuroimage. 2016 Jan 1;124(Pt A):906-917. doi: 10.1016/j.neuroimage.2015.09.048. Epub 2015 Oct 4.

DOI:10.1016/j.neuroimage.2015.09.048

PMID:26436490

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4652844/

Abstract

The underlying mechanism of how the human brain solves the cocktail party problem is largely unknown. Recent neuroimaging studies, however, suggest salient temporal correlations between the auditory neural response and the attended auditory object. Using magnetoencephalography (MEG) recordings of the neural responses of human subjects, we propose a decoding approach for tracking the attentional state while subjects are selectively listening to one of the two speech streams embedded in a competing-speaker environment. We develop a biophysically-inspired state-space model to account for the modulation of the neural response with respect to the attentional state of the listener. The constructed decoder is based on a maximum a posteriori (MAP) estimate of the state parameters via the Expectation Maximization (EM) algorithm. Using only the envelope of the two speech streams as covariates, the proposed decoder enables us to track the attentional state of the listener with a temporal resolution of the order of seconds, together with statistical confidence intervals. We evaluate the performance of the proposed model using numerical simulations and experimentally measured evoked MEG responses from the human brain. Our analysis reveals considerable performance gains provided by the state-space model in terms of temporal resolution, computational complexity and decoding accuracy.

摘要

人类大脑如何解决鸡尾酒会问题的潜在机制在很大程度上尚不清楚。然而，最近的神经影像学研究表明，听觉神经反应与被关注的听觉对象之间存在显著的时间相关性。利用人类受试者神经反应的脑磁图（MEG）记录，我们提出了一种解码方法，用于在受试者选择性地收听嵌入在竞争说话者环境中的两个语音流之一时跟踪其注意力状态。我们开发了一个受生物物理学启发的状态空间模型，以解释神经反应相对于听众注意力状态的调制。构建的解码器基于通过期望最大化（EM）算法对状态参数的最大后验（MAP）估计。仅使用两个语音流的包络作为协变量，所提出的解码器使我们能够以秒级的时间分辨率以及统计置信区间来跟踪听众的注意力状态。我们使用数值模拟和从人类大脑实验测量的诱发MEG反应来评估所提出模型的性能。我们的分析揭示了状态空间模型在时间分辨率、计算复杂度和解码准确性方面带来的显著性能提升。

相似文献

Robust decoding of selective auditory attention from MEG in a competing-speaker environment via state-space modeling.

Neuroimage. 2016 Jan 1;124(Pt A):906-917. doi: 10.1016/j.neuroimage.2015.09.048. Epub 2015 Oct 4.

Dynamic Estimation of the Auditory Temporal Response Function From MEG in Competing-Speaker Environments.

IEEE Trans Biomed Eng. 2017 Aug;64(8):1896-1905. doi: 10.1109/TBME.2016.2628884. Epub 2016 Nov 15.

Real-Time Tracking of Selective Auditory Attention From M/EEG: A Bayesian Filtering Approach.

Front Neurosci. 2018 May 1;12:262. doi: 10.3389/fnins.2018.00262. eCollection 2018.

The effect of head-related filtering and ear-specific decoding bias on auditory attention detection.

J Neural Eng. 2016 Oct;13(5):056014. doi: 10.1088/1741-2560/13/5/056014. Epub 2016 Sep 13.

The effects of selective attention and speech acoustics on neural speech-tracking in a multi-talker scene.

Cortex. 2015 Jul;68:144-54. doi: 10.1016/j.cortex.2014.12.014. Epub 2015 Jan 7.

EEG decoding of the target speaker in a cocktail party scenario: considerations regarding dynamic switching of talker location.

J Neural Eng. 2019 Jun;16(3):036017. doi: 10.1088/1741-2552/ab0cf1. Epub 2019 Mar 5.

Decoding of selective attention to continuous speech from the human auditory brainstem response.

Neuroimage. 2019 Oct 15;200:1-11. doi: 10.1016/j.neuroimage.2019.06.029. Epub 2019 Jun 15.

Noise-robust cortical tracking of attended speech in real-world acoustic scenes.

Neuroimage. 2017 Aug 1;156:435-444. doi: 10.1016/j.neuroimage.2017.04.026. Epub 2017 Apr 13.

The encoding of auditory objects in auditory cortex: insights from magnetoencephalography.

Int J Psychophysiol. 2015 Feb;95(2):184-90. doi: 10.1016/j.ijpsycho.2014.05.005. Epub 2014 May 16.

Cortical Processing of Arithmetic and Simple Sentences in an Auditory Attention Task.

J Neurosci. 2021 Sep 22;41(38):8023-8039. doi: 10.1523/JNEUROSCI.0269-21.2021. Epub 2021 Aug 16.

引用本文的文献

Temporal coherence shapes cortical responses to speech mixtures in a ferret cocktail party.

Commun Biol. 2024 Oct 25;7(1):1392. doi: 10.1038/s42003-024-07096-3.

Convolutional neural networks can identify brain interactions involved in decoding spatial auditory attention.

PLoS Comput Biol. 2024 Aug 8;20(8):e1012376. doi: 10.1371/journal.pcbi.1012376. eCollection 2024 Aug.

Temporal Coherence Shapes Cortical Responses to Speech Mixtures in a Ferret Cocktail Party.

bioRxiv. 2024 Jun 11:2024.05.21.595171. doi: 10.1101/2024.05.21.595171.

A GRU-CNN model for auditory attention detection using microstate and recurrence quantification analysis.

Sci Rep. 2024 Apr 17;14(1):8861. doi: 10.1038/s41598-024-58886-y.

EEG alpha and pupil diameter reflect endogenous auditory attention switching and listening effort.

Eur J Neurosci. 2022 Mar;55(5):1262-1277. doi: 10.1111/ejn.15616. Epub 2022 Feb 16.

Behavioral Account of Attended Stream Enhances Neural Tracking.

Front Neurosci. 2021 Dec 13;15:674112. doi: 10.3389/fnins.2021.674112. eCollection 2021.

Dynamic selective auditory attention detection using RNN and reinforcement learning.

Sci Rep. 2021 Jul 29;11(1):15497. doi: 10.1038/s41598-021-94876-0.

EEG-based detection of the locus of auditory attention with convolutional neural networks.

Elife. 2021 Apr 30;10:e56481. doi: 10.7554/eLife.56481.

Identification of Auditory Object-Specific Attention from Single-Trial Electroencephalogram Signals via Entropy Measures and Machine Learning.

Entropy (Basel). 2018 May 21;20(5):386. doi: 10.3390/e20050386.

Three New Outcome Measures That Tap Into Cognitive Processes Required for Real-Life Communication.

Ear Hear. 2020 Nov/Dec;41 Suppl 1(Suppl 1):39S-47S. doi: 10.1097/AUD.0000000000000941.

本文引用的文献

Convergence and Stability of a Class of Iteratively Re-weighted Least Squares Algorithms for Sparse Signal Recovery in the Presence of Noise.

IEEE Trans Signal Process. 2013 Oct 30;62(1):183-195. doi: 10.1109/TSP.2013.2287685. Epub 2014 Jan 1.

Decoding the attended speech stream with multi-channel EEG: implications for online, daily-life applications.

J Neural Eng. 2015 Aug;12(4):046007. doi: 10.1088/1741-2560/12/4/046007. Epub 2015 Jun 2.

Attentional Selection in a Cocktail Party Environment Can Be Decoded from Single-Trial EEG.

Cereb Cortex. 2015 Jul;25(7):1697-706. doi: 10.1093/cercor/bht355. Epub 2014 Jan 15.

Emergence of neural encoding of auditory objects while listening to competing speakers.

Proc Natl Acad Sci U S A. 2012 Jul 17;109(29):11854-9. doi: 10.1073/pnas.1205381109. Epub 2012 Jul 2.

Selective cortical representation of attended speaker in multi-talker speech perception.

Nature. 2012 May 10;485(7397):233-6. doi: 10.1038/nature11020.

Reconstructing speech from human auditory cortex.

PLoS Biol. 2012 Jan;10(1):e1001251. doi: 10.1371/journal.pbio.1001251. Epub 2012 Jan 31.

Neural coding of continuous speech in auditory cortex during monaural and dichotic listening.

J Neurophysiol. 2012 Jan;107(1):78-89. doi: 10.1152/jn.00297.2011. Epub 2011 Oct 5.

Temporal coherence and attention in auditory scene analysis.

Trends Neurosci. 2011 Mar;34(3):114-23. doi: 10.1016/j.tins.2010.11.002. Epub 2010 Dec 31.

Neural correlates of auditory scene analysis based on inharmonicity in monkey primary auditory cortex.

J Neurosci. 2010 Sep 15;30(37):12480-94. doi: 10.1523/JNEUROSCI.1780-10.2010.

The cocktail party problem.

Curr Biol. 2009 Dec 1;19(22):R1024-7. doi: 10.1016/j.cub.2009.09.005.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

在竞争说话者环境中通过状态空间模型从脑磁图对选择性听觉注意力进行稳健解码。

Robust decoding of selective auditory attention from MEG in a competing-speaker environment via state-space modeling.

作者信息

Akram Sahar, Presacco Alessandro, Simon Jonathan Z, Shamma Shihab A, Babadi Behtash

机构信息

Department of Electrical and Computer Engineering, University of Maryland, College Park, MD 20742, USA; Institute for Systems Research, University of Maryland, College Park, MD 20742, USA.

Department of Hearing and Speech Science, University of Maryland, College Park, MD 20742, USA.

出版信息

Neuroimage. 2016 Jan 1;124(Pt A):906-917. doi: 10.1016/j.neuroimage.2015.09.048. Epub 2015 Oct 4.

DOI:10.1016/j.neuroimage.2015.09.048

PMID:26436490

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4652844/

Abstract

摘要

在竞争说话者环境中通过状态空间模型从脑磁图对选择性听觉注意力进行稳健解码。

Robust decoding of selective auditory attention from MEG in a competing-speaker environment via state-space modeling.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

在竞争说话者环境中通过状态空间模型从脑磁图对选择性听觉注意力进行稳健解码。

Robust decoding of selective auditory attention from MEG in a competing-speaker environment via state-space modeling.

作者信息

机构信息

出版信息