• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于 RNN 和强化学习的动态选择性听觉注意力检测。

Dynamic selective auditory attention detection using RNN and reinforcement learning.

机构信息

Faculty of Electrical & Computer Engineering, University of Tabriz, 51666-15813, Tabriz, Iran.

出版信息

Sci Rep. 2021 Jul 29;11(1):15497. doi: 10.1038/s41598-021-94876-0.

DOI:10.1038/s41598-021-94876-0
PMID:34326401
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8322190/
Abstract

The cocktail party phenomenon describes the ability of the human brain to focus auditory attention on a particular stimulus while ignoring other acoustic events. Selective auditory attention detection (SAAD) is an important issue in the development of brain-computer interface systems and cocktail party processors. This paper proposes a new dynamic attention detection system to process the temporal evolution of the input signal. The proposed dynamic SAAD is modeled as a sequential decision-making problem, which is solved by recurrent neural network (RNN) and reinforcement learning methods of Q-learning and deep Q-learning. Among different dynamic learning approaches, the evaluation results show that the deep Q-learning approach with RNN as agent provides the highest classification accuracy (94.2%) with the least detection delay. The proposed SAAD system is advantageous, in the sense that the detection of attention is performed dynamically for the sequential inputs. Also, the system has the potential to be used in scenarios, where the attention of the listener might be switched in time in the presence of various acoustic events.

摘要

鸡尾酒会现象描述了人类大脑将听觉注意力集中在特定刺激上而忽略其他声音事件的能力。选择性听觉注意力检测(SAAD)是脑机接口系统和鸡尾酒会处理器发展中的一个重要问题。本文提出了一种新的动态注意力检测系统来处理输入信号的时间演变。所提出的动态 SAAD 被建模为一个序列决策问题,通过递归神经网络(RNN)和 Q-learning 以及深度 Q-learning 等强化学习方法来解决。在不同的动态学习方法中,评估结果表明,使用 RNN 作为智能体的深度 Q-learning 方法提供了最高的分类准确率(94.2%)和最小的检测延迟。所提出的 SAAD 系统具有优势,因为它可以对顺序输入进行动态的注意力检测。此外,该系统有可能用于在存在各种声音事件的情况下,听众的注意力可能会随时间切换的场景。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce7/8322190/580a11bc62b4/41598_2021_94876_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce7/8322190/9222335a81fc/41598_2021_94876_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce7/8322190/cb052a846646/41598_2021_94876_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce7/8322190/90d37491a5dd/41598_2021_94876_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce7/8322190/4c4c25459f6f/41598_2021_94876_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce7/8322190/3efdbce2d37a/41598_2021_94876_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce7/8322190/580a11bc62b4/41598_2021_94876_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce7/8322190/9222335a81fc/41598_2021_94876_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce7/8322190/cb052a846646/41598_2021_94876_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce7/8322190/90d37491a5dd/41598_2021_94876_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce7/8322190/4c4c25459f6f/41598_2021_94876_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce7/8322190/3efdbce2d37a/41598_2021_94876_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/5ce7/8322190/580a11bc62b4/41598_2021_94876_Fig6_HTML.jpg

相似文献

1
Dynamic selective auditory attention detection using RNN and reinforcement learning.基于 RNN 和强化学习的动态选择性听觉注意力检测。
Sci Rep. 2021 Jul 29;11(1):15497. doi: 10.1038/s41598-021-94876-0.
2
Selective auditory attention detection based on effective connectivity by single-trial EEG.基于单试 EEG 的有效连通性的选择性听觉注意力检测。
J Neural Eng. 2020 Apr 17;17(2):026021. doi: 10.1088/1741-2552/ab7c8d.
3
A Midbrain Inspired Recurrent Neural Network Model for Robust Change Detection.基于中脑的鲁棒性变化检测递归神经网络模型。
J Neurosci. 2022 Nov 2;42(44):8262-8283. doi: 10.1523/JNEUROSCI.0164-22.2022. Epub 2022 Sep 19.
4
Auditory attention tracking states in a cocktail party environment can be decoded by deep convolutional neural networks.鸡尾酒会环境中的听觉注意跟踪状态可以通过深度卷积神经网络进行解码。
J Neural Eng. 2020 Jun 12;17(3):036013. doi: 10.1088/1741-2552/ab92b2.
5
Visual Analytics for RNN-Based Deep Reinforcement Learning.基于 RNN 的深度强化学习的可视化分析。
IEEE Trans Vis Comput Graph. 2022 Dec;28(12):4141-4155. doi: 10.1109/TVCG.2021.3076749. Epub 2022 Oct 26.
6
Generalized Recurrent Neural Network accommodating Dynamic Causal Modeling for functional MRI analysis.广义循环神经网络适应功能磁共振成像分析的动态因果建模。
Neuroimage. 2018 Sep;178:385-402. doi: 10.1016/j.neuroimage.2018.05.042. Epub 2018 May 18.
7
Inferring Mechanisms of Auditory Attentional Modulation with Deep Neural Networks.基于深度神经网络推断听觉注意力调制的机制
Neural Comput. 2022 Oct 7;34(11):2273-2293. doi: 10.1162/neco_a_01537.
8
A Temporal Deep Q Learning for Optimal Load Balancing in Software-Defined Networks.用于软件定义网络中最优负载均衡的时态深度Q学习
Sensors (Basel). 2024 Feb 14;24(4):1216. doi: 10.3390/s24041216.
9
Hyperspectral Image Features Classification Using Deep Learning Recurrent Neural Networks.基于深度学习循环神经网络的高光谱图像特征分类。
J Med Syst. 2019 Jun 4;43(7):216. doi: 10.1007/s10916-019-1347-9.
10
ECG-based multi-class arrhythmia detection using spatio-temporal attention-based convolutional recurrent neural network.基于心电图的多类心律失常检测:使用基于时空注意力的卷积循环神经网络
Artif Intell Med. 2020 Jun;106:101856. doi: 10.1016/j.artmed.2020.101856. Epub 2020 May 11.

引用本文的文献

1
A Brain-Computer Interface for Improving Auditory Attention in Multi-Talker Environments.一种用于在多说话者环境中改善听觉注意力的脑机接口。
bioRxiv. 2025 Mar 13:2025.03.13.641661. doi: 10.1101/2025.03.13.641661.
2
Selective Auditory Attention Detection Using Combined Transformer and Convolutional Graph Neural Networks.基于Transformer和卷积图神经网络组合的选择性听觉注意力检测
Bioengineering (Basel). 2024 Nov 30;11(12):1216. doi: 10.3390/bioengineering11121216.
3
Alpha band modulation caused by selective attention to music enables EEG classification.

本文引用的文献

1
Identification of Auditory Object-Specific Attention from Single-Trial Electroencephalogram Signals via Entropy Measures and Machine Learning.通过熵测度和机器学习从单次试验脑电图信号中识别听觉对象特异性注意力
Entropy (Basel). 2018 May 21;20(5):386. doi: 10.3390/e20050386.
2
Selective auditory attention detection based on effective connectivity by single-trial EEG.基于单试 EEG 的有效连通性的选择性听觉注意力检测。
J Neural Eng. 2020 Apr 17;17(2):026021. doi: 10.1088/1741-2552/ab7c8d.
3
Comparison of Two-Talker Attention Decoding from EEG with Nonlinear Neural Networks and Linear Methods.
由对音乐的选择性注意引起的阿尔法波段调制可实现脑电图分类。
Cogn Neurodyn. 2024 Jun;18(3):1005-1020. doi: 10.1007/s11571-023-09955-x. Epub 2023 Apr 7.
4
A GRU-CNN model for auditory attention detection using microstate and recurrence quantification analysis.基于微状态和递归定量分析的使用 GRU-CNN 模型进行听觉注意检测。
Sci Rep. 2024 Apr 17;14(1):8861. doi: 10.1038/s41598-024-58886-y.
5
Attention to audiovisual speech shapes neural processing through feedback-feedforward loops between different nodes of the speech network.听觉-视觉言语会通过言语网络不同节点之间的反馈-前馈回路来影响神经处理过程。
PLoS Biol. 2024 Mar 11;22(3):e3002534. doi: 10.1371/journal.pbio.3002534. eCollection 2024 Mar.
6
Validation of cost-efficient EEG experimental setup for neural tracking in an auditory attention task.验证用于听觉注意任务中神经跟踪的具有成本效益的 EEG 实验设置。
Sci Rep. 2023 Dec 19;13(1):22682. doi: 10.1038/s41598-023-49990-6.
7
Neural network ensemble model for prediction of erythrocyte sedimentation rate (ESR) using partial least squares regression.基于偏最小二乘回归的红细胞沉降率(ESR)神经网络集成模型预测。
Sci Rep. 2022 Nov 15;12(1):19618. doi: 10.1038/s41598-022-23174-0.
8
How to discern external acoustic waves in a piezoelectric neuron under noise?如何在噪声下辨别压电神经元中的外部声波?
J Biol Phys. 2022 Sep;48(3):339-353. doi: 10.1007/s10867-022-09611-1. Epub 2022 Aug 10.
9
A Speech-Level-Based Segmented Model to Decode the Dynamic Auditory Attention States in the Competing Speaker Scenes.一种基于语音水平的分段模型,用于解码竞争说话者场景中的动态听觉注意状态。
Front Neurosci. 2022 Feb 10;15:760611. doi: 10.3389/fnins.2021.760611. eCollection 2021.
10
EEG alpha and pupil diameter reflect endogenous auditory attention switching and listening effort.脑电图阿尔法和瞳孔直径反映了内源性听觉注意的转换和聆听努力。
Eur J Neurosci. 2022 Mar;55(5):1262-1277. doi: 10.1111/ejn.15616. Epub 2022 Feb 16.
两种从 EEG 中解码双说话人注意力的方法比较:非线性神经网络与线性方法。
Sci Rep. 2019 Aug 8;9(1):11538. doi: 10.1038/s41598-019-47795-0.
4
EEG-assisted Modulation of Sound Sources in the Auditory Scene.脑电图辅助对听觉场景中声源的调制
Biomed Signal Process Control. 2018 Jan;39:263-270. doi: 10.1016/j.bspc.2017.08.008. Epub 2017 Aug 16.
5
EEG decoding of the target speaker in a cocktail party scenario: considerations regarding dynamic switching of talker location.鸡尾酒会场景中目标说话人的 EEG 解码:关于说话人位置动态切换的考虑因素。
J Neural Eng. 2019 Jun;16(3):036017. doi: 10.1088/1741-2552/ab0cf1. Epub 2019 Mar 5.
6
Object-based attention in complex, naturalistic auditory streams.基于对象的注意力在复杂的、自然主义的听觉流中。
Sci Rep. 2019 Feb 27;9(1):2854. doi: 10.1038/s41598-019-39166-6.
7
A Comparison of Regularization Methods in Forward and Backward Models for Auditory Attention Decoding.听觉注意力解码的前向和后向模型中正则化方法的比较
Front Neurosci. 2018 Aug 7;12:531. doi: 10.3389/fnins.2018.00531. eCollection 2018.
8
Real-Time Tracking of Selective Auditory Attention From M/EEG: A Bayesian Filtering Approach.基于脑电信号的选择性听觉注意力实时追踪:一种贝叶斯滤波方法。
Front Neurosci. 2018 May 1;12:262. doi: 10.3389/fnins.2018.00262. eCollection 2018.
9
Machine learning for decoding listeners' attention from electroencephalography evoked by continuous speech.利用机器学习从连续语音诱发的脑电图中解码听众的注意力
Eur J Neurosci. 2020 Mar;51(5):1234-1241. doi: 10.1111/ejn.13790. Epub 2018 Jan 4.
10
Neural decoding of attentional selection in multi-speaker environments without access to clean sources.多说话人环境中无法访问干净源时的注意力选择的神经解码。
J Neural Eng. 2017 Oct;14(5):056001. doi: 10.1088/1741-2552/aa7ab4. Epub 2017 Aug 4.