• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用深度神经网络将 EEG 与连续语音相关联:综述。

Relating EEG to continuous speech using deep neural networks: a review.

机构信息

KU Leuven, Department of Neurosciences, ExpORL, Leuven, Belgium.

KU Leuven, Department of Electrical Engineering (ESAT), PSI, Leuven, Belgium.

出版信息

J Neural Eng. 2023 Aug 3;20(4). doi: 10.1088/1741-2552/ace73f.

DOI:10.1088/1741-2552/ace73f
PMID:37442115
Abstract

When a person listens to continuous speech, a corresponding response is elicited in the brain and can be recorded using electroencephalography (EEG). Linear models are presently used to relate the EEG recording to the corresponding speech signal. The ability of linear models to find a mapping between these two signals is used as a measure of neural tracking of speech. Such models are limited as they assume linearity in the EEG-speech relationship, which omits the nonlinear dynamics of the brain. As an alternative, deep learning models have recently been used to relate EEG to continuous speech.This paper reviews and comments on deep-learning-based studies that relate EEG to continuous speech in single- or multiple-speakers paradigms. We point out recurrent methodological pitfalls and the need for a standard benchmark of model analysis.We gathered 29 studies. The main methodological issues we found are biased cross-validations, data leakage leading to over-fitted models, or disproportionate data size compared to the model's complexity. In addition, we address requirements for a standard benchmark model analysis, such as public datasets, common evaluation metrics, and good practices for the match-mismatch task.We present a review paper summarizing the main deep-learning-based studies that relate EEG to speech while addressing methodological pitfalls and important considerations for this newly expanding field. Our study is particularly relevant given the growing application of deep learning in EEG-speech decoding.

摘要

当一个人听连续的语音时,大脑中会产生相应的反应,可以用电极脑电图(EEG)记录下来。目前,线性模型用于将 EEG 记录与相应的语音信号相关联。线性模型在这两个信号之间找到映射的能力被用作语音神经跟踪的度量。这些模型是有限的,因为它们假设 EEG-语音关系的线性,而忽略了大脑的非线性动力学。作为替代方案,深度学习模型最近已被用于将 EEG 与连续语音相关联。本文回顾并评论了基于深度学习的研究,这些研究在单人和多人说话者范式中,将 EEG 与连续语音相关联。我们指出了递归方法上的陷阱,以及对模型分析的标准基准的需求。我们收集了 29 项研究。我们发现的主要方法问题是有偏差的交叉验证、导致过度拟合模型的数据泄露,或者与模型的复杂性相比,数据量不成比例。此外,我们还讨论了标准基准模型分析的要求,例如公共数据集、通用评估指标以及匹配不匹配任务的良好实践。我们提交了一篇综述论文,总结了将 EEG 与语音相关联的主要基于深度学习的研究,同时解决了这个新扩展领域的方法学陷阱和重要考虑因素。鉴于深度学习在 EEG 语音解码中的应用越来越多,我们的研究特别相关。

相似文献

1
Relating EEG to continuous speech using deep neural networks: a review.利用深度神经网络将 EEG 与连续语音相关联:综述。
J Neural Eng. 2023 Aug 3;20(4). doi: 10.1088/1741-2552/ace73f.
2
Robust decoding of the speech envelope from EEG recordings through deep neural networks.通过深度神经网络从 EEG 记录中稳健地解码语音包络。
J Neural Eng. 2022 Jul 6;19(4). doi: 10.1088/1741-2552/ac7976.
3
Beyond linear neural envelope tracking: a mutual information approach.超越线性神经包络跟踪:一种互信息方法。
J Neural Eng. 2023 Mar 9;20(2). doi: 10.1088/1741-2552/acbe1d.
4
ADT Network: A Novel Nonlinear Method for Decoding Speech Envelopes From EEG Signals.ADT 网络:一种从 EEG 信号中解码语音包络的新型非线性方法。
Trends Hear. 2024 Jan-Dec;28:23312165241282872. doi: 10.1177/23312165241282872.
5
Classifying coherent versus nonsense speech perception from EEG using linguistic speech features.使用语言语音特征对 EEG 中的连贯语音与无意义语音感知进行分类。
Sci Rep. 2024 Aug 14;14(1):18922. doi: 10.1038/s41598-024-69568-0.
6
EEG-based classification of imagined digits using a recurrent neural network.基于脑电图的循环神经网络对想象数字的分类
J Neural Eng. 2023 Apr 28;20(2). doi: 10.1088/1741-2552/acc976.
7
Evaluation of Hyperparameter Optimization in Machine and Deep Learning Methods for Decoding Imagined Speech EEG.机器和深度学习方法在解码想象语音 EEG 中的超参数优化评估。
Sensors (Basel). 2020 Aug 17;20(16):4629. doi: 10.3390/s20164629.
8
Deep learning for electroencephalogram (EEG) classification tasks: a review.深度学习在脑电图(EEG)分类任务中的应用:综述。
J Neural Eng. 2019 Jun;16(3):031001. doi: 10.1088/1741-2552/ab0ab5. Epub 2019 Feb 26.
9
Predicting speech intelligibility from EEG in a non-linear classification paradigm.基于非线性分类范式的脑电语音可懂度预测。
J Neural Eng. 2021 Nov 15;18(6). doi: 10.1088/1741-2552/ac33e9.
10
EEG-based auditory attention decoding using speech-level-based segmented computational models.基于脑电的听觉注意解码,使用基于语音分段的计算模型。
J Neural Eng. 2021 May 25;18(4). doi: 10.1088/1741-2552/abfeba.

引用本文的文献

1
Contrastive representation learning with transformers for robust auditory EEG decoding.用于稳健听觉脑电信号解码的基于Transformer的对比表征学习
Sci Rep. 2025 Aug 6;15(1):28744. doi: 10.1038/s41598-025-13646-4.
2
Vocal Emotion Perception and Musicality-Insights from EEG Decoding.语音情感感知与音乐性——来自脑电图解码的见解
Sensors (Basel). 2025 Mar 8;25(6):1669. doi: 10.3390/s25061669.
3
Speech Reception Threshold Estimation via EEG-Based Continuous Speech Envelope Reconstruction.基于脑电图的连续语音包络重构的言语接受阈值估计
Eur J Neurosci. 2025 Mar;61(6):e70083. doi: 10.1111/ejn.70083.
4
Classifying coherent versus nonsense speech perception from EEG using linguistic speech features.使用语言语音特征对 EEG 中的连贯语音与无意义语音感知进行分类。
Sci Rep. 2024 Aug 14;14(1):18922. doi: 10.1038/s41598-024-69568-0.