• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用头皮脑电图(EEG)对公开言语进行连续和离散解码。

Continuous and discrete decoding of overt speech with scalp electroencephalography (EEG).

作者信息

Craik Alexander, Dial Heather, L Contreras-Vidal Jose

机构信息

Department of Electrical and Computer Engineering, University of Houston, Houston, TX, United States of America.

NSF IUCRC BRAIN, University of Houston, Houston, TX, United States of America.

出版信息

J Neural Eng. 2025 Mar 14;22(2). doi: 10.1088/1741-2552/ad8d0a.

DOI:10.1088/1741-2552/ad8d0a
PMID:39476487
Abstract

. Neurological disorders affecting speech production adversely impact quality of life for over 7 million individuals in the US. Traditional speech interfaces like eye-tracking devices and P300 spellers are slow and unnatural for these patients. An alternative solution, speech brain-computer interfaces (BCIs), directly decodes speech characteristics, offering a more natural communication mechanism. This research explores the feasibility of decoding speech features using non-invasive EEG.. Nine neurologically intact participants were equipped with a 63-channel EEG system with additional sensors to eliminate eye artifacts. Participants read aloud sentences selected for phonetic similarity to the English language. Deep learning models, including Convolutional Neural Networks and Recurrent Neural Networks with and without attention modules, were optimized with a focus on minimizing trainable parameters and utilizing small input window sizes for real-time application. These models were employed for discrete and continuous speech decoding tasks.. Statistically significant participant-independent decoding performance was achieved for discrete classes and continuous characteristics of the produced audio signal. A frequency sub-band analysis highlighted the significance of certain frequency bands (delta, theta, gamma) for decoding performance, and a perturbation analysis was used to identify crucial channels. Assessed channel selection methods did not significantly improve performance, suggesting a distributed representation of speech information encoded in the EEG signals. Leave-One-Out training demonstrated the feasibility of utilizing common speech neural correlates, reducing data collection requirements from individual participants.. These findings contribute significantly to the development of EEG-enabled speech synthesis by demonstrating the feasibility of decoding both discrete and continuous speech features from EEG signals, even in the presence of EMG artifacts. By addressing the challenges of EMG interference and optimizing deep learning models for speech decoding, this study lays a strong foundation for EEG-based speech BCIs.

摘要

在美国,影响言语产生的神经系统疾病对700多万人的生活质量产生了不利影响。传统的语音接口,如眼动追踪设备和P300拼写器,对这些患者来说既缓慢又不自然。另一种解决方案是语音脑机接口(BCI),它直接解码语音特征,提供了一种更自然的交流机制。本研究探讨了使用非侵入性脑电图解码语音特征的可行性。九名神经功能正常的参与者配备了一个63通道的脑电图系统,并附加了传感器以消除眼部伪影。参与者大声朗读为与英语语音相似性而选择的句子。深度学习模型,包括带有和不带有注意力模块的卷积神经网络和循环神经网络,经过优化,重点是最小化可训练参数并利用小输入窗口大小以实现实时应用。这些模型被用于离散和连续语音解码任务。对于所产生音频信号的离散类别和连续特征,实现了具有统计学意义的与参与者无关的解码性能。频率子带分析突出了某些频段(δ、θ、γ)对解码性能的重要性,并使用微扰分析来识别关键通道。评估的通道选择方法并没有显著提高性能,这表明脑电图信号中编码的语音信息具有分布式表示。留一法训练证明了利用常见语音神经关联的可行性,减少了对单个参与者的数据收集要求。这些发现通过证明即使在存在肌电图伪影的情况下也能从脑电图信号中解码离散和连续语音特征的可行性,为基于脑电图的语音合成发展做出了重大贡献。通过解决肌电图干扰的挑战并优化用于语音解码的深度学习模型,本研究为基于脑电图的语音脑机接口奠定了坚实的基础。

相似文献

1
Continuous and discrete decoding of overt speech with scalp electroencephalography (EEG).利用头皮脑电图(EEG)对公开言语进行连续和离散解码。
J Neural Eng. 2025 Mar 14;22(2). doi: 10.1088/1741-2552/ad8d0a.
2
Identification of perceived sentences using deep neural networks in EEG.利用深度神经网络识别 EEG 中的感知句子。
J Neural Eng. 2024 Oct 30;21(5). doi: 10.1088/1741-2552/ad88a3.
3
ConTraNet: A hybrid network for improving the classification of EEG and EMG signals with limited training data.ConTraNet:一种混合网络,用于在有限的训练数据下提高 EEG 和 EMG 信号的分类。
Comput Biol Med. 2024 Jan;168:107649. doi: 10.1016/j.compbiomed.2023.107649. Epub 2023 Nov 2.
4
Speech decoding from stereo-electroencephalography (sEEG) signals using advanced deep learning methods.使用先进的深度学习方法从立体脑电图(sEEG)信号中进行语音解码。
J Neural Eng. 2024 Jun 27;21(3). doi: 10.1088/1741-2552/ad593a.
5
A Bimodal Deep Learning Architecture for EEG-fNIRS Decoding of Overt and Imagined Speech.一种用于 EEG-fNIRS 解码言语出声和想象的双模深度学习架构。
IEEE Trans Biomed Eng. 2022 Jun;69(6):1983-1994. doi: 10.1109/TBME.2021.3132861. Epub 2022 May 19.
6
Validating Deep Neural Networks for Online Decoding of Motor Imagery Movements from EEG Signals.验证深度神经网络用于从 EEG 信号中在线解码运动想象运动。
Sensors (Basel). 2019 Jan 8;19(1):210. doi: 10.3390/s19010210.
7
Towards imagined speech: Identification of brain states from EEG signals for BCI-based communication systems.迈向想象中的言语:基于 EEG 信号的脑机接口通信系统中的脑状态识别。
Behav Brain Res. 2025 Feb 4;477:115295. doi: 10.1016/j.bbr.2024.115295. Epub 2024 Oct 18.
8
Decoding articulatory and phonetic components of naturalistic continuous speech from the distributed language network.从分布式语言网络中解码自然连续语音的发音和语音成分。
J Neural Eng. 2023 Aug 14;20(4). doi: 10.1088/1741-2552/ace9fb.
9
Transformer-based neural speech decoding from surface and depth electrode signals.基于Transformer的从表面和深度电极信号进行神经语音解码
J Neural Eng. 2025 Jan 28;22(1):016017. doi: 10.1088/1741-2552/adab21.
10
Decoding lower-limb kinematic parameters during pedaling tasks using deep learning approaches and EEG.使用深度学习方法和脑电图解码踏板运动任务期间的下肢运动学参数。
Med Biol Eng Comput. 2024 Dec;62(12):3763-3779. doi: 10.1007/s11517-024-03147-3. Epub 2024 Jul 19.

引用本文的文献

1
Open access dataset integrating behavioral and EEG measures in Chinese spoken word production.整合汉语口语产生中行为和脑电图测量的开放获取数据集。
Sci Data. 2025 Aug 2;12(1):1348. doi: 10.1038/s41597-025-05671-9.