• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用混合尺度时空扩张卷积网络从 EEG 信号中解码想象中的语音。

Decoding imagined speech from EEG signals using hybrid-scale spatial-temporal dilated convolution network.

机构信息

Key Laboratory of Intelligent Perception and Image Understanding of Ministry of Education, School of Artificial Intelligence, Xidian University, Xi'an, People's Republic of China.

出版信息

J Neural Eng. 2021 Aug 11;18(4). doi: 10.1088/1741-2552/ac13c0.

DOI:10.1088/1741-2552/ac13c0
PMID:34256357
Abstract

Directly decoding imagined speech from electroencephalogram (EEG) signals has attracted much interest in brain-computer interface applications, because it provides a natural and intuitive communication method for locked-in patients. Several methods have been applied to imagined speech decoding, but how to construct spatial-temporal dependencies and capture long-range contextual cues in EEG signals to better decode imagined speech should be considered.In this study, we propose a novel model called hybrid-scale spatial-temporal dilated convolution network (HS-STDCN) for EEG-based imagined speech recognition. HS-STDCN integrates feature learning from temporal and spatial information into a unified end-to-end model. To characterize the temporal dependencies of the EEG sequences, we adopted a hybrid-scale temporal convolution layer to capture temporal information at multiple levels. A depthwise spatial convolution layer was then designed to construct intrinsic spatial relationships of EEG electrodes, which can produce a spatial-temporal representation of the input EEG data. Based on the spatial-temporal representation, dilated convolution layers were further employed to learn long-range discriminative features for the final classification.To evaluate the proposed method, we compared the HS-STDCN with other existing methods on our collected dataset. The HS-STDCN achieved an averaged classification accuracy of 54.31% for decoding eight imagined words, which is significantly better than other methods at a significance level of 0.05.The proposed HS-STDCN model provided an effective approach to make use of both the temporal and spatial dependencies of the input EEG signals for imagined speech recognition. We also visualized the word semantic differences to analyze the impact of word semantics on imagined speech recognition, investigated the important regions in the decoding process, and explored the use of fewer electrodes to achieve comparable performance.

摘要

直接从脑电图 (EEG) 信号中解码想象中的语音在脑机接口应用中引起了极大的兴趣,因为它为闭锁患者提供了一种自然和直观的交流方式。已经有几种方法被应用于想象中的语音解码,但如何构建 EEG 信号中的时空依赖关系并捕捉远程上下文线索,以更好地解码想象中的语音,这一点值得考虑。在这项研究中,我们提出了一种名为混合尺度时空扩张卷积网络 (HS-STDCN) 的新型模型,用于基于 EEG 的想象中的语音识别。HS-STDCN 将来自时间和空间信息的特征学习集成到一个统一的端到端模型中。为了描述 EEG 序列的时间依赖关系,我们采用了混合尺度时间卷积层来捕捉多个层次的时间信息。然后设计了一个深度卷积层来构建 EEG 电极的内在空间关系,这可以生成输入 EEG 数据的时空表示。基于时空表示,扩张卷积层进一步用于学习用于最终分类的远程判别特征。为了评估所提出的方法,我们在我们收集的数据集上比较了 HS-STDCN 与其他现有方法。HS-STDCN 在解码八个想象中的单词时的平均分类准确率为 54.31%,这明显优于其他方法,在 0.05 的显著性水平上具有统计学意义。所提出的 HS-STDCN 模型为利用输入 EEG 信号的时间和空间依赖关系进行想象中的语音识别提供了一种有效方法。我们还可视化了单词语义差异,以分析单词语义对想象中的语音识别的影响,研究了解码过程中的重要区域,并探索了使用更少的电极来实现可比性能的方法。

相似文献

1
Decoding imagined speech from EEG signals using hybrid-scale spatial-temporal dilated convolution network.利用混合尺度时空扩张卷积网络从 EEG 信号中解码想象中的语音。
J Neural Eng. 2021 Aug 11;18(4). doi: 10.1088/1741-2552/ac13c0.
2
A Spatio-Temporal Capsule Neural Network with Self-Correlation Routing for EEG Decoding of Semantic Concepts of Imagination and Perception Tasks.用于想象和感知任务的 EEG 解码的时空胶囊神经网络,具有自相关路由
Sensors (Basel). 2024 Sep 15;24(18):5988. doi: 10.3390/s24185988.
3
A Bimodal Deep Learning Architecture for EEG-fNIRS Decoding of Overt and Imagined Speech.一种用于 EEG-fNIRS 解码言语出声和想象的双模深度学习架构。
IEEE Trans Biomed Eng. 2022 Jun;69(6):1983-1994. doi: 10.1109/TBME.2021.3132861. Epub 2022 May 19.
4
Evaluation of Hyperparameter Optimization in Machine and Deep Learning Methods for Decoding Imagined Speech EEG.机器和深度学习方法在解码想象语音 EEG 中的超参数优化评估。
Sensors (Basel). 2020 Aug 17;20(16):4629. doi: 10.3390/s20164629.
5
[Convolutional neural network based on temporal-spatial feature learning for motor imagery electroencephalogram signal decoding].基于时空特征学习的卷积神经网络用于运动想象脑电信号解码
Sheng Wu Yi Xue Gong Cheng Xue Za Zhi. 2021 Feb 25;38(1):1-9. doi: 10.7507/1001-5515.202007006.
6
Resting state EEG assisted imagined vowel phonemes recognition by native and non-native speakers using brain connectivity measures.基于脑连接测量的静息态 EEG 辅助母语和非母语者想象元音音位识别。
Phys Eng Sci Med. 2024 Sep;47(3):939-954. doi: 10.1007/s13246-024-01417-w. Epub 2024 Apr 22.
7
EEG-based classification of imagined digits using a recurrent neural network.基于脑电图的循环神经网络对想象数字的分类
J Neural Eng. 2023 Apr 28;20(2). doi: 10.1088/1741-2552/acc976.
8
ConTraNet: A hybrid network for improving the classification of EEG and EMG signals with limited training data.ConTraNet:一种混合网络,用于在有限的训练数据下提高 EEG 和 EMG 信号的分类。
Comput Biol Med. 2024 Jan;168:107649. doi: 10.1016/j.compbiomed.2023.107649. Epub 2023 Nov 2.
9
Imagined Speech Classification Using EEG and Deep Learning.基于脑电图(EEG)和深度学习的想象语音分类
Bioengineering (Basel). 2023 May 26;10(6):649. doi: 10.3390/bioengineering10060649.
10
A Temporal Dependency Learning CNN With Attention Mechanism for MI-EEG Decoding.具有注意力机制的时间依赖学习 CNN 用于 MI-EEG 解码。
IEEE Trans Neural Syst Rehabil Eng. 2023;31:3188-3200. doi: 10.1109/TNSRE.2023.3299355. Epub 2023 Aug 9.

引用本文的文献

1
Vocal tasks-based EEG and speech signal analysis in children with neurodevelopmental disorders: a multimodal investigation.基于语音任务的神经发育障碍儿童脑电图和语音信号分析:一项多模态研究。
Cogn Neurodyn. 2024 Oct;18(5):2387-2403. doi: 10.1007/s11571-024-10096-y. Epub 2024 Mar 20.
2
Enhancing generalized anxiety disorder diagnosis precision: MSTCNN model utilizing high-frequency EEG signals.提高广泛性焦虑症诊断精度:利用高频脑电信号的MSTCNN模型
Front Psychiatry. 2023 Dec 21;14:1310323. doi: 10.3389/fpsyt.2023.1310323. eCollection 2023.
3
Multiclass classification of imagined speech EEG using noise-assisted multivariate empirical mode decomposition and multireceptive field convolutional neural network.
基于噪声辅助多变量经验模式分解和多感受野卷积神经网络的想象言语脑电信号多类分类
Front Hum Neurosci. 2023 Aug 10;17:1186594. doi: 10.3389/fnhum.2023.1186594. eCollection 2023.
4
The Role of Artificial Intelligence in Decoding Speech from EEG Signals: A Scoping Review.人工智能在从脑电图信号中解码语音中的作用:范围综述。
Sensors (Basel). 2022 Sep 15;22(18):6975. doi: 10.3390/s22186975.