• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

重新思考内心言语解码的方法和算法并使其具有可重复性。

Rethinking the Methods and Algorithms for Inner Speech Decoding and Making Them Reproducible.

作者信息

Simistira Liwicki Foteini, Gupta Vibha, Saini Rajkumar, De Kanjar, Liwicki Marcus

机构信息

Embedded Intelligent Systems LAB, Machine Learning, Department of Computer Science, Electrical and Space Engineering, Luleå University of Technology, 97187 Luleå, Sweden;

出版信息

NeuroSci. 2022 Apr 19;3(2):226-244. doi: 10.3390/neurosci3020017. eCollection 2022 Jun.

DOI:10.3390/neurosci3020017
PMID:39483370
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11523721/
Abstract

This study focuses on the automatic decoding of inner speech using noninvasive methods, such as Electroencephalography (EEG). While inner speech has been a research topic in philosophy and psychology for half a century, recent attempts have been made to decode nonvoiced spoken words by using various brain-computer interfaces. The main shortcomings of existing work are reproducibility and the availability of data and code. In this work, we investigate various methods (using Convolutional Neural Network (CNN), Gated Recurrent Unit (GRU), Long Short-Term Memory Networks (LSTM)) for the detection task of five vowels and six words on a publicly available EEG dataset. The main contributions of this work are (1) subject dependent vs. subject-independent approaches, (2) the effect of different preprocessing steps (Independent Component Analysis (ICA), down-sampling and filtering), and (3) word classification (where we achieve state-of-the-art performance on a publicly available dataset). Overall we achieve a performance accuracy of 35.20% and 29.21% when classifying five vowels and six words, respectively, in a publicly available dataset, using our tuned iSpeech-CNN architecture. All of our code and processed data are publicly available to ensure reproducibility. As such, this work contributes to a deeper understanding and reproducibility of experiments in the area of inner speech detection.

摘要

本研究聚焦于使用非侵入性方法(如脑电图(EEG))对内心言语进行自动解码。虽然内心言语在哲学和心理学领域已成为一个研究课题达半个世纪之久,但最近已有尝试通过使用各种脑机接口来解码无声说出的单词。现有工作的主要缺点在于可重复性以及数据和代码的可用性。在这项工作中,我们在一个公开可用的EEG数据集上,研究了多种方法(使用卷积神经网络(CNN)、门控循环单元(GRU)、长短期记忆网络(LSTM))用于五个元音和六个单词的检测任务。这项工作的主要贡献在于:(1)依赖个体与独立于个体的方法;(2)不同预处理步骤(独立成分分析(ICA)、下采样和滤波)的效果;(3)单词分类(我们在一个公开可用数据集上达到了当前最优性能)。总体而言,使用我们调优后的iSpeech - CNN架构,在一个公开可用数据集中对五个元音和六个单词进行分类时,我们分别实现了35.20%和29.21%的性能准确率。我们所有的代码和处理后的数据都公开可用,以确保可重复性。因此,这项工作有助于更深入地理解内心言语检测领域的实验并实现其可重复性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/271e/11523721/5cbf3480231e/neurosci-03-00017-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/271e/11523721/9e7a95bcf0b4/neurosci-03-00017-g0A4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/271e/11523721/458876df164d/neurosci-03-00017-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/271e/11523721/5cbf3480231e/neurosci-03-00017-g004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/271e/11523721/9e7a95bcf0b4/neurosci-03-00017-g0A4.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/271e/11523721/458876df164d/neurosci-03-00017-g001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/271e/11523721/5cbf3480231e/neurosci-03-00017-g004.jpg

相似文献

1
Rethinking the Methods and Algorithms for Inner Speech Decoding and Making Them Reproducible.重新思考内心言语解码的方法和算法并使其具有可重复性。
NeuroSci. 2022 Apr 19;3(2):226-244. doi: 10.3390/neurosci3020017. eCollection 2022 Jun.
2
Validating Deep Neural Networks for Online Decoding of Motor Imagery Movements from EEG Signals.验证深度神经网络用于从 EEG 信号中在线解码运动想象运动。
Sensors (Basel). 2019 Jan 8;19(1):210. doi: 10.3390/s19010210.
3
Deep learning networks based decision fusion model of EEG and fNIRS for classification of cognitive tasks.基于深度学习网络的脑电图和功能近红外光谱用于认知任务分类的决策融合模型
Cogn Neurodyn. 2024 Aug;18(4):1489-1506. doi: 10.1007/s11571-023-09986-4. Epub 2023 Jun 30.
4
Evaluation of Hyperparameter Optimization in Machine and Deep Learning Methods for Decoding Imagined Speech EEG.机器和深度学习方法在解码想象语音 EEG 中的超参数优化评估。
Sensors (Basel). 2020 Aug 17;20(16):4629. doi: 10.3390/s20164629.
5
Decoding Algorithm of Motor Imagery Electroencephalogram Signal Based on CLRNet Network Model.基于 CLRNet 网络模型的运动想象脑电信号解码算法。
Sensors (Basel). 2023 Sep 6;23(18):7694. doi: 10.3390/s23187694.
6
ConTraNet: A hybrid network for improving the classification of EEG and EMG signals with limited training data.ConTraNet:一种混合网络,用于在有限的训练数据下提高 EEG 和 EMG 信号的分类。
Comput Biol Med. 2024 Jan;168:107649. doi: 10.1016/j.compbiomed.2023.107649. Epub 2023 Nov 2.
7
A Parallel Feature Fusion Network Combining GRU and CNN for Motor Imagery EEG Decoding.一种结合门控循环单元(GRU)和卷积神经网络(CNN)的用于运动想象脑电信号解码的并行特征融合网络
Brain Sci. 2022 Sep 13;12(9):1233. doi: 10.3390/brainsci12091233.
8
Recognition of EEG Signals from Imagined Vowels Using Deep Learning Methods.基于深度学习方法的想象元音脑电信号识别。
Sensors (Basel). 2021 Sep 29;21(19):6503. doi: 10.3390/s21196503.
9
Exploring inter-trial coherence for inner speech classification in EEG-based brain-computer interface.探索基于 EEG 的脑机接口中内言语分类的试验间相干性。
J Neural Eng. 2024 Apr 26;21(2). doi: 10.1088/1741-2552/ad3f50.
10
Attention-based convolutional neural network with multi-modal temporal information fusion for motor imagery EEG decoding.基于注意力的卷积神经网络与多模态时间信息融合在运动想象 EEG 解码中的应用。
Comput Biol Med. 2024 Jun;175:108504. doi: 10.1016/j.compbiomed.2024.108504. Epub 2024 Apr 24.

引用本文的文献

1
Neural signals, machine learning, and the future of inner speech recognition.神经信号、机器学习与内心言语识别的未来
Front Hum Neurosci. 2025 Jul 10;19:1637174. doi: 10.3389/fnhum.2025.1637174. eCollection 2025.
2
Decoding imagined speech with delay differential analysis.利用延迟微分分析解码想象中的言语。
Front Hum Neurosci. 2024 May 17;18:1398065. doi: 10.3389/fnhum.2024.1398065. eCollection 2024.

本文引用的文献

1
Thinking out loud, an open-access EEG-based BCI dataset for inner speech recognition.大声思考:基于 EEG 的可公开获取的用于言语内部识别的脑机接口数据集。
Sci Data. 2022 Feb 14;9(1):52. doi: 10.1038/s41597-022-01147-2.
2
ECG signal classification based on deep CNN and BiLSTM.基于深度卷积神经网络和双向长短时记忆网络的心电图信号分类。
BMC Med Inform Decis Mak. 2021 Dec 28;21(1):365. doi: 10.1186/s12911-021-01736-y.
3
Real-time synthesis of imagined speech processes from minimally invasive recordings of neural activity.从神经活动的微创记录中实时合成想象中的语音过程。
Commun Biol. 2021 Sep 23;4(1):1055. doi: 10.1038/s42003-021-02578-0.
4
Decoding Covert Speech From EEG-A Comprehensive Review.从脑电图中解码隐蔽语音——全面综述
Front Neurosci. 2021 Apr 29;15:642251. doi: 10.3389/fnins.2021.642251. eCollection 2021.
5
MEG Sensor Selection for Neural Speech Decoding.用于神经语音解码的脑磁图(MEG)传感器选择
IEEE Access. 2020;8:182320-182337. doi: 10.1109/access.2020.3028831. Epub 2020 Oct 6.
6
Evaluation of Hyperparameter Optimization in Machine and Deep Learning Methods for Decoding Imagined Speech EEG.机器和深度学习方法在解码想象语音 EEG 中的超参数优化评估。
Sensors (Basel). 2020 Aug 17;20(16):4629. doi: 10.3390/s20164629.
7
Decoding Imagined and Spoken Phrases From Non-invasive Neural (MEG) Signals.从无创神经(脑磁图)信号中解码想象和说出的短语。
Front Neurosci. 2020 Apr 7;14:290. doi: 10.3389/fnins.2020.00290. eCollection 2020.
8
Classification of Overt and Covert Speech for Near-Infrared Spectroscopy-Based Brain Computer Interface.基于近红外光谱的脑机接口的显性和隐性语音分类。
Sensors (Basel). 2018 Sep 7;18(9):2989. doi: 10.3390/s18092989.
9
Decoding Inner Speech Using Electrocorticography: Progress and Challenges Toward a Speech Prosthesis.使用皮层脑电图解码内心言语:言语假体的进展与挑战
Front Neurosci. 2018 Jun 21;12:422. doi: 10.3389/fnins.2018.00422. eCollection 2018.
10
EEGNet: a compact convolutional neural network for EEG-based brain-computer interfaces.EEGNet:一种基于 EEG 的脑机接口用的紧凑卷积神经网络。
J Neural Eng. 2018 Oct;15(5):056013. doi: 10.1088/1741-2552/aace8c. Epub 2018 Jun 22.