• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

使用基于口语训练的 Transformer 对 ECoG 中的隐蔽语音进行解码的可行性。

Feasibility of decoding covert speech in ECoG with a Transformer trained on overt speech.

机构信息

Department of Electronic and Information Engineering, Tokyo University of Agriculture and Technology, 2-24-16 Naka-cho, Koganei-shi, Tokyo, 184-8588, Japan.

Department of Neurosurgery, Juntendo University School of Medicine, 2-1-1 Hongo, Bunkyo-ku, Tokyo, 113-8421, Japan.

出版信息

Sci Rep. 2024 May 20;14(1):11491. doi: 10.1038/s41598-024-62230-9.

DOI:10.1038/s41598-024-62230-9
PMID:38769115
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11106343/
Abstract

Several attempts for speech brain-computer interfacing (BCI) have been made to decode phonemes, sub-words, words, or sentences using invasive measurements, such as the electrocorticogram (ECoG), during auditory speech perception, overt speech, or imagined (covert) speech. Decoding sentences from covert speech is a challenging task. Sixteen epilepsy patients with intracranially implanted electrodes participated in this study, and ECoGs were recorded during overt speech and covert speech of eight Japanese sentences, each consisting of three tokens. In particular, Transformer neural network model was applied to decode text sentences from covert speech, which was trained using ECoGs obtained during overt speech. We first examined the proposed Transformer model using the same task for training and testing, and then evaluated the model's performance when trained with overt task for decoding covert speech. The Transformer model trained on covert speech achieved an average token error rate (TER) of 46.6% for decoding covert speech, whereas the model trained on overt speech achieved a TER of 46.3% . Therefore, the challenge of collecting training data for covert speech can be addressed using overt speech. The performance of covert speech can improve by employing several overt speeches.

摘要

已经有一些尝试使用侵入性测量方法(如脑电描记术)进行语音脑机接口(BCI)的研究,以解码语音感知、口语或想象(隐蔽)语音中的音素、亚词、单词或句子。从隐蔽语音中解码句子是一项具有挑战性的任务。本研究纳入了 16 名颅内植入电极的癫痫患者,在他们进行口头和隐蔽日语句子(每个句子包含三个语料)的语音时记录脑电描记术。特别是,应用了基于 Transformer 的神经网络模型来从隐蔽语音中解码文本来训练该模型,使用在口头语音中获得的脑电描记术来进行训练。我们首先使用相同的训练和测试任务来检验所提出的 Transformer 模型,然后评估该模型在使用口头任务进行隐蔽语音解码时的性能。在隐蔽语音上进行训练的 Transformer 模型在解码隐蔽语音时的平均语料错误率(TER)为 46.6%,而在口头语音上进行训练的模型的 TER 为 46.3%。因此,可以使用口头语音来解决隐蔽语音训练数据收集的挑战。通过使用多个口头语音,可以提高隐蔽语音的性能。

相似文献

1
Feasibility of decoding covert speech in ECoG with a Transformer trained on overt speech.使用基于口语训练的 Transformer 对 ECoG 中的隐蔽语音进行解码的可行性。
Sci Rep. 2024 May 20;14(1):11491. doi: 10.1038/s41598-024-62230-9.
2
Decoding covert speech for intuitive control of brain-computer interfaces based on single-trial EEG: a feasibility study.基于单次试验脑电图的脑机接口直观控制中隐蔽语音解码:一项可行性研究。
IEEE Int Conf Rehabil Robot. 2019 Jun;2019:689-693. doi: 10.1109/ICORR.2019.8779499.
3
Imagined speech can be decoded from low- and cross-frequency intracranial EEG features.想象中的言语可以从低频率和跨频率颅内 EEG 特征中解码出来。
Nat Commun. 2022 Jan 10;13(1):48. doi: 10.1038/s41467-021-27725-3.
4
Speech decoding from stereo-electroencephalography (sEEG) signals using advanced deep learning methods.使用先进的深度学习方法从立体脑电图(sEEG)信号中进行语音解码。
J Neural Eng. 2024 Jun 27;21(3). doi: 10.1088/1741-2552/ad593a.
5
Machine translation of cortical activity to text with an encoder-decoder framework.基于编解码器框架的皮质活动文本机器翻译。
Nat Neurosci. 2020 Apr;23(4):575-582. doi: 10.1038/s41593-020-0608-8. Epub 2020 Mar 30.
6
Spatio-Temporal Progression of Cortical Activity Related to Continuous Overt and Covert Speech Production in a Reading Task.阅读任务中与连续公开和隐蔽言语产生相关的皮层活动的时空进展
PLoS One. 2016 Nov 22;11(11):e0166872. doi: 10.1371/journal.pone.0166872. eCollection 2016.
7
Decoding spectrotemporal features of overt and covert speech from the human cortex.从人类大脑皮层解码公开和隐蔽言语的频谱时间特征。
Front Neuroeng. 2014 May 27;7:14. doi: 10.3389/fneng.2014.00014. eCollection 2014.
8
A Bimodal Deep Learning Architecture for EEG-fNIRS Decoding of Overt and Imagined Speech.一种用于 EEG-fNIRS 解码言语出声和想象的双模深度学习架构。
IEEE Trans Biomed Eng. 2022 Jun;69(6):1983-1994. doi: 10.1109/TBME.2021.3132861. Epub 2022 May 19.
9
Imagined speech increases the hemodynamic response and functional connectivity of the dorsal motor cortex.想象中的言语会增加大脑背侧运动皮质的血流动力学反应和功能连接。
J Neural Eng. 2021 Oct 7;18(5). doi: 10.1088/1741-2552/ac25d9.
10
Speech decoding from a small set of spatially segregated minimally invasive intracranial EEG electrodes with a compact and interpretable neural network.使用紧凑且可解释的神经网络从一小组空间隔离的微创颅内脑电图电极进行语音解码。
J Neural Eng. 2022 Nov 24;19(6). doi: 10.1088/1741-2552/aca1e1.

本文引用的文献

1
EEG temporal-spatial transformer for person identification.用于人员识别的 EEG 时空变换。
Sci Rep. 2022 Aug 23;12(1):14378. doi: 10.1038/s41598-022-18502-3.
2
A Transformer-Based Approach Combining Deep Learning Network and Spatial-Temporal Information for Raw EEG Classification.基于深度学习网络和时空信息的Transformer 结合方法用于原始 EEG 分类。
IEEE Trans Neural Syst Rehabil Eng. 2022;30:2126-2136. doi: 10.1109/TNSRE.2022.3194600. Epub 2022 Aug 4.
3
Decoding selective auditory attention with EEG using a transformer model.
使用变压器模型对 EEG 进行选择性听觉注意力解码。
Methods. 2022 Aug;204:410-417. doi: 10.1016/j.ymeth.2022.04.009. Epub 2022 Apr 18.
4
Imagined speech can be decoded from low- and cross-frequency intracranial EEG features.想象中的言语可以从低频率和跨频率颅内 EEG 特征中解码出来。
Nat Commun. 2022 Jan 10;13(1):48. doi: 10.1038/s41467-021-27725-3.
5
Gated Transformer for Decoding Human Brain EEG Signals.门控转换器用于解码人类脑电信号。
Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:125-130. doi: 10.1109/EMBC46164.2021.9630210.
6
Real-time synthesis of imagined speech processes from minimally invasive recordings of neural activity.从神经活动的微创记录中实时合成想象中的语音过程。
Commun Biol. 2021 Sep 23;4(1):1055. doi: 10.1038/s42003-021-02578-0.
7
Brain2Char: a deep architecture for decoding text from brain recordings.脑到字符:一种从脑记录中解码文本的深度架构。
J Neural Eng. 2020 Dec 16;17(6). doi: 10.1088/1741-2552/abc742.
8
Decoding Brain Representations by Multimodal Learning of Neural Activity and Visual Features.通过对神经活动和视觉特征的多模态学习来解码大脑表征。
IEEE Trans Pattern Anal Mach Intell. 2021 Nov;43(11):3833-3849. doi: 10.1109/TPAMI.2020.2995909. Epub 2021 Oct 1.
9
Machine translation of cortical activity to text with an encoder-decoder framework.基于编解码器框架的皮质活动文本机器翻译。
Nat Neurosci. 2020 Apr;23(4):575-582. doi: 10.1038/s41593-020-0608-8. Epub 2020 Mar 30.
10
Real-time decoding of question-and-answer speech dialogue using human cortical activity.使用人类大脑皮层活动实时解码问答式语音对话。
Nat Commun. 2019 Jul 30;10(1):3096. doi: 10.1038/s41467-019-10994-4.