• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

脑到字符:一种从脑记录中解码文本的深度架构。

Brain2Char: a deep architecture for decoding text from brain recordings.

机构信息

Center of Integrative Neurosciences, University of California, San Francisco, CA, United States of America.

These authors contributed equally to this work.

出版信息

J Neural Eng. 2020 Dec 16;17(6). doi: 10.1088/1741-2552/abc742.

DOI:10.1088/1741-2552/abc742
PMID:33142282
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC9591243/
Abstract

Decoding language representations directly from the brain can enable new brain-computer interfaces (BCIs) for high bandwidth human-human and human-machine communication. Clinically, such technologies can restore communication in people with neurological conditions affecting their ability to speak.. In this study, we propose a novel deep network architecture Brain2Char, for directly decoding text (specifically character sequences) from direct brain recordings (called electrocorticography, ECoG). Brain2Char framework combines state-of-the-art deep learning modules-3D Inception layers for multiband spatiotemporal feature extraction from neural data and bidirectional recurrent layers, dilated convolution layers followed by language model weighted beam search to decode character sequences, and optimizing a connectionist temporal classification loss. Additionally, given the highly non-linear transformations that underlie the conversion of cortical function to character sequences, we perform regularizations on the network's latent representations motivated by insights into cortical encoding of speech production and artifactual aspects specific to ECoG data acquisition. To do this, we impose auxiliary losses on latent representations for articulatory movements, speech acoustics and session specific non-linearities.In three (out of four) participants reported here, Brain2Char achieves 10.6%, 8.5%, and 7.0% word error rates respectively on vocabulary sizes ranging from 1200 to 1900 words.These results establish a newon decoding text fromand demonstrate the potential of Brain2Char as a high-performance communication BCI.

摘要

直接从大脑中解码语言表示可以为高带宽的人机和人人通信启用新的脑机接口 (BCI)。从临床角度来看,此类技术可以恢复影响说话能力的神经状况患者的沟通能力。在这项研究中,我们提出了一种新颖的深度网络架构 Brain2Char,用于直接从直接的大脑记录(称为皮层电图,ECoG)中解码文本(特别是字符序列)。Brain2Char 框架结合了最先进的深度学习模块-3D Inception 层,用于从神经数据中提取多频带时空特征,以及双向递归层、扩张卷积层,然后是语言模型加权波束搜索来解码字符序列,并优化连接时间分类损失。此外,鉴于皮质功能转换为字符序列的高度非线性变换,我们根据对言语产生的皮质编码的深入了解以及 ECoG 数据采集特有的人为方面,对网络的潜在表示进行正则化。为此,我们对发音运动、语音声学和特定于会话的非线性的潜在表示施加辅助损失。在报告的四个参与者中的三个中,Brain2Char 在词汇量从 1200 到 1900 个单词的范围内分别实现了 10.6%、8.5%和 7.0%的单词错误率。这些结果建立了一个新的从解码文本,并展示了 Brain2Char 作为高性能通信 BCI 的潜力。

相似文献

1
Brain2Char: a deep architecture for decoding text from brain recordings.脑到字符:一种从脑记录中解码文本的深度架构。
J Neural Eng. 2020 Dec 16;17(6). doi: 10.1088/1741-2552/abc742.
2
The Potential for a Speech Brain-Computer Interface Using Chronic Electrocorticography.利用慢性皮层脑电图实现语音脑-机接口的潜力
Neurotherapeutics. 2019 Jan;16(1):144-165. doi: 10.1007/s13311-018-00692-2.
3
Brain-Computer Interface: Applications to Speech Decoding and Synthesis to Augment Communication.脑机接口:应用于语音解码和合成以增强交流。
Neurotherapeutics. 2022 Jan;19(1):263-273. doi: 10.1007/s13311-022-01190-2. Epub 2022 Jan 31.
4
Direct speech reconstruction from sensorimotor brain activity with optimized deep learning models.基于优化深度学习模型的感觉运动脑活动的直接语音重建。
J Neural Eng. 2023 Sep 20;20(5):056010. doi: 10.1088/1741-2552/ace8be.
5
Speech decoding from stereo-electroencephalography (sEEG) signals using advanced deep learning methods.使用先进的深度学习方法从立体脑电图(sEEG)信号中进行语音解码。
J Neural Eng. 2024 Jun 27;21(3). doi: 10.1088/1741-2552/ad593a.
6
Decoding speech using the timing of neural signal modulation.利用神经信号调制的时间来解码语音。
Annu Int Conf IEEE Eng Med Biol Soc. 2016 Aug;2016:1532-1535. doi: 10.1109/EMBC.2016.7591002.
7
Decoding articulatory and phonetic components of naturalistic continuous speech from the distributed language network.从分布式语言网络中解码自然连续语音的发音和语音成分。
J Neural Eng. 2023 Aug 14;20(4). doi: 10.1088/1741-2552/ace9fb.
8
High-resolution neural recordings improve the accuracy of speech decoding.高分辨率神经记录提高了语音解码的准确性。
Nat Commun. 2023 Nov 6;14(1):6938. doi: 10.1038/s41467-023-42555-1.
9
A high-performance speech neuroprosthesis.高性能言语神经假体
Nature. 2023 Aug;620(7976):1031-1036. doi: 10.1038/s41586-023-06377-x. Epub 2023 Aug 23.
10
Online speech synthesis using a chronically implanted brain-computer interface in an individual with ALS.使用慢性植入脑-机接口对肌萎缩性侧索硬化症患者进行在线语音合成。
Sci Rep. 2024 Apr 26;14(1):9617. doi: 10.1038/s41598-024-60277-2.

引用本文的文献

1
Improved evaluation of waveform reconstruction in speech decoding based on invasive brain-computer interfaces.基于侵入式脑机接口的语音解码中波形重建的改进评估
Imaging Neurosci (Camb). 2025 Sep 10;3. doi: 10.1162/IMAG.a.146. eCollection 2025.
2
VocalMind: A Stereotactic EEG Dataset for Vocalized, Mimed, and Imagined Speech in Tonal Language.VocalMind:一个用于有声、哑剧和想象中的声调语言语音的立体定向脑电图数据集。
Sci Data. 2025 Apr 19;12(1):657. doi: 10.1038/s41597-025-04741-2.
3
Whole-brain dynamics of articulatory, acoustic and semantic speech representations.

本文引用的文献

1
Machine translation of cortical activity to text with an encoder-decoder framework.基于编解码器框架的皮质活动文本机器翻译。
Nat Neurosci. 2020 Apr;23(4):575-582. doi: 10.1038/s41593-020-0608-8. Epub 2020 Mar 30.
2
Toward a Speech Neuroprosthesis.迈向言语神经假体。
JAMA. 2020 Feb 4;323(5):413-414. doi: 10.1001/jama.2019.19813.
3
Speech synthesis from neural decoding of spoken sentences.基于语音解码的语音合成
发音、声学和语义语音表征的全脑动力学。
Commun Biol. 2025 Mar 13;8(1):432. doi: 10.1038/s42003-025-07862-x.
4
Transformer-based neural speech decoding from surface and depth electrode signals.基于Transformer的从表面和深度电极信号进行神经语音解码
J Neural Eng. 2025 Jan 28;22(1):016017. doi: 10.1088/1741-2552/adab21.
5
Iterative alignment discovery of speech-associated neural activity.语音相关神经活动的迭代对齐发现。
J Neural Eng. 2024 Aug 28;21(4):046056. doi: 10.1088/1741-2552/ad663c.
6
Temporal-spatial cross attention network for recognizing imagined characters.用于识别想象字符的时空交叉注意网络。
Sci Rep. 2024 Jul 4;14(1):15432. doi: 10.1038/s41598-024-59263-5.
7
ChineseEEG: A Chinese Linguistic Corpora EEG Dataset for Semantic Alignment and Neural Decoding.中文 EEG:用于语义对齐和神经解码的中文语言语料库 EEG 数据集。
Sci Data. 2024 May 29;11(1):550. doi: 10.1038/s41597-024-03398-7.
8
Feasibility of decoding covert speech in ECoG with a Transformer trained on overt speech.使用基于口语训练的 Transformer 对 ECoG 中的隐蔽语音进行解码的可行性。
Sci Rep. 2024 May 20;14(1):11491. doi: 10.1038/s41598-024-62230-9.
9
The speech neuroprosthesis.言语神经假体。
Nat Rev Neurosci. 2024 Jul;25(7):473-492. doi: 10.1038/s41583-024-00819-9. Epub 2024 May 14.
10
Subject-Agnostic Transformer-Based Neural Speech Decoding from Surface and Depth Electrode Signals.基于表面和深度电极信号的与主题无关的基于Transformer的神经语音解码
bioRxiv. 2024 Sep 25:2024.03.11.584533. doi: 10.1101/2024.03.11.584533.
Nature. 2019 Apr;568(7753):493-498. doi: 10.1038/s41586-019-1119-1. Epub 2019 Apr 24.
4
Speech synthesis from ECoG using densely connected 3D convolutional neural networks.使用密集连接的 3D 卷积神经网络进行脑电信号合成。
J Neural Eng. 2019 Jun;16(3):036019. doi: 10.1088/1741-2552/ab0c59. Epub 2019 Mar 4.
5
Towards reconstructing intelligible speech from the human auditory cortex.从人类听觉皮层重建可理解的语音。
Sci Rep. 2019 Jan 29;9(1):874. doi: 10.1038/s41598-018-37359-z.
6
Differential Representation of Articulatory Gestures and Phonemes in Precentral and Inferior Frontal Gyri.前中央回和下额前回中发音动作和音位的差异表达。
J Neurosci. 2018 Nov 14;38(46):9803-9813. doi: 10.1523/JNEUROSCI.1206-18.2018. Epub 2018 Sep 26.
7
Encoding of Articulatory Kinematic Trajectories in Human Speech Sensorimotor Cortex.人类言语运动感觉皮质中发音运动轨迹的编码。
Neuron. 2018 Jun 6;98(5):1042-1054.e4. doi: 10.1016/j.neuron.2018.04.031. Epub 2018 May 17.
8
Toward a universal decoder of linguistic meaning from brain activation.迈向基于大脑激活的语言意义通用解码器。
Nat Commun. 2018 Mar 6;9(1):963. doi: 10.1038/s41467-018-03068-4.
9
Decoder calibration with ultra small current sample set for intracortical brain-machine interface.用于脑机接口的超小电流样本集的解码器校准。
J Neural Eng. 2018 Apr;15(2):026019. doi: 10.1088/1741-2552/aaa8a4.
10
High performance communication by people with paralysis using an intracortical brain-computer interface.瘫痪患者使用皮层内脑机接口进行的高效通信。
Elife. 2017 Feb 21;6:e18554. doi: 10.7554/eLife.18554.