• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从神经活动的微创记录中实时合成想象中的语音过程。

Real-time synthesis of imagined speech processes from minimally invasive recordings of neural activity.

机构信息

Cognitive Systems Lab, University of Bremen, Bremen, Germany.

Department of Neurosurgery, School of Mental Health and Neurosciences, Maastricht University, Maastricht, The Netherlands.

出版信息

Commun Biol. 2021 Sep 23;4(1):1055. doi: 10.1038/s42003-021-02578-0.

DOI:10.1038/s42003-021-02578-0
PMID:34556793
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC8460739/
Abstract

Speech neuroprosthetics aim to provide a natural communication channel to individuals who are unable to speak due to physical or neurological impairments. Real-time synthesis of acoustic speech directly from measured neural activity could enable natural conversations and notably improve quality of life, particularly for individuals who have severely limited means of communication. Recent advances in decoding approaches have led to high quality reconstructions of acoustic speech from invasively measured neural activity. However, most prior research utilizes data collected during open-loop experiments of articulated speech, which might not directly translate to imagined speech processes. Here, we present an approach that synthesizes audible speech in real-time for both imagined and whispered speech conditions. Using a participant implanted with stereotactic depth electrodes, we were able to reliably generate audible speech in real-time. The decoding models rely predominately on frontal activity suggesting that speech processes have similar representations when vocalized, whispered, or imagined. While reconstructed audio is not yet intelligible, our real-time synthesis approach represents an essential step towards investigating how patients will learn to operate a closed-loop speech neuroprosthesis based on imagined speech.

摘要

言语神经修复旨在为因身体或神经损伤而无法说话的个体提供一种自然的交流渠道。直接从测量到的神经活动实时合成声学语音,可以实现自然对话,并显著提高生活质量,特别是对于那些交流手段严重受限的个体。解码方法的最新进展已经能够从侵入性测量的神经活动中重建出高质量的声学语音。然而,大多数先前的研究都利用了在有意义的语音实验中收集的数据,这些数据可能不能直接转化为想象中的语音过程。在这里,我们提出了一种可以实时合成想象和低语语音的方法。使用植入立体定向深度电极的参与者,我们能够实时可靠地生成可听见的语音。解码模型主要依赖于额部活动,这表明当发声、低语或想象时,言语过程具有相似的表示。虽然重建的音频还不能理解,但我们的实时合成方法代表了朝着研究患者如何基于想象中的语音来学习操作闭环言语神经修复的重要一步。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad15/8460739/aab3873ea693/42003_2021_2578_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad15/8460739/fec7ec6c405b/42003_2021_2578_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad15/8460739/f79e1d2ea691/42003_2021_2578_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad15/8460739/41b2ab9b09a2/42003_2021_2578_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad15/8460739/4ec571107282/42003_2021_2578_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad15/8460739/96ecfa2e4890/42003_2021_2578_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad15/8460739/aab3873ea693/42003_2021_2578_Fig6_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad15/8460739/fec7ec6c405b/42003_2021_2578_Fig1_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad15/8460739/f79e1d2ea691/42003_2021_2578_Fig2_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad15/8460739/41b2ab9b09a2/42003_2021_2578_Fig3_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad15/8460739/4ec571107282/42003_2021_2578_Fig4_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad15/8460739/96ecfa2e4890/42003_2021_2578_Fig5_HTML.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ad15/8460739/aab3873ea693/42003_2021_2578_Fig6_HTML.jpg

相似文献

1
Real-time synthesis of imagined speech processes from minimally invasive recordings of neural activity.从神经活动的微创记录中实时合成想象中的语音过程。
Commun Biol. 2021 Sep 23;4(1):1055. doi: 10.1038/s42003-021-02578-0.
2
Speech Synthesis from Stereotactic EEG using an Electrode Shaft Dependent Multi-Input Convolutional Neural Network Approach.基于电极轴相关多输入卷积神经网络的立体脑电图语音合成。
Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:6045-6048. doi: 10.1109/EMBC46164.2021.9629711.
3
The speech neuroprosthesis.言语神经假体。
Nat Rev Neurosci. 2024 Jul;25(7):473-492. doi: 10.1038/s41583-024-00819-9. Epub 2024 May 14.
4
Neuroprosthesis for Decoding Speech in a Paralyzed Person with Anarthria.神经假体用于解码无言语症瘫痪患者的言语。
N Engl J Med. 2021 Jul 15;385(3):217-227. doi: 10.1056/NEJMoa2027540.
5
The nested hierarchy of overt, mouthed, and imagined speech activity evident in intracranial recordings.颅内记录中明显存在的外显、口头和想象言语活动的嵌套层次结构。
Neuroimage. 2023 Apr 1;269:119913. doi: 10.1016/j.neuroimage.2023.119913. Epub 2023 Jan 31.
6
An Accurate and Rapidly Calibrating Speech Neuroprosthesis.一种精确且快速校准的语音神经假体。
N Engl J Med. 2024 Aug 15;391(7):609-618. doi: 10.1056/NEJMoa2314132.
7
Continuous synthesis of artificial speech sounds from human cortical surface recordings during silent speech production.在安静的言语产生过程中,从人类皮质表面记录中连续合成人工言语声音。
J Neural Eng. 2023 Jul 27;20(4). doi: 10.1088/1741-2552/ace7f6.
8
Imagined speech can be decoded from low- and cross-frequency intracranial EEG features.想象中的言语可以从低频率和跨频率颅内 EEG 特征中解码出来。
Nat Commun. 2022 Jan 10;13(1):48. doi: 10.1038/s41467-021-27725-3.
9
Online speech synthesis using a chronically implanted brain-computer interface in an individual with ALS.使用慢性植入脑-机接口对肌萎缩性侧索硬化症患者进行在线语音合成。
Sci Rep. 2024 Apr 26;14(1):9617. doi: 10.1038/s41598-024-60277-2.
10
Opportunities, pitfalls and trade-offs in designing protocols for measuring the neural correlates of speech.设计测量言语神经相关指标的方案时的机遇、陷阱与权衡。
Neurosci Biobehav Rev. 2022 Sep;140:104783. doi: 10.1016/j.neubiorev.2022.104783. Epub 2022 Jul 27.

引用本文的文献

1
Inner speech in motor cortex and implications for speech neuroprostheses.运动皮层中的内部言语及其对言语神经假体的意义。
Cell. 2025 Jul 9. doi: 10.1016/j.cell.2025.06.015.
2
An instantaneous voice-synthesis neuroprosthesis.一种即时语音合成神经假体。
Nature. 2025 Jun 12. doi: 10.1038/s41586-025-09127-3.
3
Encoding of speech modes and loudness in ventral precentral gyrus.腹侧中央前回中语音模式和响度的编码。

本文引用的文献

1
Observation and assessment of acoustic contamination of electrophysiological brain signals during speech production and sound perception.观察和评估言语产生和声音感知期间电生理脑信号的声学干扰。
J Neural Eng. 2020 Oct 15;17(5):056028. doi: 10.1088/1741-2552/abb25e.
2
Machine translation of cortical activity to text with an encoder-decoder framework.基于编解码器框架的皮质活动文本机器翻译。
Nat Neurosci. 2020 Apr;23(4):575-582. doi: 10.1038/s41593-020-0608-8. Epub 2020 Mar 30.
3
The Potential of Stereotactic-EEG for Brain-Computer Interfaces: Current Progress and Future Directions.
bioRxiv. 2025 May 31:2025.05.30.657105. doi: 10.1101/2025.05.30.657105.
4
Synthesizing intelligible utterances from EEG of imagined speech.从想象言语的脑电图中合成可理解的话语。
Front Neurosci. 2025 Apr 17;19:1565848. doi: 10.3389/fnins.2025.1565848. eCollection 2025.
5
Invasive Brain-Computer Interface for Communication: A Scoping Review.用于交流的侵入性脑机接口:一项范围综述
Brain Sci. 2025 Mar 24;15(4):336. doi: 10.3390/brainsci15040336.
6
Acoustic Inspired Brain-to-Sentence Decoder for Logosyllabic Language.用于标识音节语言的声学启发式脑到句子解码器
Cyborg Bionic Syst. 2025 Apr 29;6:0257. doi: 10.34133/cbsystems.0257. eCollection 2025.
7
VocalMind: A Stereotactic EEG Dataset for Vocalized, Mimed, and Imagined Speech in Tonal Language.VocalMind:一个用于有声、哑剧和想象中的声调语言语音的立体定向脑电图数据集。
Sci Data. 2025 Apr 19;12(1):657. doi: 10.1038/s41597-025-04741-2.
8
A streaming brain-to-voice neuroprosthesis to restore naturalistic communication.一种用于恢复自然交流的流式脑到语音神经假体。
Nat Neurosci. 2025 Apr;28(4):902-912. doi: 10.1038/s41593-025-01905-6. Epub 2025 Mar 31.
9
Whole-brain dynamics of articulatory, acoustic and semantic speech representations.发音、声学和语义语音表征的全脑动力学。
Commun Biol. 2025 Mar 13;8(1):432. doi: 10.1038/s42003-025-07862-x.
10
Learning to operate an imagined speech Brain-Computer Interface involves the spatial and frequency tuning of neural activity.学习操作想象中的语音脑机接口涉及神经活动的空间和频率调谐。
Commun Biol. 2025 Feb 20;8(1):271. doi: 10.1038/s42003-025-07464-7.
立体定向脑电图在脑机接口中的潜力:当前进展与未来方向。
Front Neurosci. 2020 Feb 27;14:123. doi: 10.3389/fnins.2020.00123. eCollection 2020.
4
Speech-related dorsal motor cortex activity does not interfere with iBCI cursor control.与言语相关的背侧运动皮层活动不会干扰 iBCI 光标控制。
J Neural Eng. 2020 Feb 5;17(1):016049. doi: 10.1088/1741-2552/ab5b72.
5
Generating Natural, Intelligible Speech From Brain Activity in Motor, Premotor, and Inferior Frontal Cortices.从运动皮层、运动前区皮层和额下回的脑活动中生成自然、可理解的语音。
Front Neurosci. 2019 Nov 22;13:1267. doi: 10.3389/fnins.2019.01267. eCollection 2019.
6
Neural ensemble dynamics in dorsal motor cortex during speech in people with paralysis.人在瘫痪时说话时背侧运动皮层中的神经集合动力学。
Elife. 2019 Dec 10;8:e46015. doi: 10.7554/eLife.46015.
7
Stability of a chronic implanted brain-computer interface in late-stage amyotrophic lateral sclerosis.晚期肌萎缩侧索硬化症慢性植入式脑机接口的稳定性。
Clin Neurophysiol. 2019 Oct;130(10):1798-1803. doi: 10.1016/j.clinph.2019.07.020. Epub 2019 Jul 27.
8
Real-time decoding of question-and-answer speech dialogue using human cortical activity.使用人类大脑皮层活动实时解码问答式语音对话。
Nat Commun. 2019 Jul 30;10(1):3096. doi: 10.1038/s41467-019-10994-4.
9
Principled BCI Decoder Design and Parameter Selection Using a Feedback Control Model.基于反馈控制模型的有原则的脑机接口解码器设计与参数选择。
Sci Rep. 2019 Jun 20;9(1):8881. doi: 10.1038/s41598-019-44166-7.
10
Speech synthesis from neural decoding of spoken sentences.基于语音解码的语音合成
Nature. 2019 Apr;568(7753):493-498. doi: 10.1038/s41586-019-1119-1. Epub 2019 Apr 24.