• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

基于电极轴相关多输入卷积神经网络的立体脑电图语音合成。

Speech Synthesis from Stereotactic EEG using an Electrode Shaft Dependent Multi-Input Convolutional Neural Network Approach.

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:6045-6048. doi: 10.1109/EMBC46164.2021.9629711.

DOI:10.1109/EMBC46164.2021.9629711
PMID:34892495
Abstract

Neurological disorders can lead to significant impairments in speech communication and, in severe cases, cause the complete loss of the ability to speak. Brain-Computer Interfaces have shown promise as an alternative communication modality by directly transforming neural activity of speech processes into a textual or audible representations. Previous studies investigating such speech neuroprostheses relied on electrocorticography (ECoG) or microelectrode arrays that acquire neural signals from superficial areas on the cortex. While both measurement methods have demonstrated successful speech decoding, they do not capture activity from deeper brain structures and this activity has therefore not been harnessed for speech-related BCIs. In this study, we bridge this gap by adapting a previously presented decoding pipeline for speech synthesis based on ECoG signals to implanted depth electrodes (sEEG). For this purpose, we propose a multi-input convolutional neural network that extracts speech-related activity separately for each electrode shaft and estimates spectral coefficients to reconstruct an audible waveform. We evaluate our approach on open-loop data from 5 patients who conducted a recitation task of Dutch utterances. We achieve correlations of up to 0.80 between original and reconstructed speech spectrograms, which are significantly above chance level for all patients (p < 0.001). Our results indicate that sEEG can yield similar speech decoding performance to prior ECoG studies and is a promising modality for speech BCIs.

摘要

神经紊乱会导致言语交流出现严重障碍,在严重的情况下,甚至会完全丧失言语能力。脑机接口作为一种替代的交流方式,已经显示出了很大的潜力,它可以直接将言语过程的神经活动转化为文本或可听的表示。以前的研究依赖于皮层表面的脑电(ECoG)或微电极阵列来获取神经信号,这些研究调查了这种言语神经假体。虽然这两种测量方法都成功地进行了言语解码,但它们都无法捕捉到来自大脑深层结构的活动,因此这些活动尚未被用于与言语相关的脑机接口。在这项研究中,我们通过将基于 ECoG 信号的语音合成的解码管道改编为植入的深部电极(sEEG)来弥补这一差距。为此,我们提出了一种多输入卷积神经网络,该网络可以为每个电极轴分别提取与语音相关的活动,并估计频谱系数以重建可听的波形。我们在 5 名患者的开环数据上评估了我们的方法,这些患者进行了荷兰语发音的背诵任务。我们实现了高达 0.80 的原始和重建语音频谱图之间的相关性,这对于所有患者都显著高于随机水平(p < 0.001)。我们的结果表明,sEEG 可以产生与之前的 ECoG 研究相似的言语解码性能,是一种很有前途的言语脑机接口模式。

相似文献

1
Speech Synthesis from Stereotactic EEG using an Electrode Shaft Dependent Multi-Input Convolutional Neural Network Approach.基于电极轴相关多输入卷积神经网络的立体脑电图语音合成。
Annu Int Conf IEEE Eng Med Biol Soc. 2021 Nov;2021:6045-6048. doi: 10.1109/EMBC46164.2021.9629711.
2
Speech synthesis from ECoG using densely connected 3D convolutional neural networks.使用密集连接的 3D 卷积神经网络进行脑电信号合成。
J Neural Eng. 2019 Jun;16(3):036019. doi: 10.1088/1741-2552/ab0c59. Epub 2019 Mar 4.
3
Speech decoding from a small set of spatially segregated minimally invasive intracranial EEG electrodes with a compact and interpretable neural network.使用紧凑且可解释的神经网络从一小组空间隔离的微创颅内脑电图电极进行语音解码。
J Neural Eng. 2022 Nov 24;19(6). doi: 10.1088/1741-2552/aca1e1.
4
Speech decoding from stereo-electroencephalography (sEEG) signals using advanced deep learning methods.使用先进的深度学习方法从立体脑电图(sEEG)信号中进行语音解码。
J Neural Eng. 2024 Jun 27;21(3). doi: 10.1088/1741-2552/ad593a.
5
A Review of Motor Brain-Computer Interfaces Using Intracranial Electroencephalography Based on Surface Electrodes and Depth Electrodes.基于表面电极和深部电极的颅内脑电图的运动脑-机接口综述
IEEE Trans Neural Syst Rehabil Eng. 2024;32:2408-2431. doi: 10.1109/TNSRE.2024.3421551. Epub 2024 Jul 4.
6
Online speech synthesis using a chronically implanted brain-computer interface in an individual with ALS.使用慢性植入脑-机接口对肌萎缩性侧索硬化症患者进行在线语音合成。
Sci Rep. 2024 Apr 26;14(1):9617. doi: 10.1038/s41598-024-60277-2.
7
The Potential for a Speech Brain-Computer Interface Using Chronic Electrocorticography.利用慢性皮层脑电图实现语音脑-机接口的潜力
Neurotherapeutics. 2019 Jan;16(1):144-165. doi: 10.1007/s13311-018-00692-2.
8
Stability of ECoG high gamma signals during speech and implications for a speech BCI system in an individual with ALS: a year-long longitudinal study.脑电高 gamma 信号在言语期间的稳定性及其对 ALS 个体言语脑机接口系统的影响:一项为期一年的纵向研究。
J Neural Eng. 2024 Jul 12;21(4). doi: 10.1088/1741-2552/ad5c02.
9
Real-time synthesis of imagined speech processes from minimally invasive recordings of neural activity.从神经活动的微创记录中实时合成想象中的语音过程。
Commun Biol. 2021 Sep 23;4(1):1055. doi: 10.1038/s42003-021-02578-0.
10
Brain-Computer Interface: Applications to Speech Decoding and Synthesis to Augment Communication.脑机接口:应用于语音解码和合成以增强交流。
Neurotherapeutics. 2022 Jan;19(1):263-273. doi: 10.1007/s13311-022-01190-2. Epub 2022 Jan 31.

引用本文的文献

1
VocalMind: A Stereotactic EEG Dataset for Vocalized, Mimed, and Imagined Speech in Tonal Language.VocalMind:一个用于有声、哑剧和想象中的声调语言语音的立体定向脑电图数据集。
Sci Data. 2025 Apr 19;12(1):657. doi: 10.1038/s41597-025-04741-2.
2
Speech decoding using cortical and subcortical electrophysiological signals.利用皮层和皮层下电生理信号进行语音解码。
Front Neurosci. 2024 Feb 29;18:1345308. doi: 10.3389/fnins.2024.1345308. eCollection 2024.
3
Generalizable spelling using a speech neuroprosthesis in an individual with severe limb and vocal paralysis.
个体严重的肢体和言语瘫痪中使用言语神经假体实现可泛化的拼写
Nat Commun. 2022 Nov 8;13(1):6510. doi: 10.1038/s41467-022-33611-3.
4
Dataset of Speech Production in intracranial.Electroencephalography.颅内脑电图语音产生数据集。
Sci Data. 2022 Jul 22;9(1):434. doi: 10.1038/s41597-022-01542-9.