• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

从颅内脑数据中实现自然语言解码。

Towards Naturalistic Speech Decoding from Intracranial Brain Data.

出版信息

Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:3100-3104. doi: 10.1109/EMBC48229.2022.9871301.

DOI:10.1109/EMBC48229.2022.9871301
PMID:36085779
Abstract

Speech decoding from brain activity can enable development of brain-computer interfaces (BCIs) to restore naturalistic communication in paralyzed patients. Previous work has focused on development of decoding models from isolated speech data with a clean background and multiple repetitions of the material. In this study, we describe a novel approach to speech decoding that relies on a generative adversarial neural network (GAN) to reconstruct speech from brain data recorded during a naturalistic speech listening task (watching a movie). We compared the GAN-based approach, where reconstruction was done from the compressed latent representation of sound decoded from the brain, with several baseline models that reconstructed sound spectrogram directly. We show that the novel approach provides more accurate reconstructions compared to the baselines. These results underscore the potential of GAN models for speech decoding in naturalistic noisy environments and further advancing of BCIs for naturalistic communication. Clinical Relevance - This study presents a novel speech decoding paradigm that combines advances in deep learning, speech synthesis and neural engineering, and has the potential to advance the field of BCI for severely paralyzed individuals.

摘要

从大脑活动中进行语音解码可以开发脑机接口(BCI),以恢复瘫痪患者的自然语言交流。以前的研究主要集中在开发从背景干净且材料有多次重复的孤立语音数据中解码模型。在这项研究中,我们描述了一种新颖的语音解码方法,该方法依赖于生成对抗神经网络(GAN),从自然语言聆听任务(观看电影)期间记录的大脑数据中重建语音。我们比较了基于 GAN 的方法,其中从大脑解码的声音的压缩潜在表示中进行重建,以及直接重建声音频谱图的几种基线模型。我们表明,与基线相比,新方法提供了更准确的重建。这些结果强调了 GAN 模型在自然噪声环境中的语音解码中的潜力,并进一步推进了用于自然语言交流的 BCI。临床相关性-本研究提出了一种新颖的语音解码范式,结合了深度学习、语音合成和神经工程的进步,有潜力推进严重瘫痪患者的 BCI 领域。

相似文献

1
Towards Naturalistic Speech Decoding from Intracranial Brain Data.从颅内脑数据中实现自然语言解码。
Annu Int Conf IEEE Eng Med Biol Soc. 2022 Jul;2022:3100-3104. doi: 10.1109/EMBC48229.2022.9871301.
2
Direct speech reconstruction from sensorimotor brain activity with optimized deep learning models.基于优化深度学习模型的感觉运动脑活动的直接语音重建。
J Neural Eng. 2023 Sep 20;20(5):056010. doi: 10.1088/1741-2552/ace8be.
3
Speech decoding from stereo-electroencephalography (sEEG) signals using advanced deep learning methods.使用先进的深度学习方法从立体脑电图(sEEG)信号中进行语音解码。
J Neural Eng. 2024 Jun 27;21(3). doi: 10.1088/1741-2552/ad593a.
4
Decoding articulatory and phonetic components of naturalistic continuous speech from the distributed language network.从分布式语言网络中解码自然连续语音的发音和语音成分。
J Neural Eng. 2023 Aug 14;20(4). doi: 10.1088/1741-2552/ace9fb.
5
Brain-Computer Interface: Applications to Speech Decoding and Synthesis to Augment Communication.脑机接口:应用于语音解码和合成以增强交流。
Neurotherapeutics. 2022 Jan;19(1):263-273. doi: 10.1007/s13311-022-01190-2. Epub 2022 Jan 31.
6
Decoding Speech from Single Trial MEG Signals Using Convolutional Neural Networks and Transfer Learning.使用卷积神经网络和迁移学习从单次试验脑磁图信号中解码语音
Annu Int Conf IEEE Eng Med Biol Soc. 2019 Jul;2019:5531-5535. doi: 10.1109/EMBC.2019.8857874.
7
Towards reconstructing intelligible speech from the human auditory cortex.从人类听觉皮层重建可理解的语音。
Sci Rep. 2019 Jan 29;9(1):874. doi: 10.1038/s41598-018-37359-z.
8
Effective Phoneme Decoding With Hyperbolic Neural Networks for High-Performance Speech BCIs.基于双曲神经网络的高效语音音素解码在高性能语音脑机接口中的应用。
IEEE Trans Neural Syst Rehabil Eng. 2024;32:3432-3441. doi: 10.1109/TNSRE.2024.3457313. Epub 2024 Sep 18.
9
Brain2Char: a deep architecture for decoding text from brain recordings.脑到字符:一种从脑记录中解码文本的深度架构。
J Neural Eng. 2020 Dec 16;17(6). doi: 10.1088/1741-2552/abc742.
10
Decoding and synthesizing tonal language speech from brain activity.从大脑活动中解码和合成声调语言的语音。
Sci Adv. 2023 Jun 9;9(23):eadh0478. doi: 10.1126/sciadv.adh0478.

引用本文的文献

1
Direct speech reconstruction from sensorimotor brain activity with optimized deep learning models.基于优化深度学习模型的感觉运动脑活动的直接语音重建。
J Neural Eng. 2023 Sep 20;20(5):056010. doi: 10.1088/1741-2552/ace8be.