一种用于恢复自然交流的流式脑到语音神经假体。

A streaming brain-to-voice neuroprosthesis to restore naturalistic communication.

作者信息

Littlejohn Kaylo T, Cho Cheol Jun, Liu Jessie R, Silva Alexander B, Yu Bohan, Anderson Vanessa R, Kurtz-Miott Cady M, Brosler Samantha, Kashyap Anshul P, Hallinan Irina P, Shah Adit, Tu-Chan Adelyn, Ganguly Karunesh, Moses David A, Chang Edward F, Anumanchipalli Gopala K

机构信息

Department of Electrical Engineering and Computer Sciences, University of California, Berkeley, Berkeley, CA, USA.

Department of Neurological Surgery, University of California, San Francisco, San Francisco, CA, USA.

出版信息

Nat Neurosci. 2025 Apr;28(4):902-912. doi: 10.1038/s41593-025-01905-6. Epub 2025 Mar 31.

DOI:10.1038/s41593-025-01905-6

PMID:40164740

Abstract

Natural spoken communication happens instantaneously. Speech delays longer than a few seconds can disrupt the natural flow of conversation. This makes it difficult for individuals with paralysis to participate in meaningful dialogue, potentially leading to feelings of isolation and frustration. Here we used high-density surface recordings of the speech sensorimotor cortex in a clinical trial participant with severe paralysis and anarthria to drive a continuously streaming naturalistic speech synthesizer. We designed and used deep learning recurrent neural network transducer models to achieve online large-vocabulary intelligible fluent speech synthesis personalized to the participant's preinjury voice with neural decoding in 80-ms increments. Offline, the models demonstrated implicit speech detection capabilities and could continuously decode speech indefinitely, enabling uninterrupted use of the decoder and further increasing speed. Our framework also successfully generalized to other silent-speech interfaces, including single-unit recordings and electromyography. Our findings introduce a speech-neuroprosthetic paradigm to restore naturalistic spoken communication to people with paralysis.

摘要

自然的口语交流是瞬间发生的。超过几秒的言语延迟会扰乱对话的自然流畅性。这使得瘫痪患者难以参与有意义的对话，可能导致孤独感和挫败感。在此，我们利用一名患有严重瘫痪和构音障碍的临床试验参与者的言语感觉运动皮层的高密度表面记录，来驱动一个持续流式传输的自然主义语音合成器。我们设计并使用深度学习循环神经网络变换器模型，以80毫秒的增量进行神经解码，实现了根据参与者受伤前的声音进行个性化的在线大词汇量可理解流畅语音合成。在离线状态下，这些模型展示了隐式语音检测能力，并且可以无限期地持续解码语音，从而实现解码器的不间断使用并进一步提高速度。我们的框架还成功推广到了其他无声语音接口，包括单细胞记录和肌电图。我们的研究结果引入了一种言语神经假体范式，以恢复瘫痪患者的自然口语交流。

相似文献

A streaming brain-to-voice neuroprosthesis to restore naturalistic communication.一种用于恢复自然交流的流式脑到语音神经假体。

Nat Neurosci. 2025 Apr;28(4):902-912. doi: 10.1038/s41593-025-01905-6. Epub 2025 Mar 31.

A high-performance neuroprosthesis for speech decoding and avatar control.一种用于语音解码和化身控制的高性能神经假体。

Nature. 2023 Aug;620(7976):1037-1046. doi: 10.1038/s41586-023-06443-4. Epub 2023 Aug 23.

A high-performance speech neuroprosthesis.高性能言语神经假体

Nature. 2023 Aug;620(7976):1031-1036. doi: 10.1038/s41586-023-06377-x. Epub 2023 Aug 23.

Neuroprosthesis for Decoding Speech in a Paralyzed Person with Anarthria.神经假体用于解码无言语症瘫痪患者的言语。

N Engl J Med. 2021 Jul 15;385(3):217-227. doi: 10.1056/NEJMoa2027540.

Decoding spoken phonemes from sensorimotor cortex with high-density ECoG grids.利用高密度 ECoG 网格从感觉运动皮层解码口语音素。

Neuroimage. 2018 Oct 15;180(Pt A):301-311. doi: 10.1016/j.neuroimage.2017.10.011. Epub 2017 Oct 7.

An Accurate and Rapidly Calibrating Speech Neuroprosthesis.一种精确且快速校准的语音神经假体。

N Engl J Med. 2024 Aug 15;391(7):609-618. doi: 10.1056/NEJMoa2314132.

Direct speech reconstruction from sensorimotor brain activity with optimized deep learning models.基于优化深度学习模型的感觉运动脑活动的直接语音重建。

J Neural Eng. 2023 Sep 20;20(5):056010. doi: 10.1088/1741-2552/ace8be.

An instantaneous voice synthesis neuroprosthesis.一种即时语音合成神经假体。

bioRxiv. 2024 Sep 20:2024.08.14.607690. doi: 10.1101/2024.08.14.607690.

Generalizable spelling using a speech neuroprosthesis in an individual with severe limb and vocal paralysis.个体严重的肢体和言语瘫痪中使用言语神经假体实现可泛化的拼写

Nat Commun. 2022 Nov 8;13(1):6510. doi: 10.1038/s41467-022-33611-3.

Online speech synthesis using a chronically implanted brain-computer interface in an individual with ALS.使用慢性植入脑-机接口对肌萎缩性侧索硬化症患者进行在线语音合成。

Sci Rep. 2024 Apr 26;14(1):9617. doi: 10.1038/s41598-024-60277-2.

引用本文的文献

Implications of shared motor and perceptual activations on the sensorimotor cortex for neuroprosthetic decoding.共享的运动和感知激活对感觉运动皮层在神经假体解码方面的影响。

J Neural Eng. 2025 Aug 7;22(4):046039. doi: 10.1088/1741-2552/adf50e.

Error encoding in human speech motor cortex.人类言语运动皮层中的错误编码。

bioRxiv. 2025 Jun 8:2025.06.07.658426. doi: 10.1101/2025.06.07.658426.

Towards Predictive Communication: The Fusion of Large Language Models and Brain-Computer Interface.迈向预测性通信：大语言模型与脑机接口的融合

Sensors (Basel). 2025 Jun 26;25(13):3987. doi: 10.3390/s25133987.

China pours money into brain chips that give paralysed people more control.中国投入资金研发能让瘫痪患者获得更多控制权的脑芯片。

Nature. 2025 Jul;643(8072):613-614. doi: 10.1038/d41586-025-02098-5.

Encoding of speech modes and loudness in ventral precentral gyrus.腹侧中央前回中语音模式和响度的编码。

bioRxiv. 2025 May 31:2025.05.30.657105. doi: 10.1101/2025.05.30.657105.

Non-Invasive Brain Stimulation and Artificial Intelligence in Communication Neuroprosthetics: A Bidirectional Approach for Speech and Hearing Impairments.通信神经假体中的非侵入性脑刺激与人工智能：一种针对言语和听力障碍的双向方法。

Brain Sci. 2025 Apr 25;15(5):449. doi: 10.3390/brainsci15050449.

本文引用的文献

Speech-induced suppression and vocal feedback sensitivity in human cortex.人类大脑皮层的语音诱导抑制和声音反馈敏感性。

Elife. 2024 Sep 10;13:RP94198. doi: 10.7554/eLife.94198.

An Accurate and Rapidly Calibrating Speech Neuroprosthesis.一种精确且快速校准的语音神经假体。

N Engl J Med. 2024 Aug 15;391(7):609-618. doi: 10.1056/NEJMoa2314132.

A bilingual speech neuroprosthesis driven by cortical articulatory representations shared between languages.一种由两种语言之间共享的皮质发音表征驱动的双语言语神经假体。

Nat Biomed Eng. 2024 Aug;8(8):977-991. doi: 10.1038/s41551-024-01207-5. Epub 2024 May 20.

The speech neuroprosthesis.言语神经假体。

Nat Rev Neurosci. 2024 Jul;25(7):473-492. doi: 10.1038/s41583-024-00819-9. Epub 2024 May 14.

Online speech synthesis using a chronically implanted brain-computer interface in an individual with ALS.使用慢性植入脑-机接口对肌萎缩性侧索硬化症患者进行在线语音合成。

Sci Rep. 2024 Apr 26;14(1):9617. doi: 10.1038/s41598-024-60277-2.

High-resolution neural recordings improve the accuracy of speech decoding.高分辨率神经记录提高了语音解码的准确性。

Nat Commun. 2023 Nov 6;14(1):6938. doi: 10.1038/s41467-023-42555-1.

A high-performance neuroprosthesis for speech decoding and avatar control.一种用于语音解码和化身控制的高性能神经假体。

Nature. 2023 Aug;620(7976):1037-1046. doi: 10.1038/s41586-023-06443-4. Epub 2023 Aug 23.

A high-performance speech neuroprosthesis.高性能言语神经假体

Nature. 2023 Aug;620(7976):1031-1036. doi: 10.1038/s41586-023-06377-x. Epub 2023 Aug 23.

Generalizable spelling using a speech neuroprosthesis in an individual with severe limb and vocal paralysis.个体严重的肢体和言语瘫痪中使用言语神经假体实现可泛化的拼写

Nat Commun. 2022 Nov 8;13(1):6510. doi: 10.1038/s41467-022-33611-3.

Real-time synthesis of imagined speech processes from minimally invasive recordings of neural activity.从神经活动的微创记录中实时合成想象中的语音过程。

Commun Biol. 2021 Sep 23;4(1):1055. doi: 10.1038/s42003-021-02578-0.

文献检索

告别复杂PubMed语法，用中文像聊天一样搜索，搜遍4000万医学文献。AI智能推荐，让科研检索更轻松。

立即免费搜索

文件翻译

保留排版，准确专业，支持PDF/Word/PPT等文件格式，支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述，25分钟生成高质量综述，智能提取关键信息，辅助科研写作。

立即免费体验

一种用于恢复自然交流的流式脑到语音神经假体。

A streaming brain-to-voice neuroprosthesis to restore naturalistic communication.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献检索

文件翻译

深度研究

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献