• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种即时语音合成神经假体。

An instantaneous voice synthesis neuroprosthesis.

作者信息

Wairagkar Maitreyee, Card Nicholas S, Singer-Clark Tyler, Hou Xianda, Iacobacci Carrina, Hochberg Leigh R, Brandman David M, Stavisky Sergey D

机构信息

Department of Neurological Surgery, University of California Davis, Davis, CA.

Department of Biomedical Engineering, University of California Davis, Davis, CA.

出版信息

bioRxiv. 2024 Sep 20:2024.08.14.607690. doi: 10.1101/2024.08.14.607690.

DOI:10.1101/2024.08.14.607690
PMID:39229047
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC11370360/
Abstract

Brain computer interfaces (BCIs) have the potential to restore communication to people who have lost the ability to speak due to neurological disease or injury. BCIs have been used to translate the neural correlates of attempted speech into text. However, text communication fails to capture the nuances of human speech such as prosody, intonation and immediately hearing one's own voice. Here, we demonstrate a "brain-to-voice" neuroprosthesis that instantaneously synthesizes voice with closed-loop audio feedback by decoding neural activity from 256 microelectrodes implanted into the ventral precentral gyrus of a man with amyotrophic lateral sclerosis and severe dysarthria. We overcame the challenge of lacking ground-truth speech for training the neural decoder and were able to accurately synthesize his voice. Along with phonemic content, we were also able to decode paralinguistic features from intracortical activity, enabling the participant to modulate his BCI-synthesized voice in real-time to change intonation, emphasize words, and sing short melodies. These results demonstrate the feasibility of enabling people with paralysis to speak intelligibly and expressively through a BCI.

摘要

脑机接口(BCIs)有潜力为因神经疾病或损伤而丧失说话能力的人恢复沟通能力。脑机接口已被用于将尝试说话的神经关联转化为文本。然而,文本交流无法捕捉人类语音的细微差别,如韵律、语调以及即时听到自己的声音。在此,我们展示了一种“脑到语音”神经假体,通过对植入一名患有肌萎缩侧索硬化症和严重构音障碍男子的腹侧中央前回的256个微电极的神经活动进行解码,利用闭环音频反馈即时合成语音。我们克服了缺乏用于训练神经解码器的真实语音这一挑战,并能够准确合成他的声音。除了音素内容,我们还能够从皮质内活动中解码副语言特征,使参与者能够实时调节其脑机接口合成的语音,以改变语调、强调单词并演唱简短旋律。这些结果证明了让瘫痪患者通过脑机接口清晰且富有表现力地说话的可行性。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/55e95c69b318/nihpp-2024.08.14.607690v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/b894e761dbfa/nihpp-2024.08.14.607690v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/e7c31a8964eb/nihpp-2024.08.14.607690v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/6cf61429bb97/nihpp-2024.08.14.607690v2-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/b735283a4291/nihpp-2024.08.14.607690v2-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/07141bb8c456/nihpp-2024.08.14.607690v2-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/827d624f4a1b/nihpp-2024.08.14.607690v2-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/93241c226113/nihpp-2024.08.14.607690v2-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/050129b95675/nihpp-2024.08.14.607690v2-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/c89fc3ed00fa/nihpp-2024.08.14.607690v2-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/e608bca5c55b/nihpp-2024.08.14.607690v2-f0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/824f9864debd/nihpp-2024.08.14.607690v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/c857c75201f8/nihpp-2024.08.14.607690v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/9fab41f6580e/nihpp-2024.08.14.607690v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/55e95c69b318/nihpp-2024.08.14.607690v2-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/b894e761dbfa/nihpp-2024.08.14.607690v2-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/e7c31a8964eb/nihpp-2024.08.14.607690v2-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/6cf61429bb97/nihpp-2024.08.14.607690v2-f0007.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/b735283a4291/nihpp-2024.08.14.607690v2-f0008.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/07141bb8c456/nihpp-2024.08.14.607690v2-f0009.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/827d624f4a1b/nihpp-2024.08.14.607690v2-f0010.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/93241c226113/nihpp-2024.08.14.607690v2-f0011.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/050129b95675/nihpp-2024.08.14.607690v2-f0012.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/c89fc3ed00fa/nihpp-2024.08.14.607690v2-f0013.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/e608bca5c55b/nihpp-2024.08.14.607690v2-f0014.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/824f9864debd/nihpp-2024.08.14.607690v2-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/c857c75201f8/nihpp-2024.08.14.607690v2-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/9fab41f6580e/nihpp-2024.08.14.607690v2-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/ccaf/11421060/55e95c69b318/nihpp-2024.08.14.607690v2-f0004.jpg

相似文献

1
An instantaneous voice synthesis neuroprosthesis.一种即时语音合成神经假体。
bioRxiv. 2024 Sep 20:2024.08.14.607690. doi: 10.1101/2024.08.14.607690.
2
An instantaneous voice-synthesis neuroprosthesis.一种即时语音合成神经假体。
Nature. 2025 Jun 12. doi: 10.1038/s41586-025-09127-3.
3
Effectiveness of voice rehabilitation on vocalisation in postlaryngectomy patients: a systematic review.喉切除术后患者的嗓音康复对发声效果的影响:系统评价。
Int J Evid Based Healthc. 2010 Dec;8(4):256-8. doi: 10.1111/j.1744-1609.2010.00177.x.
4
Interventions for dysarthria due to stroke and other adult-acquired, non-progressive brain injury.针对中风及其他成人后天性、非进行性脑损伤所致构音障碍的干预措施。
Cochrane Database Syst Rev. 2017 Jan 25;1(1):CD002088. doi: 10.1002/14651858.CD002088.pub3.
5
Interventions for childhood apraxia of speech.儿童言语失用症的干预措施。
Cochrane Database Syst Rev. 2018 May 30;5(5):CD006278. doi: 10.1002/14651858.CD006278.pub3.
6
Cost-effectiveness of using prognostic information to select women with breast cancer for adjuvant systemic therapy.利用预后信息为乳腺癌患者选择辅助性全身治疗的成本效益
Health Technol Assess. 2006 Sep;10(34):iii-iv, ix-xi, 1-204. doi: 10.3310/hta10340.
7
Health professionals' experience of teamwork education in acute hospital settings: a systematic review of qualitative literature.医疗专业人员在急症医院环境中团队合作教育的经验:对定性文献的系统综述
JBI Database System Rev Implement Rep. 2016 Apr;14(4):96-137. doi: 10.11124/JBISRIR-2016-1843.
8
Interventions for recruiting smokers into cessation programmes.将吸烟者纳入戒烟计划的干预措施。
Cochrane Database Syst Rev. 2012 Dec 12;12(12):CD009187. doi: 10.1002/14651858.CD009187.pub2.
9
Speech imagery brain-computer interfaces: a systematic literature review.言语意象脑机接口:系统文献综述
J Neural Eng. 2025 Jun 26;22(3). doi: 10.1088/1741-2552/ade28e.
10
A systematic review of speech, language and communication interventions for children with Down syndrome from 0 to 6 years.对0至6岁唐氏综合征儿童言语、语言和沟通干预措施的系统评价。
Int J Lang Commun Disord. 2022 Mar;57(2):441-463. doi: 10.1111/1460-6984.12699. Epub 2022 Feb 22.

本文引用的文献

1
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models.StyleTTS 2:通过风格扩散和与大型语音语言模型的对抗训练实现接近人类水平的文本转语音
Adv Neural Inf Process Syst. 2023 Dec;36:19594-19621. Epub 2023 Dec 10.
2
An Accurate and Rapidly Calibrating Speech Neuroprosthesis.一种精确且快速校准的语音神经假体。
N Engl J Med. 2024 Aug 15;391(7):609-618. doi: 10.1056/NEJMoa2314132.
3
Speech decoding from stereo-electroencephalography (sEEG) signals using advanced deep learning methods.
使用先进的深度学习方法从立体脑电图(sEEG)信号中进行语音解码。
J Neural Eng. 2024 Jun 27;21(3). doi: 10.1088/1741-2552/ad593a.
4
The speech neuroprosthesis.言语神经假体。
Nat Rev Neurosci. 2024 Jul;25(7):473-492. doi: 10.1038/s41583-024-00819-9. Epub 2024 May 14.
5
Online speech synthesis using a chronically implanted brain-computer interface in an individual with ALS.使用慢性植入脑-机接口对肌萎缩性侧索硬化症患者进行在线语音合成。
Sci Rep. 2024 Apr 26;14(1):9617. doi: 10.1038/s41598-024-60277-2.
6
BRAND: a platform for closed-loop experiments with deep network models.BRAND:一个用于深度网络模型闭环实验的平台。
J Neural Eng. 2024 Apr 17;21(2):026046. doi: 10.1088/1741-2552/ad3b3a.
7
Preparatory activity and the expansive null-space.预备活动与扩张零空间。
Nat Rev Neurosci. 2024 Apr;25(4):213-236. doi: 10.1038/s41583-024-00796-z. Epub 2024 Mar 5.
8
Intraoperative cortical stimulation mapping with laryngeal electromyography for the localization of human laryngeal motor cortex.经喉肌电图的术中皮质刺激映射定位人类喉运动皮质。
J Neurosurg. 2024 Jan 5;141(1):268-277. doi: 10.3171/2023.10.JNS231023. Print 2024 Jul 1.
9
Neural control of lexical tone production in human laryngeal motor cortex.人类喉 Motor 皮质中词汇音产生的神经控制。
Nat Commun. 2023 Oct 30;14(1):6917. doi: 10.1038/s41467-023-42175-9.
10
A high-performance neuroprosthesis for speech decoding and avatar control.一种用于语音解码和化身控制的高性能神经假体。
Nature. 2023 Aug;620(7976):1037-1046. doi: 10.1038/s41586-023-06443-4. Epub 2023 Aug 23.