• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

用于标识音节语言的声学启发式脑到句子解码器

Acoustic Inspired Brain-to-Sentence Decoder for Logosyllabic Language.

作者信息

Feng Chen, Cao Lu, Wu Di, Zhang En, Wang Ting, Jiang Xiaowei, Chen Jinbo, Wu Hui, Lin Siyu, Hou Qiming, Zhu Junming, Yang Jie, Sawan Mohamad, Zhang Yue

机构信息

Department of Neurosurgery, The Second Affiliated Hospital of Zhejiang University School of Medicine, Hangzhou, Zhejiang Province, China.

School of Engineering, Westlake University, Hangzhou, Zhejiang Province, China.

出版信息

Cyborg Bionic Syst. 2025 Apr 29;6:0257. doi: 10.34133/cbsystems.0257. eCollection 2025.

DOI:10.34133/cbsystems.0257
PMID:40302941
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC12038182/
Abstract

Recent advances in brain-computer interfaces (BCIs) have demonstrated the potential to decode language from brain activity into sound or text, which has predominantly focused on alphabetic languages, such as English. However, logosyllabic languages, such as Mandarin Chinese, present marked challenges for establishing decoders that cover all characters, due to its unique syllable structures, extended character sets (e.g., over 50,000 characters for Mandarin Chinese), and complex mappings between characters and syllables, thus hindering practical applications. Here, we leverage the acoustic features of Mandarin Chinese syllables, constructing prediction models for syllable components (initials, tones, and finals), and decode speech-related stereoelectroencephalography (sEEG) signals into coherent Chinese sentences. The results demonstrate a high sentence-level offline decoding performance with a median character accuracy of 71.00% over the full spectrum of characters in the best participant. We also verified that incorporating acoustic-related features into the design of prediction models substantially enhances the accuracy of initials, tones, and finals. Moreover, our findings revealed that effective speech decoding also involves subcortical structures like the thalamus in addition to traditional language-related brain regions. Overall, we established a brain-to-sentence decoder for logosyllabic languages over full character set with a large intracranial electroencephalography dataset.

摘要

脑机接口(BCI)的最新进展已证明从大脑活动解码语言为声音或文本的潜力,这主要集中在字母语言上,如英语。然而,像汉语这样的语素音节文字语言,由于其独特的音节结构、庞大的字符集(例如,汉语有超过50,000个字符)以及字符与音节之间复杂的映射关系,在建立涵盖所有字符的解码器方面面临显著挑战,从而阻碍了实际应用。在此,我们利用汉语普通话音节的声学特征,构建音节成分(声母、声调、韵母)的预测模型,并将与语音相关的立体脑电图(sEEG)信号解码为连贯的中文句子。结果表明,在最佳参与者的所有字符全谱上,句子级离线解码性能较高,中位数字符准确率为71.00%。我们还验证了将与声学相关的特征纳入预测模型设计中可显著提高声母、声调及韵母的准确率。此外,我们的研究结果表明,有效的语音解码除了涉及传统的与语言相关的脑区外,还涉及丘脑等皮层下结构。总体而言,我们利用一个大型颅内脑电图数据集建立了一种针对语素音节文字语言全字符集的脑到句子解码器。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb8d/12038182/9ea514bad0a6/cbsystems.0257.fig.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb8d/12038182/fc82508196b2/cbsystems.0257.fig.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb8d/12038182/155838c39361/cbsystems.0257.fig.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb8d/12038182/87f4e495fd32/cbsystems.0257.fig.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb8d/12038182/a7029522d42a/cbsystems.0257.fig.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb8d/12038182/9ea514bad0a6/cbsystems.0257.fig.005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb8d/12038182/fc82508196b2/cbsystems.0257.fig.001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb8d/12038182/155838c39361/cbsystems.0257.fig.002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb8d/12038182/87f4e495fd32/cbsystems.0257.fig.003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb8d/12038182/a7029522d42a/cbsystems.0257.fig.004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fb8d/12038182/9ea514bad0a6/cbsystems.0257.fig.005.jpg

相似文献

1
Acoustic Inspired Brain-to-Sentence Decoder for Logosyllabic Language.用于标识音节语言的声学启发式脑到句子解码器
Cyborg Bionic Syst. 2025 Apr 29;6:0257. doi: 10.34133/cbsystems.0257. eCollection 2025.
2
A brain-to-text framework for decoding natural tonal sentences.一种用于解码自然声调句子的脑到文本框架。
Cell Rep. 2024 Nov 26;43(11):114924. doi: 10.1016/j.celrep.2024.114924. Epub 2024 Oct 31.
3
Speech decoding using cortical and subcortical electrophysiological signals.利用皮层和皮层下电生理信号进行语音解码。
Front Neurosci. 2024 Feb 29;18:1345308. doi: 10.3389/fnins.2024.1345308. eCollection 2024.
4
A bilingual speech neuroprosthesis driven by cortical articulatory representations shared between languages.一种由两种语言之间共享的皮质发音表征驱动的双语言语神经假体。
Nat Biomed Eng. 2024 Aug;8(8):977-991. doi: 10.1038/s41551-024-01207-5. Epub 2024 May 20.
5
VocalMind: A Stereotactic EEG Dataset for Vocalized, Mimed, and Imagined Speech in Tonal Language.VocalMind:一个用于有声、哑剧和想象中的声调语言语音的立体定向脑电图数据集。
Sci Data. 2025 Apr 19;12(1):657. doi: 10.1038/s41597-025-04741-2.
6
Syllable language models for Mandarin speech recognition: exploiting character language models.基于音节的汉语语音识别语言模型:利用汉字语言模型。
J Acoust Soc Am. 2013 Jan;133(1):519-28. doi: 10.1121/1.4768800.
7
Decoding articulatory and phonetic components of naturalistic continuous speech from the distributed language network.从分布式语言网络中解码自然连续语音的发音和语音成分。
J Neural Eng. 2023 Aug 14;20(4). doi: 10.1088/1741-2552/ace9fb.
8
[Design of standard voice sample text for subjective auditory perceptual evaluation of voice disorders].[用于嗓音障碍主观听觉感知评估的标准语音样本文本设计]
Zhonghua Er Bi Yan Hou Tou Jing Wai Ke Za Zhi. 2010 Sep;45(9):719-22.
9
Commonalities and differences in the neural representations of English, Portuguese, and Mandarin sentences: When knowledge of the brain-language mappings for two languages is better than one.英语、葡萄牙语和汉语句子的神经表征中的共性与差异:当两种语言的脑-语言映射知识比一种语言的更好时。
Brain Lang. 2017 Dec;175:77-85. doi: 10.1016/j.bandl.2017.09.007. Epub 2017 Oct 15.
10
Protocol to perform offline ECoG brain-to-text decoding for natural tonal sentences.用于自然声调句子的离线脑电皮层脑到文本解码协议。
STAR Protoc. 2025 Mar 21;6(1):103650. doi: 10.1016/j.xpro.2025.103650. Epub 2025 Feb 21.

本文引用的文献

1
High-resolution neural recordings improve the accuracy of speech decoding.高分辨率神经记录提高了语音解码的准确性。
Nat Commun. 2023 Nov 6;14(1):6938. doi: 10.1038/s41467-023-42555-1.
2
Neural control of lexical tone production in human laryngeal motor cortex.人类喉 Motor 皮质中词汇音产生的神经控制。
Nat Commun. 2023 Oct 30;14(1):6917. doi: 10.1038/s41467-023-42175-9.
3
A high-performance neuroprosthesis for speech decoding and avatar control.一种用于语音解码和化身控制的高性能神经假体。
Nature. 2023 Aug;620(7976):1037-1046. doi: 10.1038/s41586-023-06443-4. Epub 2023 Aug 23.
4
A high-performance speech neuroprosthesis.高性能言语神经假体
Nature. 2023 Aug;620(7976):1031-1036. doi: 10.1038/s41586-023-06377-x. Epub 2023 Aug 23.
5
Decoding and synthesizing tonal language speech from brain activity.从大脑活动中解码和合成声调语言的语音。
Sci Adv. 2023 Jun 9;9(23):eadh0478. doi: 10.1126/sciadv.adh0478.
6
Generalizable spelling using a speech neuroprosthesis in an individual with severe limb and vocal paralysis.个体严重的肢体和言语瘫痪中使用言语神经假体实现可泛化的拼写
Nat Commun. 2022 Nov 8;13(1):6510. doi: 10.1038/s41467-022-33611-3.
7
Dataset of Speech Production in intracranial.Electroencephalography.颅内脑电图语音产生数据集。
Sci Data. 2022 Jul 22;9(1):434. doi: 10.1038/s41597-022-01542-9.
8
Spelling interface using intracortical signals in a completely locked-in patient enabled via auditory neurofeedback training.通过听觉神经反馈训练,使完全闭锁综合征患者能够使用皮质内信号进行拼写。
Nat Commun. 2022 Mar 22;13(1):1236. doi: 10.1038/s41467-022-28859-8.
9
Real-time synthesis of imagined speech processes from minimally invasive recordings of neural activity.从神经活动的微创记录中实时合成想象中的语音过程。
Commun Biol. 2021 Sep 23;4(1):1055. doi: 10.1038/s42003-021-02578-0.
10
Neuroprosthesis for Decoding Speech in a Paralyzed Person with Anarthria.神经假体用于解码无言语症瘫痪患者的言语。
N Engl J Med. 2021 Jul 15;385(3):217-227. doi: 10.1056/NEJMoa2027540.