• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

一种用于解码自然声调句子的脑到文本框架。

A brain-to-text framework for decoding natural tonal sentences.

机构信息

Department of Neurosurgery, Huashan Hospital, Shanghai Medical College, Fudan University, Shanghai 200040, China; Shanghai Key Laboratory of Brain Function Restoration and Neural Regeneration, Shanghai 200040, China; National Center for Neurological Disorders, Huashan Hospital, Shanghai Medical College, Fudan University, Shanghai 200040, China.

School of Biomedical Engineering, ShanghaiTech University, Shanghai 201210, China; State Key Laboratory of Advanced Medical Materials and Devices, ShanghaiTech University, Shanghai 201210, China.

出版信息

Cell Rep. 2024 Nov 26;43(11):114924. doi: 10.1016/j.celrep.2024.114924. Epub 2024 Oct 31.

DOI:10.1016/j.celrep.2024.114924
PMID:39485790
Abstract

Speech brain-computer interfaces (BCIs) directly translate brain activity into speech sound and text. Despite successful applications in non-tonal languages, the distinct syllabic structures and pivotal lexical information conveyed through tonal nuances present challenges in BCI decoding for tonal languages like Mandarin Chinese. Here, we designed a brain-to-text framework to decode Mandarin sentences from invasive neural recordings. Our framework dissects speech onset, base syllables, and lexical tones, integrating them with contextual information through Bayesian likelihood and a Viterbi decoder. The results demonstrate accurate tone and syllable decoding during naturalistic speech production. The overall word error rate (WER) for 10 offline-decoded tonal sentences with a vocabulary of 40 high-frequency Chinese characters is 21% (chance: 95.3%) averaged across five participants, and tone decoding accuracy reaches 93% (chance: 25%), surpassing previous intracranial Mandarin tonal syllable decoders. This study provides a robust and generalizable approach for brain-to-text decoding of continuous tonal speech sentences.

摘要

语音脑-机接口(BCI)直接将脑活动转化为语音和文本。尽管在非声调语言中成功应用,但声调语言(如普通话)独特的音节结构和通过声调细微差别传达的关键词汇信息给 BCI 解码带来了挑战。在这里,我们设计了一个从侵入性神经记录中解码汉语句子的脑到文本框架。我们的框架分解了语音起始、基础音节和词汇声调,并通过贝叶斯似然和维特比解码器将它们与上下文信息结合起来。结果表明,在自然语言产生过程中可以准确地解码声调。在五位参与者的平均情况下,对词汇量为 40 个高频汉字的 10 个离线解码声调句子的整体单词错误率(WER)为 21%(机会:95.3%),声调解码准确率达到 93%(机会:25%),超过了以前的颅内普通话声调音节解码器。这项研究为连续声调语音句子的脑到文本解码提供了一种稳健且可推广的方法。

相似文献

1
A brain-to-text framework for decoding natural tonal sentences.一种用于解码自然声调句子的脑到文本框架。
Cell Rep. 2024 Nov 26;43(11):114924. doi: 10.1016/j.celrep.2024.114924. Epub 2024 Oct 31.
2
Protocol to perform offline ECoG brain-to-text decoding for natural tonal sentences.用于自然声调句子的离线脑电皮层脑到文本解码协议。
STAR Protoc. 2025 Mar 21;6(1):103650. doi: 10.1016/j.xpro.2025.103650. Epub 2025 Feb 21.
3
Decoding and synthesizing tonal language speech from brain activity.从大脑活动中解码和合成声调语言的语音。
Sci Adv. 2023 Jun 9;9(23):eadh0478. doi: 10.1126/sciadv.adh0478.
4
Acoustic Inspired Brain-to-Sentence Decoder for Logosyllabic Language.用于标识音节语言的声学启发式脑到句子解码器
Cyborg Bionic Syst. 2025 Apr 29;6:0257. doi: 10.34133/cbsystems.0257. eCollection 2025.
5
Decoding lexical tones and vowels in imagined tonal monosyllables using fNIRS signals.使用功能近红外光谱(fNIRS)信号解码想象中的单音节声调中的声调与元音。
J Neural Eng. 2022 Nov 10;19(6). doi: 10.1088/1741-2552/ac9e1d.
6
Electrophysiological evidence for the integral nature of tone in Mandarin spoken word recognition.电生理证据表明,汉语普通话语音识别中,声调具有整体性。
Neuropsychologia. 2019 Aug;131:325-332. doi: 10.1016/j.neuropsychologia.2019.05.031. Epub 2019 Jun 8.
7
A bilingual speech neuroprosthesis driven by cortical articulatory representations shared between languages.一种由两种语言之间共享的皮质发音表征驱动的双语言语神经假体。
Nat Biomed Eng. 2024 Aug;8(8):977-991. doi: 10.1038/s41551-024-01207-5. Epub 2024 May 20.
8
VocalMind: A Stereotactic EEG Dataset for Vocalized, Mimed, and Imagined Speech in Tonal Language.VocalMind:一个用于有声、哑剧和想象中的声调语言语音的立体定向脑电图数据集。
Sci Data. 2025 Apr 19;12(1):657. doi: 10.1038/s41597-025-04741-2.
9
Brain2Char: a deep architecture for decoding text from brain recordings.脑到字符:一种从脑记录中解码文本的深度架构。
J Neural Eng. 2020 Dec 16;17(6). doi: 10.1088/1741-2552/abc742.
10
Neural correlates of intonation and lexical tone in tonal and non-tonal language speakers.语调与音高在声调语言与非声调语言使用者大脑中的神经关联。
Hum Brain Mapp. 2020 May;41(7):1842-1858. doi: 10.1002/hbm.24916. Epub 2020 Jan 20.

引用本文的文献

1
Improved evaluation of waveform reconstruction in speech decoding based on invasive brain-computer interfaces.基于侵入式脑机接口的语音解码中波形重建的改进评估
Imaging Neurosci (Camb). 2025 Sep 10;3. doi: 10.1162/IMAG.a.146. eCollection 2025.
2
Protocol to perform offline ECoG brain-to-text decoding for natural tonal sentences.用于自然声调句子的离线脑电皮层脑到文本解码协议。
STAR Protoc. 2025 Mar 21;6(1):103650. doi: 10.1016/j.xpro.2025.103650. Epub 2025 Feb 21.