• 文献检索
  • 文档翻译
  • 深度研究
  • 学术资讯
  • Suppr Zotero 插件Zotero 插件
  • 邀请有礼
  • 套餐&价格
  • 历史记录
应用&插件
Suppr Zotero 插件Zotero 插件浏览器插件Mac 客户端Windows 客户端微信小程序
定价
高级版会员购买积分包购买API积分包
服务
文献检索文档翻译深度研究API 文档MCP 服务
关于我们
关于 Suppr公司介绍联系我们用户协议隐私条款
关注我们

Suppr 超能文献

核心技术专利:CN118964589B侵权必究
粤ICP备2023148730 号-1Suppr @ 2026

文献检索

告别复杂PubMed语法,用中文像聊天一样搜索,搜遍4000万医学文献。AI智能推荐,让科研检索更轻松。

立即免费搜索

文件翻译

保留排版,准确专业,支持PDF/Word/PPT等文件格式,支持 12+语言互译。

免费翻译文档

深度研究

AI帮你快速写综述,25分钟生成高质量综述,智能提取关键信息,辅助科研写作。

立即免费体验

利用功能性言语运动皮层的信号对所有美式英语音素进行直接分类。

Direct classification of all American English phonemes using signals from functional speech motor cortex.

作者信息

Mugler Emily M, Patton James L, Flint Robert D, Wright Zachary A, Schuele Stephan U, Rosenow Joshua, Shih Jerry J, Krusienski Dean J, Slutzky Marc W

机构信息

Bioengineering, University of Illinois at Chicago, 851 S. Morgan Street, Chicago, IL 60607, USA.

出版信息

J Neural Eng. 2014 Jun;11(3):035015. doi: 10.1088/1741-2560/11/3/035015. Epub 2014 May 19.

DOI:10.1088/1741-2560/11/3/035015
PMID:24836588
原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4097188/
Abstract

OBJECTIVE

Although brain-computer interfaces (BCIs) can be used in several different ways to restore communication, communicative BCI has not approached the rate or efficiency of natural human speech. Electrocorticography (ECoG) has precise spatiotemporal resolution that enables recording of brain activity distributed over a wide area of cortex, such as during speech production. In this study, we sought to decode elements of speech production using ECoG.

APPROACH

We investigated words that contain the entire set of phonemes in the general American accent using ECoG with four subjects. Using a linear classifier, we evaluated the degree to which individual phonemes within each word could be correctly identified from cortical signal.

MAIN RESULTS

We classified phonemes with up to 36% accuracy when classifying all phonemes and up to 63% accuracy for a single phoneme. Further, misclassified phonemes follow articulation organization described in phonology literature, aiding classification of whole words. Precise temporal alignment to phoneme onset was crucial for classification success.

SIGNIFICANCE

We identified specific spatiotemporal features that aid classification, which could guide future applications. Word identification was equivalent to information transfer rates as high as 3.0 bits s(-1) (33.6 words min(-1)), supporting pursuit of speech articulation for BCI control.

摘要

目的

尽管脑机接口(BCI)可以通过多种不同方式用于恢复交流,但用于交流的BCI尚未达到自然人类言语的速度或效率。皮层脑电图(ECoG)具有精确的时空分辨率,能够记录分布在广泛皮质区域的脑活动,例如在言语产生过程中。在本研究中,我们试图使用ECoG解码言语产生的要素。

方法

我们使用ECoG对四名受试者进行研究,调查包含通用美国口音中所有音素集的单词。使用线性分类器,我们评估了每个单词内各个音素能够从皮质信号中被正确识别的程度。

主要结果

对所有音素进行分类时,我们对音素的分类准确率高达36%,对单个音素的分类准确率高达63%。此外,错误分类的音素遵循语音学文献中描述的发音组织,有助于对整个单词进行分类。与音素起始的精确时间对齐对分类成功至关重要。

意义

我们识别出有助于分类的特定时空特征,这可以指导未来的应用。单词识别相当于高达3.0比特每秒(33.6个单词每分钟)的信息传输速率,支持为实现BCI控制而对言语发音进行的探索。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa25/4097188/581f21a99d90/nihms-600070-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa25/4097188/537f3f3d9284/nihms-600070-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa25/4097188/17ca303e0b81/nihms-600070-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa25/4097188/51d0f1b051e0/nihms-600070-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa25/4097188/02a2638b30a5/nihms-600070-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa25/4097188/3c9df29d6183/nihms-600070-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa25/4097188/581f21a99d90/nihms-600070-f0006.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa25/4097188/537f3f3d9284/nihms-600070-f0001.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa25/4097188/17ca303e0b81/nihms-600070-f0002.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa25/4097188/51d0f1b051e0/nihms-600070-f0003.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa25/4097188/02a2638b30a5/nihms-600070-f0004.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa25/4097188/3c9df29d6183/nihms-600070-f0005.jpg
https://cdn.ncbi.nlm.nih.gov/pmc/blobs/fa25/4097188/581f21a99d90/nihms-600070-f0006.jpg

相似文献

1
Direct classification of all American English phonemes using signals from functional speech motor cortex.利用功能性言语运动皮层的信号对所有美式英语音素进行直接分类。
J Neural Eng. 2014 Jun;11(3):035015. doi: 10.1088/1741-2560/11/3/035015. Epub 2014 May 19.
2
Cortical encoding of phonemic context during word production.单词生成过程中语音语境的皮层编码。
Annu Int Conf IEEE Eng Med Biol Soc. 2014;2014:6790-3. doi: 10.1109/EMBC.2014.6945187.
3
Decoding spoken phonemes from sensorimotor cortex with high-density ECoG grids.利用高密度 ECoG 网格从感觉运动皮层解码口语音素。
Neuroimage. 2018 Oct 15;180(Pt A):301-311. doi: 10.1016/j.neuroimage.2017.10.011. Epub 2017 Oct 7.
4
Effective Phoneme Decoding With Hyperbolic Neural Networks for High-Performance Speech BCIs.基于双曲神经网络的高效语音音素解码在高性能语音脑机接口中的应用。
IEEE Trans Neural Syst Rehabil Eng. 2024;32:3432-3441. doi: 10.1109/TNSRE.2024.3457313. Epub 2024 Sep 18.
5
Improved phoneme-based myoelectric speech recognition.基于音素的改进型肌电语音识别。
IEEE Trans Biomed Eng. 2009 Aug;56(8):2016-23. doi: 10.1109/TBME.2009.2024079. Epub 2009 Jun 16.
6
Syllable-based speech recognition using EMG.基于肌电图的音节语音识别。
Annu Int Conf IEEE Eng Med Biol Soc. 2010;2010:4699-702. doi: 10.1109/IEMBS.2010.5626426.
7
Myoelectric signal classification for phoneme-based speech recognition.用于基于音素的语音识别的肌电信号分类
IEEE Trans Biomed Eng. 2007 Apr;54(4):694-9. doi: 10.1109/TBME.2006.889175.
8
EEG classification in a single-trial basis for vowel speech perception using multivariate empirical mode decomposition.基于多变量经验模态分解的单次试验元音语音感知脑电分类
J Neural Eng. 2014 Jun;11(3):036010. doi: 10.1088/1741-2560/11/3/036010. Epub 2014 May 8.
9
Speech sound classification and detection of articulation disorders with support vector machines and wavelets.基于支持向量机和小波的语音分类及构音障碍检测
Conf Proc IEEE Eng Med Biol Soc. 2006;2006:2199-202. doi: 10.1109/IEMBS.2006.259499.
10
Generalizing neural signal-to-text brain-computer interfaces.推广神经信号到文本的脑机接口。
Biomed Phys Eng Express. 2021 Apr 30;7(3). doi: 10.1088/2057-1976/abf6ab.

引用本文的文献

1
A streaming brain-to-voice neuroprosthesis to restore naturalistic communication.一种用于恢复自然交流的流式脑到语音神经假体。
Nat Neurosci. 2025 Apr;28(4):902-912. doi: 10.1038/s41593-025-01905-6. Epub 2025 Mar 31.
2
Convergent vocal representations in parrot and human forebrain motor networks.鹦鹉和人类前脑运动网络中的趋同语音表征。
Nature. 2025 Apr;640(8058):427-434. doi: 10.1038/s41586-025-08695-8. Epub 2025 Mar 19.
3
Decoding speech intent from non-frontal cortical areas.从非额叶皮质区域解码言语意图。
J Neural Eng. 2025 Feb 13;22(1):016024. doi: 10.1088/1741-2552/adaa20.
4
Transcranial Magnetic Stimulation Facilitates Neural Speech Decoding.经颅磁刺激促进神经语音解码。
Brain Sci. 2024 Sep 2;14(9):895. doi: 10.3390/brainsci14090895.
5
Cortical sites critical to language function act as connectors between language subnetworks.对语言功能至关重要的皮质位点充当语言子网之间的连接点。
Nat Commun. 2024 Sep 16;15(1):7897. doi: 10.1038/s41467-024-51839-z.
6
Iterative alignment discovery of speech-associated neural activity.语音相关神经活动的迭代对齐发现。
J Neural Eng. 2024 Aug 28;21(4):046056. doi: 10.1088/1741-2552/ad663c.
7
An Accurate and Rapidly Calibrating Speech Neuroprosthesis.一种精确且快速校准的语音神经假体。
N Engl J Med. 2024 Aug 15;391(7):609-618. doi: 10.1056/NEJMoa2314132.
8
Decoding imagined speech with delay differential analysis.利用延迟微分分析解码想象中的言语。
Front Hum Neurosci. 2024 May 17;18:1398065. doi: 10.3389/fnhum.2024.1398065. eCollection 2024.
9
A bilingual speech neuroprosthesis driven by cortical articulatory representations shared between languages.一种由两种语言之间共享的皮质发音表征驱动的双语言语神经假体。
Nat Biomed Eng. 2024 Aug;8(8):977-991. doi: 10.1038/s41551-024-01207-5. Epub 2024 May 20.
10
The speech neuroprosthesis.言语神经假体。
Nat Rev Neurosci. 2024 Jul;25(7):473-492. doi: 10.1038/s41583-024-00819-9. Epub 2024 May 14.

本文引用的文献

1
Functional organization of human sensorimotor cortex for speech articulation.人类运动言语感知皮层的功能组织。
Nature. 2013 Mar 21;495(7441):327-32. doi: 10.1038/nature11911. Epub 2013 Feb 20.
2
An electrocorticographic brain interface in an individual with tetraplegia.一名四肢瘫痪患者的脑电皮层接口。
PLoS One. 2013;8(2):e55344. doi: 10.1371/journal.pone.0055344. Epub 2013 Feb 6.
3
Spontaneously reactivated patterns in frontal and temporal lobe predict semantic clustering during memory search.额叶和颞叶中自发重新激活的模式预测记忆搜索过程中的语义聚类。
J Neurosci. 2012 Jun 27;32(26):8871-8. doi: 10.1523/JNEUROSCI.5321-11.2012.
4
Accurate decoding of reaching movements from field potentials in the absence of spikes.在没有尖峰的情况下,从场电位中准确解码运动。
J Neural Eng. 2012 Aug;9(4):046006. doi: 10.1088/1741-2560/9/4/046006. Epub 2012 Jun 25.
5
Local field potentials allow accurate decoding of muscle activity.局部场电位可实现肌肉活动的精确解码。
J Neurophysiol. 2012 Jul;108(1):18-24. doi: 10.1152/jn.00832.2011. Epub 2012 Apr 11.
6
Decoding vowels and consonants in spoken and imagined words using electrocorticographic signals in humans.使用人类脑电信号解码口语和想象单词中的元音和辅音。
J Neural Eng. 2011 Aug;8(4):046028. doi: 10.1088/1741-2560/8/4/046028. Epub 2011 Jul 13.
7
Classification of intended phoneme production from chronic intracortical microelectrode recordings in speech-motor cortex.根据言语运动皮层慢性皮层内微电极记录对预期音素产生进行分类。
Front Neurosci. 2011 May 12;5:65. doi: 10.3389/fnins.2011.00065. eCollection 2011.
8
Decoding the rat forelimb movement direction from epidural and intracortical field potentials.从硬膜外和皮质内场电位解码大鼠前肢运动方向。
J Neural Eng. 2011 Jun;8(3):036013. doi: 10.1088/1741-2560/8/3/036013. Epub 2011 Apr 21.
9
Using the electrocorticographic speech network to control a brain-computer interface in humans.利用脑电言语网络控制人类脑机接口
J Neural Eng. 2011 Jun;8(3):036004. doi: 10.1088/1741-2560/8/3/036004. Epub 2011 Apr 7.
10
A high-speed BCI based on code modulation VEP.基于码调制 VEP 的高速脑机接口。
J Neural Eng. 2011 Apr;8(2):025015. doi: 10.1088/1741-2560/8/2/025015. Epub 2011 Mar 24.