使用人类大脑皮层活动实时解码问答式语音对话。

Real-time decoding of question-and-answer speech dialogue using human cortical activity.

机构信息

Department of Neurological Surgery and the Center for Integrative Neuroscience at UC San Francisco, 675 Nelson Rising Lane, San Francisco, CA, 94158, USA.

出版信息

Nat Commun. 2019 Jul 30;10(1):3096. doi: 10.1038/s41467-019-10994-4.

DOI:10.1038/s41467-019-10994-4

PMID:31363096

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6667454/

Abstract

Natural communication often occurs in dialogue, differentially engaging auditory and sensorimotor brain regions during listening and speaking. However, previous attempts to decode speech directly from the human brain typically consider listening or speaking tasks in isolation. Here, human participants listened to questions and responded aloud with answers while we used high-density electrocorticography (ECoG) recordings to detect when they heard or said an utterance and to then decode the utterance's identity. Because certain answers were only plausible responses to certain questions, we could dynamically update the prior probabilities of each answer using the decoded question likelihoods as context. We decode produced and perceived utterances with accuracy rates as high as 61% and 76%, respectively (chance is 7% and 20%). Contextual integration of decoded question likelihoods significantly improves answer decoding. These results demonstrate real-time decoding of speech in an interactive, conversational setting, which has important implications for patients who are unable to communicate.

摘要

自然交流通常发生在对话中，在听和说的过程中，听觉和运动感觉大脑区域会有不同程度的参与。然而，以前尝试直接从人类大脑中解码言语的方法通常将听或说的任务孤立起来考虑。在这里，人类参与者听问题并大声回答，而我们使用高密度脑电图 (ECoG) 记录来检测他们何时听到或说出一个语句，然后解码该语句的身份。因为某些答案只是对某些问题的合理回答，所以我们可以使用解码的问题可能性作为上下文动态更新每个答案的先验概率。我们分别以高达 61%和 76%的准确率解码产生和感知的语句（机会为 7%和 20%）。解码的问题可能性的上下文集成显著提高了答案的解码。这些结果证明了在交互式对话环境中实时解码言语的能力，这对无法交流的患者具有重要意义。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/e3aa/6667454/9f783111e049/41467_2019_10994_Fig1_HTML.jpg

相似文献

Real-time decoding of question-and-answer speech dialogue using human cortical activity.

Nat Commun. 2019 Jul 30;10(1):3096. doi: 10.1038/s41467-019-10994-4.

Decoding spoken phonemes from sensorimotor cortex with high-density ECoG grids.

Neuroimage. 2018 Oct 15;180(Pt A):301-311. doi: 10.1016/j.neuroimage.2017.10.011. Epub 2017 Oct 7.

Coarse behavioral context decoding.

J Neural Eng. 2019 Feb;16(1):016021. doi: 10.1088/1741-2552/aaee9c. Epub 2018 Nov 6.

Three- and four-dimensional mapping of speech and language in patients with epilepsy.

Brain. 2017 May 1;140(5):1351-1370. doi: 10.1093/brain/awx051.

The influence of prior pronunciations on sensorimotor cortex activity patterns during vowel production.

J Neural Eng. 2018 Dec;15(6):066025. doi: 10.1088/1741-2552/aae329. Epub 2018 Sep 21.

Repeated Vowel Production Affects Features of Neural Activity in Sensorimotor Cortex.

Brain Topogr. 2019 Jan;32(1):97-110. doi: 10.1007/s10548-018-0673-4. Epub 2018 Sep 20.

Decoding hand gestures from primary somatosensory cortex using high-density ECoG.

Neuroimage. 2017 Feb 15;147:130-142. doi: 10.1016/j.neuroimage.2016.12.004. Epub 2016 Dec 5.

The use of intracranial recordings to decode human language: Challenges and opportunities.

Brain Lang. 2019 Jun;193:73-83. doi: 10.1016/j.bandl.2016.06.003. Epub 2016 Jul 1.

Activity associated with speech articulation measured through direct cortical recordings.

Brain Lang. 2017 Jun;169:1-7. doi: 10.1016/j.bandl.2017.01.013. Epub 2017 Feb 23.

Neuroprosthesis for Decoding Speech in a Paralyzed Person with Anarthria.

N Engl J Med. 2021 Jul 15;385(3):217-227. doi: 10.1056/NEJMoa2027540.

引用本文的文献

Natural sounds can be reconstructed from human neuroimaging data using deep neural network representation.

PLoS Biol. 2025 Jul 23;23(7):e3003293. doi: 10.1371/journal.pbio.3003293. eCollection 2025 Jul.

Acoustic Inspired Brain-to-Sentence Decoder for Logosyllabic Language.

Cyborg Bionic Syst. 2025 Apr 29;6:0257. doi: 10.34133/cbsystems.0257. eCollection 2025.

VocalMind: A Stereotactic EEG Dataset for Vocalized, Mimed, and Imagined Speech in Tonal Language.

Sci Data. 2025 Apr 19;12(1):657. doi: 10.1038/s41597-025-04741-2.

Multisensory naturalistic decoding with high-density diffuse optical tomography.

Neurophotonics. 2025 Jan;12(1):015002. doi: 10.1117/1.NPh.12.1.015002. Epub 2025 Jan 23.

Transformer-based neural speech decoding from surface and depth electrode signals.

J Neural Eng. 2025 Jan 28;22(1):016017. doi: 10.1088/1741-2552/adab21.

Supplementary motor area in speech initiation: A large-scale intracranial EEG evaluation of stereotyped word articulation.

iScience. 2024 Dec 4;28(1):111531. doi: 10.1016/j.isci.2024.111531. eCollection 2025 Jan 17.

Real-time detection of spoken speech from unlabeled ECoG signals: A pilot study with an ALS participant.

medRxiv. 2024 Sep 22:2024.09.18.24313755. doi: 10.1101/2024.09.18.24313755.

Diffusion model-based image generation from rat brain activity.

PLoS One. 2024 Sep 6;19(9):e0309709. doi: 10.1371/journal.pone.0309709. eCollection 2024.

An Accurate and Rapidly Calibrating Speech Neuroprosthesis.

N Engl J Med. 2024 Aug 15;391(7):609-618. doi: 10.1056/NEJMoa2314132.

Neural Decoding of Spontaneous Overt and Intended Speech.

J Speech Lang Hear Res. 2024 Nov 7;67(11):4216-4225. doi: 10.1044/2024_JSLHR-24-00046. Epub 2024 Aug 6.

本文引用的文献

Differential Representation of Articulatory Gestures and Phonemes in Precentral and Inferior Frontal Gyri.

J Neurosci. 2018 Nov 14;38(46):9803-9813. doi: 10.1523/JNEUROSCI.1206-18.2018. Epub 2018 Sep 26.

The Control of Vocal Pitch in Human Laryngeal Motor Cortex.

Cell. 2018 Jun 28;174(1):21-31.e9. doi: 10.1016/j.cell.2018.05.016.

Encoding of Articulatory Kinematic Trajectories in Human Speech Sensorimotor Cortex.

Neuron. 2018 Jun 6;98(5):1042-1054.e4. doi: 10.1016/j.neuron.2018.04.031. Epub 2018 May 17.

Human Sensorimotor Cortex Control of Directly Measured Vocal Tract Movements during Vowel Production.

J Neurosci. 2018 Mar 21;38(12):2955-2966. doi: 10.1523/JNEUROSCI.2382-17.2018. Epub 2018 Feb 8.

NAPLIB: AN OPEN SOURCE TOOLBOX FOR REAL-TIME AND OFFLINE NEURAL ACOUSTIC PROCESSING.

Proc IEEE Int Conf Acoust Speech Signal Process. 2017 Mar;2017:846-850. doi: 10.1109/ICASSP.2017.7952275. Epub 2017 Jun 19.

Real-time classification of auditory sentences using evoked cortical activity in humans.

J Neural Eng. 2018 Jun;15(3):036005. doi: 10.1088/1741-2552/aaab6f. Epub 2018 Jan 30.

Semi-automated Anatomical Labeling and Inter-subject Warping of High-Density Intracranial Recording Electrodes in Electrocorticography.

Front Neuroinform. 2017 Oct 31;11:62. doi: 10.3389/fninf.2017.00062. eCollection 2017.

Chronic ambulatory electrocorticography from human speech cortex.

Neuroimage. 2017 Jun;153:273-282. doi: 10.1016/j.neuroimage.2017.04.008. Epub 2017 Apr 7.

High performance communication by people with paralysis using an intracortical brain-computer interface.

Elife. 2017 Feb 21;6:e18554. doi: 10.7554/eLife.18554.

Functional and Quantitative MRI Mapping of Somatomotor Representations of Human Supralaryngeal Vocal Tract.

Cereb Cortex. 2017 Jan 1;27(1):265-278. doi: 10.1093/cercor/bhw393.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

使用人类大脑皮层活动实时解码问答式语音对话。

Real-time decoding of question-and-answer speech dialogue using human cortical activity.

机构信息

Department of Neurological Surgery and the Center for Integrative Neuroscience at UC San Francisco, 675 Nelson Rising Lane, San Francisco, CA, 94158, USA.

出版信息

Nat Commun. 2019 Jul 30;10(1):3096. doi: 10.1038/s41467-019-10994-4.

DOI:10.1038/s41467-019-10994-4

PMID:31363096

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC6667454/

Abstract

摘要

使用人类大脑皮层活动实时解码问答式语音对话。

Real-time decoding of question-and-answer speech dialogue using human cortical activity.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

使用人类大脑皮层活动实时解码问答式语音对话。

Real-time decoding of question-and-answer speech dialogue using human cortical activity.

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献