从人类大脑皮层解码公开和隐蔽言语的频谱时间特征。

Decoding spectrotemporal features of overt and covert speech from the human cortex.

作者信息

Martin Stéphanie, Brunner Peter, Holdgraf Chris, Heinze Hans-Jochen, Crone Nathan E, Rieger Jochem, Schalk Gerwin, Knight Robert T, Pasley Brian N

机构信息

Helen Wills Neuroscience Institute, University of California Berkeley, CA, USA ; Department of Bioengineering, École Polytechnique Fédérale de Lausanne Lausanne, Switzerland.

New York State Department of Health, Wadsworth Center Albany, NY, USA ; Department of Neurology, Albany Medical College Albany, NY, USA.

出版信息

Front Neuroeng. 2014 May 27;7:14. doi: 10.3389/fneng.2014.00014. eCollection 2014.

DOI:10.3389/fneng.2014.00014

PMID:24904404

原文链接:https://pmc.ncbi.nlm.nih.gov/articles/PMC4034498/

Abstract

Auditory perception and auditory imagery have been shown to activate overlapping brain regions. We hypothesized that these phenomena also share a common underlying neural representation. To assess this, we used electrocorticography intracranial recordings from epileptic patients performing an out loud or a silent reading task. In these tasks, short stories scrolled across a video screen in two conditions: subjects read the same stories both aloud (overt) and silently (covert). In a control condition the subject remained in a resting state. We first built a high gamma (70-150 Hz) neural decoding model to reconstruct spectrotemporal auditory features of self-generated overt speech. We then evaluated whether this same model could reconstruct auditory speech features in the covert speech condition. Two speech models were tested: a spectrogram and a modulation-based feature space. For the overt condition, reconstruction accuracy was evaluated as the correlation between original and predicted speech features, and was significant in each subject (p < 10(-5); paired two-sample t-test). For the covert speech condition, dynamic time warping was first used to realign the covert speech reconstruction with the corresponding original speech from the overt condition. Reconstruction accuracy was then evaluated as the correlation between original and reconstructed speech features. Covert reconstruction accuracy was compared to the accuracy obtained from reconstructions in the baseline control condition. Reconstruction accuracy for the covert condition was significantly better than for the control condition (p < 0.005; paired two-sample t-test). The superior temporal gyrus, pre- and post-central gyrus provided the highest reconstruction information. The relationship between overt and covert speech reconstruction depended on anatomy. These results provide evidence that auditory representations of covert speech can be reconstructed from models that are built from an overt speech data set, supporting a partially shared neural substrate.

摘要

听觉感知和听觉意象已被证明会激活重叠的脑区。我们假设这些现象也共享一个共同的潜在神经表征。为了评估这一点，我们使用了癫痫患者的皮层脑电图颅内记录，这些患者执行大声或默读任务。在这些任务中，短篇故事在两种情况下在视频屏幕上滚动：受试者大声（公开）和默读（隐蔽）阅读相同的故事。在对照条件下，受试者保持静息状态。我们首先构建了一个高伽马（70 - 150赫兹）神经解码模型，以重建自我产生的公开言语的频谱时间听觉特征。然后我们评估这个相同的模型是否能在隐蔽言语条件下重建听觉言语特征。测试了两种言语模型：频谱图和基于调制的特征空间。对于公开条件，重建准确性通过原始言语特征与预测言语特征之间的相关性来评估，并且在每个受试者中都具有显著性（p < 10^(-5)；配对双样本t检验）。对于隐蔽言语条件，首先使用动态时间规整将隐蔽言语重建与公开条件下相应的原始言语进行对齐。然后重建准确性通过原始言语特征与重建言语特征之间的相关性来评估。将隐蔽重建准确性与在基线对照条件下重建获得的准确性进行比较。隐蔽条件下的重建准确性显著优于对照条件（p < 0.005；配对双样本t检验）。颞上回、中央前回和中央后回提供了最高的重建信息。公开和隐蔽言语重建之间的关系取决于解剖结构。这些结果提供了证据，表明隐蔽言语的听觉表征可以从基于公开言语数据集构建的模型中重建，支持了部分共享的神经基质。

https://cdn.ncbi.nlm.nih.gov/pmc/blobs/032a/4034498/14b6978c7a40/fneng-07-00014-g0001.jpg

相似文献

Decoding spectrotemporal features of overt and covert speech from the human cortex.

Front Neuroeng. 2014 May 27;7:14. doi: 10.3389/fneng.2014.00014. eCollection 2014.

Spatio-Temporal Progression of Cortical Activity Related to Continuous Overt and Covert Speech Production in a Reading Task.

PLoS One. 2016 Nov 22;11(11):e0166872. doi: 10.1371/journal.pone.0166872. eCollection 2016.

Feasibility of decoding covert speech in ECoG with a Transformer trained on overt speech.

Sci Rep. 2024 May 20;14(1):11491. doi: 10.1038/s41598-024-62230-9.

Song and speech: brain regions involved with perception and covert production.

Neuroimage. 2006 Jul 1;31(3):1327-42. doi: 10.1016/j.neuroimage.2006.01.036. Epub 2006 Mar 20.

Towards reconstructing intelligible speech from the human auditory cortex.

Sci Rep. 2019 Jan 29;9(1):874. doi: 10.1038/s41598-018-37359-z.

Phonetic detail and lateralization of reading-related inner speech and of auditory and somatosensory feedback processing during overt reading.

Hum Brain Mapp. 2017 Jan;38(1):493-508. doi: 10.1002/hbm.23398. Epub 2016 Sep 13.

Imagined speech increases the hemodynamic response and functional connectivity of the dorsal motor cortex.

J Neural Eng. 2021 Oct 7;18(5). doi: 10.1088/1741-2552/ac25d9.

Classification of Overt and Covert Speech for Near-Infrared Spectroscopy-Based Brain Computer Interface.

Sensors (Basel). 2018 Sep 7;18(9):2989. doi: 10.3390/s18092989.

A comparison and classification of oscillatory characteristics in speech perception and covert speech.

Brain Res. 2022 Apr 15;1781:147778. doi: 10.1016/j.brainres.2022.147778. Epub 2022 Jan 7.

Lipreading and covert speech production similarly modulate human auditory-cortex responses to pure tones.

J Neurosci. 2010 Jan 27;30(4):1314-21. doi: 10.1523/JNEUROSCI.1950-09.2010.

引用本文的文献

From pronounced to imagined: improving speech decoding with multi-condition EEG data.

Front Neuroinform. 2025 Jun 27;19:1583428. doi: 10.3389/fninf.2025.1583428. eCollection 2025.

Non-Invasive Brain Stimulation and Artificial Intelligence in Communication Neuroprosthetics: A Bidirectional Approach for Speech and Hearing Impairments.

Brain Sci. 2025 Apr 25;15(5):449. doi: 10.3390/brainsci15050449.

VocalMind: A Stereotactic EEG Dataset for Vocalized, Mimed, and Imagined Speech in Tonal Language.

Sci Data. 2025 Apr 19;12(1):657. doi: 10.1038/s41597-025-04741-2.

Whole-brain dynamics of articulatory, acoustic and semantic speech representations.

Commun Biol. 2025 Mar 13;8(1):432. doi: 10.1038/s42003-025-07862-x.

Learning to operate an imagined speech Brain-Computer Interface involves the spatial and frequency tuning of neural activity.

Commun Biol. 2025 Feb 20;8(1):271. doi: 10.1038/s42003-025-07464-7.

Imagined speech event detection from electrocorticography and its transfer between speech modes and subjects.

Commun Biol. 2024 Jul 5;7(1):818. doi: 10.1038/s42003-024-06518-6.

Feasibility of decoding covert speech in ECoG with a Transformer trained on overt speech.

Sci Rep. 2024 May 20;14(1):11491. doi: 10.1038/s41598-024-62230-9.

The speech neuroprosthesis.

Nat Rev Neurosci. 2024 Jul;25(7):473-492. doi: 10.1038/s41583-024-00819-9. Epub 2024 May 14.

Recommendations for promoting user agency in the design of speech neuroprostheses.

Front Hum Neurosci. 2023 Oct 18;17:1298129. doi: 10.3389/fnhum.2023.1298129. eCollection 2023.

Direct speech reconstruction from sensorimotor brain activity with optimized deep learning models.

J Neural Eng. 2023 Sep 20;20(5):056010. doi: 10.1088/1741-2552/ace8be.

本文引用的文献

Mental imagery of speech: linking motor and perceptual systems through internal simulation and estimation.

Front Hum Neurosci. 2012 Nov 28;6:314. doi: 10.3389/fnhum.2012.00314. eCollection 2012.

High-frequency neural activity and human cognition: past, present and possible future of intracranial EEG research.

Prog Neurobiol. 2012 Sep;98(3):279-301. doi: 10.1016/j.pneurobio.2012.06.008. Epub 2012 Jun 26.

Variance and autocorrelation of the spontaneous slow brain activity.

PLoS One. 2012;7(5):e38131. doi: 10.1371/journal.pone.0038131. Epub 2012 May 30.

Reach and grasp by people with tetraplegia using a neurally controlled robotic arm.

Nature. 2012 May 16;485(7398):372-5. doi: 10.1038/nature11076.

A review and synthesis of the first 20 years of PET and fMRI studies of heard speech, spoken language and reading.

Neuroimage. 2012 Aug 15;62(2):816-47. doi: 10.1016/j.neuroimage.2012.04.062. Epub 2012 May 12.

Decoding covert spatial attention using electrocorticographic (ECoG) signals in humans.

Neuroimage. 2012 May 1;60(4):2285-93. doi: 10.1016/j.neuroimage.2012.02.017. Epub 2012 Feb 16.

Reconstructing speech from human auditory cortex.

PLoS Biol. 2012 Jan;10(1):e1001251. doi: 10.1371/journal.pbio.1001251. Epub 2012 Jan 31.

Effects of covert and overt paradigms in clinical language fMRI.

Acad Radiol. 2012 May;19(5):518-25. doi: 10.1016/j.acra.2011.12.017. Epub 2012 Jan 27.

Intra-cranial recordings of brain activity during language production.

Front Psychol. 2011 Dec 27;2:375. doi: 10.3389/fpsyg.2011.00375. eCollection 2011.

The neural correlates of inner speech defined by voxel-based lesion-symptom mapping.

Brain. 2011 Oct;134(Pt 10):3071-82. doi: 10.1093/brain/awr232.

文献AI研究员

20分钟写一篇综述，助力文献阅读效率提升50倍。

立即体验

用中文搜PubMed

大模型驱动的PubMed中文搜索引擎

马上搜索

文档翻译

学术文献翻译模型，支持多种主流文档格式。

立即体验

从人类大脑皮层解码公开和隐蔽言语的频谱时间特征。

Decoding spectrotemporal features of overt and covert speech from the human cortex.

作者信息

机构信息

出版信息

相似文献

引用本文的文献

本文引用的文献

文献AI研究员

用中文搜PubMed

文档翻译

Suppr 超能文献

相似文献

引用本文的文献

本文引用的文献